Post-training neural architecture optimization for small language models

Artificial intelligence & Data intelligence Computer science and software Engineering sciences Technological challenges 

Abstract

Generative AI, and particularly language models (LLM), have sparked a new revolution in AI with applications across all domains. However, LLMs are highly resource-intensive and, hence, difficult to implement on autonomous embedded systems. LLMs can be optimized by modifying their architecture to replace heavy Transformer layers with lighter alternatives. Given the difficulty of training LLM "from scratch," this thesis aims to develop post-training neural architecture optimization methods applicable to small LLM (SLM). Additionally, the thesis seeks to propose performance metrics of different layers of an SLM and their alternatives, to guide the replacement, and thus propose a comprehensive methodology for optimizing SLMs while considering hardware constraints. The work will be valorized through publications in major AI conferences and journals, and the developed codes and methods could be integrated into the tools developed at CEA.

Laboratory

Département Systèmes et Circuits Intégrés Numériques (LIST)

DSCIN

Laboratoire Intelligence Intégrée Multi-capteurs

Back

Share this thesis topic

Practicle information

Pre-requisite:

Master 2 ou diplôme d'ingénieur en informatique ou IA ou systèmes embarqués

University - graduate school:

Starting date:

01-10-2026

Place:

Grenoble

Contact Person

Manon

DAMPFHOFFER

CEA

DRT/DSCIN/DSCIN/LIIM

Tel : 0438789747

Email : manon.dampfhoffer@cea.fr