Internalisation of external knowledge by foundation models

Artificial intelligence & Data intelligence Computer science and software Engineering sciences Technological challenges 

Abstract

To perform an unknown task, a subject (human or robot) has to consult external information, which involves a cognitive cost. After several similar experiments, it masters the situation and can act automatically. The 1980s and 1990s saw explorations in AI using conceptual graphs and schemas, but their large-scale implementation was limited by the technology available at the time.

Today's neural models, including transformers and LLM/VLMs, learn universal representations through pre-training on huge amounts of data. They can be used with prompts to provide local context. Fine-tuning allows these models to be specialised for specific tasks.

RAG and GraphRAG methods can be used to exploit external knowledge, but their use for inference is resource-intensive. This thesis proposes a cognitivist approach in which the system undergoes continuous learning. It consults external sources during inference and uses this information to refine itself regularly, as it does during sleep. This method aims to improve performance and reduce resource consumption.

In humans, these processes are linked to the spatial organisation of the brain. The thesis will also study network architectures inspired by this organisation, with dedicated but interconnected “zones”, such as the vision-language and language models.

These concepts can be applied to the Astir and Ridder projects, which aim to exploit foundation models for software engineering in robotics and the development of generative AI methods for the safe control of robots.

Laboratory

Département Intelligence Ambiante et Systèmes Interactifs (LIST)

Service Intelligence Artificielle pour le Langage et la Vision

Laboratoire Analyse Sémantique Textes et Images

Caen

Back

Share this thesis topic

Practicle information

Pre-requisite:

Master 2 Intelligence artificielle ou Diplôme d'Ingénieur à forte composante IA.

University - graduate school:

Caen

Starting date:

01-10-2025

Place:

Saclay

Contact Person

Gaël

DE CHALENDAR

CEA

DRT/DIASI//LASTI

Tel : 01.69.08.01.50

Email : Gael.de-Chalendar@cea.fr

Thesis supervisor

Romain

HERAULT

Université de Caen Normandie

Département de Mathématiques/Informatique (Laboratoire GREYC - Équipe IMAGE)