Lightweight CNN and Causal GNN for scene understanding

Artificial intelligence & Data intelligence Computer science and software Engineering sciences Technological challenges 

Abstract

Scene understanding is a major challenge in computer vision, with recent approaches dominated by transformers (ViT, LLM, MLLM), which offer high performance but at a significant computational cost. This thesis proposes an innovative alternative combining lightweight convolutional neural networks (Lightweight CNN) and causal graph neural networks (Causal GNN) for efficient spatio-temporal analysis while optimizing computational resources. Lightweight CNNs enable high-performance extraction of visual features, while causal GNNs model dynamic relationships between objects in a scene graph, addressing challenges in object detection and relationship prediction in complex environments. Unlike current transformer-based models, this approach aims to reduce computational complexity while maintaining competitive accuracy, with potential applications in embedded vision and real-time systems.

Laboratory

Département Systèmes et Circuits Intégrés Numériques (LIST)

DSCIN

Laboratoire Systèmes-sur-puce et Technologies Avancées

Nice-Sophia-Antipolis

Back

Share this thesis topic

Practicle information

Pre-requisite:

Master's degree in Computer Science. Strong knowledge in neural networks and algorithms. Optional knowledge in embedded programming.

University - graduate school:

Sciences et Technologies de l’Information et de la Communication (STIC)

Nice-Sophia-Antipolis

Starting date:

01-09-2026

Place:

Grenoble

Contact Person

Thomas

MESQUIDA

CEA

DRT/DSCIN/LSTA

Tel :

Email : thomas.mesquida@cea.fr

Thesis supervisor

Jean

MARTINET

Université Côte d'Azur

I3S (Laboratoire d'Informatique, Signaux et Systèmes de Sophia Antipolis (UMR CNRS 7271)