Distributed multimodal learning for cooperative acoustic source localization and classification

Artificial intelligence & Data intelligence Automatics, Remote handling Engineering sciences Technological challenges 

Abstract

In many complex environments, such as industrial sites, disaster-stricken buildings, or public spaces, it is necessary to automatically detect and localize sound events (falls, alarms, voices, mechanical failures). Mobile platforms equipped with cameras and microphones represent a promising solution, but a single platform remains limited: its microphone array provides an approximate direction towards the source but not a precise position in space, and its camera may be obstructed. This thesis proposes to study how a network of mobile platform, each carrying a calibrated audio-visual unit, can collaborate to localize and classify such events in 3D. Each platform analyses its own audio-visual observations and shares an estimate of the source direction with its neighbours; the network then combines these estimates to reconstruct the position of the event and identify it. The expected outcomes are a cooperative localization system that is robust to occlusions and partial platform failures.

Laboratory

Département d’Instrumentation Numérique

Service Monitoring, Contrôle et Diagnostic

Laboratoire Instrumentation Intelligente, Distribuée et Embarquée

Paris-Saclay

Back

Share this thesis topic

Practicle information

Pre-requisite:

Diplôme d’ingénieur.e ou Master 2 en traitement du signal et des images, robotique, intelligence artificielle, ou domaine équivalent.

University - graduate school:

Sciences et Technologies de l’Information et de la Communication (STIC)

Paris-Saclay

Starting date:

01-10-2026

Place:

Saclay

Contact Person

Andréa

MACARIO BARROS

CEA

DRT

Tel :

Email : andrea.barros@cea.fr

Thesis supervisor

Fred Maurice

NGOLE MBOULA

CEA

DRT/LIST/DIN/SMCD/LIIDE