New Reliable Strategies for Optimizing Predictive Thermodynamics Models
Predictive thermodynamic models, developed by the Calphad method, are essential for designing new materials by anticipating their behavior without resorting to costly and time-consuming experiments. These models allow for the extrapolation of the properties of complex materials, predicting their behavior in extreme environments, and linking energy properties to in-service performance. However, current methods for developing these models are complex, and uncertainties are not quantified in existing software. Scientists still rely on their expertise to adjust and validate these models, which is time-consuming and poorly suited to the era of automation.
To address this, it is proposed to develop a reliable, autonomous, and fast digital tool capable of optimizing thermodynamic models based solely on experimental data provided by users. The goal is to provide simple, reliable, validated, and modular models, enabling users to make strategic decisions with confidence, such as evaluating new process conditions or optimizing manufacturing without risking uncertain extrapolations. This project aims to bridge the gap between specific experimental data and modern nonlinear programming methods, using advanced optimization approaches.
Scalable Network Digital Twins through Adaptive Fidelity Management
Future communication systems such as 6G networks are evolving toward highly distributed, autonomous, and heterogeneous infrastructures integrating cloud-edge continuum architectures, Open RAN (O-RAN), massive IoT deployments, edge computing, and highly dynamic wireless environments.
These systems are expected to support demanding services such as mission-critical communications, industrial automation, autonomous mobility, and immersive applications, operating under highly dynamic traffic conditions, frequent topology changes, fluctuating resource availability, and stringent latency and reliability requirements.
Managing such systems through risk-free configuration, optimization, and evolution operations is becoming increasingly challenging. This is particularly true when performing real-time network optimization, operational what-if analysis, network troubleshooting, or planning network upgrades and extensions.
To address these challenges, recent research initiatives have investigated the application of the Digital Twin paradigm to communication networks, commonly referred to as Network Digital Twins (NDTs).
An NDT is a virtual representation of a communication network that remains sufficiently aligned with the physical infrastructure to reproduce its operational state and behavior, support predictive analysis, and evaluate hypothetical scenarios before applying decisions to the real system.
However, maintaining an accurate and temporally consistent NDT in large-scale and highly dynamic networks remains a major challenge.
Current NDTs predominantly rely on explicit synchronization mechanisms to maintain fidelity between the physical and virtual systems. Although recent works have introduced AI-assisted prediction mechanisms to reduce synchronization overhead, these approaches do not fully address the problem of dynamically adapting the fidelity of the NDT according to predictive uncertainty, information value, network dynamics, and operational requirements. Adaptive fidelity can be interpreted as a multi-resolution representation mechanism, where the NDT dynamically adjusts its observation granularity, synchronization overhead, and reconstruction accuracy according to information value, predictive uncertainty, network dynamics, and available resources. The main objective of this PhD thesis is to design, develop, and validate an Adaptive Fidelity Management framework enabling scalable and resource-efficient Network Digital Twins for future communication systems.
Systemic validation of fuzzy rule bases: accounting for data availability and the specific characteristics of fuzzy inference
This PhD topic lies within the field of symbolic artificial intelligence. Unlike approaches based on neural networks, these methods rely on explicit rules, often provided by experts or learned from limited data, making them interpretable but potentially imperfect.
The central problem is therefore the validation of fuzzy rule bases: the goal is to ensure that the rules produce consistent, useful, and reliable results. Existing methods use global metrics (overall system performance) and local metrics (the quality of each rule), but they do not sufficiently account for certain important specificities. For example, interactions between rules can strongly influence the final behavior.
The thesis proposes to develop a comprehensive and systematic approach to validate these rule bases, whether data is available or not. In particular, it aims to design new metrics capable of capturing these interactions, drawing inspiration, for example, from graph-based approaches (such as FinGrams or reputation systems).
The work will include the definition of a methodological framework, the proposal of new validation measures, as well as their implementation and experimental evaluation.
The expected outcomes are more precise tools for detecting problematic rules, and an overall improvement in the performance and reliability of fuzzy inference systems.
Post-training neural architecture optimization for small language models
Generative AI, and particularly language models (LLM), have sparked a new revolution in AI with applications across all domains. However, LLMs are highly resource-intensive and, hence, difficult to implement on autonomous embedded systems. LLMs can be optimized by modifying their architecture to replace heavy Transformer layers with lighter alternatives. Given the difficulty of training LLM "from scratch," this thesis aims to develop post-training neural architecture optimization methods applicable to small LLM (SLM). Additionally, the thesis seeks to propose performance metrics of different layers of an SLM and their alternatives, to guide the replacement, and thus propose a comprehensive methodology for optimizing SLMs while considering hardware constraints. The work will be valorized through publications in major AI conferences and journals, and the developed codes and methods could be integrated into the tools developed at CEA.
Reconciling predictability and performance in processor architectures for critical systems
Critical systems have both functional and timing requirements, the latter ensuring that deadlines are always met during operation; failure to do so may lead to catastrophic consequences. The critical nature of such systems demands specialized hardware and software solutions. This PhD thesis topic focuses on the development of computer architecture designs for critical systems, known as predictable architectures, capable of providing the necessary timing guarantees. Several such architectures exist, typically based on in-order pipelines and incorporating behavioral restrictions (e.g., disabling complex speculation mechanisms) or structural specializations (e.g., redesigned caches or deterministic arbitration for shared resources). These restrictions and specializations inevitably impact performance, and the design of predictable architectures must therefore address the predictability–performance tradeoff directly. This PhD thesis aims to explore this tradeoff in a novel way, by adapting a high-performance variant of an in-order processor (CVA6) and developing top-down techniques to make it predictable. Performance in such processors is usually achieved through mechanisms like branch prediction, prefetching, and value prediction, implemented via specialized storage elements (e.g., buffers) and supported by control mechanisms such as rollback on misprediction. Within this context, the goal of the thesis is to define a general predictability scheme for speculative execution, covering both storage organization and rollback behavior.
Learning Mechanisms for Detecting Abnormal Behaviors in Embedded Systems
Embedded systems are increasingly used in critical infrastructures (e.g., energy production networks) and are therefore prime targets for malicious actors. The use of intrusion detection systems (IDS) that dynamically analyze the system's state is becoming necessary to detect an attack before its impacts become harmful.
The IDS that interest us are based on machine learning anomaly detection methods and allow learning the normal behavior of a system and raising an alert at the slightest deviation. However, the learning of normal behavior by the model is done only once beforehand on a static dataset, even though the embedded systems considered can evolve over time with updates affecting their nominal behavior or the addition of new behaviors deemed legitimate.
The subject of this thesis therefore focuses on studying re-learning mechanisms for anomaly detection models to update the model's knowledge of normal behavior without losing information about its prior knowledge. Other learning paradigms, such as reinforcement learning or federated learning, may also be studied to improve the performance of IDS and enable learning from the behavior of multiple systems.
Investigation of polytopal methods apllied to CFD and optimized on GPU architecture
This research proposal focuses on the study and implementation of polytopal methods for solving the equations of fluid mechanics. These methods aim to handle the most general meshes possible, overcoming geometric constraints or those inherited from CAD operations such as extrusions or assemblies that introduce non-conformities. This work also falls within the scope of high-performance computing, addressing the increase in computational resources and, in particular, the development of massively parallel computing on GPUs.
The objective of this thesis is to build upon existing polytopal methods already implemented in the TRUST software, specifically the Compatible Discrete Operator (CDO) and Discontinuous Galerkin (DG) methods. The study will be extended to include convection operators and will investigate other methods from the literature, such as Hybrid High Order (HHO), Hybridizable Discontinuous Galerkin (HDG), and Virtual Element Method (VEM).
The main goals are to evaluate:
1. The numerical behavior of these different methods on the Stokes/Navier-Stokes equations;
2. The adaptability of these methods to heterogeneous architectures such as GPUs.
Implementation of TFHE on RISC-V based embedded systems
Fully Homomorphic Encryption (FHE) is a technology that allows computations to be performed directly on encrypted data, meaning that we can process information without ever knowing its actual content. For example, it could enable online searches where the server never sees what you are looking for, or AI inference tasks on private data that remain fully confidential. Despite its potential, current FHE implementations remain computationally intensive and require substantial processing power, typically relying on high-end CPUs or GPUs with significant energy consumption. In particular, the bootstrapping operation represents a major performance bottleneck that prevents large-scale adoption. Existing CPU-based FHE implementations can take over 20 seconds on standard x86 architectures, while custom ASIC solutions, although faster, are prohibitively expensive, often exceeding 150 mm² in silicon area. This PhD project aims to accelerate the TFHE scheme, a more lightweight and efficient variant of FHE. The objective is to design and prototype innovative implementations of TFHE on RISC-V–based systems, targeting a significant reduction in bootstrapping latency. The research will explore synergies between hardware acceleration techniques developed for post-quantum cryptography and those applicable to TFHE, as well as tightly coupled acceleration approaches between RISC-V cores and dedicated accelerators. Finally, the project will investigate the potential for integrating a fully homomorphic computation domain directly within the processor’s instruction set architecture (ISA).
Exploration and optimization of RAID architectures and virtualization technologies for high-performance data servers
Given the ever-increasing demands of numerical simulation, supercomputers
must constantly evolve to improve their performance and thus maintain a
high quality of service for users. These demands are reflected on storage
systems, which, to be performant, reliable, and capacitive, must contain
cutting-edge technologies concerning the optimization of data placement
and the scheduling of I/O accesses. The objective of this thesis is to
study these technologies such as GPU-based RAID and I/O virtualization,
to evaluate them, and to establish optimizations that can improve the
performance of HPC storage systems.
High-Fidelity Monte Carlo Simulations of Neutron Noise in Nuclear Power Reactors
Operating nuclear reactors are subject to a variety of perturbations. These can include vibrations of the fuel pins and fuel assemblies due to fluid-structure interactions with the moderator, or even vibrations of the core barrel, baffle, and pressure vessel. All of these perturbations can lead to small periodic fluctuations in the reactor power about the stable average power level. These power fluctuations are referred to as “neutron noise”. Being able to simulate different types of in-core perturbations allows reactor designers and operators to predict how the neutron flux could behave in the presence of such perturbations. In recent years, many different research groups have worked to develop computational models to simulate these sources of neutron noise, and their resulting effects on the neutron flux in the reactor. The primary objective of this PhD thesis will be to bring Monte Carlo neutron noise simulations to the scale of real-world industry calculations of nuclear reactor cores, with a high-fidelity continuous-energy physics representation. As part of this process, the student will add novel neutron noise simulation capabilities to TRIPOLI-5, the next-generation production Monte Carlo particle-transport code jointly developed by CEA and ASNR, with the support of EDF.