Systemic validation of fuzzy rule bases: accounting for data availability and the specific characteristics of fuzzy inference

This PhD topic lies within the field of symbolic artificial intelligence. Unlike approaches based on neural networks, these methods rely on explicit rules, often provided by experts or learned from limited data, making them interpretable but potentially imperfect.

The central problem is therefore the validation of fuzzy rule bases: the goal is to ensure that the rules produce consistent, useful, and reliable results. Existing methods use global metrics (overall system performance) and local metrics (the quality of each rule), but they do not sufficiently account for certain important specificities. For example, interactions between rules can strongly influence the final behavior.

The thesis proposes to develop a comprehensive and systematic approach to validate these rule bases, whether data is available or not. In particular, it aims to design new metrics capable of capturing these interactions, drawing inspiration, for example, from graph-based approaches (such as FinGrams or reputation systems).

The work will include the definition of a methodological framework, the proposal of new validation measures, as well as their implementation and experimental evaluation.

The expected outcomes are more precise tools for detecting problematic rules, and an overall improvement in the performance and reliability of fuzzy inference systems.

Post-training neural architecture optimization for small language models

Generative AI, and particularly language models (LLM), have sparked a new revolution in AI with applications across all domains. However, LLMs are highly resource-intensive and, hence, difficult to implement on autonomous embedded systems. LLMs can be optimized by modifying their architecture to replace heavy Transformer layers with lighter alternatives. Given the difficulty of training LLM "from scratch," this thesis aims to develop post-training neural architecture optimization methods applicable to small LLM (SLM). Additionally, the thesis seeks to propose performance metrics of different layers of an SLM and their alternatives, to guide the replacement, and thus propose a comprehensive methodology for optimizing SLMs while considering hardware constraints. The work will be valorized through publications in major AI conferences and journals, and the developed codes and methods could be integrated into the tools developed at CEA.

Reconciling predictability and performance in processor architectures for critical systems

Critical systems have both functional and timing requirements, the latter ensuring that deadlines are always met during operation; failure to do so may lead to catastrophic consequences. The critical nature of such systems demands specialized hardware and software solutions. This PhD thesis topic focuses on the development of computer architecture designs for critical systems, known as predictable architectures, capable of providing the necessary timing guarantees. Several such architectures exist, typically based on in-order pipelines and incorporating behavioral restrictions (e.g., disabling complex speculation mechanisms) or structural specializations (e.g., redesigned caches or deterministic arbitration for shared resources). These restrictions and specializations inevitably impact performance, and the design of predictable architectures must therefore address the predictability–performance tradeoff directly. This PhD thesis aims to explore this tradeoff in a novel way, by adapting a high-performance variant of an in-order processor (CVA6) and developing top-down techniques to make it predictable. Performance in such processors is usually achieved through mechanisms like branch prediction, prefetching, and value prediction, implemented via specialized storage elements (e.g., buffers) and supported by control mechanisms such as rollback on misprediction. Within this context, the goal of the thesis is to define a general predictability scheme for speculative execution, covering both storage organization and rollback behavior.

Sofware support for computing accelerators and memory transferts accelerators

For energy reasons, future computers will have to use accelerators for both computation and memory access (GPUs, TPUs, NPUs, smart DMAs). AI applications have intensive computational requirements in terms of both computing power and memory throughput.

These accelerators are not based on a simple instruction set (ISA), they break the Von Neuman model: they require specialized code to be written manually.

Furthermore, it is difficult to compare the use of these accelerators with code using a non-specialized processor, as the initial source codes are very different.

HybroLang is a hardware-close programming language that allows programs to be written using all of a processor's computing capabilities, while also allowing code to be specialized based on data known at runtime.

The HybroGen compiler has already demonstrated its ability to program in-memory computing accelerators, as well as to optimize code on conventional CPUs by performing innovative optimizations.

This thesis proposes to extend the HybroLang language in order to

- facilitate the programming of AI applications by providing support for complex data: stencils, convolution, sparse computing

- enable code generation both on CPUs and with hardware accelerators currently under development at the CEA (sparse computing, in-memory computing, memory access)

- allow to benchmark different computing architectures with the same initial source code

Ideally, a candidate should have knowledge of computer architecture, programming language implementation, code optimization and compilation.

Learning Mechanisms for Detecting Abnormal Behaviors in Embedded Systems

Embedded systems are increasingly used in critical infrastructures (e.g., energy production networks) and are therefore prime targets for malicious actors. The use of intrusion detection systems (IDS) that dynamically analyze the system's state is becoming necessary to detect an attack before its impacts become harmful.
The IDS that interest us are based on machine learning anomaly detection methods and allow learning the normal behavior of a system and raising an alert at the slightest deviation. However, the learning of normal behavior by the model is done only once beforehand on a static dataset, even though the embedded systems considered can evolve over time with updates affecting their nominal behavior or the addition of new behaviors deemed legitimate.
The subject of this thesis therefore focuses on studying re-learning mechanisms for anomaly detection models to update the model's knowledge of normal behavior without losing information about its prior knowledge. Other learning paradigms, such as reinforcement learning or federated learning, may also be studied to improve the performance of IDS and enable learning from the behavior of multiple systems.

Vulnerability analysis of protocols on hardware devices

The Information Technology Security Evaluation Facility (ITSEF) conducts activities in the field of security evaluation of electronic systems, embedded software components, either within the framework of certification schemes, for example the one led by the l’Agence Nationale de la Sécurité des Systèmes d’information (ANSSI), or at the direct request of developers.
In the context of security evaluations conducted by the ITSEF, evaluators are required, among other things, to test the resistance of cryptographic mechanisms embedded on smart cards against physical attacks, such as chip tampering attacks or attacks by observing compromising signals. In an application context (banking, healthcare, identity), these mechanisms are used within cryptographic protocols, such as key exchanges or authentications. When a vulnerability is detected in a product, the evaluator must analyze its impact on the protocol. Currently, this analysis relies on the evaluator's expertise, but the use of formal methods would be advantageous for tracing attack paths or for providing greater assurance that the vulnerability will not be exploited.
Initially, this thesis will focus on studying existing verification tools (e.g., Tamarin [1]) in order to test them on the protocols used in commonly evaluated applications. The thesis will then aim to examine the different ways in which a vulnerability can be expressed within the protocol, and to evaluate the tool's ability to formally analyze its impacts by identifying attack paths. Finally, the PhD student will be required to enhance the tool with new components to address the identified needs.
References
[1] Tamarin : https://github.com/tamarin-prover/tamarin-prover

Implementation of TFHE on RISC-V based embedded systems

Fully Homomorphic Encryption (FHE) is a technology that allows computations to be performed directly on encrypted data, meaning that we can process information without ever knowing its actual content. For example, it could enable online searches where the server never sees what you are looking for, or AI inference tasks on private data that remain fully confidential. Despite its potential, current FHE implementations remain computationally intensive and require substantial processing power, typically relying on high-end CPUs or GPUs with significant energy consumption. In particular, the bootstrapping operation represents a major performance bottleneck that prevents large-scale adoption. Existing CPU-based FHE implementations can take over 20 seconds on standard x86 architectures, while custom ASIC solutions, although faster, are prohibitively expensive, often exceeding 150 mm² in silicon area. This PhD project aims to accelerate the TFHE scheme, a more lightweight and efficient variant of FHE. The objective is to design and prototype innovative implementations of TFHE on RISC-V–based systems, targeting a significant reduction in bootstrapping latency. The research will explore synergies between hardware acceleration techniques developed for post-quantum cryptography and those applicable to TFHE, as well as tightly coupled acceleration approaches between RISC-V cores and dedicated accelerators. Finally, the project will investigate the potential for integrating a fully homomorphic computation domain directly within the processor’s instruction set architecture (ISA).

Investigation of polytopal methods apllied to CFD and optimized on GPU architecture

This research proposal focuses on the study and implementation of polytopal methods for solving the equations of fluid mechanics. These methods aim to handle the most general meshes possible, overcoming geometric constraints or those inherited from CAD operations such as extrusions or assemblies that introduce non-conformities. This work also falls within the scope of high-performance computing, addressing the increase in computational resources and, in particular, the development of massively parallel computing on GPUs.

The objective of this thesis is to build upon existing polytopal methods already implemented in the TRUST software, specifically the Compatible Discrete Operator (CDO) and Discontinuous Galerkin (DG) methods. The study will be extended to include convection operators and will investigate other methods from the literature, such as Hybrid High Order (HHO), Hybridizable Discontinuous Galerkin (HDG), and Virtual Element Method (VEM).

The main goals are to evaluate:
1. The numerical behavior of these different methods on the Stokes/Navier-Stokes equations;
2. The adaptability of these methods to heterogeneous architectures such as GPUs.

Exploration and optimization of RAID architectures and virtualization technologies for high-performance data servers

Given the ever-increasing demands of numerical simulation, supercomputers
must constantly evolve to improve their performance and thus maintain a
high quality of service for users. These demands are reflected on storage
systems, which, to be performant, reliable, and capacitive, must contain
cutting-edge technologies concerning the optimization of data placement
and the scheduling of I/O accesses. The objective of this thesis is to
study these technologies such as GPU-based RAID and I/O virtualization,
to evaluate them, and to establish optimizations that can improve the
performance of HPC storage systems.

High-Fidelity Monte Carlo Simulations of Neutron Noise in Nuclear Power Reactors

Operating nuclear reactors are subject to a variety of perturbations. These can include vibrations of the fuel pins and fuel assemblies due to fluid-structure interactions with the moderator, or even vibrations of the core barrel, baffle, and pressure vessel. All of these perturbations can lead to small periodic fluctuations in the reactor power about the stable average power level. These power fluctuations are referred to as “neutron noise”. Being able to simulate different types of in-core perturbations allows reactor designers and operators to predict how the neutron flux could behave in the presence of such perturbations. In recent years, many different research groups have worked to develop computational models to simulate these sources of neutron noise, and their resulting effects on the neutron flux in the reactor. The primary objective of this PhD thesis will be to bring Monte Carlo neutron noise simulations to the scale of real-world industry calculations of nuclear reactor cores, with a high-fidelity continuous-energy physics representation. As part of this process, the student will add novel neutron noise simulation capabilities to TRIPOLI-5, the next-generation production Monte Carlo particle-transport code jointly developed by CEA and ASNR, with the support of EDF.

Top