About us
Espace utilisateur
Education
INSTN offers more than 40 diplomas from operator level to post-graduate degree level. 30% of our students are international students.
Professionnal development
Professionnal development
Find a training course
INSTN delivers off-the-self or tailor-made training courses to support the operational excellence of your talents.
Human capital solutions
At INSTN, we are committed to providing our partners with the best human capital solutions to develop and deliver safe & sustainable projects.
Thesis
Home   /   Thesis   /   Analysis et optimisation of internode exchanges in MSA context

Analysis et optimisation of internode exchanges in MSA context

Abstract

The race to exascale has caused the emergence of highly heteorgenous supercomputers. The "MSA" model (Modular Supercomputing Architecture) is the result of this evolution. An MSA system is composed of several "modules", each module being in itself a smaller supercomputer with a specific architecture to address specific computation needs. These modules are linked together with a fast interconnect and a common software stack, providing the possibility ot launch a unique job on multiple modules. It is possible that the computationel units and the interconnect link inside each module is different from the other modules.

Having mulitple interconnection network impose an increased pressure on internode communication libraries, such as MPI implementations. Indeed, if an application can run on multiple modules at the same time, the MPI implementation needs to be able to make messages travel between two MPI processes located on two distinct modules. It is then necessary for the MPI implementation to 1) support all networks involved and 2) make a unique message go through several networks.

The purpose of the thesis is to analyze the features and constraints required by the MSA model, and to offer solutions for efficient runs on such supercomputer.

The Ph.D. candidate will rely on the expertise of the advising team, along with the network support in the MPC framework offering multi-rail and multi-networks capabilities.

Through this thesis, we aim to study and provide solution for:

- A user interface providing a multi-module job launcher

- Gathering cross-module network topology information, and providing a comprehensive and useful representation of such network

- Analyze, develop and implements hierarchical algorithms adapted to MSA, and aknowledging the underlying network topology of the allocated resources

Laboratory

DSSI
DSSI
Bordeaux
Top pencilenvelopegraduation-hatlicensebookuserusersmap-markercalendar-fullbubblecrossmenuarrow-down