Measuring event impact and propagation in the internet

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : LIP6
Durée : 4 to 6 months
Contact : lionel.tabourier@lip6.fr
Date limite de publication : 2024-03-31

Contexte :
Understanding the impact of internet anomalous events at internet scale, such as performance degradation, outages, or attacks, is a challenging problem. If techniques and systems have been designed to detect outages at some particular internet facilities, or detect congestion between interdomain links, there exists no internet scale system to monitor events across all autonomous systems (ASes), and thus, we have no clear understanding on the impact of an event on the internet.

The BGP protocol allows ASes to interconnect, so that each AS can reach the prefixes containing the IP addresses of another AS via the routes received with BGP. As most internet events rarely last more than tens of minutes, to capture them, we need to run traceroutes towards each BGP prefix announced by all the ASes very frequently. And in addition to these background measurements, we need to be able to run even more targeted measurements during an event, in order to have a precise understanding of the behavior of the internet paths before and after this event.

Unfortunately, public measurement systems, such as RIPE Atlas and CAIDA Ark do not offer such measurements or the possiblity to run them. They either perform meshed traceroutes between hundreds of sources and destinations at short intervals (15 minutes), or perform traceroutes to one destination per BGP prefix from a hundred of vantage points every day. This is neither sufficient to have an internet scale coverage nor to cover most internet events.

Sujet :
We propose to design this missing measurement system, that will run background traceroutes at high speed every 15 minutes from a few vantage points to one destination in each BGP prefix announced by any AS. When an event is detected, we will run targeted measurements using
propagation algorithms to understand how this event spreads on the internet.

Profil du candidat :
This internship is directed at Master students (preferably Master 2 students) with a background in computer science. Good coding skills are requested for the internship, knowledge of a widely-used language in learning, such as python, is preferable but not mandatory.

Formation et compétences requises :
Background in computer networking, system building, and graph theory are at the heart of the internship, so a background in those areas is an asset, but not mandatory.

Adresse d’emploi :
LIP6, Sorbonne University (4 place Jussieu, 75005, Paris)

Document attaché : 202312201034_Measuring_Event_Impact.pdf

Gaussian Process Prior Variational Autoencoders for Earth Data Time Series Anlaysis

Offre en lien avec l’Action/le Réseau : RoCED/– — –

Laboratoire/Entreprise : INRAE Toulouse
Durée : 6 mois
Contact : mathieu.fauvel@inrae.fr
Date limite de publication : 2024-02-28

Contexte :
Over the last ten years, Earth Observation (EO) has made enormous advances in terms of spatial and temporal resolutions, data
availability and open policies for end-users. The increasing availability of complementary imaging sensors allows land ecosystems
state variables and processes to be observed at different spatio-temporal scales. Big EO data can thus enable the design of new
land monitoring systems providing critical information in order to guide climate change monitoring, mitigation and adaptation.
Conventional machine learning methods are not well adapted to the complexity of multi-modal, multi-resolution satellite image
time series (SITS) with irregular sampling. Therefore they are not suitable for extracting and processing all the relevant infor-
mation. On the other hand, methods based on deep neural networks have shown to be very effective to learn low-dimensional
representations of complex data for several tasks and come with high potential for EO data. However, they often emerge from the
computer vision (CV) and natural language processing (NLP) communities and need to be extended and properly instantiated to
handle the very specificities of Earth Observation data.
Previous works at the CESBIO-lab have shown that generative encoder-decoder architectures such as the Variational Auto-
Encoder (VAE) or the U-NET models perform very well for a variety of EO tasks : estimation of biophysical parameters or
Sentinel-1 to Sentinel-2 translations, to cite a few.
However, such approaches appear to be inadequate to handle data coming from more than 2 sources and acquired at different
time and spatial resolutions, as prioritized in the RELEO chair within ANITI. In particular, the generative capability of these
models may generalize poorly to unseen regions or temporal periods. Processing such streams of data requires to jointly encode
all sources into a structured latent space where each complementary information carried by each source can be embedded while
ensuring long-term encoding of newly acquired data (from possibly new sensors).

Sujet :
The objective of this internship is to investigate Gaussian process (GP) prior for Variational Auto-Encoders (VAEs). Usually,
VAEs assume independence between samples. This assumption is generally made for sake of simplicity and computational ef-
ficiency of the training and inference steps. However, assuming independence of samples amount to ignoring the correlation
between adjacent pixels in the temporal domains. Furthermore, because of the very deterministic nature of such neural networks
architectures, they do not properly encode uncertainty related to missing/noisy data.
Adopting this GP prior is expected to model correlations between times. However, due to the irregular and unaligned nature of
SITS and their massive volume, approximation are required to maintain fast training and inference.
The work-plan of this Master internship is as follows :
1. Define GP prior VAEs for pixel satellite time series with different approximations,
2. Implement the models in PyTorch,
3. Perform experiments on massive SITS and compare with others VAEs on downstream tasks.

Profil du candidat :
Master or Engineering school students with major in applied mathematics, computer science or electrical engineering.

Formation et compétences requises :
The knowledge needed for this work includes a strong background in machine learning or data science, signal & image
processing or remote sensing data processing. Good scientific programming skills (e.g., Python) and good communication skills
in English, both written and oral are also expected. Interests in Earth observation will be appreciated.

Adresse d’emploi :
Centre d’Etudes Spatiales de la Biosph`ere (CESBIO) & INRAE

Document attaché : 202312190628_proposal_MSc_2024_ANITI.pdf

Explicabilité des décisions d’un GNN, application à la chémoinformatique

Offre en lien avec l’Action/le Réseau : DSChem/– — –

Laboratoire/Entreprise : Groupe de recherche en informatique, image, automa
Durée : 6 mois
Contact : jean-luc.lamotte@unicaen.fr
Date limite de publication : 2024-02-28

Contexte :
Ce stage prend place dans une collaboration de longue date entre trois laboratoires :
1. Le GREYC (UMR 6072, Caen), a développé au cours des années une expertise forte en fouille de données et apprentissage appliqué à la Chemoinformatique,
2. Le CERMN (UR 4258, Caen) est le centre d’études et de recherche sur le médicament de Normandie et entretient depuis de nombreuses années une collaboration avec le GREYC sur l’analyse informatique de molécules pour créer de nouveaux médicaments,
3. Le LITIS (UR 4108, Rouen), a acquis une solide expérience en apprentissage machine qu’il applique notamment sur des bases de graphes.
Le GREYC et le LITIS collaborent activement au sein de la fédération NormaSTIC.

Des membres des trois laboratoires participeront à l’encadrement du stage en y apportant leurs compétences respectives.

Ce stage peut être considéré comme une étape préparatoire à une thèse de doctorat sur le même sujet.

Sujet :
Le stage commencera par une étude des méthodes GNN permettant de prédire les propriétés de nos jeux de données. L’étude sera ciblée sur la prédiction des interactions protéines/ligands à partir des structures des molécules. A
cette occasion, des méthodes basées GCN, au sens large, et GCN+pooling seront étudiées.

Si cette étape est validée, nous aborderons une comparaison des méthodes de la littérature permettant d’expliquer les résultats de ces GNNs. Les résultats produits (en termes d’explication) seront évalués en utilisant plusieurs critères tels que l’accuracy, l’aire sous la courbe, la fidélité, la parcimonie,. . .. Nous espérons identifier à partir de cette étude des sous structures pharmacophoriques pertinentes pour les propriétés à prédire.

Nous essaierons, dans un troisième temps, d’appliquer ces méthodes sur les graphes moléculaires squelettiques. Il s’agira de comparer sous l’angle de l’explicabilité les descriptions moléculaires ”brutes” et celles produites en intégrant une expertise du domaine via le graphe pharmacophorique.

Profil du candidat :
Le candidat doit être inscrit en dernière année d’un Master ou d’un diplôme d’ingénieur dans un domaine lié à l’informatique ou aux mathématiques appliquées, et posséder de solides compétences en programmation. Une expérience en informatique pour la Science des Données, apprentissage profond, notamment sur graphes, sera un plus.

Formation et compétences requises :
Le candidat doit être inscrit en dernière année d’un Master ou d’un diplôme d’ingénieur dans un domaine lié à l’informatique ou aux mathématiques appliquées, et posséder de solides compétences en programmation.

Adresse d’emploi :
Le stage sera effectué au GREYC (Caen) ou au LITIS (Rouen) en fonction du lieu de recrutement de l’étudiant. Il débutera en février ou mars 2024 pour une durée de 6 mois et bénéficiera d’une gratification au tarif minimum réglementaire pour les stages.

Document attaché : 202312181731_stageMasterGNN-chemo.pdf

Job Recommendation From A Heterogeneous Graph

Offre en lien avec l’Action/le Réseau : DOING/– — –

Laboratoire/Entreprise : SAMOVAR/Télécom SudParis
Durée : 6 mois
Contact : romerojulien34@gmail.com
Date limite de publication : 2024-02-28

Contexte :
Job recommendation is the task of associating candidates with
jobs. This can be useful for candidates who would like to find to best possible
jobs, for companies that want to find the rarest talents in the vast pool of
candidates, but also for independent recruiters who need to be as precise as
possible when they send a resume to a company.
In this internship, you will work on a new dataset for job recommendations.
Its particularity is that it contains much additional information about candi-
dates and jobs we can represent as a graph. Besides, it is very sensitive to the
cold start problem: We have many new candidates and new jobs, and it restricts
a lot of the algorithms we can use.
If we consider video recommendations on Youtube, an average viewer watches
many videos, and each video is viewed many times. Therefore, when recom-
mending new videos to a specific user, we can look at what other similar view-
ers watched and recommend the most relevant video. This is the principle of
collaborative filtering. In our case, our users are likely to get a job and never
come back. Likewise, jobs are associated with one person, and then, we are
done with it. Therefore, we need to exploit extra information to make the
recommendation.
For our dataset, we can represent our pool of candidates and jobs with a
heterogeneous graph, connecting candidates and jobs, but also additional node
types like skills, cities, or employment types. Because we have this expressive
representation, we must adapt the existing algorithms. During the internship,
we will see how graph neural networks can be used to make recommendations,
and we will propose a new architecture to solve our specific problem.
The goal of this internship will be to publish a paper at an international
conference. The intern will work together with a Ph.D. student.

Sujet :
Job recommendation is the task of associating candidates with
jobs. This can be useful for candidates who would like to find to best possible
jobs, for companies that want to find the rarest talents in the vast pool of
candidates, but also for independent recruiters who need to be as precise as
possible when they send a resume to a company.
In this internship, you will work on a new dataset for job recommendations.
Its particularity is that it contains much additional information about candi-
dates and jobs we can represent as a graph. Besides, it is very sensitive to the
cold start problem: We have many new candidates and new jobs, and it restricts
a lot of the algorithms we can use.
If we consider video recommendations on Youtube, an average viewer watches
many videos, and each video is viewed many times. Therefore, when recom-
mending new videos to a specific user, we can look at what other similar view-
ers watched and recommend the most relevant video. This is the principle of
collaborative filtering. In our case, our users are likely to get a job and never
come back. Likewise, jobs are associated with one person, and then, we are
done with it. Therefore, we need to exploit extra information to make the
recommendation.
For our dataset, we can represent our pool of candidates and jobs with a
heterogeneous graph, connecting candidates and jobs, but also additional node
types like skills, cities, or employment types. Because we have this expressive
representation, we must adapt the existing algorithms. During the internship,
we will see how graph neural networks can be used to make recommendations,
and we will propose a new architecture to solve our specific problem.
The goal of this internship will be to publish a paper at an international
conference. The intern will work together with a Ph.D. student.

Profil du candidat :
The intern should be involved in a master’s program and have
a good knowledge of machine learning, deep learning, natural language processing, and graphs. A good understanding of Python and the standard libraries
used in data science (scikit-learn, PyTorch, pandas, transformers) is also expected. In addition, a previous experience with graph neural networks would be appreciated.

Formation et compétences requises :
The intern should be involved in a master’s program and have
a good knowledge of machine learning, deep learning, natural language processing, and graphs. A good understanding of Python and the standard libraries
used in data science (scikit-learn, PyTorch, pandas, transformers) is also expected. In addition, a previous experience with graph neural networks would be appreciated.

Adresse d’emploi :
Télécom Sudparis, Palaiseau

Document attaché : 202312181300_internship_job_recommandation-2.pdf

Financial Forecasting With Deep Learning

Offre en lien avec l’Action/le Réseau : DOING/– — –

Laboratoire/Entreprise : SAMOVAR/Télécom SudParis
Durée : 6 mois
Contact : romerojulien34@gmail.com
Date limite de publication : 2024-02-28

Contexte :
In this internship, we propose to study the problem of financial forecasting, i.e., predicting the future
variation of the price of a financial instrument, using deep learning. The student will work on a new
data source with a finer granularity than existing datasets. Because of the difficulty of obtaining
data, previous works focused on price prediction at the scale of a day, a week, or a month. Our
new dataset contains intraday information. Therefore, we can predict the price within a day and
use multi-scale analysis. Besides, our new dataset contains different kinds of financial instruments
(FOREX, crypto, options, futures) and additional information about the companies (description,
financial reports, dividends).

Sujet :
In this internship, we propose to study the problem of financial forecasting, i.e., predicting the future
variation of the price of a financial instrument, using deep learning. The student will work on a new
data source with a finer granularity than existing datasets. Because of the difficulty of obtaining
data, previous works focused on price prediction at the scale of a day, a week, or a month. Our
new dataset contains intraday information. Therefore, we can predict the price within a day and
use multi-scale analysis. Besides, our new dataset contains different kinds of financial instruments
(FOREX, crypto, options, futures) and additional information about the companies (description,
financial reports, dividends).

Profil du candidat :
The intern should be involved in a master’s program and have a good knowledge of machine learning,
deep learning, and data processing. A good understanding of Python and the standard libraries used
in data science (scikit-learn, PyTorch, pandas) is also expected. A previous experience with finance
is appreciated but not required for this internship.

Formation et compétences requises :
The intern should be involved in a master’s program and have a good knowledge of machine learning,
deep learning, and data processing. A good understanding of Python and the standard libraries used
in data science (scikit-learn, PyTorch, pandas) is also expected. A previous experience with finance
is appreciated but not required for this internship.

Adresse d’emploi :
Télécom SudParis, Palaiseau

Document attaché : 202312181259_stage_finance.pdf

Emploi de Chef-fe de projet ou expert-e en Ingéniérie logicielle oiur mobilité NOEMIE

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : CALMIP Toulouse
Durée : CDI
Contact : jean-luc.estivalezes@imft.fr
Date limite de publication : 2023-12-15

Contexte :
Le mésocentre régional de calcul CALMIP – https://www.calmip.univ-toulouse.fr – situé à Toulouse, propose un poste CNRS de type Noémie ouvert à la mobilité pour un chef.fe de projet ou expert en ingénierie logicielle. Cette personne évoluera dans un contexte de valorisation de la donnée issue du HPC. Il/elle sera placé(e) sous la responsabilité hiérarchique du directeur.

Sujet :
Ses missions s’articuleront autour de la gestion et de la valorisation de la donnée scientifique : il/elle contribuera au support et à l’évolution de l’offre de service PADIRE développée à CALMIP, avec comme élément central le portail Callisto ( https://callisto.calmip.univ-toulouse.fr/ ) basé sur DataVerse et l’accompagnement des utilisateurs pour la FAIRisation de leurs données.

Le détail du poste et les modalités de candidature sont accessibles à l’adresse suivante :

https://mobiliteinterne.cnrs.fr/ords/afip/owa/consult.affiche_fonc?code_fonc=Y56026&type_fonction=&code_dr=14&code_corps=IR&code_bap=E&nbjours=&page=1&colonne_triee=1&type_tri=ASC

Profil du candidat :
Agent CNRS en mobilité interne
IIngénieur de recherche en informatique

Formation et compétences requises :
Thèse en informatique ou niveau équivalent

Adresse d’emploi :
Unité d’accueil
UAR3667 http://www.calmip.univ-toulouse.fr

TOULOUSE

Post-doctorant(e) pour travailler sur une approche innovante pour le partitionnement de réseaux complexes (réseaux télécom, d’énergie, réseaux sociaux, …) centré sur les besoins des utilisateurs.

Offre en lien avec l’Action/le Réseau : Musiscale/– — –

Laboratoire/Entreprise : IMT Atlantique
Durée : 12 mois
Contact : cecile.bothorel@imt-atlantique.fr
Date limite de publication : 2023-12-15

Contexte :
IMT Atlantique est une grande école d’ingénieurs française de renom, formant les futurs professionnels de l’innovation, des technologies et du management, dont les thématiques d’enseignement se situent à l’intersection de l’ingénierie, de l’informatique, des télécommunications et de l’énergie. Le laboratoire Lab-STICC est une unité de recherche interdisciplinaire reconnue, traitant de questions liées à l’informatique,
l’électronique et les sciences de l’information. Fortement axé sur l’innovation, ce laboratoire favorise la collaboration entre chercheurs pour résoudre des problématiques complexes.

L’équipe DECIDE du Lab-STICC et du département Data Science d’IMT Atlantique vise à exploiter des synergies entre l’aide à la décision et la science des données pour répondre aux enjeux scientifiques, industriels et sociétaux émergeant de problèmes de décision dans les systèmes complexes (environnement, transport, énergie, réseaux sociaux, santé, défense).

Dans ce cadre, l’équipe recherche un(e) post-doctorant(e) pour un projet de recherche innovant sur le partitionnement de réseaux complexes centré sur les besoins des utilisateurs.

Sujet :
Vous participerez au développement d’une nouvelle approche de partitionnement de réseaux complexes (social, transport, énergie, etc.) guidée par les préférences de l’utilisateur. Votre rôle sera de :
– Définir les propriétés caractérisant les partitions selon les besoins des utilisateurs
– Intégrer la dimension temporelle dans les partitions
– Explorer l’espace des solutions candidates via des algorithmes d’optimisation
– Prendre en compte les préférences non-monotones des utilisateurs
– Générer des explications sur les partitions obtenues Cette approche pluridisciplinaire fait appel à l’optimisation multi-objectif, l’aide à la décision et l’analyse de réseaux complexes.

Profil du candidat :
Doctorat en informatique, mathématiques appliquées ou aide à la décision.

Formation et compétences requises :
Compétences en optimisation, algorithmique et analyse de réseaux complexes. Bonnes compétences en Python et idéalement en utilisation de solveurs de programmation mathématique (Gurobi, Cplex, …).

Adresse d’emploi :
IMT atlantique
Technopôle Brest-Iroise CS 83818
29238 Brest Cedex 3

Document attaché : 202312151310_annonce-en.pdf

Generative Model for multivariate time series. Application on aircraft engine

Offre en lien avec l’Action/le Réseau : BigData4Astro/– — –

Laboratoire/Entreprise : DAVID Lab – UVSQ – Versailles
Durée : 6 mois
Contact : mustapha.lebbah@uvsq.fr
Date limite de publication : 2024-01-31

Contexte :
In this research internship, we aim to test the feasibility of a modern neural methodology based on the generative model, which has been successfully applied to text/image processing. The field of video generation technology has seen significant advancements, with modern models capable of producing highly realistic videos [1, 4, 5]. Drawing an analogy to this, studying the life cycle of an aircraft engine can be viewed similarly to creating a video. In this analogy, each frame represents a distinct flight undertaken by the aircraft, during which multiple continuous parameters forming multivariate time series data. Each multivariate time series can be compared to a frame in a video, reflecting the dynamic states of the aircraft engine during the respective flight.

Sujet :
The aim of this research internship is to strengthen collaboration with Safran.

-Study the current state of the art in deep generative model and multivariate time series,
-By sequentially analyzing this collection of parameters flight after flight, akin to stringing together video frames, we can create a detailed and comprehensive depiction of the aircraft engine’s life cycle, allowing for the identification of behavioral patterns, anomalies and providing predictive insights into the engine’s performance and longevity.
-Based on previous studies [2, 3], implement one or more algorithms/architectures. The results obtained during the internship may lead to contributions to open-source software, or even a scientific publication, depending on the intern’s skills and motivation.

Profil du candidat :
End of engineering degree, M1/M2 in data science, statistics, artificial intelligence, or computer science. Excellent understanding of machine learning basics, particularly deep learning models. Excellent programming skills, especially with tensorflow/keras.

Formation et compétences requises :
End of engineering degree, M1/M2 in data science, statistics, artificial intelligence, or computer science. Excellent understanding of machine learning basics, particularly deep learning models. Excellent programming skills, especially with tensorflow/keras.

Adresse d’emploi :
The internship will be in the DAVID Lab at the University of Versailles

Document attaché : 202312150838_DAVID-UVSQ-Research_Internship_GenerativeMTS.pdf

Position on data-driven weather forecasting

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : Centre National de Recherches Météorologiques
Durée : 21 months
Contact : laure.raynaud@meteo.fr
Date limite de publication : 2024-01-31

Contexte :
In the past months several studies have proposed a new paradigm for weather prediction, demonstrating that data-driven forecasting models are now competitive with traditional physics-based models, on a number of variables and at a relatively coarse scale.
In this context, the main mission of the open position is to carry out innovative work to develop an emulator of numerical weather prediction model on a regional scale, based on state-of-the-art artificial intelligence methods (transformers, graph networks, generative models).
Application on CNRS portal : https://emploi.cnrs.fr/Offres/CDD/UMR3589-LAURAY-005/Default.aspx

Sujet :
In the current state of knowledge and progress, the areas to explore are:
1. the creation of learning datasets, based on simulations and/or observations;
2. the choice of a neural network architecture adapted to the problem, based on recent literature on the subject (transformers, graph networks, etc.);
3. implementation and training of the selected architecture on the available datasets;
4. evaluation of the emulator, using diagnostics and objective forecast scores, and analysis of high-impact weather situations.
The developments will be carried out in priority for high-resolution forecasting on a regional scale, over France and neighboring countries, but could be extended to forecasting on overseas areas or for forecasting on a global scale. The work will use the most innovative aspects of deep learning, and will require scientific watch on these techniques. The position holder will also be required to collaborate with other Météo-France services, scientists and industrials in the mathematical and IT fields, and European partners. He or she will be encouraged to participate in international institutional or scientific meetings, to contribute to national or European projects, and to participate to the publication of results in the form of scientific articles.

Profil du candidat :
We are seeking for a highly motivated candidate with a strong experience in deep learning with large-scale data.

Formation et compétences requises :
The ideal candidate will have the following qualifications:
– A master’s or engineering degree in computer science or statistics
– Recognized expertise in statistical learning methods (convolutional neural networks, Transformers, graph networks, GANs, etc.)
– Excellent capabilities in Python programming and classic development environments for deep learning (PyTorch, Tensorflow)
– Experience in using GPU processors and code optimization (parallelization)
– Experience in handling very large datasets
– An ability to work scientifically in a team, with internal and external collaborators, and to communicate results in English, both written and orally
– Scientific curiosity, rigor, autonomy
– An interest in the application to atmospheric sciences; a first experience in this field will be an advantage.

Adresse d’emploi :
CNRM, Toulouse, France.
The National Center for Meteorological Research (CNRM) is a Joint Research Unit (UMR) with dual supervision from Météo-France and CNRS. The CNRM conducts research in conjunction with the missions of Météo-France and other organizations, notably the CNRS and universities. The CNRM participates in major international or European scientific programs in its field of expertise.
The position holder will be part of the Predictability team, located on the Météopole site in Toulouse. The team’s research focuses on the development of operational ensemble forecasting systems at Météo-France and the integration of AI methods in atmospheric modeling.

Detection of electric arcs using physics-informed neural networks

Offre en lien avec l’Action/le Réseau : BigData4Astro/– — –

Laboratoire/Entreprise : DAVID Lab – UVSQ – Versailles
Durée : 36
Contact : mustapha.lebbah@uvsq.fr
Date limite de publication : 2024-01-31

Contexte :
An electric arc is a high-current disruptive discharge capable of self-sustaining at low voltage. Its study belongs to the field of plasma physics.

While legacy networks have learned to live with the non-zero probability of this fault occurring, since the causes and consequences are known, the same cannot be said for networks dedicated to propulsion, whether all-electric or hybrid. The criticality of the damage is often equated with the energy deployed by the fault compared with the susceptibility to damage of the materials in direct or indirect contact with the arc. We can therefore imagine the impact that a high-power arc could have in such a network (KV/MW) if its duration were to exceed a few milliseconds! Given the increase in voltage and the DC waveform, the risk and consequences of arc faults are increased. This observation, combined with a constrained environment (confined areas, high-risk zones such as FFLZ, severe environment zones, etc.), prompted the EWIS engineers to carry out high-power electric arc tests, leading to the following conclusions:

When an electric arc is generated whose power exceeds several hundred kW and whose lifetime is not controllable, it is no longer possible to control it.

When an electric arc is generated, the power of which exceeds several hundred kW and the lifetime of which cannot be controlled, it is no longer possible to mitigate the consequences solely by choosing ‘arc-resistant’ materials and design guides in the confined environment and safety constraints inherent in civil aeronautics.

This is why the detection and elimination of this fault is inevitable. Arc detection systems, which were not mandatory on legacy networks, will certainly be required on the propulsion networks envisaged.

Arc detection is important for electrical safety, because arcs can cause fires, damage electrical equipment and pose a risk to people. To detect arcs, several technologies and methods are used, including: monitoring by current sensors, light sensors, electrical signal analysis, thermal cameras, etc.

Sujet :
Objectives
The aim of the proposed thesis will be to couple AI and physics for the modeling and early detection of electric arcs. The AI tools targeted are Physics-Informed Neural Networks (PINNs). Experimental data will be available to feed these models. In addition, the physical equations to describe the evolution of arcs to be taken into account are well-known (Maxwell, Faraday, Navier-Stokes, etc).

The major challenges of the thesis will be as follows:

• Modeling electric arcs: Developing PINNs models to describe electric arcs accurately, taking into account the physical equations that govern them. As a check on the calculations (or as a learning tool), physical simulations called MHD (Magneto-Hydro-Dynamics) could be carried out for all the geometries envisaged in the SafranTech E&E team.
• Network training: Train neural networks to predict the presence of electric arcs using observation data and the relevant physical equations.
• Early detection: Develop techniques for the early detection of electric arcs based on PINNs models.
• Experimental validation: Test PINN models and detection methods on real experimental data from Safran electrical systems. This data will be available for several “typical” geometries and will enable the PINNs models to be tested on reproducible and controllable cases.

Profil du candidat :
● End of engineering degree / Master’s degree in a relevant field (e.g., computer science, ML/AI, Statistics …)
● Excellent understanding of machine learning and physics basics. Familiar with recent Artificial Intelligence: transformers, diffusion
model, auto-encoder…etc.
● Excellent programming skills, especially with Python, Pytorch,
● Autonomous and able to quickly adapt to recent scientific literature / technologies.

This PhD topic is participating to the Université Paris-Saclay EU COFUND DeMythif.AI program : https://www.dataia.eu/actualites/cofund-demythifai-appel-sujets-de-these. It is reserved to international students who have spent less than 12 months in France in the last 3 years. The candidates will be evaluated by a jury that will select 15 PhD to start in fall 2024. The successful candidates will be fully funded for 3 years, have access to specific scientific and non-scientific training, and be fully part of the Université Paris-Saclay AI community. The aim of this Ph.D. research is to strengthen collaboration with Safran group. The thesis is accompanied by a collaborative contract with Safran, ensuring the environment, interaction with experts, and data availability.

Formation et compétences requises :
● End of engineering degree / Master’s degree in a relevant field (e.g., computer science, ML/AI, Statistics …)
● Excellent understanding of machine learning and physics basics. Familiar with recent Artificial Intelligence: transformers, diffusion
model, auto-encoder…etc.
● Excellent programming skills, especially with Python, Pytorch,
● Autonomous and able to quickly adapt to recent scientific literature / technologies.

This PhD topic is participating to the Université Paris-Saclay EU COFUND DeMythif.AI program : https://www.dataia.eu/actualites/cofund-demythifai-appel-sujets-de-these. It is reserved to international students who have spent less than 12 months in France in the last 3 years. The candidates will be evaluated by a jury that will select 15 PhD to start in fall 2024. The successful candidates will be fully funded for 3 years, have access to specific scientific and non-scientific training, and be fully part of the Université Paris-Saclay AI community. The aim of this Ph.D. research is to strengthen collaboration with Safran group. The thesis is accompanied by a collaborative contract with Safran, ensuring the environment, interaction with experts, and data availability.

How to apply
https://adum.fr/as/ed/voirproposition.pl?site=PSaclay&matricule_prop=51640&langue=en

Adresse d’emploi :
David Lab /UVSQ Versailles

Document attaché : 202312150810_Proposal-PHD.pdf