Stage de M2 : Algorithmes asynchrones pour l’inférence bayésienne

When:
15/09/2022 – 16/09/2022 all-day
2022-09-15T02:00:00+02:00
2022-09-16T02:00:00+02:00

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : CRIStAL UMR 9189, Lille
Durée : 6 mois
Contact : pierre.chainais@centralelille.fr
Date limite de publication : 2022-09-15

Contexte :
Centre de Recherche en Informatique, Signal et Automatique de Lille (UMR 9189 CRIStAL), Villeneuve d’Ascq, France.

Dates: Starting date between March and April 2022, 4 to 6 months internship.

Pierre-Antoine THOUVENIN (pierre-antoine.thouvenin@centralelille.fr), € https://pthouvenin.github.io/
Pierre CHAINAIS (pierre.chainais@centralelille.fr),
€ pierre.chainais@centralelille.fr

The intern will be jointly supervised by Pierre Chainais, professor at Centrale Lille, and Pierre-Antoine Thouvenin, assitant professor at Centrale Lille. The internship will take place in the Centre de Recherche en Informatique, Signal et Automatique de Lille (CRIStAL, UMR 9189), France, within the SigMA team.
This project is part of the ANR Chaire IA SHERLOCK (Fast inference with controlled uncertainty: application to astrophysical observations) led by Pierre Chainais (co-funded by Agence Nationale de la Recherche (ANR), ISITE, Centrale Lille Institut and Région Haut-de-France). Participation to a national or international workshop is considered.
Period and continuation in PhD
This 4 to 6-months internship will start between March and April 2022. The precise start and end dates will be adjusted depending on the availability of the candidate. The intern will be granted the usual stipend of ∼ 600 euros/month (3,90 euros/hour).
This M.Sc. project may be continued as a PhD thesis, for which a grant is already secured for the period 2022-2025 thanks to the ANR Chaire IA SHERLOCK.

Keywords. Bayesian inference, continuous optimization, distributed asynchronous algorithms, MCMC methods.

Sujet :
Bayesian inference is a usual approach to estimate parameters from a dataset, a typical setting underlying the resolution of inverse problems. An inverse problem consists in estimating a collection of parameters involved in a physical model from degraded and noisy observations, e.g., reconstucting an image from noisy incomplete observations of the sky in radio- astronomy. In many signal and image processing applications, especially in astronomy (Abdulaziz et al. 2019; Cai et al. 2018) and remote sensing (Ghamisi et al. 2019), no ground truth is available. Fast parameter inference under controlled uncertainty is thus critical to guarantee the quality of the resulting predictions. Indeed, different values of a parameter can be associated to different physical processes, for instance in remote sensing source separation in presence of outliers.
Inference cost can be large, and increases significantly with both the number of observations (large dataset) and param- eters to be inferred (high dimensional problem). Typical signal and image processing applications lead to the resolution of high-dimensional inverse problems, relying on large datasets. Asynchronous (parallel or distributed) optimization al- gorithms have recently regained interest due to their potential of acceleration to form an estimator, in comparison with their synchronous counterparts (Hannah et al. 2017).
The project is aimed at investigating the potential of asynchrony to accelerate distributed optimization algorithms amenable to a Single Program Multiple Data (SPMD) implementation. We will study several aspects, such as the algo- rithm convergence, the resulting estimation quality and inference time. Applications to the resolution of inverse problems in remote sensing or astronomy will be considered.
Depending on the evolution of the project, the study will be extended to a few selected Markov-chain Monte Carlo (MCMC) methods (Durmus et al. 2018; Simsekli et al. 2018; Terenin et al. 2020) to provide estimators with quantified uncertainty, beyond the point estimate provided by optimization algorithms.
This M.Sc. project may be continued as a PhD thesis, for which a grant is already secured for the period 2022-2025 thanks to the ANR Chaire IA SHERLOCK.

Profil du candidat :
Master 2 or last year engineering school students with major in applied mathematics, computer science or electrical engineering. The project requires a strong background in data science and/or machine learning (statistics, optimization), signal & image processing. Very good Python coding skills are expected.

Application procedure
Applicants are invited to send the following documents in .pdf format to both co-advisors:
• a detailed curriculum;
• official transcripts from the institutions you have attended over the last 2 years (in French or in English);
• references: letters of recommendation or names of two researchers/professors willing to recommend your applica- tion.
For further information, please contact both co-advisors of the project:
• Pierre-Antoine Thouvenin, pierre-antoine.thouvenin@centralelille.fr • Pierre Chainais, pierre.chainais@centralelille.fr.

Formation et compétences requises :
A B2 English level is mandatory.
Knowledge in C++ programming, as well as experience or interest in parallel/distributed code development (MPI, OpenMP, CUDA, …) will be appreciated.

Adresse d’emploi :
CRIStAL, Cité Scientifique, 59651 Villeneuve d’Ascq Cedex

Document attaché : 202201242012_msc_proposal_2022_CRIStAL.pdf