PhD Position – A multi-modal language model for Earth observation [INRIA – Team EVERGREEN, Montpellier, France]

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : INRIA – EVERGREEN
Durée : 3 ans
Contact : diego.marcos@inria.fr
Date limite de publication : 2024-06-30

Contexte :
This PhD offer is funded by the GEO-ReSeT ANR project, representing a collaboration between Inria (team EVERGREEN, Montpellier) and Université de Paris Cité (team LIPADE, Paris).

Leveraging the large amounts of available geo-spatial data from different sources, the GEO-ReSeT (Generalized Earth Observation with Remote Sensing and Text) project has the objective to learn a rich representation of any geo-spatial location and convey a semantic representation of the information, by improving on existing models and providing a better experience to the end users. By using location on the Earth’s surface as the common link between different modalities, a geo-spatial foundation model would be able to incorporate a variety of data sources, including remote sensing imagery, textual descriptions of places, and other generic features.

Such a foundation model has the potential to open a set of all new possibilities in terms of Earth observation applications, by allowing for few or zero-shot solutions to classical problems such as land-cover and land-use mapping, target detection, and visual question answering. It will also be useful for a wide range of applications with a geo-spatial component, including environmental monitoring, urban planning and agriculture.
By leveraging several data modalities, this foundation model could provide a comprehensive and accurate understanding of the Earth’s surface, enabling informed decisions and actions. This will be particularly valuable for new potential users in sectors such as journalism, social sciences or environmental monitoring, who may not have the resources or expertise to collect their own training datasets and develop their own methods, thus moving beyond open Earth observation data and democratizing the access to Earth observation information.

Sujet :
The work to be conducted during the proposed PhD thesis will contribute to the ambition of the GEO-ReSeT ANR project by linking textual descriptions of places (e.g., collected from heterogeneous online sources, such as news articles or search engine results), to their approximate geo-location, a task known as geoparsing.

This text-location link will then be used in combination with other geospatial data modalities, with a focus on remote sensing data from sensors such as Sentinel-1 and -2, in order to train multi-modal models that are aware about the way in which people describe locations.

This will be done by first combining information stemming from different databases containing geographic named entities, such as Open Street Map, Wikipedia and gazetteers, such that geographic points or polygons can be linked to each named entity.

In a second step, a Natural Language Processing (NLP) pipeline will be developed to obtain the most likely geographic named entities that are referred to in any piece of text that describes a place.

With respect to existing Named Entity Recognition (NER) methodologies, in order to avoid restricting us to cases where entities’ names appear exactly as in the databases or gazetteers, we will leverage pre-trained Large Language Models (LLM) to resolve ambiguities and gather evidence towards the most likely entities that are being described in the text. Such an approach will be trained and validated by using the cases that do match the names in the gazetteer.

We will then move on, in collaboration with the rest of the GEO-ReSeT consortium, to train a multi-modal large language model (MMLLM) that will serve as a foundation model for Earth observation tasks.

This model will finally be evaluated on several agro-environmental tasks.

Application must be sent through the following link : https://recrutement.inria.fr/public/classic/en/offres/2024-07756

Profil du candidat :
Main activities
Description of the state-of-the-art in unstructured text geoparsing, with a focus on approaches leveraging LLMs.
Collection of a database of geographic named entities linked to their geographic footprint (e.g. point or polygon). Collection of a database of unstructured online text that is likely to contain a reference to a geographic location.
Development of an NLP pipeline to link each piece of geographic text to its likely geographic footprint.
Participate in the design and training of a multi-modal large language model (MMLLM) using remote sensing and geoparsed text.
Evaluation of the final model on two of the following case studies at a national or continental scale: ecosystem type mapping, crop type mapping or land-use mapping.

Formation et compétences requises :
Skills
Python programming.
Deep Learning with Python (preferably with Pytorch).
Experience with NLP.
Experience with GIS would be a plus.

Adresse d’emploi :
Montpellier, France

2024 Summer School on AI Technologies for Trust, Interoperability, Autonomy and Resilience in Industry 4.0

Date : 2024-07-22 => 2024-07-26
Lieu : Saint-Étienne

*Call for Participation 2024 Summer School on AI Technologies for Trust, Interoperability, Autonomy and Resilience in Industry 4.0*

We invite PhD students, academics, and researcher staff from academia and industry to apply to the Summer School on AI Technologies for Trust, Interoperability, Autonomy and Resilience in Industry 4.0 (henceforth AI4Industry).

The summer school aims to teach the state of the art of the use of AI technologies and models to tackle the challenges of data revolution and to increase automation of cognitive tasks to develop a trustful and resilient Industry 4.0 (or Industry of the Future). The summer school is organized around concrete industry problems structured as use cases. These use cases aim to stimulate the discussion at the academic institutions toward addressing real-world problems and to showcase innovative solutions to industrial partners.

During the summer school, participants attend lectures of AI technologies and carry out practical exercises dedicated to applying these technologies to solve concrete industry problems. The summer school addresses various aspects of AI in Industry with a particular emphasis on:

* Web of Things
* Knowledge Graphs
* Multi-Agent Systems
* Trustworthy and Responsible AI

Half of the summer school is dedicated to practical work (through a hackathon) while the other half is meant to introduce the theoretical foundations underlying the hackathon’s framework.

The AI4Industry Summer School series website provides more information about the summer school.

*Important Dates*

* *Application on a rolling basis*: until June 15, 2024
* *Notification*: 2 weeks after application submission
* *Summer School*: July 22-26, 2024

*Contact *

contact-ai4industry@listes.emse.fr

Lien direct

Notre site web : www.madics.fr
Suivez-nous sur Tweeter : @GDR_MADICS
Pour vous désabonner de la liste, suivre ce lien.

Deep Learning with Generative Models for Anomaly Detection

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : LITIS Lab, Rouen
Durée : 3 years
Contact : paul.honeine@univ-rouen.fr
Date limite de publication : 2024-06-30

Contexte :
Keywords: Deep learning, generative (probabilistic) models, diffusion probabilistic models, normalizing flows, anomaly detection, time series

Sujet :
The broad interest in deep neural networks has driven recent advances in anomaly detection, also called out-of-distribution or novelty detection. Deep anomaly detection methods fall within three major categories: Deep one-class, variational autoencoders and generative adversarial networks [1, 2]. While these methods do not allow an exact evaluation of the probability density of new samples, they also suffer from notorious training instability (mode collapse, posterior collapse, vanishing gradients and non-convergence), as corroborated by many research studies [3]. For these reasons, we aim to go beyond the weaknesses of these methods, by investigating novel classes of generative models with deep learning to address anomaly detection.

The goal of this PhD thesis is to explore novel generative models, such as diffusion probabilistic models (DPM) and normalizing flows (NF). These classes consist of models that can generate, through a deep latent space, a probability distribution for a given dataset from which we can then sample. With solid theoretical foundations and often interconnections with Optimal Transport, several variants of generative models have been proposed based on different definitions of their main components, namely the forward and backward processes and the sampling procedure. Of particular interest are NF and DPM. NF are generative models where both sampling and density evaluation are efficient and exact, and where the latent representation is learned through an invertible transformation, thus providing explainable models [4, 5]. DPM rely on diffusion processes, inspired from nonequilibrium thermodynamics, with their flagship being denoising diffusion probabilistic models [6]. Diffusion models have been demonstrating record-breaking performance in many applications in computer vision, mainly for image synthesis [7, 8] and medical imaging [9].

The proposed PhD research program aims to investigate these recent advances in generative models with deep learning for anomaly detection. Recent studies have explored generative probabilistic models for anomaly detection, mainly in images [10-12] with some attempts in signal processing [13, 14], demonstrating preliminary results on their relevance in anomaly detection and bringing out new research questions [15]. The PhD candidate will investigate such generative probabilistic models in a more in-depth research study, in order to take full advantage of their underlying theory. Moreover, the PhD candidate will go beyond image processing, with a focus on anomaly detection in time series, by considering the specificities of time series. The proposed framework and devised methods will be assessed in a variety of scenarios and real-world time series datasets.

Research Environment

The PhD candidate will conduct her/his research within the Machine Learning group in the LITIS Lab, under the supervision of Prof. Paul Honeine, Dr. Fannia Pacheco and Dr. Maxime Berar. This PhD thesis is within a research project gathering 9 permanent researchers of the LITIS Lab and the PhD candidate will also interact with several PhD students and interns also working on deep anomaly detection with a focus on time series.

Application

Applicants are invited to send their CV and grade transcripts by email to:
paul.honeine@univ-rouen.fr, fannia.pacheco@univ-rouen.fr, maxime.berar@univ-rouen.fr.

References

[1] L. Ruff et al., “A unifying review of deep and shallow anomaly detection,” Proceedings of the IEEE, 2021.
[2] G. Pang et al., “Deep learning for anomaly detection: A review,” ACM Computing Surveys, 2021.
[3] D. Saxena and J. Cao, “Generative adversarial networks (GANs) challenges, solutions, and future directions,” ACM Computing Surveys, 2021.
[4] I. Kobyzev et al., “Normalizing flows: An introduction and review of Current Methods,” IEEE T PAMI, 2021.
[5] G. Papamakarios et al., “Normalizing flows for probabilistic modeling and inference,” JMLR, 2021.
[6] J. Ho et al., “Denoising diffusion probabilistic models,” NeurIPS, 2020.
[7] L. Yang et al., “Diffusion models: A comprehensive survey of methods and applications,” ACM Computing Surveys, 2023.
[8] F.-A. Croitoru et al., “Diffusion models in vision: A survey,” IEEE T PAMI, 2023.
[9] A. Kazerouni et al., “Diffusion models in medical imaging: A comprehensive survey,” Medical Image Analysis, 2023.
[10] J. Wolleb et al., “Diffusion models for medical anomaly detection.” MICCAI, 2022.
[11] W.H. Pinaya et al., “Fast unsupervised brain anomaly detection and segmentation with diffusion models,” MICCAI, 2022.
[12] A. Kascenas et al., “The role of noise in denoising models for anomaly detection in medical images.” Medical Image Analysis, 2023.
[13] Y. Chen et al., “ImDiffusion: Imputed Diffusion Models for Multivariate Time Series Anomaly Detection,” in Proc. of the VLDB Endowment, 2023.
[14] R. Hu et al., “Unsupervised Anomaly Detection for Multivariate Time Series Using Diffusion Model,” IEEE ICASSP, 2024.
[15] P. Kirichenko et al., “Why normalizing flows fail to detect out-of-distribution data,” NeurIPS, 2020.

Profil du candidat :
– Master or Engineering degree, in data science, AI, applied mathematics, or related fields.
– Strong skills in advanced statistics and Machine Learning, including Deep Learning
– Good programming experience in Python

Formation et compétences requises :

Adresse d’emploi :
LITIS Lab, Rouen

Document attaché : 202405242231_PhD – Deep learning with Generative Models for Anomaly Detection.pdf

DAAfrica – Workshop on Data Science for Agriculture in Africa

Date : 2024-11-23
Lieu : Bejaia, Algeria (hybrid)

==========================================================
Call for short papers
DAAfrica – Workshop on Data Science for Agriculture in Africa

November 23, 2024
Bejaia, Algeria (hybrid)

Event affiliated with CARI’2024

Workshop supported by ASDS, #DigitAg and the MOOD project
==========================================================

=== SCOPE ===

Data science in agriculture has evolved with the accessibility of data by farmers that allow them to analyze and facilitate decision making. Today new technology like Internet of Things (IoT) enables us to collect and store farm and environmental data (e.g. soil data, water data, etc.) in dedicated databases and/or data warehouses. This agricultural data can be combined with other data sources (e.g. remote sensing, weather stations, web and social media, etc.) that need to address new challenges like ingestion of heterogeneous data.
Data science in agriculture aims to explore and mine agricultural data using different techniques like machine learning, deep learning, computer vision, text-mining, large language models (LLM), etc. For instance, data science can predict crop yields and plant and animal diseases with different variables, including rainfall, temperature fluctuations, and soil conditions by using a variety of data (e.g. sensor data, texts, satellite images, plant images, etc.).
So, agriculture professionals and decision-makers can use data science to provide information and knowledge in order to make decisions about agricultural activities in Africa.

=== TOPICS OF INTEREST ===

The topics of the workshop encompass all aspects concerning the intersection of data science and agriculture in Africa with different applications:
– Smart Farming
– Yield and production
– Plant specie identification
– Land cover monitoring
– Crop recommendation
– Crop monitoring & forecasting
– Animal and plant health monitoring
– Water management
– Food safety & security
– Agroecology
– etc.

=== SUBMISSIONS ===

Researchers, academics, and students working on the field of data science with application in agriculture in Africa are invited to submit short papers for oral presentations or posters. Submitted abstracts must be in English and will be reviewed by the workshop committees for suitability and interest to the DAAfrica audience. The authors can submit papers of unpublished work reporting original and early results, introducing new ideas or describing prototypes.
Every accepted submission must have at least one author registered for the workshop. All submitted extended abstracts must follow the LNCS format (https://www.springer.com/gp/computer-science/lncs/conference-proceedings-guideline) with a page limit of up to 6 pages including the title page, figures, references, and an optional appendix. The abstracts should be submitted electronically in PDF format via EasyChair via the following link: https://easychair.org/conferences/?conf=daafrica2024
Accepted extended abstract will be published as CEUR proceedings. Selected contributions will be invited to submit full papers to ARIMA Journal (indexed by DBLP and DOAJ) for a peer-review according to its usual reviewing process.

=== IMPORTANT DATES ===

– Submission deadline: July 31, 2024
– Notification to authors: September 23, 2024
– Workshop date: November 23, 2024

=== PARTICIPATION ===

The workshop will be held in Bejaia, Algeria, as an event affiliated to CARI’2024 (https://www.cari-info.org). This workshop will be a hybrid event that combines a “live” in-person event with a “virtual” online component.
Further information related to registration is available on the ASDS website: https://asds.africa/daafrica2024

=== WORKSHOP CHAIRS ===

– Paulin Melatagia, University of Yaoundé I, Cameroun
– Mathieu Roche, CIRAD, UMR TETIS, France

Lien direct

Notre site web : www.madics.fr
Suivez-nous sur Tweeter : @GDR_MADICS
Pour vous désabonner de la liste, suivre ce lien.

Post-doc / IGR – Brest, Reims – Analyse de données hétérogènes pour l’estimation de trajectoires neurodéveloppementales chez le nouveau-né

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : LaTIM (Brest) ou CReSTIC (Reims)
Durée : 2 ans
Contact : nicolas.passat@univ-reims.fr
Date limite de publication : 2024-12-31

Contexte :
L’imagerie médicale en néonatologie est un domaine complexe et crucial pour le diagnostic et le suivi des nouveau-nés prématurés. Cependant, l’exploitation des données hétérogènes issues de multiples sources (IRM, échographie, EEG) est un défi majeur

Ce projet financé par l’ANR implique les laboratoires du CReSTIC (Reims), LaTIM (Brest) et GRAMFC (Amiens) ainsi que l’entreprise Kitware SAS (Lyon). Les postes à pouvoir seront localisés à Reims (université de Reims Champagne-Ardenne) et Brest (IMT Atlantique).

Sujet :
Le projet vise à développer des méthodes numériques et des outils logiciels pour:
• Comprendre les trajectoires de neurodéveloppement.
• Surmonter les défis liés à l’hétérogénéité des données.
• Mettre en correspondance les données multiphysiques et multidimensionnelles.
• Aider à l’analyse d’information en signal et imagerie par des approches d’IA.
• Développer des outils de visualisation multiphysique.

Plus spécifiquement, il s’agira de s’intéresser aux objectifs suivant :
1. Calcul de biomarqueurs du neurodéveloppement anatomique. Nous considérerons les informations complémentaires fournies par l’échographie et par l’IRM afin de déterminer la géométrie (ex. plissement cortical, épaisseur) et le volume des structures d’intérêt
(cervelet, cortex, corps calleux, thalami) et leur évolution.
2. Définition des trajectoires neurodéveloppementales. En agrégeant les biomarqueurs structurels et fonctionnels préalablement affinés, l’objectif sera de développer des modèles et des outil s dédiés à la comparai son entre t rajectoires neurodéveloppementales « de référence » et trajectoires « pathologiques ». Une telle modélisation de la maturation cérébrale permettrait la reconnaissance précoce des
troubles du développement neurologique chez les prématurés et l’évaluation d’interventions et de neuroprotection adaptées.

Les méthodes développées seront utilisées pour analyser des cohortes de données néonatologiques et contribuer à l’amélioration des soins aux nouveau-nés prématurés.

Profil du candidat :
Les compétences requises pour mener à bien ce travail concernent l’apprentissage machine, le traitement d’images, et les mathématiques appliquées. Des connaissances en informatique et en programmation (Python) seront également requises afin de développer les algorithmes associés.

Formation et compétences requises :

Adresse d’emploi :
Brest – LaTIM
Reims CReSTIC

Nicolas Passat – email : nicolas.passat@univ-reims.fr
François Rousseau – email : francois.rousseau@imt-atlantique.fr

Document attaché : 202405220755_2024-HINT.pdf

Modèles hiérarchiques pour l’analyse multi-échelle de données de très haute résolution en imagerie synchrotron

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : CReSTIC & MEDyC
Durée : 36 mois
Contact : nicolas.passat@univ-reims.fr
Date limite de publication : 2024-12-31

Contexte :
Projet ANR MODELAGE
Cette thèse est proposée dans le cadre du projet ANR MODELAGE (Modélisation de l’évolution des microstructures
vasculaires par imagerie synchrotron à très haute résolution – Prédiction du vieillissement
normal vs accéléré), mené en partenariat entre l’Université de Reims Champagne-Ardenne, l’Université de
Technologie de Troyes et le Synchrotron SOLEIL (Saclay).

Contexte
Le vieillissement vasculaire se caractérise par des altérations lentes et asymptomatiques des microstructures
vasculaires. Parmi celles-ci, les lamelles élastiques de la paroi vasculaire sont les premières concernées.
Néanmoins, les événements précoces prévoyant ces altérations restent pour la plupart non documentés.
En effet, les méthodes d’exploration actuelles n’atteignent pas une résolution suffisante. L’exploration des
caractéristiques vasculaires à l’aide de la microtomographie à rayons X synchrotron haute résolution (μCT) a
révélé l’existence d’un réseau en forme de treillis construit à l’intérieur des lamelles élastiques chez la souris.
Les images μCT acquises sur synchrotron peuvent ainsi fournir de nouveaux indices pour comprendre le
processus de vieillissement vasculaire [1]. En effet, leur résolution, leur contraste et leur champ de vision sont
si élevés qu’ils révèlent de nouveaux détails structurels fins dans la paroi aortique. Cependant,
la recherche, l’extraction et l’analyse de ces données massives et riches en informations constituent un
véritable défi.

Sujet :
Objectifs
Les images μCT sont des données 3D de très haute résolution (voxels de 0.65 μm de côté) de très grande
taille (4000 x 4000 x 2000 voxels) pouvant de plus être empilées jusqu’à former des structure de l’ordre du
tera-octet. Il est, en l’état, impossible de naviguer dans ces données et de les analyser dans leur globalité.
Les solutions actuellement développées les manipulent par coupes 2D et/ou par tranches 3D épaisses [2].
Le premier but de cette thèse est de développer de nouvelles structures de données hiérarchiques (arbres)
qui permettent de modéliser les images à différents niveaux d’échelle en adaptant le niveau d’échelle local
au niveau de détail dans les images. Une telle politique repose de manière conjointe sur deux paradigmes :
les espaces d’échelles [3] et les modèles de décomposition de type quadtree/octree [5]. Contrairement aux
stratégies usuellement considérées pour les espaces d’échelles (approximation gaussienne) et pour les octrees
(subdivisions régulières), l’idée est ici de tirer parti de la connaissance a priori sur le contenu des images pour
développer un modèle hiérarchique morphologique [4] qui puisse représenter les images avec un minimum
de perte d’information, tout en maximisant la compacité des structures, afin de permettre leur gestion en
mémoire et une navigation complète sans recours à des architectures matérielles lourdes.

Les objectifs de cette thèse seront ainsi d’explorer des stratégies pour :
• définir de tels modèles hiérarchiques ;
• les construire de manière efficace ;
• développer de nouveaux descripteurs d’images dédiés aux images μCT ;
• développer des politiques de calcul efficaces de ces descripteurs sur les modèles hiérarchiques développés.

Ces travaux viendront s’interfacer avec des méthodes et outils récemment développés pour l’analyse des
images synchrotron, dans le cadre du projet ANR MODELAGE. Le(la) candidat(e) aura aussi l’opportunité
de participer aux campagnes d’acquisition des images lors des expériences synchrotron.

Profil du candidat :
Compétences requises
Le(la) candidat(e) sera titulaire d’un diplôme de Master 2 et/ou d’un diplôme d’ingénieur. Il(elle) aura
des compétences solides en informatique, mathématiques, et une capacité à travailler dans un contexte
collaboratif et pluridisciplinaire.
Compétences impératives :
• Programmation C++ et Python
• Traitement et analyse d’images
Compétences souhaitées mais non-indispensables :
• Imagerie

Formation et compétences requises :
Compétences requises
Le(la) candidat(e) sera titulaire d’un diplôme de Master 2 et/ou d’un diplôme d’ingénieur. Il(elle) aura
des compétences solides en informatique, mathématiques, et une capacité à travailler dans un contexte
collaboratif et pluridisciplinaire.
Compétences impératives :
• Programmation C++ et Python
• Traitement et analyse d’images
Compétences souhaitées mais non-indispensables :
• Imagerie

Adresse d’emploi :
Lieu d’exercice
Université de Reims Champagne-Ardenne, Campus Moulin de la Housse
Laboratoires CReSTIC et MEDyC

Document attaché : 202405211157_MODELAGE_PhD.pdf

Paris Generative AI Autumn School (21 – 25 October 2024)- Call for Participation

Date : 2024-10-21 => 2024-10-25
Lieu : Paris-Saclay University, France

The GenAI-School aims to provide an overview of the latest advances in generative AI, covering theoretical foundations, methodologies, and practical industrial applications beyond NLP. The program will introduce fundamental methodological tools and presentations from experts across diverse scientific fields.

The basic methodological tools will be presented, and then speakers from different scientific fields will present the most recent applications and developments. The aim is to bring together various speakers to cover a broad range of fields of study and offer participants an overview of generative AI and its applications.

Topics will include :
Foundations of generative models: VAE, GAN, and diffusion models,
Large Language Models and advanced techniques (e.g., RAG),
Multimodal generation,
Frugal models,
Ethics of generative AI,
Applications of generative AI to :
Audio synthesis,
Climate change,
Image and video generation,
Medical data,
Meteorology,
Telecommunications and networking,
Robotics
…

During poster sessions and flash talk sessions, you will have the opportunity to share recent research results and open problems.

Registration is mandatory before 16 June 2024 for the first round.

Lien direct

Notre site web : www.madics.fr
Suivez-nous sur Tweeter : @GDR_MADICS
Pour vous désabonner de la liste, suivre ce lien.

Séminaire thématique LASTIG 2024

Date : 2024-05-28
Lieu : Visio-conférence par Teams

Bonjour à tou.te.s,

Nous vous annonçons le lancement de séminaires thématiques organisés par le Laboratoire en sciences et techniques de l’information géographique (LASTIG, co-tutelles IGN, Université Gustave Eiffel, École des Ingénieurs de la Ville de Paris).

Nous prévoyons 2 à 3 séminaires par an, sur maximum une demi-journée, pour vous présenter nos travaux récents, ceux qui se terminent comme ceux qui débutent, afin que vous puissiez avoir un aperçu au fil de l’eau de nos avancées. Ces séminaires sont ouverts à tou.te.s.

Le premier séminaire est dédié aux NeRFs (Neural Radiance Fields). Les NeRFs désignent une technique d’intelligence artificielle pour créer rapidement une scène 3D à partir d’images 2D. Parmi les applications envisagées figurent l’analyse par des robots et/ou des voitures autonomes de leur environnement, mais également la création rapide d’avatars ou de selfies 3D (appelés aussi “nerfies”) à partir de photos. Au LASTIG, ces techniques sont expérimentées sur des données géographiques comme des images satellites.

Si vous êtes disponibles et curieux, le séminaire a lieu le mardi 28 mai de 10h30 à 12h30 via Teams (cf. ci-dessous).
Les présentations seront mises à disposition par la suite.

Venez nombreux & bonne journée !

Arnaud Le Guilcher, Clément Mallet & Bruno Vallet

——————————————————————–
Programme détaillé

10h30 – 11h : Arnaud Le Guilcher & Bruno Vallet: “Présentation du LASTIG et des séminaires thématiques”

11h-11h30 : Camille Billouard: “NeRF et SDF (Signed Distance Field): introduction et applications aux images satellites multi-dates pour la reconstruction 3D”

11h30-12h: Karim Kassab: “Exploring 3D-aware Latent Spaces for Efficiently Learning Numerous Scenes”

12h-12h30 : Lulin Zhang: ” Physics-based NeRF for Optical Satellite Images”

——————————————————————–
Informations de connexion (Teams)

Rejoindre la réunion maintenant

ID de réunion : 373 467 695 512

Code secret : 7knPaY

Lien direct

Notre site web : www.madics.fr
Suivez-nous sur Tweeter : @GDR_MADICS
Pour vous désabonner de la liste, suivre ce lien.

Exploration de la dynamique des nanoparticules dans les environnements terrestres et extraterrestres à l’aide de la spectrométrie spICP-ToF-MS : avancées et défis méthodologiques

Offre en lien avec l’Action/le Réseau : DSChem/Doctorants

Laboratoire/Entreprise : Institut de physique du globe de Paris
Durée : 3 ans
Contact : chaussidon@ipgp.fr
Date limite de publication : 2024-07-01

Contexte :
Les nanoparticules (NPs) jouent un rôle crucial dans les systèmes naturels, que ce soit la lithosphère, l’hydrosphère, l’atmosphère et les environnements extraterrestres. La compréhension de leur comportement, de leur composition et de leur distribution est essentielle pour divers domaines, notamment la nano géochimie et la cosmochimie. Cependant, les techniques analytiques existantes ont du mal à caractériser avec précision les NPs, ce qui empêche d’avoir une vision globale de leur rôle dans les processus naturels terrestres et extraterrestres.

L’émergence de la spectrométrie de masse à plasma inductif et temps de vol de particules élémentaires (spICP-ToF-MS) a révolutionné la caractérisation des NPs en permettant des investigations inégalées. Cependant, des défis analytiques et méthodologiques persistent, nécessitant des approches avancées axées sur les données afin de maximiser les informations fournies par les analyses.

Environnement de travail

Le projet proposé sera réalisé à l’Institut de physique du globe de Paris en collaboration étroite avec le Prof Leonard Seydoux (équipe de sismologie). L’IPGP héberge une plateforme de géochimie de premier plan (PARI), équipée d’un instrument spICP-ToF-MS opérationnel. Nous disposons d’un vaste ensemble de données de mesures spICP-ToF-MS pour des NPs artificielles et naturelles, ainsi que pour des NPs de verres silicatés standards. Cet ensemble de données servira de base au développement et au test d’algorithmes. De plus, grâce à diverses collaborations en France (LiPADE, Université Gustave Eiffel) et à l’étranger (Université de Graz, Colorado School of Mines…), nous pouvons simuler des séries temporelles de spICP-ToF-MS pour l’entraînement et la validation de la méthodologie développée.

Sujet :
Défis du sujet

1. Amélioration de la sensibilité : améliorer la sensibilité de la méthode spICP-ToF-MS pour la détection précise des nanoparticules.
2. Réduction du bruit : s’attaquer au bruit de fond pour révéler des signaux clairs de nanoparticules.
3. Recherche des sources des NPs : développer des méthodologies polyvalentes pour caractériser les nanoparticules provenant de différentes sources et identifier leur composition minéralogique.
4. Exploitation des données : Explorer plus profondément le comportement des nanoparticules à l’aide de techniques d’analyse de données avancées.

Objectifs de recherche proposée

Les progrès des techniques d’analyse sont essentiels pour élucider les processus complexes dans les systèmes extraterrestres et terrestres. À l’heure actuelle, l’une des principales difficultés est de pouvoir décrire la chimie et la minéralogie (et dans certains cas la composition isotopique) de milliers de nanoparticules provenant d’un échantillon donné.

La détection et la caractérisation précises des nanoparticules dans le bruit instrumental constituent un défi majeur pour la caractérisation des nanoparticules par spICP-ToF-MS. En procédant à une analyse complète et à l’optimisation des paramètres d’acquisition des données spICP-ToF-MS et des réglages de l’instrument, nous proposons d’améliorer la précision et la sensibilité de la détection et de la caractérisation des nanoparticules. Grâce à un étalonnage et à un réglage méticuleux, nous minimiserons le bruit de fond et maximiserons la netteté du signal, améliorant ainsi notre capacité à reconstruire les distributions en composition et taille des nanoparticules dans un échantillon donné.

Les méthodologies ainsi développées seront utilisées pour analyser les nanoparticules extraites de divers échantillons. Quelques études de cas seront sélectionnées pour (i) comprendre la dynamique des nanoparticules dans les systèmes environnementaux et fournir des informations sur des processus tels que le transport des éléments, l’altération et la dispersion des polluants, contribuant ainsi aux efforts de surveillance et d’assainissement de l’environnement, et (ii) obtenir une caractérisation statistiquement significative des phases sub-micrométriques dans la matrice des météorites chondritiques primitives afin de contraindre l’origine et la dynamique de la poussière dans le disque d’accrétion au cours de l’évolution précoce du système solaire.

Méthodologie et contributions attendues

1. Recherche en chimie analytique : analyse complète et optimisation des paramètres d’acquisition des données spICP-ToF-MS et des réglages de l’instrument afin d’améliorer la précision et la sensibilité de la détection et de la caractérisation des NPs.
2. Approches avancées basées sur les données : développer de nouvelles méthodologies intégrant l’inférence bayésienne, les réseaux neuronaux et les algorithmes de regroupement pour une modélisation robuste du bruit, la détection du signal et la caractérisation des NPs dans les données spICP-ToF-MS.
3. Caractérisation complète des NP : appliquer les méthodologies développées pour analyser les NPs extraites de diverses sources, en fournissant des informations sur leur composition, leur distribution de taille et leur distribution spatiale.
4. Cadre analytique amélioré : faire progresser les cadres statistiques et d’apprentissage automatique conçus pour faciliter une meilleure compréhension du comportement des NPs et de leurs interactions dans les différents systèmes.
5. Perspectives géochimiques et cosmochimiques : explorer la composition, la distribution des tailles et la distribution spatiale des NPs dans les matériaux terrestres et météoritiques, contribuant ainsi à notre compréhension de la formation et de l’évolution du système solaire et de la Terre.

Profil du candidat :
Pour réussir, le candidat doit avoir une formation de chimie analytique avec des connaissances en science des données et en géochimie. Il/elle doit démontrer sa capacité à travailler en équipe et avoir de bonnes compétences de communication en anglais (écrites et orales).

Formation et compétences requises :

Adresse d’emploi :
Institut de physique du globe de Paris
1, rue Jussieu, 75005 Paris.

Document attaché : 202405161414_ED_STEP_UP_2024_PropositionTE_F_Chaussidon.pdf

ML4Oceans school 2024

Date : 2024-10-07 => 2024-10-11
Lieu : SCAI/ISCD, Sorbonne Université, 4 place Jussieu, 75005 Paris, France

ML4Oceans 2024 – 2nd edition!

The use of artificial intelligence is now crucial for environmental and climate issues, including oceanography. In this context, SCAI, ISCD and the Institut de l’Océan of Sorbonne Université joined their forces, together with the support of EUR-IPSL, to offer a transdisciplinary school for PhD candidates and post-doctoral researchers. 20 candidates will be selected according to their academic background, their research subject & their motivation to participate in this school.

The objective: to acquire knowledge and skills in AI applied to oceanography (marine biology – omics, biogeochemistry, acoustics, imagery).

Practical information

The school will take place in the seminar room of SCAI, the Sorbonne Center for Artificial Intelligence. It is located on the Pierre and Marie Curie Campus of Sorbonne University, 4 place Jussieu (métro Jussieu), 75005 Paris, France. Lectures will be given in English.

Programme

The school will be from 9h to 17h or 18h, from Monday to Friday. Each day, you will have two classes of 1h30 each in the morning, by experts in their field: one on an oceanography topic and one on a machine learning topic. In the afternoon, you will have a tutored practical lab, in a dedicated computing environment, to apply a machine learning algorithm on real oceanic data. This direct application of the morning class will encompass coding, training, and applying a model.

Tentative programme:

Monday 7th October: Introductions to the school, the programming language (Python and PyTorch) and the coding environment.
Tuesday 8th October: Oceano: Marine omics. ML: Tree based models. Lab: Predicting carbon concentration genomic potential using multivariate gradient boosted trees.
Wednesday 9th October: Oceano: Biogeochemical Argo floats. ML: Multilayer Perceptron. Lab: Deriving scarcely observed biogeochemical variables from profiles of common Argo ones using an MLP.
Thursday 10th October: Oceano: Passive acoustics. ML: Convolutional neural networks. Lab: Classifying marine mammals sounds using CNNs.
Friday 11th October: Oceano: Plankton imaging. ML: CNNs for object detection and segmentation. Lab: Instance segmentation of planktonic organisms from images using CNNs.

Application procedure

Please apply online before the 27th of June 2024:

LINK TO THE REGISTRATION FORM : https://framaforms.org/ml4oceans-school-2024-1713121567

Candidates will be selected using the following criteria: adequation between the school’s thematics and the candidate’s research project, minimum skill requirement, motivation for the ML4Oceans school. Candidates from different backgrounds are encouraged to apply.

The outcome of the selection procedure will be communicated to the candidates mid-July.

Speakers

Olivier Adam, IJLRDA, SU
Lucie Bittner, ISYEB, SU
Dorian Cazau, LabSTICC, ENSTA Bretagne
Anastase Charantonis, LaMME/LOCEAN, ENSIIE/INRIA
Hervé Claustre, LOV, SU/CNRS
Jean-Olivier Irisson, LOV, SU
Jérémy Fix, CentraleSupélec, LORIA
Raphaëlle Sauzède, LOV-IMEV, SU

Organising committee

Sakina-Dorothée Ayata, LOCEAN, SU
Lucie Bittner, ISYEB, SU
Anastase Charantonis, LaMME/LOCEAN, ENSIIE/INRIA
Jean-Olivier Irisson, LOV, SU

Please contact sakina-dorothee.ayata@locean.ipsl.fr for any questions.

Lien direct

Notre site web : www.madics.fr
Suivez-nous sur Tweeter : @GDR_MADICS
Pour vous désabonner de la liste, suivre ce lien.

MaDICS

Masses de Données, Informations et Connaissances en Sciences

Big Data - Data Science

Archives

PhD Position – A multi-modal language model for Earth observation [INRIA – Team EVERGREEN, Montpellier, France]

2024 Summer School on AI Technologies for Trust, Interoperability, Autonomy and Resilience in Industry 4.0

Deep Learning with Generative Models for Anomaly Detection

DAAfrica – Workshop on Data Science for Agriculture in Africa

Post-doc / IGR – Brest, Reims – Analyse de données hétérogènes pour l’estimation de trajectoires neurodéveloppementales chez le nouveau-né

Modèles hiérarchiques pour l’analyse multi-échelle de données de très haute résolution en imagerie synchrotron

Paris Generative AI Autumn School (21 – 25 October 2024)- Call for Participation

Séminaire thématique LASTIG 2024

Exploration de la dynamique des nanoparticules dans les environnements terrestres et extraterrestres à l’aide de la spectrométrie spICP-ToF-MS : avancées et défis méthodologiques

ML4Oceans school 2024