Call for Participation – XAIDATA: Spring School on Explainable AI for Data-Intensive Systems, 28-29 of May 2026, ETIS lab, Cergy

Date : 2026-05-28 => 2026-05-29
Lieu : ETIS lab, Cergy, France

Chers collègues,

Je vous partage, l’appel à participation pour notre école thématique XAI Spring Sc hool on Explainable Artificial Intelligence, les 28/29 Mai 2026.

Si vous êtes en thèse : cette école est pour vous ! Inscrivez vous dès maintenant !

Si vous êtes encadrant.e. : invitez vos étudiant.e.s à y s’inscrire !

La participation sera limitée à 25 personnes.

L’inscription est gratuit (repas et pauses cafes compris) mais obligatoire. Lien inscription :

https://docs.google.com/forms/d/e/1FAIpQLSdJkblxmDX-vaFRNPQ_bhQzKicOP7OxbCvehOVCIUaTp8FsK g/viewform

———————-
Dear colleagues,

We are pleased to invite you to participate – and/or invite your graduate students to participate – in the XAI Spring School on Explainable Artificial Intelligence, hosted by the ETIS laboratory, at Cergy Paris University (Cergy, France).

This spring school brings together researchers, students, and practitioners interested in the foundations and applications of explainable AI (XAI), with a focus on systems with different data modalities.

Topics include (but are not limited to):
Explainable and interpretable machine learning
Trustworthy and responsible AI
Data-centric AI and evaluation
Applications of XAI in real-world and inconsistent systems
Explainability with RAGs and LLMs

Target audience:
Graduate students (Master/PhD), researchers, and industry practitioners working in AI, data science, or related fields.

Location: CYU Maisons SHS, Cergy Paris Université
Dates: 28-29 May 2026
Website: https://xaietis.github.io/

Participation is free of charge, including coffee breaks and lunches.
⚠️ Registration is mandatory (max 25 participants).

The program will include keynote talks, tutorials, and interactive sessions led by leading researchers in the field.

We kindly encourage you to share this announcement with colleagues and students who may be interested.

Best regards,
The organising committee,

Vassilis Christophides (ETIS, CNRS, ENSEA, CYU, France, IPAL Singapour)
Evi Pitoura (University of Ioannina, Greece)
Dimitris Kotzinos (ETIS, CNRS, ENSEA, CYU, France)
Katerina Tzompanaki (ETIS, CNRS, ENSEA, CYU, France)

Lien direct

Notre site web : www.madics.fr
Suivez-nous sur Tweeter : @GDR_MADICS
Pour vous désabonner de la liste, suivre ce lien.

AI FOR THE PROCESS INDUSTRIES

Date : 2026-06-11 => 2026-06-12
Lieu : Webinar

Paris Dauphine – PSL and SECF join their expertise to contribute to increase efficiency and competitiveness

Europe’s process industries sits at the center of the energy transition, circular economy, and strategic autonomy. Between now and 2050, competitiveness will depend on turning sustainability requirements into engineering advantage: designing circular products, deploying low carbon manufacturing at scale, and operating plants as data driven, resilient systems. Using AI tools becomes imperative.

Submission deadline – 15 April 2026

Acceptance notification – 30 April 2026

Early bird registration – 15 May 2026

Conference: 11 & 12 June 2026

We welcome submissions on the following topics and related topics as one page abstracts:

1. AI for Process Optimization & Control
• Reinforcement learning for advanced process control (APC)
• AI-enhanced Model Predictive Control (MPC)
• Real-time optimization (RTO) using machine learning
• Hybrid first-principles + ML models
• Nonlinear and multivariable process modeling
• Soft sensors and virtual analyzers
• AI for batch vs. continuous process optimization
2. Digital Twins & Simulation
• AI-enabled digital twins for plants and assets
• Physics-informed neural networks (PINNs)
• Surrogate modeling for computational fluid dynamics (CFD)
• Hybrid simulation environments
• Closed-loop digital twins for operations
• Digital twins for operator training
3. Predictive Maintenance & Asset Reliability
• AI for predictive maintenance in rotating equipment
• Anomaly detection in process data
• Remaining Useful Life (RUL) estimation
• Root cause analysis using ML
• Reliability-centered maintenance with AI
• Edge AI for condition monitoring
4. Monte Carlo Tree Search
• Retrosynthesis
• Parameters optimization
• Molecule design
• Modeling complex systems
• Formula optimization
• Material Design
5. Energy Efficiency & Sustainability
• AI for energy optimization in refineries and plants
• Carbon footprint monitoring and reduction
• AI for electrification and decarbonization strategies
• Process integration and heat recovery optimization
• AI for hydrogen production and storage
• Waste minimization and circular economy optimization
6. Safety, Risk & Compliance
• AI for process safety monitoring
• Early warning systems for abnormal situations
• Hazard identification using NLP
• Risk modeling and probabilistic AI
• AI for regulatory compliance and reporting
• Cyber-physical risk detection
7. AI in Chemical & Materials Development
• AI for catalyst discovery
• Machine learning in reaction optimization
• Autonomous laboratories (self-driving labs)
• AI-guided formulation development
• Materials informatics
• Generative models for molecular design
8. AI & Industrial Data Infrastructure
• Data governance in process industries
• Time-series modeling for industrial data
• Data quality and sensor fusion
• Edge vs. cloud AI architectures
• Industrial IoT and AI integration
• MLOps for manufacturing environments
9. Generative AI & Knowledge Systems
• Large Language Models (LLMs) for plant documentation
• AI copilots for operators and engineers
• Automated report generation
• AI for troubleshooting and knowledge retrieval
• Conversational interfaces for control rooms
10. Supply Chain & Planning
• AI for production scheduling
• Demand forecasting in process industries
• Inventory optimization
• Supply chain resilience modeling
• AI-driven logistics optimization
11. Human–AI Collaboration
• AI adoption strategies in industrial environments
• Change management for AI transformation
• Trust and explainability in industrial AI
• Operator-in-the-loop AI systems
• Workforce upskilling for AI-enabled plants
12. Advanced Methods & Research Frontiers
• Causal AI for industrial systems
• Uncertainty quantification in ML models
• Transfer learning across plants
• Federated learning for industrial sites
• Multi-agent systems for plant-wide optimization
• Graph neural networks for process networks
13. Industry Case Studies
• AI deployment in refineries
• AI in specialty chemicals production
• AI in food and beverage manufacturing
• AI in pharmaceutical manufacturing
• AI in pulp & paper operations

Kind regards,

Tristan Cazenave

Lien direct

Notre site web : www.madics.fr
Suivez-nous sur Tweeter : @GDR_MADICS
Pour vous désabonner de la liste, suivre ce lien.

GreenFieldData EU Project – PhD positions – women’s applications welcome

Offre en lien avec l’Action/le Réseau : – — –/Innovation

Laboratoire/Entreprise : LIRIS & University of Milano
Durée : 3 ans
Contact : genoveva.vargas@gmail.com
Date limite de publication : 2026-04-15

Contexte :
**********************************************************************************
GreenFieldData: agricultural practices of the future
14 EU-funded Double Diploma PhDs
https://www.eu4greenfielddata.eu/phd-positions-application/list-of-phds
**********************************************************************************

Sujet :
GreenFieldData Marie Skłodowska-Curie Project is offering 14 EU-funded double-diploma PhD positions in digital agriculture at the intersection of IoT and robotics, data engineering, data management, and data analysis. The network brings together academic and non-academic partners across several countries and offers an interdisciplinary doctoral environment with joint supervision, international collaboration, and research connected to important challenges such as climate change and low-input agricultural systems.

We would particularly appreciate your support in encouraging women to apply. We are looking for scientists who can widen how agriculture is imagined, studied, and innovated. Different bold perspectives and skills matter, and we want to help build a research community that is both vibrant and inclusive.

The positions are also framed by supportive working conditions, attention to work–life balance, and room for different personal situations, which we believe are essential for meaningful and sustainable research careers.

The application deadline is 15 April 2026 (https://www.eu4greenfielddata.eu/phd-positions-application/how-to-apply)

Please feel free to circulate the poster and the call within your networks, mailing lists, associations, and communities. Your support would be extremely valuable in helping us reach more potential candidates.

With many thanks in advance for your help and solidarity,

GreenFieldData Project Publicity Board

Profil du candidat :
We are looking for scientists who can widen how agriculture is imagined, studied, and innovated. Different bold perspectives and skills matter, and we want to help build a research community that is both vibrant and inclusive.

Formation et compétences requises :

Adresse d’emploi :
University Claude Bernard Lyon 1, Lyon, France
University of Milano, Milano, Italy

Document attaché : 202603211717_w-greenfielddata.pdf

POSTE MCF 27 – Université d’Orléans – Profil Gestion de Données

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : LIFO -Université d’Orléans
Durée : permanent
Contact : mirian@univ-orleans.fr
Date limite de publication : 2026-04-04

Contexte :
Dans le cadre de la campagne synchronisée de recrutement des enseignants-chercheurs 2026, l’université d’Orléans ouvre un poste de MCF avec un rattachement recherche au laboratoire LIFO

Sujet :
La personne recrutée devra renforcer en priorité l’axe Bases de données et s’intégrer au projet commun de l’équipe PAMDA (Parallélisme et Bases de Données)

Profil du candidat :

Formation et compétences requises :
Contacts recherche :

Sophie Robert (responsable de l’équipe PAMDA) : sophie.robert@univ-orleans.fr

Mirian Halfeld Ferrari (directrice du LIFO) : direction.lifo@listes.univ-orleans.fr

Contacts enseignement : Laure Kahlem (directrice du département informatique de l’UFR ST) : lkahlem@univ-orleans.fr

Adresse d’emploi :
Fiche du poste: https://www.univ-orleans.fr/upload/public/2026-03/260886_Poste_MCF_2026%2027%20UFR%20ST%20LIFO.pdf

Robust-to-noise information extraction, unifying challenges of optical character recognition (OCR) and automatic speech recognition (ASR)

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : La Rochelle Université – Laboratoire l3i
Durée : 36
Contact : mickael.coustaty@univ-lr.fr
Date limite de publication : 2026-04-04

Contexte :
The growing digitization of written and oral content has made Optical Character Recognition (OCR) and Automatic Speech Recognition (ASR) essential in cultural heritage preservation, media accessibility, legal documentation, knowledge management and information retrieval. However, the outputs generated by these systems are inherently noisy: OCR is affected by document degradation, layout complexity or poor scanning quality, while ASR suffers from background noise, overlapping speech or non-standard oral expressions. Despite significant progress, it remains pervasive, and imperfections directly impact natural language downstream tasks where data quality is a key prerequisite. Although OCR and ASR face many similar error phenomena, their correction has mostly been studied in isolation, resulting in a lack of unified methodologies.

Sujet :
Objectives:
• Compare and analyse existing post-correction methods in OCR and ASR and potential for cross-domain adaptation.
• Develop unified approaches for post-correction that leverage the shared error patterns between OCR and ASR.
• Enable robust information extraction from noisy OCR and ASR outputs by designing strategies that mitigate the propagation of recognition errors into downstream NLP tasks.
Scientific challenges:
• Heterogeneity of noise sources: OCR errors are generated from visual artifacts while ASR errors are acoustic, a unified framework must generalize across modalities.
• Domain adaptation: OCR/ASR models often struggle on domain-specific datasets (e.g., historical texts, administrative documents, technical reports, scientific papers…) requiring correction methods that adapt to varying contexts.
• Complex error structures: beyond character and subword substitution, OCR/ASR introduce higher-level disruptions (mis-segmentation, overlapping text blocs or speech, layout misinterpretation) that complicate correction.
• Evaluation difficulties: classical metrics such as Character Error Rate (CER) or Word Error Rate
(WER) fail to fully capture the impact of errors on downstream information extraction, that
necessitate new evaluation methods.
• Scalability: correction methods must be applicable to large-scale corpora and adaptable to new
data without full retraining.
To tackle these challenges, the thesis will explore a combination of:
• Comparative state-of-the-art analysis: systematic benchmarking of existing OCR and ASR
post-correction methods on heterogeneous corpora.
• Unified modeling approaches: leveraging neural architectures (e.g., sequence-to-sequence
models, transformers, multilingual pre-trained LLMs) that can learn correction patterns across
both modalities.
• Hybrid methods: integrating symbolic rules, edit distance algorithms, and domain-specific
lexicons with machine learning models to improve robustness.
• Error modeling and simulation: designing artificial noise injection techniques to train models on
synthetic but realistic OCR/ASR-like errors, thus improving generalization.
• Evaluation frameworks: extending standard CER/WER with task-oriented metrics reflecting the
quality of downstream information extraction and retrieval.
This thesis helps to overcome the current limitations of automatic correction of texts produced by OCR
and ASR systems by proposing a unified approach, which represents a significant scientific advance. In
fact, in-depth analysis of the similarities and differences between OCR and ASR errors will provide a
better understanding of how these two fields can intersect. This project will enable the development of
more robust methods based on multidisciplinary knowledge from natural language processing, signal
processing, and image processing. The expected results will thus offer new perspectives in the
development and use of multimodal language models, contributing to the evolution of generative AI in
both language processing and signal processing. With the rise of multimodal databases (text, image,
audio, video), this thesis could inspire the creation of tools capable of simultaneously exploiting data
from various sources to extract more relevant information. The thesis is expected to deliver a
contribution to the bridging of OCR and ASR research communities and opening new research avenues
in multimodal NLP.

Profil du candidat :
The highly motivated candidate should hold a master’s degree in computer science or a related field. She/he should have
a strong background in NLP with an interest in text processing and multimodal data (text, speech,
document images). Familiarity with generative AI methods (e.g., large language models, text-to-text
generation, deep learning, fine-tuning strategies) will form a strong asset.

Formation et compétences requises :
Master of science in computer science, ai or applied mathemics, or any equivalent diploma

Adresse d’emploi :
mickael.coustaty@univ-lr.fr

Document attaché : 202603191652_Alloc_Doc_AI_DH_Coustaty_Suire_Public.pdf

Synthèse de la parole multilingue appliquée aux langues régionales

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : Inria
Durée : 24 mois
Contact : emmanuel.vincent@inria.fr
Date limite de publication : 2026-04-04

Contexte :
Ce poste d’ingénieur.e se place dans le cadre du Défi Inria COLaF “Corpus et Outils pour les Langues de France”, dont l’objectif est de créer des corpus, des modèles et des logiciels ouverts et inclusifs pour les langues de France. Cela inclut les langues régionales (alsacien, breton, corse, occitan, picard, etc.) et d’outre-mer (créoles, langues polynésiennes, langues kanakes, mahorais, etc.) et les langues d’immigration non-territoriales (arabe dialectal, arménien occidental, berbère, judéo-espagnol, romani, yiddish).

Le/la ingénieur.e sera co-encadré.e par Vincent Colotte et Emmanuel Vincent et lorsque nécessaire par Pascale Erhart. Il/elle bénéficiera de l’expertise de l’équipe Multispeech en traitement de la parole et celle de LiLPa en dialectologie, en phonétique de corpus et en TAL. Il/elle collaborera avec les ingénieurs chargés de la création et la distribution des corpus et des briques logicielles et avec les autres partenaires du projet.

Sujet :
La synthèse vocale est une technologie-clé pour la valorisation des langues régionales et d’immigration. Ces langues restent cependant largement ignorées des fournisseurs de technologies linguistiques [1], qui entraînent classiquement les systèmes de synthèse vocale sur des jeux de données monolingues de haute qualité enregistrés en studio par un petit nombre d’acteurs professionnels. Cette méthode induit un coût élevé pour chaque langue et limite le nombre de voix et leur expressivité.

L’objectif de ce poste est de concevoir un système de synthèse vocale multilingue et multi-voix applicable aux langues régionales et de le distribuer pour qu’il soit largement utilisé. Parmi les systèmes de synthèse vocale multilingue existants, IMS Toucan [2] est le seul à couvrir plus de 7000 langues. Il s’appuie sur le phonétiseur multilingue transphone [3], l’encodeur articulatoire PanPhon [4], le synthétiseur FastSpeech 2 [5] conditionné sur des plongements de locuteur et de langue et le vocodeur HiFi-GAN [6] entraînés sur un corpus de 18000 heures de parole en 462 langues. Deux défis subsistent: réduire le caractère haché de la voix et les erreurs de phonétisation, tous deux plus prononcés pour les langues peu dotées non vues à l’apprentissage. L’ingénieur collaborera avec une doctorante travaillant sur cette thématique, qui considère aussi CMS [7] et StyleTTS 2 [8].

Pour réduire le caractère haché, la doctorante financée par le projet exploite les enregistrements vocaux disponibles pour les langues régionales et d’immigration visées. Nous exploiterons donc d’autres langues proches sur le plan phonétique et/ou morphologique. Pour améliorer la phonétisation, nous exploiterons aussi les connaissances phonologiques disponibles, avec une attention particulière à la variabilité des prononciations. L’ingénieur.e travaillera sur ces deux sujets et, plus généralement, sur l’évaluation des systèmes de TTS développés par le Défi COLaF sur d’autres langues régionales de France, selon ses compétences et ses souhaits, et sur la mise à disposition concrète et l’utilisabilité de ces logiciels de TTS. L’approche développée sera validée notamment pour l’alsacien, qui est la deuxième langue régionale parlée en France en nombre de locuteurs tout en restant une langue sous-dotée [9]. Le travail de recherche s’appuiera sur les jeux de données collectés et enregistrés par les ingénieurs du Défi COLaF.

[1] DGLFLF, Rapport au Parlement sur la langue française 2023, https://www.culture.gouv.fr/Media/Presse/Rapport-au-Parlement-sur-la-langue-francaise-2023
[2] F. Lux, S. Meyer, L. Behringer, F. Zalkow, P. Do, M. Coler, E.A.P. Habets, N.T. Vu, “Meta learning text-to-speech synthesis in over 7000 languages”, in Interspeech, 2024, pp.4958-4962.
[3] X. Li, F. Metze, D. Mortensen, S. Watanabe, and A. Black, “Zero-shot learning for grapheme to phoneme conversion with language ensemble”, in Findings of ACL, 2022, pp.2106-2115.
[4] D.R. Mortensen, P. Littell, A. Bharadwaj, K. Goyal, C. Dyer, L.S. Levin, “PanPhon: A resource for mapping IPA segments to articulatory feature vectors”, in 26th International Conference on Computational Linguistics (COLING), 2016, pp.3475-3484.
[5] Y. Ren, C. Hu, X. Tan, T. Qin, S. Zhao et al., “FastSpeech 2: Fast and high-quality end-to-end text to speech”, in 9th International Conference on Learning Representations (ICLR), 2021.
[6] J. Kong, J. Kim, J. Bae, “HiFi-GAN: Generative adversarial networks for efficient and high fidelity speech synthesis”, in NeurIPS, 2020, pp.17022-17033.
[7] J. Schalkwyk, A. Kumar, D. Lyth, S. E. Eskimez, Z. Hodari, C. Resnick, R. Sanabria, R. Jiang, and the Sesame team., “CSM: A conversational speech generation model”, https://github.com/SesameAILabs/csm, 2025.
[8] Y. A. Li, C. Han, V. Raghavan, G. Mischler, N. Mesgarani, “StyleTTS 2: Towards human-level text-to-speech through style diffusion and adversarial training with large speech language models”, in NeurIPS, 2023, pp.19594-19621.
[9] D. Bernhard, A.-L. Ligozat, M. Bras, F. Martin, M. Vergez-Couret, P. Erhart, J. Sibille, A. Todirascu, P. Boula de Mareüil, D. Huck, “Collecting and annotating corpora for three under-resourced languages of France: Methodological issues”, Language Documentation & Conservation, 2021, 15, pp.316-357.

Profil du candidat :
Master ou Diplôme d’ingénieur en traitement de la parole, TAL, machine learning, linguistique informatique ou dans un domaine lié.
Solides compétences en programmation Python/Pytorch.
Une expérience préalable en traitement de la parole ou en TAL sera un atout.
La connaissance d’une langue régionale, d’outre-mer ou non-territoriale de France est un plus.

Formation et compétences requises :
Merci de postuler en ligne à l’adresse: https://jobs.inria.fr/public/classic/fr/offres/2026-09871

Adresse d’emploi :
615, rue du Jardin Botanique
54600 Villers-lès-Nancy

AI4EO spring school 2026

Date : 2026-04-08 => 2026-04-10
Lieu : Vannes

Following the successful AI4EO 2025 symposium in Rennes last Fall, we are organizing a Spring School on AI4EO in Vannes from April 8 to 10, 2026 with the support of Erasmus Mundus Joint Master Copernicus Master in Digital Earth, Cluster SequoIA, IRISA – Institut de Recherche en Informatique et Systèmes Aléatoires, University of South Brittany, and European Space Agency – ESA.
The program features lectures and hands-on sessions on foundation models, MLOps, Responsible AI and generative models, as well as a data-driven project and some social events! The list of speakers includes Pedram Ghamisi and Weikang Yu from Helmholtz-Zentrum Dresden-Rossendorf (HZDR), Adam Stewart from Technical University of Munich, Stéphane May from Centre National d’Études Spatiales, Iris de Gélis from Observatoire de Paris | PSL, Nicolas Audebert from IGN – French Mapping Agency (Institut Géographique National), and Pierre Adorni from IRISA – Institut de Recherche en Informatique et Systèmes Aléatoires
If you are interested, register ASAP since the number of seats is limited!
All details here: https://www-obelix.irisa.fr/ai4eo-spring-school-2026/

Lien direct

Notre site web : www.madics.fr
Suivez-nous sur Tweeter : @GDR_MADICS
Pour vous désabonner de la liste, suivre ce lien.

SI-DH @ INFORSID 2026 : Appel à communication

Date : 2026-05-26
Lieu : Toulouse

3ème édition Atelier Systèmes d’Information et Humanités Numériques (SI-HN)

Thématique Spéciale : Le cycle de la temporalité : Extraction, Modélisation et Interrogation

Présentation de l’Atelier
Cet atelier explore la relation bidirectionnelle entre les systèmes d’information (SI) et les humanités numériques (HN). Cette année, nous plaçons au cœur de nos échanges le défi de la temporalité.
En sciences humaines, le temps est une donnée plurielle, incertaine et souvent non linéaire. L’objectif est de réunir des chercheurs pour discuter de la manière dont les SI peuvent capturer, structurer et restituer cette complexité temporelle, du document brut jusqu’à l’interface de recherche.

Axe Thématique : Le défi du temps dans les SI pour les HN
Nous sollicitons des contributions originales portant sur l’ensemble de la chaîne de traitement de la donnée temporelle :
1. Extraction de la temporalité
• Analyse de textes : Reconnaissance automatique d’entités temporelles (NER) dans des corpus historiques.
• Interprétation du flou : Passage d’expressions naturelles (« à l’aube du XVIIe », « sous le règne de… ») à des données exploitables.
• Corrélation de sources : Extraire et réconcilier des dates issues de documents hétérogènes (archives, registres, récits).
2. Modélisation et Représentation
• Temporalités multiples : Coexistence du temps historique (l’événement), narratif (le récit) et documentaire (la production de l’objet).
• Structures complexes : Modélisation temporelle dans les graphes de connaissances et les ontologies.
• Incertitude : Représentation formelle de l’imprécision, des intervalles flous et des événements à durée indéterminée.
• Discordances et Autorité des Sources : les conflits de temporalités avec une gestion des contradictions entre gisements de données hétérogènes (ex: une date estimée dans un registre médiéval devenant une “vérité” factuelle par sédimentation dans les encyclopédies modernes) et l’arbitrage et traçabilité, en abordant le questionnement sur la primacité de la source directe (le document d’époque) versus la source compilée, et modélisation de la preuve pour trancher ou maintenir la divergence.
3. Interrogation et Exploitation
• Langages de requêtes temporels : Comment interroger des relations de type « avant », « pendant » ou « contemporain de » dans des bases de données HN ?
• Visualisation et navigation : Interfaces pour explorer des chronologies non linéaires ou des réseaux spatio-temporels.
• Alignement d’ontologies : Interroger des systèmes ayant des référentiels temporels hétérogènes.

Thématiques Générales sollicitées
Outre le focus sur la temporalité, l’atelier reste ouvert aux travaux portant sur :
• SI et Disciplines : Applications à l’Histoire, l’Art, la Philosophie, le Droit…
• Méthodes SHS et SI : Amélioration des modèles SI par l’apport des sciences humaines.
• Études de cas : Retours d’expérience sur des projets concrets mêlant ingénierie des données et recherche patrimoniale.

Nous sollicitons des travaux originaux présentant des avancées théoriques, des développements méthodologiques ou des retours d’expériences critiques. Les contributions peuvent prendre la forme de communications de recherche (présentant des résultats consolidés), de notes de projets (exposant des travaux en cours ou des protocoles expérimentaux) ou de démonstrations logicielles mettant en avant des outils d’extraction ou de visualisation temporelle. Une attention particulière sera portée aux propositions démontrant une réelle collaboration interdisciplinaire, où les défis posés par les sources humaines ont conduit à une innovation ou une adaptation des modèles de systèmes d’information classiques. Les articles peuvent être rédigés en français ou en anglais et d’une longueur de 8 à 12 pages au format (peu compact !) de la conférence INFORSID (Springer LNCS). Les articles sont à soumettre sur le site suivant :
https://easychair.org/conferences/?conf=sihn2026

Dates importantes :
17/04 à 12h clôture de l’envoi des soumissions
04/05 notification aux auteurs
26/05 atelier SIDH
Pour vos questions ou remarques vous pouvez vous adresser aux organisateurs de l’atelier :
Stéphane Lamassé stephane.lamasse@univ-paris1.fr
Cédric du Mouza dumouza@cnam.fr

Notre site web : www.madics.fr
Suivez-nous sur Tweeter : @GDR_MADICS
Pour vous désabonner de la liste, suivre ce lien.

FUSION-KG: Framework for Unified multimodal Semantic extractION for Knowledge Graphs construction

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : ICube Strasbourg
Durée : 3 ans
Contact : franco.giustozzi@insa-strasbourg.fr
Date limite de publication : 2026-05-11

Contexte :
Environmental restoration projects generate large volumes of heterogeneous documentation, including technical reports, project plans, cartographic materials, engineering drawings, and photographic records. These materials contain valuable but fragmented knowledge describing intervention strategies, environmental contexts, technical constraints, and outcomes.
Within the TETRA project (ANR-22-FAI2-0006), previous research efforts primarily concentrated on text-based knowledge extraction using Large Language Models (LLMs), enabling the structuring of restoration knowledge from technical and narrative reports. While this approach demonstrated the potential of large language models for semantic modeling and ontology enrichment, it remained largely confined to textual sources. However, restoration documentation increasingly includes rich visual materials, such as maps, technical drawings, aerial imagery, and photographic records that contain complementary and sometimes critical information not explicitly described in text. This PhD builds upon the foundations established in TETRA by extending the extraction paradigm toward a unified multimodal framework. The central hypothesis is that integrating textual and visual understanding through advanced Vision-Language Models (VLMs) can substantially improve the completeness, semantic consistency, and interpretability of structured environmental knowledge graphs.

Sujet :
The FUSION-KG PhD aims to design a unified multimodal semantic extraction framework capable of transforming heterogeneous environmental documentation into structured, interpretable, and queryable knowledge graphs. The ambition is not only to extract information from text and images, but to develop a coherent framework in which multimodal understanding and structured external knowledge
jointly contribute to reliable and semantically consistent knowledge graph construction.
The work involves the systematic modeling and characterization of heterogeneous documentary sources, including technical reports, maps, engineering drawings, aerial and satellite imagery, and photographic
records of restoration interventions. These materials provide complementary yet often fragmented accounts of intervention types, spatial configurations, temporal phases, environmental parameters, constraints, and outcomes. A major challenge lies in ensuring that information extracted from visual and textual modalities is semantically aligned and represented within a shared conceptual framework.

Profil du candidat :
The doctoral contract is awarded by the doctoral school’s selection committee through a competitive process in which the candidates’ merit is a key factor

Formation et compétences requises :
Education: Student about to graduate a Master or Engineer (Bac + 5) with a specialization in Computer Science.

Specific knowledge: Knowledge on data science methods, knowledge representation and reasoning, knowledge graphs.
Languages: Python, java, owl/sparql.
Ability to work with experts who are not computer scientists. Interest in the application domain would be appreciated.

Adresse d’emploi :
ICube laboratory (CNRS UMR 7357),
300 boulevard Sebastien Brant
BP 10413
67412 ILLKIRCH cedex

Document attaché : 202603151916_Sujet_These_ED_VLM.pdf

Détection d’Anomalies dans l’Environnement (DAE) : Un Atelier Interdisciplinaire – 2e édition

Annonce en lien avec l’Action/le Réseau : DAE

Thème :

Détection d\’anomalies, IA symbolique, apprentissage automatique

Présentation :

Cet atelier fait suite à une première édition organisée lors du congrès INFORSID 2025 à Pau. Il vise à rassembler chercheurs, industriels et décideurs afin de promouvoir l’intégration de méthodologies d\’IA symbolique et d\’apprentissage automatique afin de relever les défis environnementaux actuels grâce à une meilleure exploitation des masses de données disponibles.

En mettant l\’accent sur la détection et l\’analyse des anomalies environnementales (pollution, catastrophes naturelles et d\’origine anthropique, etc.), l\’atelier cherche à mettre en lumière de nouvelles approches innovantes pour caractériser ces phénomènes atypiques, identifier leurs causes, prédire leur évolution et évaluer leurs impacts sur la santé humaine et les écosystèmes.

L\’atelier DAE se veut avant tout un espace d\’échange. Son objectif principal est de favoriser l\’émergence de collaborations interdisciplinaires visant à proposer des solutions impactantes basées sur l\’exploitation de jeux de données complexes et hétérogènes. En complément des présentations scientifiques, l’atelier encouragera des discussions et des échanges autour de solutions concrètes pour l’analyse des systèmes d’information environnementaux. Des exemples pratiques et des retours d\’expérience seront partagés.

Du : 2026-05-26

Au : 2026-05-26

Lieu : Toulouse

Site Web : https://inforsid2026.sciencesconf.org/resource/page/id/44

MaDICS

Masses de Données, Informations et Connaissances en Sciences

Big Data - Data Science

Archives

Call for Participation – XAIDATA: Spring School on Explainable AI for Data-Intensive Systems, 28-29 of May 2026, ETIS lab, Cergy

AI FOR THE PROCESS INDUSTRIES

GreenFieldData EU Project – PhD positions – women’s applications welcome

POSTE MCF 27 – Université d’Orléans – Profil Gestion de Données

Robust-to-noise information extraction, unifying challenges of optical character recognition (OCR) and automatic speech recognition (ASR)

Synthèse de la parole multilingue appliquée aux langues régionales

AI4EO spring school 2026

SI-DH @ INFORSID 2026 : Appel à communication

FUSION-KG: Framework for Unified multimodal Semantic extractION for Knowledge Graphs construction

Détection d’Anomalies dans l’Environnement (DAE) : Un Atelier Interdisciplinaire – 2e édition