DOING@ADBIS 2022: 3rd workshop on Intelligent Data – from data to knowledge

Annonce en lien avec l’Action/le Réseau : DOING

Thème :

Databases, Natural Language Processing, Artificial Intelligence

Présentation :

The workshop focuses on transforming data into information and then into knowledge. The idea is to gather researchers to discuss two main problems :

++ how to extract information from textual data and represent it in knowledge bases;
++ how to propose intelligent methods for handling and maintaining these databases with new forms of requests, including efficient, flexible, and secure analysis mechanisms, adapted to the user, and with quality and privacy preservation guarantees.

Du : 2022-09-05

Au : 2022-09-05

Lieu : Politenico de Torino

Site Web : https://www.univ-orleans.fr/lifo/evenements/doing/?page_id=820

Poste MCF – Science de données

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : LIS – Aix-Marseille Université
Durée : indéterminée
Contact : nicolas.durand@univ-amu.fr
Date limite de publication : 2022-03-31

Contexte :
Un poste MCF en Informatique (section 27) est ouvert au concours pour la rentrée 2022 à l’Université d’Aix-Marseille.

Enseignement : Polytech Marseille, département Informatique.
Recherche : LIS (UMR 7020), Pôle Science des données.

Mots-clés : science des données, apprentissage statistique, data mining, web sémantique, intelligence artificielle.

Référence GALAXIE : 1071

Sujet :
* Enseignement : affectation au département Informatique de Polytech Marseille, basé à Luminy. La personne recrutée renforcera le projet du département en Science de données en assurant des cours généraux ou plus spécifiques : algorithmique, systèmes de recommandation, BD NoSQL, etc. Les enseignements seront effectués aussi bien en formation initiale qu’en formation par apprentissage.

* Recherche : la recherche s’effectuera au Laboratoire d’Informatique et Systèmes (UMR 7020), au sein de l’une des équipes du pôle Science des Données. Dans le cadre de la mise en avant de la thématique IA-santé par l’université et la création d’un institut dédié, le laboratoire souhaite renforcer son potentiel recherche sur le sujet dans une équipe du pôle SD. Une expérience avérée des relations avec le monde médical sera particulièrement appréciée, ainsi que la volonté de développer des pistes de recherche conjointement avec les partenaires hospitaliers et universitaires de l’institut Laënnec.

* Contacts :
– Enseignement : Stéphane Ayache, directeur du département Informatique de Polytech Marseille : stephane.ayache@univ-amu.fr
– Recherche : Frédéric Béchet, directeur du LIS : frederic.bechet@lis-lab.fr (et les responsables d’équipes) .

Profil du candidat :
Doctorat Informatique ou équivalent.

Formation et compétences requises :
Qualification section 27.

Adresse d’emploi :
Laboratoire LIS – Polytech Marseille.

Poste MCF – Aix-Marseille – Recherche d’information / Fouille de données

Offre en lien avec l’Action/le Réseau : SimpleText/– — –

Laboratoire/Entreprise : LIS – Aix-Marseille Université
Durée : indéterminée
Contact : patrice.bellot@univ-amu.fr
Date limite de publication : 2022-03-31

Contexte :
un poste MCF (ref. Galaxie 1067) est ouvert au concours à l’IUT
d’Aix-Marseille (site Aix-en-Provence) / laboratoire LIS dans le pôle
« Science des données ».

Côté recherche, il concerne plus particulièrement les équipes R2I
(Recherche d’information et Interactions) et DANA (Data miNing at
scAle).

Date limite de soumission : 31 mars à 16:00

Plus d’information ci-dessous et en suivant le lien Galaxie :
https://www.galaxie.enseignementsup-recherche.gouv.fr/ensup/ListesPostesPublies/ANTEE/2022_1/0134009M/FOPC_0134009M_1067.pdf

NB : D’autres postes MCR et PR sont ouverts au LIS, certains
selon des profils proches de celui-ci. Voir :

Offre de poste PR ou MCF

Sujet :
Candidature avant : 31/03/2022

Modalité :
Les candidatures seront à déposer sur Galaxie, l’application sera
ouverte le 24 février 10:00 et la date limite de soumission est le
31 mars à 16:00 (heure de Paris).

Contacts :
Recherche : Frédéric Béchet frederic.bechet@lis-lab.fr
Enseignement : Vincent Risch vincent.risch@univ-amu.fr

Une attention particulière doit être prêtée aux recommandations
générales formulées par la CNU 27 quant à la rédaction du dossier :
https://cnu27.univ-lille.fr/qualification-note.html

Résumé :

Compétences particulières requises :
Une compréhension de l’organisation de la pédagogie que sous-tend
l’Approche Par Compétences telle qu’implémentée dans le nouveau
Bachelor Universitaire de Technologie sera bienvenue.

Enseignement :
Filière de formations concernées : Bachelor Universitaire de
Technologie (BUT) Informatique.
Objectifs pédagogiques : Prise en charge et suivi de cours, TD, TP,
projets, tout au long des trois années du BUT Informatique. Sont
plus particulièrement concernés les enseignements d’Architecture,
Systèmes et Réseaux, Conception et Développement d’Applications, mais
aussi Algorithmique et Programmation.
Les informations relatives à ces enseignements se rattachent au
Programme National du Bachelor Universitaire de Technologie dans la
spécialité Informatique.

Recherche :
La personne recrutée s’intégrera au pôle Science des Données du
Laboratoire Informatique et Systèmes (LIS). Une préférence sera donnée
aux thèmes de recherche des membres du pôle enseignant sur le site
aixois, et plus particulièrement au Département Informatique de l’IUT.
Ces thèmes relèvent essentiellement des équipes DANA (Data miNing at
scAle) et R2I (Recherche d’Information et Interactions) — sans que cela
n’exclue les autres thématiques du pôle.
Équipe DANA : la personne recrutée développera, en collaboration avec
les autres membres de l’équipe, des méthodes et algorithmes pour
l’extraction de motifs qui représentent des relations entre items, des
modèles thématiques, et plus généralement l’analyse de données
spatio-temporelles. Les recherches menées peuvent aussi concerner
l’analyse formelle de concepts.
Équipe R2I : la personne recrutée devra mener des recherches autour des
thèmes de la recherche d’information dans le contexte des interactions,
et notamment : le traitement, l’exploitation et l’analyse de documents
hétérogènes (articles, pages Web, réseaux sociaux, transcriptions de
l’oral depuis des vidéos) leur couplage avec des données multimodales
(voix, suivi visuel, données physiologiques), porteuses d’informations
émotionnelles, internes aux documents ou acquises auprès d’utilisateurs
de systèmes de recherche interactifs. Un point important est la prise
en compte du contexte et de composantes personnalisées et subjectives
pour des modèles non strictement informationnels de la pertinence. La
rareté des données et la nécessaire protection des données personnelles
demandent de favoriser les recherches vers des algorithmes peu
gourmands. En outre, les recherches pourront se diriger vers des
algorithmes proposant des modèles joints ou hybrides permettant
d’obtenir une meilleure expressivité et une meilleure diversité.

Mots clés : masse de données, science des données, fouille de données,
recherche d’information, analyse de sentiments et d’émotions.

Profil du candidat :
Doctorat Informatique ou équivalent

Formation et compétences requises :
Qualification section 27

Adresse d’emploi :
Laboratoire LIS – IUT Aix-Marseille

Autonomous Database Management System

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : ORANGE- INRIA LILLE
Durée : 3 ans
Contact : patrick.royer@orange.com
Date limite de publication : 2022-03-31

Contexte :
These CIFRE ORANGE-INRIA

Sujet :
Database administration requires a high level of expertise. Its automation in distributed cloud environments can be improved through autonomous systems (ADBMS).

The ADBMS aims to manage without or with a minimum of human intervention, the operations of the life cycle of a database in production or non-production environments. It covers the self-driving of maintenance operations, self-security, self-healing.

The full description can be found here https://orange.jobs/jobs/offer.do?joid=110965&lang=EN

Profil du candidat :
The main objective of the thesis will be to study an autonomous function of a database management system (DBMS) used at Orange, able of maintaining a required level of QoS and QoE while optimizing the use of infrastructure resources and the need for human skills.

Formation et compétences requises :
The candidate must be fluent in French.

Adresse d’emploi :
The thesis will take place mainly in Guyancourt (France, 78) with regular trips to Villeneuve-d’Asq (France, 59) for 3 years.

Explication et les circuits pour l’évaluation de requêtes et le raisonnement logique

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : INRIA Lille
Durée : 3 ans
Contact : pierre.bourhis@univ-lille.fr
Date limite de publication : 2022-03-31

Contexte :
Une offre de doctorant co-encadrée, par Antoine Amarilli et Pierre Bourhis dans le cadre d’une offre de thèse à Inria Lille, sur l’explication et les
circuits pour l’évaluation de requêtes et le raisonnement logique.

Sujet :
L’objectif de la thèse est d’étudier de façon fine les problèmes liés à
l’explication des raisonnements logiques faits dans plusieurs cadres :
l’explication d’interrogation de bases de données par des requêtes,
l’inférence de données par des règles logiques et l’exécution de
programmes définis par des règles logiques comme les data-centric
workflows.

Profil du candidat :
Le lien de candidature et plus d’informations sont disponibles sur le
portail Inria https://jobs.inria.fr/public/classic/fr/offres/2022-04484
et la date limite pour candidater est le 8 avril.

Formation et compétences requises :
Le candidate devra connaitre une formation en information théorique proche des bases de données ou de la logique ou de la compilation des connaissances

Adresse d’emploi :
pierre.bourhis@inria.fr

link prediction in distributed knowledge graphs

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : LORIA UMR7503 CNRS-Universtié de Lorraine
Durée : 6 mois
Contact : sabeur.aridhi@loria.fr
Date limite de publication : 2022-03-31

Contexte :
Today, vast and diverse sources of data exist for almost every scientific domain, making their integration and intelligent exploitation challenging. Indeed, complex data require expressive data representation models such as graph representation. The Linked Open Data (LOD) movement along with the FAIR (Findability, Accessibility, Interoperability, Reusability) data principles are intended to facilitate heterogeneous data integration and analyses. In the LOD context, graphs are called knowledge graphs as they encompass domain ontologies for typing objects and describing their relationships. Semantic web languages (RDFS, OWL, SPARQL) have reached an interesting level of maturity on which ambitious machine learning techniques can rely. Interestingly, big data and NoSQL solutions make possible web-scale data analyses. So far, such analyses on dedicated big-data architectures are often limited to MapReduce scenarios on rather simple data models (key-value oriented, homogeneous graphs with only one type of nodes and one type of edges). Graph databases, as one NoSQL approach, allow for rich representation of multi-typed attributed nodes and edges. This better expressivity comes with a cost as graph and program distribution is not an easy task.

The objective of this Master project is to make progress to the state-of-the-art of link prediction problem in knowledge graphs in a distributed setting [1][2][3]. We will mainly focus on link prediction approaches proposed by the CAPSID team to solve biological problems like drug discovery.
The proposed distributed approaches will be evaluated using web-scale knowledge graphs for inferring missing links (data completion). YAGO, DBpedia, and synthetic benchmarks are usable for such evaluation and validation purposes [4].

Sujet :
This Master thesis project aims to develop scalable link prediction methods in large and complex graphs. More specifically, the aims of this project are:

– to design scalable implementations of the studied approaches for distributed architectures. In this context, the use of big graph processing frameworks such as Pregel, Trinity, GraphLab and BLADYG need to be studied [5];
– to define evaluation and validation protocols for the proposed algorithms in the context of web-scale knowledge graphs;

This project will be carried out mainly within the Capsid team at INRIA Nancy which combines expertise in knowledge graphs and distributed graph computing (https://capsid.loria.fr).

Profil du candidat :
Candidates must have a bachelor degree in computer science, mathematics, or one of the physical sciences.

Formation et compétences requises :
Good programming skills in an object-oriented programming language such as JAVA or C++ are essential. Experience of NoSQL solutions (Neo4j, Titan, MongoDB), parallel/distributed programming (Spark, Hadoop, Flink) and graph processing frameworks (Pregel, GraphLab, GraphX) is also desirable but not essential.

Adresse d’emploi :
Laboratoire Lorrain de Recherche en Informatique et ses Applications
LORIA
Campus Scientifique
BP239
54500 Vandoeuvre les Nancy

Special issue Text Complexity and Simplification in Frontiers in Artificial Intelligence / Natural Language Processing

Date : 2022-07-02

Web site: https://www.frontiersin.org/research-topics/34050/text-complexity-and-simplification

Submission Deadlines

02 July 2022 Manuscript

Context
Text complexity assessment is one of the urgent problems of our time. Many modern texts, including classroom books and legislative acts, prove to be too difficult and as such cannot cater to readers’ needs. This also applies to legal, financial, banking documents. Although the first methods of measuring text complexity were suggested over 70 years ago, the problem is far from being solved. The diversity of languages, text types and genres, as well as their audience, are major challenges for researchers. Despite the constant growth in the number of scientific publications, their complex language or the lack of scientific acculturation of users creates a tendency to avoid these sources by favoring commercial or political incentives rather than accuracy and informational value. This difficulty in reading scientific documents also exists when scientists are interested in scientific documents from disciplines other than those in which they are experts. Text simplification aims to reduce these barriers. Text simplification is used in the field of translation (pre-editing), localization and technical writing. Simplified texts are also more accessible to non-native speakers, young readers, people with reading disabilities, or with lower levels of education.

We are looking for contributions in the form of Review, Original Research, Brief Research Report, Perspective, Technology and Code etc. in the following areas, including, but not limited to:

application of state-of-the-art models of neural architectures to text simplification and complexity
understanding which features neural networks extract from texts for text simplification and complexity
compiling corpora annotated with complexity labels for training and testing
model evaluation and validation
description of linguistic features relevant to the assessment of the difficulty of various classes of texts
complex word Identification
evaluating the dependence on subject areas, types and genres of texts
text readability for foreign language learners
complexity of web content
text adaptation
scientific multi-document summarization
visualization as text simplification
identification of difficulties preventing the simplification and summarization of texts
metrics of text difficulty
applications in education, law, etc.

Keywords:
neural networks, text, machine learning, complexity, simplification, readability

Lien direct

Notre site web : www.madics.fr
Suivez-nous sur Tweeter : @GDR_MADICS
Pour vous désabonner de la liste, suivre ce lien.

Appel à participation à la 8ème journée de l’AFIA : PDIA 2022 – IA et Créativité, 07 avril 2022, Paris

Date : 2022-04-07
Lieu : CNAM, Paris 75003

L’Association Française pour l’Intelligence Artificielle (AFIA) organise sa huitième journée PERSPECTIVES ET DEFIS DE l’IA sur le thème de « IA et créativité » le 07 avril 2022, au CNAM, Amphi Georges Friedmann, 2 rue Conté – Paris 75003.

Le programme détaillé de la journée est accessible via le lien : https://afia.asso.fr/pdia-2022/

Pour des raisons logistiques, il est fortement conseillé de s’inscrire avant le 01 avril 2022 via le lien : https://www.linscription.com/pro/activite.php?P1=91152

Cet événement est organisé par : Fayçal HAMDI (CEDRIC, CNAM Paris), Engelbert MEPHU NGUIFO (LIMOS, Université Clermont Auvergne), Davy MONTICOLO (ERPI, Université de Lorraine), Fatiha SAIS (LISN, Université Paris Saclay)

—— —— — ——
Programme

9h-9h15 – Accueil
9h15 – 9h30 : Présentation de l’AFIA et introduction de la journée

Session 1 : Texte, IHM et IA

9h30 – 10h30 : Baptiste Caramiaux, Chercheur CNRS au laboratoire ISIR, Sorbonne Paris Université, membre du HCI Sorbonne group [page-web].
Titre: Repenser l’Interaction avec les Technologies d’Apprentissage

10h30 – 11h30 : Alex Gabriel, Chercheur post-doctoral au laboratoire ERPI, Université de Lorraine [page-web]
Titre : Intelligence artificielle pour assister l’idéation et la conception amont

11h30 – 11h45 : Pause café

11h45 – 12h45 : Anne-Gwenn BOSSER, Maîtresse de Conférences au laboratoire STICC, ENIB, Université de Brest Bretagne Loire, [page-web]
Titre : Machines à écrire: créer des programmes qui créent pour apprendre à se servir de l’IA

12h45 – 14h : Pause déjeuner

Session 2 – Arts et IA

14h – 15h : Jean-Claude Heudin, Chercheur en IA, écrivain et compositeur [page-web].
Titre : Angelia – une Intelligence Artificielle pour la musique électronique.

15h -16h : Jérôme Nika, Chercheur à l’IRCAM, [page-web].
Titre : Musique et “IA” pour “Instruments Artificiels”

16h-16h15 – pause café

16h15 – 17h15 : François Pachet, directeur du Spotify Creator Technology Research Lab, [page-web].
Exposé autour de « La créativité computationnelle lié à la musique »

17h15 – Clôture de la journée – discussion

Lien direct

Notre site web : www.madics.fr
Suivez-nous sur Tweeter : @GDR_MADICS
Pour vous désabonner de la liste, suivre ce lien.

Coverage Measures for Machine Learning Enabled Cyber-Physical Systems

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : Université Grenoble Alpes (UGA), Verimag Laborato
Durée : 3 ans
Contact : thao.dang@univ-grenoble-alpes.fr
Date limite de publication : 2022-08-31

Contexte :
The thesis is fully funded for three years by a grant from region Auvergne-Rhône-Alpes starting in 2022.
The description of the topic can be found at https://www.decyphir.com/PhD_Position_DETAI22.html

Cyber Physical Systems (CPS) are systems mixing software and hardware (cyber) components in interaction with their (physical) environment. Typical examples includ autonomous cars, robots, medical devices. Mathematically, they are modelled with so-called hybrid systems, which are dynamical systems with multiple modes, which can be continuous or discrete in nature. Since the modelisation includes the physical/biological environment, the models can be of arbitrary complexity, from trivial (not all models need be complex to be useful) to untractable for nowadays computational resources due to the infiniteness of input and state spaces of these systems. Hence new methods and tools are always needed to manage and handle the type of heterogeneous computations and data generated by the analysis and design of hybrid systems.

Sujet :
In this thesis, we want to tackle this issue from the angle of coverage measures. Given a CPS problem and some data and/or models (e.g., a hybrid system) associated to it, the question is: what is the mathematical domain that can represent all possible data that can be observed, and can we measure how well the given data represent this domain? This question is of primordial theoretical and practical interest in many contexts. One popular contemporaneous instance is that of machine learning (ML). It is well-known that ML-based algorithms, which are more and more used for CPS design, are only as good as the data used to train them. However it is much less well understood how to formally define the “goodness” of the data at our disposal. Hence there is a need for meaningful measures that can be computed and used not only to quantify the quality of a set, but also to fix it by, e.g., shrinking or augmenting it to better represent a domain to learn.
The questions of coverage, sampling, data augmentation, ML, CPS, etc are not new and topics that have attracted a lot of interest recently. The originality of this thesis will be to tackle these problems from the perspective of hybrid systems and formal methods, which are two research directions in which Verimag and Decyphir are specialized into and internationally recognized for. The intrinsic hybrid nature of data and systems considered in machine learning for CPS is often overlooked and we believed there is a need to study it in a more systematic and explicit way. Formal methods makes it possible to derive more rigorous guarantees and the hope is also that through the use of specification languages such as, e.g., Signal Temporal Logics (STL), they can help in the development of “explainable” measures, i.e., measures that are directly related to precisely formulated requirements as opposed to some hard to interpret mean squared error quantity as is the most frequent practice.

Profil du candidat :
We are looking for candidates with a Master degree in computer science or control engineering interested in CPS, artificial intelligence and machine learning. The thesis is expected to feature a strong experimental and development component but opportunities to developping theoretical contributions will also be likely. As a consequence, candidates with both theoretical and practical inclinations are welcome to apply.

Formation et compétences requises :
Master degree in computer science or control engineering

Adresse d’emploi :
Verimag Laboratory, Université Grenoble Alpes (UGA),
700 avenue centrale
38400 Saint Martin D’Hères

L’Epita recrute des Enseignants-Chercheurs

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : Laboratoires de recherche de l’EPITA (LRDE/LSE)
Durée : CDI
Contact : thierry.geraud@epita.fr
Date limite de publication : 2022-04-15

Contexte :
L’EPITA ouvre plusieurs postes d’enseignant·e·s-chercheur·e·s en informatique à temps complet, pour un recrutement au plus tard en début d’année scolaire 2022-2023,
en particulier sur les thématiques de la cybersécurité et des systèmes d’exploitation.

Afin d’accompagner la dynamique de développement de l’École à l’échelle nationale, les postes sont à pourvoir sur les sites de :

– Paris (Kremlin-Bicêtre et Campus Cyber à la Défense),
– Lyon,
– Rennes,
– Strasbourg,
– Toulouse.

Sujet :
Vous viendrez consolider nos équipes et axes de recherche sur les thématiques suivantes :

– Science et ingénierie des données, extraction de connaissances,
– Apprentissage automatique et autres sous-domaines de l’IA,
– Traitement d’images, reconnaissance des formes et vision,
– Automates et leurs applications (dont vérification et synthèse),
– Logiciel et performance (dont HPC, GPU),
– Sécurité des logiciels et des architectures : identification, protection, détection et réaction,
– Système bas-niveau (noyau, assembleur), systèmes d’exploitation, machines virtuelles et informatique en nuage,
– Système embarqué (dont robotique).

Profil du candidat :
Les informations précises concernant ces postes et le lien pour nous transférer votre dossier de candidature sont disponibles ici :

– https://www.lrde.epita.fr/~theo/postes_EPITA_MCF_2022.pdf pour les profils MCF,
– https://www.lrde.epita.fr/~theo/postes_EPITA_HDR_2022.pdf pour les profils HDR ou très bientôt HDR.

La date limite de candidature est le 15 avril 2022.

(La procédure de recrutement est lisible ici :
https://tinyurl.com/ProcedureRecrutementEPITA2022)

Formation et compétences requises :
Il n’est pas formellement nécessaire d’avoir la qualification aux postes de maître·sse de conférences ou de professeur·e des universités pour pouvoir postuler.

Document joint: profil McF.
Le descriptif du profil HDR est dans le lien ci-dessus.

Adresse d’emploi :
– Paris (Kremlin-Bicêtre et Campus Cyber à la Défense),
– Lyon,
– Rennes,
– Strasbourg,
– Toulouse.

Document attaché : 202203142010_postes_EPITA_MCF_2022.pdf

MaDICS

Masses de Données, Informations et Connaissances en Sciences

Big Data - Data Science

Archives

DOING@ADBIS 2022: 3rd workshop on Intelligent Data – from data to knowledge

Poste MCF – Science de données

Poste MCF – Aix-Marseille – Recherche d’information / Fouille de données

Autonomous Database Management System

Explication et les circuits pour l’évaluation de requêtes et le raisonnement logique

link prediction in distributed knowledge graphs

Special issue Text Complexity and Simplification in Frontiers in Artificial Intelligence / Natural Language Processing

Appel à participation à la 8ème journée de l’AFIA : PDIA 2022 – IA et Créativité, 07 avril 2022, Paris

Coverage Measures for Machine Learning Enabled Cyber-Physical Systems

L’Epita recrute des Enseignants-Chercheurs