PhD at LabHC: Data analysis with incomplete or uncertain relational data

When:
08/05/2021 – 09/05/2021 all-day
2021-05-08T02:00:00+02:00
2021-05-09T02:00:00+02:00

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : Laboratoire Hubert Curien ( LabHC)
Durée : 36 mois
Contact : christine.largeron@univ-st-etienne.fr
Date limite de publication : 2021-05-08

Contexte :
In many applications, the data to be studied is relational, modeled in the form of a network and represented by graphs. This representation makes possible the study of interactions between people, especially in social sciences. Although network analysis is an active branch of data min-ing and machine learning, a majority of works in other fields focuses also graphs where the nodes correspond to the entities of the network and the links (edges or arcs) to their relation-ships, for instance in biology or image processing.
Among the main tasks studied on this data, one can mention supervised and unsupervised cluste-ring, link prediction or, recommendation.

Sujet :
One of the main limitations of the existing methods allowing to solve these issues is that they usually consider the network to be fully known so that it can be perfectly defined by a graph or by a sequence of graphs for a dynamic network. However, in practice, the graph is often only partially observable, or the data can be observed with a degree of uncertainty. This uncertainty is derived from the presence or absence of a node (or link) at a given moment and, if applicable, the lack of weight or orientation characterizing the link.
Although it is very widespread in practice, this uncertainty is not without effect on the outcomes resulting from classical graph analysis but it is understudied in the literature. Concerning the classi-fication of entities, one can cite the works of Sevon et al. or Pfeiffer et al.. For the prediction of links, Mallek et al., Ahmed et al., Murata et al. Finally for the detection of communities, we can mention Dahlin et al., Liu et al., Martin et al., Zhang et al. as well as our works (Benyahia et al.). In the frame of Graph Neural Networks, only few works focus on learning attribute-missing graph embeddings (Chen et al.). The applications on many downstream information communication, science and technology domains are nonetheless important, such as in databases (Banerjee et al.) and computer networks (Sasaki et al.). Thus, in this project our objectives are twofold. First, we will study the impact of the incompleteness of the relational data on network algorithms. Second, we will develop methods able to factor nodes and links incompleteness in graph processing algorithms as well as machine learning techniques based on graphs.

Profil du candidat :
The candidate should have a master degree or equivalent in Computer Science. The subject is at the intersection of several domains: graph theory, statistics, data mining and machine learning, big data. Thus the candidate should have strong backgrounds in several of these topics.
Other required skills:
• Good abilities in algorithm design and programming.
• Good technical skills regarding data mining, machine learning and data management
• A very good level (written and oral) in English.
• Good communication skills (oral and written).
• The ability to work in a team with colleagues from other scientific disciplines.
• Autonomy and motivation for research.

Formation et compétences requises :
Applicants are invited to contact as soon as possible.

The application file should contain the following documents:
1. a curriculum vitæ (CV);
2. the official academic transcripts of all the candidate’s higher education degrees (BSc, License, MSc, Master’s degree, Engineer degree, etc.). If the candidate is currently finishing a Master’s degree, s/he must send the transcript of the grades obtained so far, with the rank among her/his peers, and the list of classes taken during the last year;
3. some recommendation letters (quality is more important than quantity, there);
4. and a motivation letter written specifically for this position.
Send all of these documents by email to all the advisors:
• Christophe Gravier christophe.gravier@univ-st-etienne.fr
• Christine Largeron christine.largeron@univ-st-etienne.fr

Adresse d’emploi :
The PhD candidate will work at the Laboratoire Hubert Curien (UMR 5516) under the supervision of Christophe Gravier and Christine Largeron, Professor in Computer Science (LabHC – Université Jean Monnet, Saint-Etienne, France).

Document attaché : 202103081707_PHD-positionLabHC.pdf