2 open PhD positions in AI and databases

When:
01/12/2023 all-day
2023-12-01T01:00:00+01:00
2023-12-01T01:00:00+01:00

Offre en lien avec l’Action/le Réseau : – — –/Doctorants

Laboratoire/Entreprise : Télecom SudParis and Université d’Artois
Durée : 3 ans
Contact : aikaterini.tzompanaki@cyu.fr
Date limite de publication : 2023-12-01

Contexte :
The ANR project EXPIDA (EXplainable and parsimonious Preference models
to get the most out of Inconsistent DAtabases) (Project website: https://www.cril.univ-artois.fr/expida/index.html) aims to develop principled and
rigorous explainable techniques for dealing with imperfect data. More precisely,
EXPIDA aims to design tractable methods for dealing with conflicts in databases
by efficiently exploring novel inconsistency-tolerant semantics and quantifying
contradictions [6] to answer queries and to draw (high level) explanatory information.
While the set of repairs (maximal consistent datasets) is often large for
real databases, we aim to explore preference mechanisms (e.g., [4]) in order to
retrieve meaningful answers and explanations to identify the reasons of query answers,
and to assist end-users to “realize” query outputs. The EXPIDA project
aims in addition to be useful for applications intensively relying on multiple
heterogeneous data sources. Many such applications are nowadays developed
in various domains such as transportation control, health management, social
network analysis, data journalism, etc. This research project will advance the
state-of-the-art in two major ways: innovations in inconsistency management,
preferences and explanation for databases, and developing practical Artificial
Intelligence tools for managing inconsistent databases with validations on real
data. In this context we are proposing two PhD positions.

Sujet :
1st PhD position: This PhD thesis will focus on the following two main aspects. The first one
concerns query answering semantics with inconsistent databases. Despite the
fact that various methods have been studied, for drawing useful information in
presence of conflicts in the propositional and description logic settings [2, 5], we
are not aware of any existing work studying rich conflict-tolerant relations in
the context of databases equipped with flexible preference relations [7], such as
the partially ordered relation (e.g., [1, 3]). Then, investigating database query
answering under such varying conflict-tolerant methods should be accompanied
by studying the computational complexity of the different related problems,
which is among the main objectives of this thesis.
The second objective of the thesis is how to handle preferences in conflicting
databases. In fact, to answer queries over conflicting databases, it is crucial
to express priority among potential repairs in order to select the most optimal
candidates. As the number of potential repairs can be (very) large, one may
choose to rank repairs according to some preference criteria, and select a small
number of the most desirable repairs. Moreover, we note that the preference
among the sources of data could be of different nature. Often, conflict-tolerant
methods aim to find a stratification inducing a total preorder among all pieces
of information. Such a stratification allows to handle more easily inconsistencies
in data sources. Nonetheless, this can lead to a comparison of incomparable and
independent pieces of information. The main objective here is then to develop
a new framework to handle preferences in conflicting databases in order to draw
meaningful answers to user queries.
Applications should be submitted via email to Said Jabbour (jabbour@cril.fr),
Badran Raddaoui (badran.raddaoui@telecom-sudparis.eu) and Yue Ma (ma@lri.fr)
with the subject “Application for EXPIDA PhD 1”.

2nd PhD position: This PhD thesis will focus on the aspect of explainability in two ways, as
presented in what follows.
First, we will consider explanations for query results over inconsistent databases
with different conflict-tolerant semantics (e.g., consistent, brave, intersection repair,
intersection closed repair, non-objection, nonconsensus based semantics,
etc.). To this end, we will adapt the notion of lineage (or provenance) [2] in the
context of uncertain/inconsistent data and devise mathematical formalisations
that will provide the necessary properties for characterising and measuring the
‘quality’ of the explanations. Through the study of causality [7] and argumentation
[4, 9, 8, 3] in our setting, we further aim at improving the acceptability and
usefulness of the provided explanations by the end-user. Second, we will investigate
the complementary problem of explaining missing query results, widely
known as Why-Not explanations, which has not been yet addressed in the context
of inconsistent databases. In the setting of consistent databases, Why-Not
explanations ‘explain’ why certain results are not generated by a query (or a
workflow) by means of instance-based (i.e., source tuples), query-based (i.e.,
query operators) or refinement-based explanations (i.e., corrected query). Close
to our problem, [1] has proposed Why-Not provenance polynomials, which may
account for probabilistic tuples. It would be interesting to check how such formalisations
can be revisited to fit the inconsistent database’s different conflict tolerant semantics. Applications should be submitted via email to Badran RADDAOUI badran.raddaoui@telecomsudparis.
eu and Aikaterini TZOMPANAKI aikaterini.tzompanaki@cyu.fr, with
the subject “Application for EXPIDA PhD 2”.

Profil du candidat :
The PhD candidate should have a background on computer science and on at least on of the following domains: databases, logics, artificial intelligence, algorithms and complexity. They should be excellent students, self-motivated and eager learners.

Formation et compétences requises :
Engineering degree or Bachelor and Master’s degree in Computer science or (applied) mathematics.

Adresse d’emploi :
1st PhD: Université d’Artois, Lens-France
2nd PhD: Télécom SudParis, Paleseau-France

Document attaché : 202310032054_EXPIDA_PhD_subjects.pdf