Postdoc: Article separation in historical newspapers

When:
21/11/2021 – 22/11/2021 all-day
2021-11-21T01:00:00+01:00
2021-11-22T01:00:00+01:00

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : L3i – La Rochelle Université
Durée : 2 ans
Contact : antoine.doucet@univ-lr.fr
Date limite de publication : 2021-11-21

Contexte :
Joining a young group the crossroad between document analysis and NLP, located in a historical town by the Atlantic Ocean? And walk 10 minutes from the lab to the beach. We have open positions in the context of 2 ongoing Horizon 2020 projects: Embeddia and NewsEye as well as subsequent projects. In 2020-2021, we have among others published long papers in CORE A* and A conferences ACL, JCDL, ICDAR, CoNLL, DAS COLING, ICADL.. We coordinate the H2020 NewsEye project, focused on improving access to large European collections of historical newspapers. We developed the NewsEye platform for navigating through such collections, a platform it will build upon in future years. Full details on the NewsEye project are available on its website – http://newseye.eu/

Sujet :
Applications are invited for a postdoctoral researcher position on the separation of articles from digitized newspapers, in particular historical newspapers. This task is a critical first step for any use of digitized newspapers, which are initially only split per “page image” files.

Your goal will be to study the state of the art and devise methods combining visual and textual features so as improve the performance of article separation on a large scale. In particular, we seek for methods that function with limited training data and that function for several languages.

Profil du candidat :
Who we search for:
– proven record of high-level publications in one or more of those fields

Keywords: digitized documents, combination of visual and textual features, layout analysis, statistical NLP, language-independent approaches, deep/machine learning.

Formation et compétences requises :
– PhD in document analysis, NLP, IR, or ML, ideally followed by postdoctoral experience
– fluency in written and spoken English (French language skills are not relevant)

Adresse d’emploi :
Laboratoire L3i
Université de La Rochelle
Ave EINSTEIN
F-17000 LA ROCHELLE

Document attaché : 202109281405__2021-PosteANNA.pdf