Post-Doc at BABBAR.TECH: Web Page Segmentation

When:
30/04/2024 – 01/05/2024 all-day
2024-04-30T02:00:00+02:00
2024-05-01T02:00:00+02:00

Offre en lien avec l’Action/le Réseau : – — –/– — –

Laboratoire/Entreprise : GREYC / Babbar.tech
Durée : 12-18 months
Contact : recrutement@babbar.tech
Date limite de publication : 2024-04-30

Contexte :
The GREYC lab performs research works in the field of digital science with activities in image processing, machine learning, artificial intelligence, computer security, fundamental computer science, Web science, electronics.

Babbar is specialized in web data collection and provides a large scale view on the web graph to its users. Babbar crawls more than 1.5B pages per day, and its index currently contains information about more than 1500B urls.

Sujet :
The postdoctoral scholar will be working on Web Page Segmentation with a primary goal to detect the different zones of a web page, select interesting areas and extract meaningful content. This interdisciplinary research project combines structural analysis, natural language processing and machine learning techniques to develop advanced algorithms capable of segmenting web pages into meaningful and semantically distinct regions.

Profil du candidat :
– Hold a recent Ph.D. degree in Computer Science, Electrical Engineering, or a related field.

Formation et compétences requises :
– Demonstrate a strong research background in natural language processing or machine learning.
– Possess a track record of publications in top-tier conferences/journals related to machine learning, NLP, or related areas.
– Strong programming skills.
– Excellent written and verbal communication in English and interpersonal skills.

Adresse d’emploi :
Caen, France

Document attaché : 202401091508_Post-doc BABBAR.pdf