You can find below an offer for a PhD student contract in Natural Processing at 
Univ. of Lorraine, Nancy, France.

Subject: Automatic generation of explanations for multiword expressions in the 
context of language learning
Thesis supervisors: Mathieu Constant (ATILF, Univ. Lorraine, France) and 
Patrick Watrin (CENTAL, Univ. of Louvain, Belgium)
Thesis funded for three years by the ANR STAR-FLE project
Start date: 1 October 2024
Salary: 2135,00 € gross monthly
Host laboratory: ATILF (Computer Processing and Analysis of the French Language)
Location: Nancy, France
Application deadline: July 11, 2024

Scientific background: 

The successful candidate will join the ATILF, a research unit in language 
sciences, and in particular the research group on natural language processing 
(NLP). This research group works, among other things, on exploiting recent NLP 
models for linguistic modelling (e.g. lexical modelling) with applications in 
the medical field and language learning. In particular, its work is based on 
the integration of large (generative) language models and knowledge bases (e.g. 
scientific textual data, lexical resources).


More specifically, the thesis will be part of the STAR-FLE project (STrategic 
Adaptations for better Reading and Text Comprehension in FFL) funded by the 
Agence Nationale de la Recherche for 4 years (2024-2027). The project is in the 
field of computer-assisted language teaching. The aim of STAR-FLE is to gain a 
better understanding of the difficulties encountered by learners of French as a 
foreign language (FFL) when faced with the lexicon present in authentic texts. 
It will propose digital solutions based on natural language processing (NLP) to 
facilitate text comprehension and enable teachers to better manage 
heterogeneous levels in the classroom. Contextual aids and personalized 
vocabulary adaptations are envisaged, particularly for multiword expressions. 

Objectives:

The thesis will focus on multiword expressions. They correspond to combinations 
of several lexical units which are composed in an irregular manner on one or 
more linguistic levels (morphology, syntax, semantics, etc.). This term covers 
a wide variety of phenomena, such as idiomatic expressions (run around in 
circles, dry run), support verb constructions (take a walk), complex functional 
units (in spite of), etc. This non-compositionality, which can lead to a 
certain semantic opacity, can pose problems for learners when reading. 
In this thesis, the person recruited will develop methods based on new NLP 
techniques to produce in-context explanatory card enabling learners to better 
understand these expressions.
The production of these cards will be based on the prediction of linguistic 
properties (e.g. a dry run is not dry), on the generation of natural language 
explanations using large generative language models (e.g. paraphrases), or on 
semantic linking to different lexical resources (e.g. to retrieve definitions 
and lexical neighbors), depending on the context in which the expression 
occurs. One of the challenges will be to propose explanatory cards adapted to 
the learner's level.

Application requirements and procedures

Candidates should have the following skills and profiles:
        - a Master's degree in computational linguistics, in natural language 
processing, in computer science or  in cognitive science.
        - very good programming skills
        - very good skills in recent models of natural language processing 
(e.g. large language models). 

Applications should include a cover letter, CV and Master's grades, together 
with references or one or more letters of recommendation.
They should be submitted at the following url:
https://emploi.cnrs.fr/Offres/Doctorant/UMR7118-SABMAR-020/Default.aspx?lang=EN 
<https://emploi.cnrs.fr/Offres/Doctorant/UMR7118-SABMAR-020/Default.aspx?lang=EN>

For more information, do not hesitate to contact Mathieu Constant 
(mathieu.const...@univ-lorraine.fr <mailto:mathieu.const...@univ-lorraine.fr>).
_______________________________________________
Corpora mailing list -- corpora@list.elra.info
https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
To unsubscribe send an email to corpora-le...@list.elra.info

Reply via email to