Call for Participation

CWI Shared Task 2018 at BEA - Second Complex Word Identification (CWI) Shared 
Task

URL: https://sites.google.com/view/cwisharedtask2018

Many lexical simplification systems have been proposed up to this date (Glavaš 
and Štajner, 2015; Paetzold and Specia, 2016a). As it has been shown by 
Paetzold and Specia, 2016b, systems that discern between complex and simple 
words before simplification tend to be more reliable in practice. Therefore, 
the automatic identification of words that are difficult for a given target 
population is an important step for building better performing lexical 
simplification systems. This process is known as complex word identification 
(CWI) (Shardlow, 2013).

The first shared task on CWI was organized at the SemEval 2016 (Paetzold and 
Specia, 2016c). It featured 21 teams that competed submitting 42 systems 
trained to predict whether words in a given context were complex or non-complex 
for a non-native English speaker. Following the success of the first CWI shared 
task at SemEval 2016 we organize a second edition of the challenge at the BEA 
workshop 2018.

The goal of this year’s CWI shared task is to predict which words can be 
difficult for a non-native speaker, based on annotations collected from a 
mixture of native and non-native speakers.

Tracks

Monolingual English CWI shared task
Monolingual Spanish CWI shared task
Monolingual German CWI shared task
Multilingual CWI shared task with French test set (English, Spanish, and German 
datasets can be used for training)

Tasks

Binary classification task
Probabilistic classification task

In the binary classification task, the participants are asked to label the 
given target word in particular context as complex or simple.

In the probabilistic classification task, the participants are asked to give a 
probability of the given target word in particular context being complex.

Participants can submit up to two systems for each track and for each task.

Registration is now open. Please visit the website and sign up to receive more 
information.

Important Dates

Training set release: January 19, 2018
Test set release: February 26, 2018
Submissions due: February 28, 2018
Results announced: March 2, 2018
System papers deadline: March 26, 2018
Reviews due: April 5, 2018
Camera-ready versions: April 15, 2018
BEA Workshop: June 5 or 6, 2018

Organizers

Sanja Štajner (Unviersity of Mannheim)
Chris Biemann (University of Hamburg)
Shervin Malmasi (Harvard Medical School)
Gustavo Paetzold (University of Sheffield)
Lucia Specia (University of Sheffield)
Anaïs Tack (Université Catholique de Louvain and KU Leuven)
Seid Muhie Yimam (University of Hamburg)
Marcos Zampieri (University of Wolverhampton)

Contact

Sanja Štajner: sanja(at)informatik(dot)uni-mannheim(dot)de
_______________________________________________
Mt-list site list
Mt-list@eamt.org
http://lists.eamt.org/mailman/listinfo/mt-list

Reply via email to