Call for Participation CWI Shared Task 2018 at BEA - Second Complex Word Identification (CWI) Shared Task
URL: https://sites.google.com/view/cwisharedtask2018 Many lexical simplification systems have been proposed up to this date (Glavaš and Štajner, 2015; Paetzold and Specia, 2016a). As it has been shown by Paetzold and Specia, 2016b, systems that discern between complex and simple words before simplification tend to be more reliable in practice. Therefore, the automatic identification of words that are difficult for a given target population is an important step for building better performing lexical simplification systems. This process is known as complex word identification (CWI) (Shardlow, 2013). The first shared task on CWI was organized at the SemEval 2016 (Paetzold and Specia, 2016c). It featured 21 teams that competed submitting 42 systems trained to predict whether words in a given context were complex or non-complex for a non-native English speaker. Following the success of the first CWI shared task at SemEval 2016 we organize a second edition of the challenge at the BEA workshop 2018. The goal of this year’s CWI shared task is to predict which words can be difficult for a non-native speaker, based on annotations collected from a mixture of native and non-native speakers. Tracks Monolingual English CWI shared task Monolingual Spanish CWI shared task Monolingual German CWI shared task Multilingual CWI shared task with French test set (English, Spanish, and German datasets can be used for training) Tasks Binary classification task Probabilistic classification task In the binary classification task, the participants are asked to label the given target word in particular context as complex or simple. In the probabilistic classification task, the participants are asked to give a probability of the given target word in particular context being complex. Participants can submit up to two systems for each track and for each task. Registration is now open. Please visit the website and sign up to receive more information. Important Dates Training set release: January 19, 2018 Test set release: February 26, 2018 Submissions due: February 28, 2018 Results announced: March 2, 2018 System papers deadline: March 26, 2018 Reviews due: April 5, 2018 Camera-ready versions: April 15, 2018 BEA Workshop: June 5 or 6, 2018 Organizers Sanja Štajner (Unviersity of Mannheim) Chris Biemann (University of Hamburg) Shervin Malmasi (Harvard Medical School) Gustavo Paetzold (University of Sheffield) Lucia Specia (University of Sheffield) Anaïs Tack (Université Catholique de Louvain and KU Leuven) Seid Muhie Yimam (University of Hamburg) Marcos Zampieri (University of Wolverhampton) Contact Sanja Štajner: sanja(at)informatik(dot)uni-mannheim(dot)de _______________________________________________ Mt-list site list Mt-list@eamt.org http://lists.eamt.org/mailman/listinfo/mt-list