The “test suites” sub-task will be included for the sixth time in the
General MT Shared Task of the Conference on Machine Translation (WMT24).
*OVERVIEW*
Test suites are custom extensions to the test sets of the General MT
Shared Task, constructed so that they can focus on concrete aspects of
the MT output. They consist of a source-side test-set and a customized
evaluation service. As opposed to the standard evaluation process which
produces generic quality scores, test suites often produce separate
fine-grained results for each phenomenon.
Since the usage of LLMs for translation is getting more popular, and we
are expecting more LLMs participations in WMT this year, the theme of
this year’s test suite sub-task is "Help us break LLMs", i.e. to reveal
weaknesses and serious flaws of LLMs when translating, hidden within the
overall high-quality generation.
*IMPORTANT DATES*
* 11th April: Test suite source texts may be submitted for a pre-run
on SoTA MT systems
* 12th June: Test suite source texts must reach us
* 11th July: Translated test suites shipped back to test suites authors:
* TBC - August: Test suite description and analysis paper
* 12th-13th November: Conference
Potential participants are kindly requested to fill in this form
https://forms.office.com/e/e4JuMTSWFF <the “Test suites” sub-task will
be included for the sixth time in the General MT Shared Task of the
Conference on Machine Translation (WMT24). , ,,*OVERVIEW* ,,Test suites
are custom extensions to the test sets of the General MT Shared Task,
constructed so that they can focus on concrete aspects of the MT output.
They consist of a source-side test-set and a customized evaluation
service. As opposed to the standard evaluation process which produces
generic quality scores, test suites often produce separate fine-grained
results for each phenomenon. ,,Since the usage of LLMs for translation
is getting more popular, and we are expecting more LLMs participations
in WMT this year, the theme of this year’s test suite sub-task is "Help
us break LLMs", i.e. to reveal weaknesses and serious flaws of LLMs when
translating, hidden within the overall high-quality generation. ,,
,,*IMPORTANT DATES* ,,11th April: Test suite source texts may be
submitted for a pre-run on SoTA MT systems ,,12th June: Test suite
source texts must reach us ,,18th July: Translated test suites shipped
back to test suites authors: ,,TBC - August: Test suite description and
analysis paper ,, ,,Potential participants are kindly requested to fill
in this form ,,https://forms.office.com/e/e4JuMTSWFF ,, ,,Further
information can be found in the dedicated page of the WMT website
,,http://www2.statmt.org/wmt24/testsuite-subtask.html>
Further information can be found in the dedicated page of the WMT website
http://www2.statmt.org/wmt24/testsuite-subtask.html
<http://www2.statmt.org/wmt24/testsuite-subtask.html>
--
Eleftherios Avramidis, senior researcher
German Research Center for Artificial Intelligence (DFKI)
departments: Design Research eXplorations, Speech and Language Technology
short name: Lefteris, (pronouns: he/him), languages: English, German, Greek
Website:https://www.dfki.de/~elav01
Address: Alt Moabit 91c, 10559 Berlin, Germany
Tel.: +49 30 23895 1806
Sec.: +49 30 23895 1800
Fax.: +49 30 23895 1810
Firmensitz: Trippstadter Strasse 122, D-67663 Kaiserslautern, Germany
Geschäftsführung: Prof. Dr. Antonio Krüger, Helmut Ditzer
Vorsitzender des Aufsichtsrats: Dr. Ferri Abolhassan
Amtsgericht Kaiserslautern, HRB 2313
_______________________________________________
Corpora mailing list -- corpora@list.elra.info
https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
To unsubscribe send an email to corpora-le...@list.elra.info