[Moses-support] 1st CfP: 2nd Workshop on Natural Language Processing for Translation Memories (NLP4TM 2016) at LREC 2016

Carla Parra Fri, 18 Dec 2015 05:17:06 -0800

 

(apologies for cross-posting)


2ND WORKSHOP ON NATURAL LANGUAGE PROCESSING FOR TRANSLATION MEMORIES
(NLP4TM 2016) 

http://rgcl.wlv.ac.uk/nlp4tm2016/ [1] 

to be held at LREC 2016 (Portorož, Slovenia), May 28, 2016 

Submission deadline: February 10, 2016 

1. CALL FOR PAPERS 

Translation Memories (TM) are amongst the most used tools by
professional translators, if not the most used. The underlying idea of
TMs is that a translator should benefit as much as possible from
previous translations by being able to retrieve how a similar sentence
was translated before. Moreover, the usage of TMs aims at guaranteeing
that new translations follow the client's specified style and
terminology. Despite the fact that the core idea of these systems relies
on comparing segments (typically of sentence length) from the document
to be translated with segments from previous translations, most of the
existing TM systems hardly use any language processing for this. Instead
of addressing this issue, most of the work on translation memories
focused on improving the user experience by allowing processing of a
variety of document formats, intuitive user interfaces, etc. 

The term second generation translation memories has been around for more
than ten years and it promises translation memory software that
integrates linguistic processing in order to improve the translation
process. This linguistic processing can involve tasks such as matching
of subsentential chunks, edit distance operations between syntactic
trees, incorporation of semantic and discourse information in the
matching process. This workshop invites papers presenting second
generation translation memories and related initiatives. 

Terminologies, glossaries and ontologies are also very useful for
translation memories, by facilitating the task of the translator and
ensuring a consistent translation. The field of Natural Language
Processing (NLP) has proposed numerous methods for terminology
extraction and ontology extraction. Researchers are encouraged to submit
papers to the workshop which show how these methods are being
successfully applied to Translation Memories. In addition, papers
discussing the integration of Machine Translation and Translation
Memories or studies about automatic building of translation memories
from corpora are also welcomed. 

2. TOPICS OF INTEREST 

This workshop invites original papers which show how language processing
can help translation memories. Topics of interest include but are not
limited to: 

        * 

Improving matching and retrieval of segments by using morphological,
syntactic, semantic and discourse information 
        * 

Automatic extraction of terminologies and ontologies for translation
memories 
        * 

Integration of named entity recognition and terminologies in matching
and retrieval 
        * 

Using natural language processing for automatic construction of
translation memories 
        * 

Extracting and aligning TM segments from a parallel or comparable corpus

        * 

Construction of translation memories using the Internet 
        * 

Corpus based studies about the usefulness of TM for specific domains 
        * 

Development of hybrid TM and MT translation systems 
        * 

Study of NLP techniques used by TM tools available in the market 
        * 

Automatic methods for TM cleaning and maintenance 

  

3. SHARED TASK 

A shared task on cleaning translation memories will be organised. A
training set will be distributed to be used to develop and train the
participants' systems. The testing will be done on 500 segments
distributed during the testing phase.

        * TASK: Automatically clean translation memories

        * 

TRAINING SET: 1,500 TM segments annotated with information on whether
they are a valid translation of each other 
        * 

TEST SET: 500 TM segments 
        * 

LANGUAGE PAIRS: 

        * 

English-Italian 
        * 

English-German 
        * 

English-Spanish 

        * 

RELEASE OF THE TRAINING DATA: end of January 2016 

Participants are encouraged to submit working notes of their systems to
be presented during the workshop. More details, including the shared
task schedule will be announced soon in a dedicated Call for
Participation. 

4. SUBMISSION INFORMATION 

We invite contributions of either long papers (8 pages + 2 references)
which present unpublished original research or short paper/demos of
systems which present work in progress or working systems (4 pages + 2
references). The submissions do not need to be anonymised. 

All the papers will have to be submitted in PDF format via the START
system by following this link: https://www.softconf.com/lrec2016/NLP4TM/
[2] 

5. Identify, Describe and Share your LRs

As scientific work requires accurate citations of referenced work so as
to
allow the community to understand the whole context and also replicate
the
experiments conducted by other researchers, LREC 2016 endorses the need
to
uniquely Identify LRs through the use of the International Standard
Language
Resource Number (ISLRN, www.islrn.org [3]), a Persistent Unique
Identifier to be
assigned to each Language Resource. The assignment of ISLRNs to LRs
cited in
LREC papers will be offered at submission time. 

6. IMPORTANT DATES 

        * 

Submission deadline: 10th February 2016 
        * 

Acceptance notification: 7th March 2016 
        * 

Camera-ready versions: 31st March 2016 
        * 

Workshop date: 28th May 2016 

7. ORGANISING COMMITTEE 

For the workshop 

        * 

Constantin Orasan, University of Wolverhampton, UK 
        * 

Carla Parra, Hermes, Spain 
        * 

Eduard Barbu, Translated, Italy 
        * 

Marcello Federico, FBK, Italy 

For the shared task 

        * 

Eduard Barbu, Translated, Italy 
        * 

Carla Parra, Hermes, Spain 
        * 

Luca Mastrostefano, Translated, Italy 
        * 

Matteo Negri, FBK, Italy 
        * 

Marco Turchi, FBK, Italy 
        * 

Constantin Orasan, University of Wolverhampton, UK 

The organisers can be contacted by sending an email to
nlp4tm2...@gmail.com. 

8. PROGRAM COMMITTEE 

        * 

Juanjo Arevalillo, Hermes, Spain 
        * 

Yves Champollion, WordFast, France 
        * 

Gloria Corpas, University of Malaga, Spain 
        * 

Maud Ehrmann, EPFL, Switzerland 
        * 

Kevin Flanagan, Swansea University, UK 
        * 

Corina Forascu, University "Al. I. Cuza", Romania 
        * 

Gabriela Gonzalez, eTrad, Argentina 
        * 

Rohit Gupta, University of Wolverhampton, UK 
        * 

Manuel Herranz, Pangeanic, Spain 
        * 

Samuel Läubli, Autodesk, Switzerland 
        * 

Liangyou Li, DCU, Ireland 
        * 

Qun Liu, DCU, Ireland 
        * 

Ruslan Mitkov, University of Wolverhampton, UK 
        * 

Aleksandros Poulis, Lionbridge, Sweden 
        * 

Gabor Proszeky, Morphologic, Hungary 
        * 

Uwe Reinke, Cologne University of Applied Sciences, Germany 
        * 

Michel Simard, NRC, Canada 
        * 

Mark Shuttleworth, UCL, UK 
        * 

Masao Utiyama, NICT, Japan 
        * 

Mihaela Vela, Saarland University, Germany 
        * 

Andy Way, DCU, Ireland 
        * 

Jörn Wübker, Lilt, USA 
        * 

Marcos Zampieri, Saarland University and DFKI, Germany 

-- 

DR. CARLA PARRA ESCARTÍN 

 Especialista en tecnología aplicada - Investigadora Marie Curie -
EXPERT ITN [4]

 Applied Technology Engineer - Marie Curie Experienced Researcher -
EXPERT ITN [4] 

www.hermestrans.com [5] 

(+34) 91 640 7640 (Madrid) 

(+34) 95 202 0525 (Málaga)

AVISO LEGAL: Este mensaje está dirigido únicamente a su destinatario.
Contiene información CONFIDENCIAL sometida a secreto profesional o cuya
divulgación está prohibida por la ley. Si ha recibido este mensaje por
error, debe saber que su lectura, copia y uso no están autorizados. Le
rogamos que nos lo comunique inmediatamente por esta misma vía y proceda
a su destrucción. El correo electrónico mediante Internet no permite
asegurar la confidencialidad de los mensajes que se transmiten ni su
integridad o correcta recepción. Hermes Traducciones y Servicios
Lingüísticos, SL no asume responsabilidad alguna por estas
circunstancias y se reserva el derecho a ejercer las acciones legales
que le correspondan contra todo tercero que acceda de forma ilegítima al
contenido de este mensaje y al de los archivos en él contenidos. Si el
destinatario de este mensaje no consintiera la utilización del correo
electrónico por Internet y la grabación de los mensajes, rogamos que lo
ponga en nuestro conocimiento de forma inmediata. 

LEGAL NOTICE: This message is only intended for the addressee. It
contains CONFIDENTIAL information protected by professional secrecy.
Dissemination of such information is prohibited by law. If you have
received his message by mistake, please be aware that you are not
authorised to read, copy or use it. Please notify us immediately via
this means and destroy it. E-mail over the Internet does not allow to
ensure the confidentiality, integrity or correct reception of the
messages that are sent. Hermes Traducciones y Servicios Lingüísticos, SL
does not accept liability for these circumstances and reserves the right
to take the legal measures to which it is entitled against any third
party that unlawfully accesses the content of this message and the files
attached here to. If the addressee of this message does not consent to
the use of e-mail via the Internet and to messages being saved, please
notify us on an immediate basis.

 

Links:
------
[1] http://rgcl.wlv.ac.uk/nlp4tm2016/
[2] https://www.softconf.com/lrec2016/NLP4TM/
[3] http://www.islrn.org
[4] http://expert-itn.eu/
[5] http://www.hermestrans.com

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

[Moses-support] 1st CfP: 2nd Workshop on Natural Language Processing for Translation Memories (NLP4TM 2016) at LREC 2016

Reply via email to