date:20220228

[Mt-list] CfP: Workshop on Multilingual de-identification of (sensitive) LRs at LREC 2022 - Marseille, June 20, 2022

2022-02-28 Thread ELRA ELDA Information


[Apologies for multiple postings]

CALL FOR PAPERS

Workshop on Multilingual de-identification of (sensitive) language resources

To be held in conjunction with the 13th International Language Resources 
and Evaluation Conference (LREC 2022)

20 June 2022, Le Palais du Pharo, Marseille, France

*Deadline for submission: 10 April 2022*

*Description*
The General Data Protection Regulation (GDPR - Regulation (EU) 2016/679 
of the European Parliament and of the Council of 27 April 2016) ensures 
the protection of natural persons with regard to the processing of 
personal data and on the free movement of such data. The GDPR outlines a 
specific set of rules that protect citizens and user data and create 
transparency in information sharing. GDPR is the strictest data privacy 
regulation in the world, and considerable work is taking place to 
develop techniques and deploy systems that help comply with this 
regulation while rendering data accessible and, thus, usable for further 
processing. Different techniques are studied to guarantee such 
compliance, implying different levels of sensitive content protection 
and with a short- or long-term guarantee depending on whether we may 
have access to additional related information. In this regard, we can 
read about work on anonymization, de-identification and 
pseudonymization. While anonymization implies a zero re-identification 
risk, which is extremely difficult to secure, de-identification and 
pseudonymization represent an attainable target under the GDPR, given 
that this regulation defines pseudonymization as “the processing of 
personal data in such a manner that the personal data can no longer be 
attributed to a specific data subject without the use of additional 
information, provided that such additional information is kept 
separately and is subject to technical and organisational measures to 
ensure that the personal data are not attributed to an identified or 
identifiable natural person.” Bearing this context in mind, multilingual 
approaches and kits for (sensitive) language resources de-identification 
may provide the means to share language data while also protecting 
private or sensitive data by spotting then deleting, obfuscating, 
pseudonymizing or encrypting person identifying information.


De-identification is typically performed for the purpose of protecting 
an individual’s private ctivities while maintaining the usefulness of 
the gathered data for research and development
purposes. This workshop aims at discussing the various approaches to 
effective and reliable text de-identification, focusing on some 
sensitive domains such as the medical and legal domains, but not only.


Based on these premises a consensus emerges that shows a clear situation 
and needs:


1. Tools for the multilingual de-identification of (sensitive) language
   resources are becoming essential to ensure that such resources can
   be shared.
2. De-identification is crucial to ensure that all legal & ethical
   considerations are taken into account during the
   production/repurposing phases but also that the quality/nature of
   the de-identified data sets remains appropriate to conduct research
   activities.
3. European Public Administrations need personal data processing tools
   to handle the extremely large amounts of data they manage.
4. Europe’s multilingual context will benefit from approaches and tools
   that can support the European Digital Market in their multilingual
   data exchanges.


*Workshop Objectives and Topics of Interest**
*
This workshop is organised by members of the MAPA project, funded by the 
EU Connecting Europe Facility (CEF) program (https://mapa-project.eu/). 
This project has developed a toolkit for the de-identification of texts 
in the medical and legal fields which addresses all EU official 
languages. It has followed a BERT-based Named Entity Recognition 
approach for personal information identification. A wide range of topics 
have been considered and are hot topics open for discussion to all 
participants of this workshop. Among them, we have the following:


1. Sensitive personal information, domains and services that require
   de-identification
2. Corpora annotation and/or creation
3. Annotation guidelines and platforms
4. De-identification tools, data and/or applications
5. De-identification and minority languages
6. Multi-domain and/or multilingual processing
7. NLP techniques and tools used for de-identification
8. Multimodal de-identification
9. Validation and benchmarking of de-identified resources
10. Evaluation of de-identification tools and applications
11. Evaluation protocols: how to evaluate, metrics, approaches, data,
   experiences
12. Best practices
13. Approaches, activities and systems addressing “anonymization” are
   also welcome to share their experience.
14. Any other topic related to de-identification


This workshop will also be a good forum to discuss the possibility to 
design and initiate a new

[Mt-list] DEADLINE EXTENSION: NeTTT - New Trends in Translation and Technology

2022-02-28 Thread Sheila Castilho M de Sousa

 *apologies for cross-posting*

*International Conference '**New Trends in Translation and Technology'
(NeTTT)*

*Rhodes Island, Greece, 4-6 July 2022 *





DEADLINE EXTENSION



*Abstract submission: *



*15 March 2022  (*extended abstracts for proposals for oral or poster
presentations)




The International Conference 'New Trends in Translation and Technology'
(NeTTT'2022) will take place on the island of Rhodes, Greece, 4-6 July
2022.   NeTTT'2022  invites submission (
https://www.softconf.com/l/nettt2022/) on the following topics:



*The conference *

The International Conference 'New Trends in Translation and Technology'
(NeTTT'2022) will take place on the island of Rhodes, Greece, 4-6 July
2022.

 The objective of the conference is to bring together academics in
linguistics, translation studies, machine translation, translation tools
and natural language processing, as well as developers, practitioners,
language service providers and vendors who work on or are interested in
different aspects of technology for translation. The conference will be
a distinctive event for discussing the latest developments and practices in
translation and technology. NeTTT'2022 invites all professionals who would
like to learn about recent trends, present their latest work, and/or share
their experiences in the field; the conference will be an idea place to
establish business and research contacts, collaborations and new ventures.

 The conference will feature a research track and an industry track and
will take the form of presentations (peer-reviewed research and industry
presentations, as well as keynote speeches), and posters. The authors of
the accepted abstracts will be offered the opportunity to write up full
papers which will be published as open-access conference e-proceedings.



NeTTT'2022 will be preceded by a two-day summer school (2-3 July) on Neural
Machine Translation, a workshop on Translation Technology for Creative
Domains (3 July) and a series of tutorials on July 7.





*Conference topics *



We expect contributions to the following topics:



*(A) Research track *



Submissions are invited for papers reporting on original work related to
any topic regarding the latest technology used in machine translation (MT),
human translation, computer-assisted translation (CAT), audiovisual
translation (AVT), localisation and interpreting. Research papers should
have a strong theoretical and/or methodological contribution or should
report significant research results. Topics include, but are not limited
to, the following:



   - Machine Translation (any MT-related research activity including Neural
   Machine Translation, from development to human factors, MT in literary
   texts, MT in AVT, MT in localisation, MT and creative texts, MT and
   interpreting)
   - CAT tools (Translation Memory (TM) systems, integration of MT in TM
   systems)
   - Quality assessment and quality control
   - MT evaluation techniques, metrics, and evaluation results
   - Speech Technology
   - Terminology management
   - Crowdsourcing
   - Translation Workflow and Management
   - Training (to include university programmes on translation and
   interpreting and training in this fast-changing industry)
   - New and emerging language and mobile technologies
   - Tools and resources that can support the work of translators,
   localisers, subtitlers and interpreters, Linguistic resources for MT e.g.
   dictionaries, termbases, corpora, etc.
   - Human and cognitive factors in MT and user interfaces
   - Ethical issues in translation and technology
   - NLP support for translation
   - MT integration with NLP tasks





*(B) User track *



Submissions are invited for reports on case studies and implementation
experience with translation technology in LSPs, large corporations,
government organisations and NGOs as well as by translation
practitioners. Topics
include, but are not limited to, the following:



   - Integrating MT and computer-assisted translation into the translation
   workflow
   - Translation Memory systems in practice
   - Integration of Translation Memory with Machine Translation
   - Terminology tools
   - QA and MT evaluation in business settings
   - Post-editing strategies and tools
   - Legal and ethical issues associated with translation technology (e.g.
   copyright, privacy)
   - Using MT in social media or business communication
   - Impact of MT on translators' work: ergonomics, invoicing issues, new
   skill sets required
   - Freelance translators' independent use of MT
   - MT and usability
   - MT in crisis settings
   - Translation technologies training





*Schedule *

*15 March 2022 *-*  Submission deadline (*extended abstracts for proposals
for oral or poster presentations)

15 April 2022  -*  Notification of acceptance of abstracts*

20 June 2022  -* Full Paper Submission *(optional: full-length papers for
publication in the e-proceedings)

2-3 July 2022  -* Summer School on Neural

[Mt-list] CfP: Workshop on Multilingual de-identification of (sensitive) LRs at LREC 2022 - Marseille, June 20, 2022

[Mt-list] DEADLINE EXTENSION: NeTTT - New Trends in Translation and Technology

2 matches

Site Navigation

Mail list logo

Footer information