LINGUIST List 31.380

Mon Jan 27 2020

Calls: English; Computational Linguistics/France

Editor for this issue: Lauren Perkins <laurenlinguistlist.org>



Date: 26-Jan-2020
From: Reinhard Rapp <reinhardrappgmx.de>
Subject: 13th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA
E-mail this message to a friend

Full Title: 13th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA
Short Title: BUCC 2020

Date: 11-May-2020 - 11-May-2020
Location: Marseille, France
Contact Person: Reinhard Rapp
Meeting Email: < click here to access email >
Web Site: https://comparable.limsi.fr/bucc2020/

Linguistic Field(s): Computational Linguistics

Subject Language(s): English

Call Deadline: 25-Feb-2020

Meeting Description:

13th WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA

Co-located with LREC 2020, Pharo Palace, Marseille, France

Monday, May 11, 2020

SHARED TASK: Bilingual dictionary induction from comparable corpora

Website workshop: https://comparable.limsi.fr/bucc2020/

Website shared task: https://comparable.limsi.fr/bucc2020/bucc2020-task.html

Invited speaker: Holger Schwenk, Facebook AI Research


Call for Papers:

Motivation:
In the language engineering and the linguistics communities, research in comparable corpora has been motivated by two main reasons. In language engineering, on the one hand, it is chiefly motivated by the need to use comparable corpora as training data for statistical NLP applications such as statistical and neural machine translation or cross-lingual retrieval. In linguistics, on the other hand, comparable corpora are of interest in themselves by making possible cross-language discoveries and comparisons. It is generally accepted in both communities that comparable corpora are documents in one or several languages that are comparable in content and form in various degrees and dimensions. We believe that the linguistic definitions and observations related to comparable corpora can improve methods to mine such corpora for applications of statistical NLP. As such, it is of great interest to bring together builders and users of such corpora.

Topics:
We solicit contributions on all topics related to comparable corpora, including but not limited to the following:

Building Comparable Corpora:
• Human translations
• Automatic and semi-automatic methods
• Methods to mine parallel and non-parallel corpora from the web
• Tools and criteria to evaluate the comparability of corpora
• Parallel vs non-parallel corpora, monolingual corpora
• Rare and minority languages, across language families
• Multi-media/multi-modal comparable corpora

Applications of comparable corpora:
• Human translations
• Language learning
• Cross-language information retrieval & document categorization
• Bilingual projections
• Machine translation
• Writing assistance
• Machine learning techniques using comparable corpora

Mining from Comparable Corpora:
• Induction of morphological, grammatical, and translation rules from comparable corpora
• Extraction of parallel segments or paraphrases from comparable corpora
• Extraction of bilingual and multilingual translations of single words and multi-word expressions, proper names, and named entities from comparable corpora
• Induction of multilingual word classes from comparable corpora
• Cross-language distributional semantics

Submission Information: Please see the BUCC 2018 website at http://comparable.limsi.fr/bucc2020/

Important Dates:
25 February 2020: Paper submission deadline
12 March 2020: Notification of acceptance
mid March 2020 (tentative): Early bird registration (reduced rates)
2 April, 2020: Camera ready final papers
May 11, 2020: Workshop date


Shared Task: Bilingual dictionary induction from comparable corpora. For information on the shared task see https://comparable.limsi.fr/bucc2020/bucc2020-task.html

Workshop and Shared Task Organizers:
Reinhard Rapp (Magdeburg-Stendal University of Applied Sciences and University of Mainz, Germany), Chair and contact person: reinhardrapp (at] gmx (dot) de
Pierre Zweigenbaum (Université Paris-Saclay,CNRS, Orsay, France)
Serge Sharoff (University of Leeds, United Kingdom)




Page Updated: 27-Jan-2020