BUCC, 14th Workshop on Building and Using Comparable Corpora
TOPICS
This year our special topic is "Neural Networks in Comparable Corpora Research". But we solicit contributions on all topics related to comparable (and parallel) corpora, including but not limited to the following:
- Automatic and semi-automatic methods
- Methods to mine parallel and non-parallel corpora from the web
- Tools and criteria to evaluate the comparability of corpora
- Parallel vs non-parallel corpora, monolingual corpora
- Rare and minority languages, across language families
- Multi-media/multi-modal comparable corpora
- Human translation
- Language learning
- Cross-language information retrieval & document categorization
- Bilingual and multilingual projections
- Machine translation
- Writing assistance
- Machine learning techniques using comparable corpora
- Cross-language distributional semantics, word embeddings and pre-trained multilingual transformer models
- Extraction of parallel segments or paraphrases from comparable corpora
- Methods to derive parallel from non-parallel corpora (e.g. to provide for low-resource languages in neural machine translation)
- Extraction of bilingual and multilingual translations of single words and multi-word expressions, proper names, and named entities from comparable corpora
- Induction of morphological, grammatical, and translation rules from comparable corpora
- Induction of multilingual word classes from comparable corpora
IMPORTANT DATES
PRACTICAL INFORMATION
The workshop proceedings (full PDF; full list of BibTeX entries) are published in the ACL Anthology. See the Program page for information about connection.
Workshop fees are 45 Euros for presenters and 15 Euros for non-presenters. For further details see https://ranlp.org/ranlp2021/fees.php
SUBMISSION INFORMATION
Please follow the style sheet and templates provided for the main conference at https://www.softconf.com/ranlp2021/BUCC2021/. Papers should be submitted as a PDF file at https://www.softconf.com/ranlp2021/BUCC2021/. Submissions must describe original and unpublished work and range from four (4) to eight (8) pages plus unlimited references.
Reviewing will be double blind, so the papers should not reveal the authors' identity. Accepted papers will be published in the workshop proceedings (full PDF; full list of BibTeX entries).
Double submission policy: Parallel submission to other meetings or publications is possible but must be immediately notified to the workshop organizers.