BUCC, 12th Workshop on Building and Using Comparable Corpora
TOPICS
The special topic for this year is Neural Networks for Building and Using Comparable Corpora. More broadly, we solicit contributions to the following topics:
- Automatic and semi-automatic methods
- Methods to mine parallel and non-parallel corpora from the Web
- Tools and criteria to evaluate the comparability of corpora
- Parallel vs non-parallel corpora, monolingual corpora
- Rare and minority languages, across language families
- Multi-media/multi-modal comparable corpora
- Human translations
- Language learning
- Cross-language information retrieval & document categorization
- Bilingual projections
- Machine translation
- Writing assistance
- Cross-language distributional semantics and word embeddings
- Extraction of parallel segments or paraphrases from comparable corpora
- Methods to extract parallel from non-parallel corpora (e.g. to provide for low-resource languages in neural machine translation)
- Extraction of bilingual and multilingual translations of single words and multi-word expressions; proper names, named entities, etc., from comparable corpora
IMPORTANT DATES
SUBMISSION INFORMATION
Please follow the style sheet and templates provided for the main conference at http://lml.bas.bg/ranlp2019/submissions.php. Papers should be submitted as a PDF file at https://www.softconf.com/ranlp2019/BUCC/. Submissions must describe original and unpublished work and range from four (4) to eight (8) pages plus unlimited references.
Reviewing will be double blind, so the papers should not reveal the authors' identity. Accepted papers will be published in the workshop proceedings.
Double submission policy: Parallel submission to other meetings or publications is possible but must be immediately notified to the workshop organizers.
For further information, please contact Serge Sharoff <S (dot) Sharoff (at) leeds (dot) ac (dot) uk>