BUCC, 10th Workshop on Building and Using Comparable Corpora
Shared task: parallel sentence extraction from comparable corpora
Sample, training and test data are available for four language pairs
TOPICS
We solicit contributions including but not limited to the following topics:
- Building Comparable Corpora:
-
- Human translations
- Automatic and semi-automatic methods
- Methods to mine parallel and non-parallel corpora from the Web
- Tools and criteria to evaluate the comparability of corpora
- Parallel vs non-parallel corpora, monolingual corpora
- Rare and minority languages, across language families
- Multi-media/multi-modal comparable corpora
- Applications of comparable corpora:
-
- Human translations
- Language learning
- Cross-language information retrieval & document categorization
- Bilingual projections
- Machine translation
- Writing assistance
- Machine learning techniques using comparable corpora
- Mining from Comparable Corpora:
-
- Induction of morphological, grammatical, and translation rules from comparable corpora
- Extraction of parallel segments or paraphrases from comparable corpora
- Extraction of bilingual and multilingual translations of single words and multi-word expressions, proper names, and named entities from comparable corpora
- Induction of multilingual word classes from comparable corpora
- Cross-language distributional semantics
IMPORTANT DATES
27 April 2017 | Deadline for submission of full papers |
19 May 2017 | Notification to authors |
26 May 2017 | Camera-ready papers due |
3 August 2017 | Workshop date |
SUBMISSION INFORMATION
Papers should follow the ACL main conference formatting details (see the ACL conference website http://acl2017.org/calls/papers/) and should be submitted as a PDF-file via the START workshop manager at https://www.softconf.com/acl2017/bucc/.
Contributions can be short or long papers. Short paper submission must describe original and unpublished work without exceeding four (4) pages of content, plus unlimited references. Characteristics of short papers include: a small, focused contribution; work in progress; a negative result; an opinion piece; an interesting application nugget. Long paper submissions must describe substantial, original, completed and unpublished work without exceeding eight (8) pages of content, plus unlimited references.
Reviewing will be double blind, so the papers should not reveal the authors’ identity. Accepted papers will be published in the workshop proceedings.
Double submission policy: Parallel submission to other meetings or publications is possible but must be immediately notified to the workshop organizers.
For further information, please contact Serge Sharoff <S (dot) Sharoff (at) leeds (dot) ac (dot) uk>
Plain-text CFP : bucc2017-_cfp.txt
PDF CFP : bucc2017-_cfp.pdf
Last modified: 17 May 2017