Subject: 8th BUCC Workshop BUCC, 8TH WORKSHOP ON BUILDING AND USING COMPARABLE CORPORA Co-located with ACL 2015 Beijing (China) 30 July 2015 Extended deadline for papers: 15 May 2015 Website: http://comparable.limsi.fr/bucc2015/ INVITED SPEAKER: Benjamin Tsou, City University of Hong Kong IMPORTANT DATES 15 May 2015 Deadline for submission of full papers 4 June 2015 Notification of acceptance 21 June 2015 Camera-ready papers due 30 July 2015 Workshop date TOPICS We solicit contributions including but not limited to the following topics. Building Comparable Corpora: • Human translations • Automatic and semi-automatic methods • Methods to mine parallel and non-parallel corpora from the Web • Tools and criteria to evaluate the comparability of corpora • Parallel vs non-parallel corpora, monolingual corpora • Rare and minority languages, across language families • Multi-media/multi-modal comparable corpora Applications of comparable corpora: • Human translations • Language learning • Cross-language information retrieval & document categorization • Bilingual projections • Machine translation • Writing assistance • Machine learning techniques using comparable corpora Mining from Comparable Corpora: • Induction of morphological, grammatical, and translation rules from comparable corpora • Extraction of parallel segments or paraphrases from comparable corpora • Extraction of bilingual and multilingual translations of single words and multi-word expressions, proper names, and named entities from comparable corpora • Induction of multilingual word classes from comparable corpora • Cross-language distributional semantics SUBMISSION INFORMATION Submissions should follow the ACL 2015 length and formatting requirements found at http://acl2015.org/call_for_papers.html: long papers can have a maximum of eight (8) pages of content plus two (2) extra pages for references, while short papers can have a maximum of four (4) pages of content plus two (2) extra pages for references. For more detail, see BUCC 2015 website: http://comparable.limsi.fr/bucc2015/bucc2015-cfp.html ORGANISERS Pierre Zweigenbaum LIMSI, CNRS, Orsay (France), Chair Serge Sharoff University of Leeds (UK), Shared Task Chair Reinhard Rapp University of Mainz (Germany) SCIENTIFIC COMMITTEE Ahmet Aker, University of Sheffield (UK) Srinivas Bangalore (AT&T Labs, US) Caroline Barrière (CRIM, Montréal, Canada) Hervé Déjean (Xerox Research Centre Europe, Grenoble, France) Kurt Eberle (Lingenio, Heidelberg, Germany) Andreas Eisele (European Commission, Luxembourg) Éric Gaussier (Université Joseph Fourier, Grenoble, France) Gregory Grefenstette (INRIA, Saclay, France) Silvia Hansen-Schirra (University of Mainz, Germany) Hitoshi Isahara (Toyohashi University of Technology) Kyo Kageura (University of Tokyo, Japan) Adam Kilgarriff (Lexical Computing Ltd, UK) Natalie Kübler (Université Paris Diderot, France) Philippe Langlais (Université de Montréal, Canada) Michael Mohler (Language Computer Corp., US) Emmanuel Morin (Université de Nantes, France) Dragos Stefan Munteanu (Language Weaver, Inc., US) Lene Offersgaard (University of Copenhagen, Denmark) Ted Pedersen (University of Minnesota, Duluth, US) Reinhard Rapp (Université Aix-Marseille, France) Sujith Ravi (Google, US) Serge Sharoff (University of Leeds, UK) Michel Simard (National Research Council Canada) Tim Van de Cruys (IRIT-CNRS, Toulouse, France) Stephan Vogel, QCRI (Qatar) Guillaume Wisniewski (Université Paris Sud & LIMSI-CNRS, Orsay, France) Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France)