3rd Workshop on Building and Using Comparable Corpora
Applications of Parallel and Comparable Corpora in Natural Language
Engineering and the Humanities |
LREC 2010 post-conference workshop, 22 May 2010
Mediterranean Conference Centre, Valletta, Malta
http://www.fb06.uni-mainz.de/lk/bucc2010
We are inviting submissions for the third edition of the Workshop on Building and Using Comparable Corpora. Comparable corpora are collections of documents that are comparable in content and form in various degrees and dimensions. This definition includes many types of parallel and non-parallel multilingual corpora, but also sets of monolingual corpora that are used for comparative purposes. By bringing together researchers from several disciplines, including natural language engineering and the humanities, the workshop aims at showing the full breadth of research on comparable corpora. We solicit contributions on a range of topics, including but not limited to:
Building Comparable Corpora
Applications of Comparable Corpora
Mining from Comparable Corpora
Important Dates
March 1, 2010 | Deadline for paper submissions |
March 12, 2010 | Notification of acceptance |
March 29, 2010 | Camera-ready copies due |
May 22, 2010 | Workshop |
Invited Speaker
Adam Kilgarriff (Lexical Computing Ltd)
Workshop Co-chairs
Reinhard Rapp (University of Tarragona, Spain)
Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France)
Serge Sharoff (University of Leeds, UK)
Programme Committee
Srinivas Bangalore (AT&T Labs, USA)
Caroline Barrière (National Research Council Canada)
Chris Biemann (Microsoft / Powerset, San Francisco, USA)
Lynne Bowker (University of Ottawa, Canada)
Hervé Déjean (Xerox Research Centre Europe, Grenoble, France)
Kurt Eberle (Lingenio, Heidelberg, Germany)
Andreas Eisele (DFKI, Saarbrücken, Germany)
Pascale Fung (Hong Kong University of Science & Technology, China)
Éric Gaussier (Université Joseph Fourier, Grenoble, France)
Gregory Grefenstette (Exalead, Paris, France)
Silvia Hansen-Schirra (University of Mainz, Germany)
Hitoshi Isahara (NICT, Tokyo, Japan)
Kyo Kageura (University of Tokyo, Japan)
Min-Yen Kan (National University of Singapore)
Adam Kilgarriff (Lexical Computing Ltd, UK)
Natalie Kübler (Université Paris Diderot, France)
Philippe Langlais (Université de Montréal, Canada)
Tony McEnery (Lancaster University, UK)
Emmanuel Morin (Université de Nantes, France)
Dragos Stefan Munteanu (Language Weaver Inc., USA)
Carol Peters (ISTI-CNR, Pisa, Italy)
Emmanuel Prochasson (Hong Kong University of Science & Technology, China)
Reinhard Rapp (University of Tarragona, Spain)
Sujith Ravi (ISI, University of Southern California, USA)
Serge Sharoff (University of Leeds, UK)
Michel Simard (National Research Council Canada)
Richard Sproat (OGI School of Science and Technology, USA)
Michael Zock (LIF, CNRS Marseille, France)
Pierre Zweigenbaum (LIMSI-CNRS, Orsay, France)
Submissions
Papers of no more than 10 pages and following the LREC main conference formatting details (available at http://www.lrec-conf.org/lrec2010/?Author-s-Kit-and-Templates) should be submitted as a PDF file via the START conference manager at https://www.softconf.com/lrec2010/BUCC2010/. Papers will be blind reviewed by at least two members of the Program Committee. Therefore, authors' names and affiliations should not appear in the paper. Accepted papers will be published in the workshop proceedings.
This year, when submitting a paper from the START page, authors will be asked to participate in a survey on the use of language resources by providing essential information about the resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of the research. For further information on this new iniative, please refer to http://www.lrec-conf.org/lrec2010/?LREC2010-Map-of-Language-Resources.
Further Information
If you have questions, please consult the workshop website at http://www.fb06.uni-mainz.de/lk/bucc2010 or contact Reinhard Rapp at reinhardrapp AT gmx DOT de