6th Workshop on Building and Using Comparable Corpora

Conference Program

  Session: Invited talk
9:00-10:00 Three dimensions of comparable corpora: same or different language, given or inferred comparability, means to an end or end in itself
  Hinrich Schütze
  Session: (10:00-12:30) Terminology
10:00-10:30 Cross-lingual WSD for Translation Extraction from Comparable Corpora
  Marianna Apidianaki, Nikola Ljubešic and Darja Fišer
  Coffee break: (10:30-11:00)
11:00-11:30 Bilingual Lexicon Extraction via Pivot Language and Word Alignment Tool
  Hong-seok Kwon, Hyeong-won Seo and Jae-hoon Kim
11:30-12:00 Using WordNet and Semantic Similarity for Bilingual Terminology Mining from Comparable Corpora
  Dhouha Bouamor, Nasredine Semmar and Pierre Zweigenbaum
12:00-12:30 A Comparison of Smoothing Techniques for Bilingual Lexicon Extraction from Comparable Corpora
  Amir Hazem and Emmanuel Morin
  Session: (14:00-15:00) Comparable corpora
14:00-14:30 Finding More Bilingual Webpages with High Credibility via Link Analysis
  Chengzhi Zhang, Xuchen Yao and Chunyu Kit
14:30-15:00 A modular open-source focused crawler for mining monolingual and bilingual corpora from the web
  Vassilis Papavassiliou, Prokopis Prokopidis and Gregor Thurmair
  Session: (15:00-15:30) Posters with Booster Session
15:00-15:03 Building basic vocabulary across 40 languages
  Judit Acs, Katalin Pajkossy and Andras Kornai
15:04-15:07 Scientific registers and disciplinary diversification: a comparable corpus approach
  Elke Teich, Stefania Degaetano-Ortlieb, Hannah Kermes and Ekaterina Lapshinova-Koltunski
15:08-15:11 Improving MT System Using Extracted Parallel Fragments of Text from Comparable Corpora
  Rajdeep Gupta, Santanu Pal and Sivaji Bandyopadhyay
15:12-15:15 VARTRA: A Comparable Corpus for Analysis of Translation Variation
  Ekaterina Lapshinova-Koltunski
15:16-15:19 Building Ontologies from Collaborative Knowledge Bases to Search and Interpret Multilingual Corpora
  Yegin Genc, Elizabeth Lennon, Winter Mason and Jeffrey Nickerson
15:20-15:23 Using a Random Forest Classifier to recognise translations of biomedical terms across languages
  Georgios Kontonatsios, Ioannis Korkontzelos, Sophia Ananiadou and Jun’ichi Tsujii
15:24-15:27 Comparing Multilingual Comparable Articles Based On Opinions
  Motaz Saad, David Langlois and Kamel Smaili
  Coffee break: (15:30-16:00)
  Session: (16:00-18:00) Comparable corpora
16:00-16:30 Mining for Domain-specific Parallel Text from Wikipedia
  Magdalena Plamada and Martin Volk
16:30-17:00 Gathering and Generating Paraphrases from Twitter with Application to Normalization
  Wei Xu, Alan Ritter and Ralph Grishman
17:00-17:30 Learning Comparable Corpora from Latent Semantic Analysis Simplified Document Space
  Ekaterina Stambolieva
17:30-18:00 Chinese-Japanese Parallel Sentence Extraction from Quasi-Comparable Corpora
  Chenhui Chu, Toshiaki Nakazawa and Sadao Kurohashi

Serge Sharoff 2013-07-10