6th Workshop on Building and Using Comparable Corpora

Conference Program


  Session: Invited talk
   
9:00-10:00 Three dimensions of comparable corpora: same or different language, given or inferred comparability, means to an end or end in itself
  Hinrich Schütze
   
  Session: (10:00-12:30) Terminology
   
10:00-10:30 Cross-lingual WSD for Translation Extraction from Comparable Corpora
  Marianna Apidianaki, Nikola Ljubešic and Darja Fišer
   
  Coffee break: (10:30-11:00)
   
11:00-11:30 Bilingual Lexicon Extraction via Pivot Language and Word Alignment Tool
  Hong-seok Kwon, Hyeong-won Seo and Jae-hoon Kim
   
11:30-12:00 Using WordNet and Semantic Similarity for Bilingual Terminology Mining from Comparable Corpora
  Dhouha Bouamor, Nasredine Semmar and Pierre Zweigenbaum
   
12:00-12:30 A Comparison of Smoothing Techniques for Bilingual Lexicon Extraction from Comparable Corpora
  Amir Hazem and Emmanuel Morin
   
  Session: (14:00-15:00) Comparable corpora
   
14:00-14:30 Finding More Bilingual Webpages with High Credibility via Link Analysis
  Chengzhi Zhang, Xuchen Yao and Chunyu Kit
   
14:30-15:00 A modular open-source focused crawler for mining monolingual and bilingual corpora from the web
  Vassilis Papavassiliou, Prokopis Prokopidis and Gregor Thurmair
   
  Session: (15:00-15:30) Posters with Booster Session
   
15:00-15:03 Building basic vocabulary across 40 languages
  Judit Acs, Katalin Pajkossy and Andras Kornai
   
15:04-15:07 Scientific registers and disciplinary diversification: a comparable corpus approach
  Elke Teich, Stefania Degaetano-Ortlieb, Hannah Kermes and Ekaterina Lapshinova-Koltunski
   
15:08-15:11 Improving MT System Using Extracted Parallel Fragments of Text from Comparable Corpora
  Rajdeep Gupta, Santanu Pal and Sivaji Bandyopadhyay
   
15:12-15:15 VARTRA: A Comparable Corpus for Analysis of Translation Variation
  Ekaterina Lapshinova-Koltunski
   
15:16-15:19 Building Ontologies from Collaborative Knowledge Bases to Search and Interpret Multilingual Corpora
  Yegin Genc, Elizabeth Lennon, Winter Mason and Jeffrey Nickerson
   
15:20-15:23 Using a Random Forest Classifier to recognise translations of biomedical terms across languages
  Georgios Kontonatsios, Ioannis Korkontzelos, Sophia Ananiadou and Jun’ichi Tsujii
   
15:24-15:27 Comparing Multilingual Comparable Articles Based On Opinions
  Motaz Saad, David Langlois and Kamel Smaili
   
  Coffee break: (15:30-16:00)
   
  Session: (16:00-18:00) Comparable corpora
   
16:00-16:30 Mining for Domain-specific Parallel Text from Wikipedia
  Magdalena Plamada and Martin Volk
   
16:30-17:00 Gathering and Generating Paraphrases from Twitter with Application to Normalization
  Wei Xu, Alan Ritter and Ralph Grishman
   
17:00-17:30 Learning Comparable Corpora from Latent Semantic Analysis Simplified Document Space
  Ekaterina Stambolieva
   
17:30-18:00 Chinese-Japanese Parallel Sentence Extraction from Quasi-Comparable Corpora
  Chenhui Chu, Toshiaki Nakazawa and Sadao Kurohashi
   


Serge Sharoff 2013-07-10