11th Workshop on Building and Using Comparable Corpora; Co-located with LREC 2018

11th Workshop on Building and Using Comparable Corpora

08:55-9:00	*Opening Remarks*

	Session 1: Invited Presentation
09:00-10:00	Kyo Kageura
	Cross-lingual Correspondences of Terms in Texts and Terminologies: Theoretical Issues and Practical Implications

	Session 2a: Applications of Comparable Corpora
10:00-10:30	Laurent Prévot, Matthieu Stali and Shu-Chuan Tseng
	Grouping conversational markers across languages by exploiting large comparable corpora and unsupervised segmentation
10:30-11:00	*Coffee Break*

	Session 2b: Applications of Comparable Corpora
11:00-11:30	Bartholomäus Wloka
	Identifying Bilingual Topics in Wikipedia for Efficient Parallel Corpus Extraction and Building Domain-Specific Glossaries for the Japanese-English Language Pair
11:30-12:00	Firas Sabbah and Ahmet Aker
	Creating Comparable Corpora through Topic Mappings
12:00-12:30	Pierre Lison and A. Seza Dogruöz
	Detecting Machine-translated Subtitles in Large Parallel Corpora
12:30-14:00	*Lunch Break*

	Session 3: Invited Presentation
14:00-15:00	Yves Lepage
	Quasi-Parallel Corpora: Hallucinating Translations for the Chinese-Japanese Language Pair

	Session 4a: Shared Task: Parallel Sentence Extraction
15:00-15:30	Pierre Zweigenbaum, Serge Sharoff and Reinhard Rapp
	Overview of the Third BUCC Shared Task: Spotting Parallel Sentences in Comparable Corpora
15:30-16:00	Houda Bouamor and Hassan Sajjad
	H2@BUCC18: Parallel Sentence Extraction from Comparable Corpora Using Multilingual Sentence Embeddings
16:00-16:30	*Coffee Break*

	Session 4b: Shared Task: Parallel Sentence Extraction
16:30-17:00	Andoni Azpeitia, Thierry Etchegoyhen and Eva Martinez Garcia
	Extracting Parallel Sentences from Comparable Corpora with STACC Variants
17:00-17:30	Chongman Leong, Derek F. Wong and Lidia S. Chao
	UM-pAligner: Neural Network-Based Parallel Sentence Identification Model
17:30-17:35	*Closing*