Transcript [poster]

Using Monolingual Crowds to
Improve Translation
1
Kronrod ,
2
Hu ,
2
Buzek ,
2
Quinn
Yakov
Chang
Olivia
Alex
University of Maryland (1) Dept. of Linguistics (2) Dept. of Computer Science
ParaTrans: Targeted Paraphrasing
Space of Available Translation Options
Affordability
Monolingual
Human
Participation

Monolingual speakers improving translation quality

Human participants perform the following tasks:
1. Identify mistranslated spans in target language
2. Create paraphrases for spans in source language
responsible for the target error spans.
•
Produces gains of 0.82 and 1.32 for adequacy and
fluency on a 5-point scale
•
Improves 65.3% of sentences examined
•
Yields increases of +0.9 BLEU to +3.8 BLEU,
depending on size of error span, in an oracle study.
Quality
Paraphrases from the Crowd
Translations and Human Judgments
Original Google Translate output (GT), with translations produced by the targeted
paraphrase translation process (TP), selected to show a range from strong to weak
improvements in fluency. Fluency and adequacy (preservation of meaning) were
rated on a 5-point scale by independent judges not familiar with this research.
The Bigger Picture: MonoTrans
Source Language
Target Language
MT
Sentence
Sentence
Crowd
Tasks:
Features
Results
•Online system for collaborative
monolingual translation
•Asynchronous edits, short
tasks, minimal context
1 Vote
2 Identify error spans
3 Create new
translation
candidates
Crowd
Tasks:
1 Vote
2 Paraphrase
3 Explain errors
New candidate
repeat …
…
1
2
3
68% of sentences translated
perfectly with no bilingual humans
UMD HCI Lab (cs.umd.edu/hcil)
Comp. Ling. and Info. Processing Lab (umiacs.umd.edu/labs/clip)
MonoTrans Project Page: (http://www.cs.umd.edu/hcil/monotrans)