Using Translation Memory to Speed up Translation Process

Download Report

Transcript Using Translation Memory to Speed up Translation Process

Using Translation Memory
to Speed up Translation
Process
Marija Brkić, [email protected]
Department of Informatics - University of Rijeka
Sanja Seljan, [email protected]
Department of Information Sciences
Faculty of Humanities and Social Sciences- University of Zagreb
Božena Bašić Mikulić, [email protected]
Primary school “Turnić”
Outline
• Motivation
• Translation memories
• Experimental study
• Description
• Results
• Evaluation
• Conclusion
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
2/22
Motivation
languages differ culturally, stylistically,
• Globalization
syntactically
and lexically
• EU accession negotiations
consistency
• Information exchange
data sharing
insufficient number of translators
short deadlines
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
3/22
Outline
• Motivation
• Translation memories
• Experimental study
• Description
• Results
• Evaluation
• Conclusion
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
4/22
Translation memories
• Approaches
• Sentence-based approach
• Character-string-in-bitext-based approach
• Advantages
• Disadvantages
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
5/22
Sentence-based approach
• Translation Units (Tus)
– Sentences
– Titles
– Subtitles
– List entries
• High precision, low recall search
• Fuzzy matching
• Noise
• Silence
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
6/22
Character-string-in-bitextbased approach
• Character strings of any length
– Several consecutive identical sentences
• Unreliably small matches
• Noise
• No support for fuzzy matching
• Silence
• Recycling internal repetitions through
terminology databases
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
7/22
Advantages
• Consistency
– Internal and external repetitions
– Shared projects
• Speed
– Globalization
• Cost reduction
• Preservation of original page layout
• Statistics
– Scheduling localization projects
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
8/22
Disadvantages
• Initial decrease in productivity
• Beneficiary effects felt only on repetitive
texts
• TM needs to be of considerable size
• Is alignment worthwhile and preconditions
met?
• Database maintenance
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
9/22

•
•
•
•
•
Disadvantages (cont.)
File format support
The quality of translation
Lack of language knowledge
Context insensitivity
Additional costs
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
10/22
Outline
• Motivation
• Translation memories
• Experimental study
• Description
• Results
• Evaluation
• Conclusion
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
11/22
Experimental study
• Feature: the speed of the translation
process
• Tool: ATRIL – Déjà vu
• Parameters:
•
•
•
•
•
•
•
Measure
Evaluation procedure
Score
Metric
Languages
Text type
System used
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
12/22

•
•
•
•
Experimental study (cont.)
Text selection phase
Preparation phase
Translation phase
Revision phase
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
13/22

Experimental study (cont.)
Figure 1: Matches found by DVX after the third source text was inserted
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
14/22
Problems
• morphology
• word order
• capitalization
• punctuation
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
15/22
Outline
• Motivation
• Translation memories
• Experimental study
• Description
• Results
• Evaluation
• Conclusion
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
16/22
Evaluation
• Automatic
• Manual
– Post-editing
– 1-4 scale
Chart 1: Effectiveness of TM
7%
2%
20%
absolutely useless
many changes needed
few changes needed
no changes needed
71%
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
17/22

Evaluation (cont.)
Time (minutes)
Chart 2: Total time for traditional translation and TM
75
60
45
30
15
0
69
63
42
36
First text
Second
text
36
34
Traditional
translation
TM
Third text
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
18/22

Evaluation (cont.)
Time (minutes)
Chart 3: Time spent in translation process for
traditional translation and TM
40
35
30
25
20
15
10
5
0
37
First text
31
30
Second
text
29
23
Traditional
translation
TM
Third text
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
19/22
Outline
• Motivation
• Translation memories
• Experimental study
• Description
• Results
• Evaluation
• Conclusion
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
20/22
Conclusion
• TMs speed up the translation process
• Database maintenance time-consuming
only in the beginning
• Fast, consistent and professional-quality
translations
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
21/22
Thank you on your attention!
Marija Brkić, Sanja Seljan, Božena Bašić Mikulić
22/22