Automating Your QA Process to Save Time and Improve Consistency Iñaki Hernández-Lasa Linguistic Engineering Department Manager Tek Translation International.
Download ReportTranscript Automating Your QA Process to Save Time and Improve Consistency Iñaki Hernández-Lasa Linguistic Engineering Department Manager Tek Translation International.
Automating Your QA Process to Save Time and Improve Consistency
Iñaki Hernández-Lasa Linguistic Engineering Department Manager Tek Translation International
Agenda
The Need For Automated Language Quality What Can Be Automated Technologies Available Real Production Case Study The Language Quality Checker
The Need For Automated Language Quality
Language Quality tasks are far too time-consuming.
Manual language review activity is prone to error.
Manual language review provides no statistical analysis.
Manual language review cannot “catch” all errors.
Reviewers are subjective and constantly introduce exceptions.
There is a need to introduce a more empirical and objective approach to language.
Statistical analysis for Language Quality Benchmarking (LQB) is critical in our industry.
Issues with traditional spell checking tools – insertion of lists for acceptance.
What Can Be Automated
Spelling verification Source segment with multiple translations Source segment = Target segment Punctuation Terminology adherence to an approved glossary Overtranslations: CaMel words and Product Names
Technologies Available
Hashtables: Association between keys and values Fast searches, for instance source with multiple targets Sou=Key Targ= Value Regular expressions engine: special text string for describing a pattern Overtranslation : CaMel word in source and in target iFilter Windows interface: interpreting different file formats .rtf, .doc, .tmx, .ttx, .txt, .xliff, .csv
XML parser: interpret different formats like .ttx and .tmx into .xml
XSL transformation: Generation of different output files (.xls, .html and .txt)
EXAMPLE OF E-MAIL ADDRESS
[^\s]+\@[^\s]+\.[A-Za-z]{2,4} Any number or letter followed by @ followed by any number of letter followed by . followed by any letter capitalised or not which ranges between 2 or 4 characters.
Real Production Case Study
PROOF READING SCENARIO WITH NO LANGUAGE TECHNOLOGY Real case study LQC Project for one of the largest US Energy Enterprises Installation Guide – 9 files – Total 4,368 words Instruction Manual – 27 files – Total 49,723 words Based on the total of 54,091 words
Real Production Case Study
PROOF READING SCENARIO Without Language Technologies
18 hours
(Based on 3,000 words per hour)
Real Production Case Study
PROOF READING SCENARIO WITH LANGUAGE TECHNOLOGY Real case study LQC Project for one of the largest US Energy Enterprises Installation Guide – 9 files – Total 4,368 words Instruction Manual – 27 files – Total 49,723 words Based on the total of 54,091 words Running an LE tool: 5 minutes Statistical Analysis: 2,031 errors = 3,7% errors Implementation of errors reported at a rate of 5 per minute = 300 errors per hour Total implementation time for 2,031 errors = 6.5 hours
Real Production Case Study
PROOF READING SCENARIO – TIME SAVINGS Without LQC
18 hours
(Based on 3,000 words per hour) With LQC
6.5 hours
Real Production Case Study
ERROR CATEGORISATION & OCURRENCES 357 Translation SOU=TAR Spelling 612 Terminology 231 179 Consistency 652 Punctuation
Real Production Case Study
EXAMPLES OF ERRORS FOUND
Be careful when installing and inserting each wire.
Tenga cuidado cuando instale e inserte cada cable.
Sea cuidadoso cuando instale e inserte cada conductor.
Water-Oil Condensation Temperature
Temperatura de condensación agua-aceite Temperatura de la condensación del agua-aceite
There are two possibilities:
Existen dos posibilidades: Hay dos posibilidades:
First, remove the plug present in the modem’s connector.
Primero, quite el enchufe presente en el conector del módem.
Approved Glossary entry remove = retirar
Real Production Case Study
PROOF READING SCENARIO – COST SAVINGS Reduction of costs by up to 65% (from 18 hours to 6.5 hours)
65% cost savings
Real Production Case Study
AIM OF TEK’S LANGUAGE QUALITY CHECKER (LQC) To assist in Language Quality tasks through Linguistic Engineering automation To provide Language Quality Benchmarking (LQB) Quantitative analysis - reporting objectively on the number of mistakes Qualitative analysis – reporting empirically on the type of mistakes To guarantee consistency of existing lexical resources: terminology and consistent segments To save time during proofreading cycles To improve higher cost containment in localization practices
Real Production Case Study
TYPES OF MISTAKES IDENTIFIED BY TEK’S LQC Glossary adherence - Non-adherence to an approved glossary Consistency Checker - Cases where SOU=Multiple Translations Translation Checker – Empty translations and cases where SOU=TAR Spelling Checker – Target spelling Grammar Errors – Target grammatical errors. List of configurable forbidden words.
Overtranslations – Terms that should not have been translated: CaMel words, Product Names, etc.
Mistranslations - Terms that should have one translation and have another (false friends) Punctuation Checker – Source vs. Target equivalence / (¡ / ¿)
LQC UI
Where Does The LQC Reside?
Business Intelligence Suite Personalised Workflow Linguistic Tools
Early Warning System On-line Quotation Change Management Project Management ETM TekTerm TM Warehouse
Document Management Open Architecture
LQC Self-Service On-Line Translation
Tek Overview
Tek's OneWorld Solution provides an easy-to-use localization business management system for the Manufacturing, IT and Life Sciences Industries.
It provides:
Technology
One On-Demand Globalization Platform for business intelligence, language management, open connectivity and worldwide collaboration.
One powerful ERP system driving processes for all users.
Process
One set of tools and ISO approved quality standards covering over 60 languages.
One “Follow the Sun” approach through our Client OneWorld Centers in different time zones for a streamlined service to our clients.
People
One highly specialized and experienced team of people sharing the OneWorld vision.
One united company since 1961.
Tek’s LE Department
LINGUISTIC ENGINEERING DEPARTMENT TM MANAGEMENT ENTERPRISE TERMINOLOGY MANAGEMENT L10N / TECH. WRITING STYLE GUIDES Quality@Source Consistency in Project Components Reduction of Validation Time Reduction of Management & Effort More streamlined / efficient L10N Cost Savings
Thank You!
For any enquiries please contact me at ih AT tektrans.com