Automating Your QA Process to Save Time and Improve Consistency Iñaki Hernández-Lasa Linguistic Engineering Department Manager Tek Translation International.

Download Report

Transcript Automating Your QA Process to Save Time and Improve Consistency Iñaki Hernández-Lasa Linguistic Engineering Department Manager Tek Translation International.

Automating Your QA Process to Save Time and Improve Consistency

Iñaki Hernández-Lasa Linguistic Engineering Department Manager Tek Translation International

Agenda

     The Need For Automated Language Quality What Can Be Automated Technologies Available Real Production Case Study The Language Quality Checker

The Need For Automated Language Quality

        Language Quality tasks are far too time-consuming.

Manual language review activity is prone to error.

Manual language review provides no statistical analysis.

Manual language review cannot “catch” all errors.

Reviewers are subjective and constantly introduce exceptions.

There is a need to introduce a more empirical and objective approach to language.

Statistical analysis for Language Quality Benchmarking (LQB) is critical in our industry.

Issues with traditional spell checking tools – insertion of lists for acceptance.

What Can Be Automated

      Spelling verification Source segment with multiple translations Source segment = Target segment Punctuation Terminology adherence to an approved glossary Overtranslations: CaMel words and Product Names

Technologies Available

     Hashtables:  Association between keys and values   Fast searches, for instance source with multiple targets Sou=Key Targ= Value Regular expressions engine: special text string for describing a pattern  Overtranslation : CaMel word in source and in target iFilter Windows interface: interpreting different file formats  .rtf, .doc, .tmx, .ttx, .txt, .xliff, .csv

XML parser: interpret different formats like .ttx and .tmx into .xml

XSL transformation: Generation of different output files (.xls, .html and .txt)

EXAMPLE OF E-MAIL ADDRESS

[^\s]+\@[^\s]+\.[A-Za-z]{2,4}      Any number or letter followed by @ followed by any number of letter followed by . followed by any letter capitalised or not which ranges between 2 or 4 characters.

Real Production Case Study

PROOF READING SCENARIO WITH NO LANGUAGE TECHNOLOGY   Real case study LQC Project for one of the largest US Energy Enterprises   Installation Guide – 9 files – Total 4,368 words Instruction Manual – 27 files – Total 49,723 words  Based on the total of 54,091 words

Real Production Case Study

PROOF READING SCENARIO Without Language Technologies

18 hours

(Based on 3,000 words per hour)

Real Production Case Study

PROOF READING SCENARIO WITH LANGUAGE TECHNOLOGY   Real case study LQC Project for one of the largest US Energy Enterprises   Installation Guide – 9 files – Total 4,368 words Instruction Manual – 27 files – Total 49,723 words      Based on the total of 54,091 words Running an LE tool: 5 minutes Statistical Analysis: 2,031 errors = 3,7% errors Implementation of errors reported at a rate of 5 per minute = 300 errors per hour Total implementation time for 2,031 errors = 6.5 hours

Real Production Case Study

PROOF READING SCENARIO – TIME SAVINGS Without LQC

18 hours

(Based on 3,000 words per hour) With LQC

6.5 hours

Real Production Case Study

ERROR CATEGORISATION & OCURRENCES 357 Translation SOU=TAR Spelling 612 Terminology 231 179 Consistency 652 Punctuation

Real Production Case Study

EXAMPLES OF ERRORS FOUND

Be careful when installing and inserting each wire.

Tenga cuidado cuando instale e inserte cada cable.

Sea cuidadoso cuando instale e inserte cada conductor.

Water-Oil Condensation Temperature

Temperatura de condensación agua-aceite Temperatura de la condensación del agua-aceite

There are two possibilities:

Existen dos posibilidades: Hay dos posibilidades:

First, remove the plug present in the modem’s connector.

Primero, quite el enchufe presente en el conector del módem.

Approved Glossary entry  remove = retirar

Real Production Case Study

PROOF READING SCENARIO – COST SAVINGS  Reduction of costs by up to 65% (from 18 hours to 6.5 hours)

65% cost savings

Real Production Case Study

AIM OF TEK’S LANGUAGE QUALITY CHECKER (LQC)  To assist in Language Quality tasks through Linguistic Engineering automation  To provide Language Quality Benchmarking (LQB)  Quantitative analysis - reporting objectively on the number of mistakes  Qualitative analysis – reporting empirically on the type of mistakes  To guarantee consistency of existing lexical resources: terminology and consistent segments  To save time during proofreading cycles  To improve higher cost containment in localization practices

Real Production Case Study

TYPES OF MISTAKES IDENTIFIED BY TEK’S LQC  Glossary adherence - Non-adherence to an approved glossary  Consistency Checker - Cases where SOU=Multiple Translations  Translation Checker – Empty translations and cases where SOU=TAR  Spelling Checker – Target spelling  Grammar Errors – Target grammatical errors. List of configurable forbidden words.

Overtranslations – Terms that should not have been translated: CaMel words, Product Names, etc.

Mistranslations - Terms that should have one translation and have another (false friends)  Punctuation Checker – Source vs. Target equivalence / (¡ / ¿)

LQC UI

Where Does The LQC Reside?

Business Intelligence Suite Personalised Workflow Linguistic Tools

Early Warning System On-line Quotation Change Management Project Management ETM TekTerm TM Warehouse

Document Management Open Architecture

LQC Self-Service On-Line Translation

Tek Overview

Tek's OneWorld Solution provides an easy-to-use localization business management system for the Manufacturing, IT and Life Sciences Industries.

It provides:

Technology

  One On-Demand Globalization Platform for business intelligence, language management, open connectivity and worldwide collaboration.

One powerful ERP system driving processes for all users.

Process

  One set of tools and ISO approved quality standards covering over 60 languages.

One “Follow the Sun” approach through our Client OneWorld Centers in different time zones for a streamlined service to our clients.

People

  One highly specialized and experienced team of people sharing the OneWorld vision.

One united company since 1961.

Tek’s LE Department

LINGUISTIC ENGINEERING DEPARTMENT TM MANAGEMENT ENTERPRISE TERMINOLOGY MANAGEMENT L10N / TECH. WRITING STYLE GUIDES  Quality@Source  Consistency in Project Components  Reduction of Validation Time  Reduction of Management & Effort  More streamlined / efficient L10N  Cost Savings

Thank You!

For any enquiries please contact me at ih AT tektrans.com