Article

Free Access

Automatic evaluation of machine translation quality using n-gram co-occurrence statistics

Author:
George Doddington

View Profile

HLT '02: Proceedings of the second international conference on Human Language Technology ResearchMarch 2002Pages 138–145

Published:24 March 2002Publication History

HLT '02: Proceedings of the second international conference on Human Language Technology Research

Pages 138–145

ABSTRACT

Evaluation is recognized as an extremely helpful forcing function in Human Language Technology R&D. Unfortunately, evaluation has not been a very powerful tool in machine translation (MT) research because it requires human judgments and is thus expensive and time-consuming and not easily factored into the MT research agenda. However, at the July 2001 TIDES PI meeting in Philadelphia, IBM described an automatic MT evaluation technique that can provide immediate feedback and guidance in MT research. Their idea, which they call an "evaluation understudy", compares MT output with expert reference translations in terms of the statistics of short sequences of words (word N-grams). The more of these N-grams that a translation shares with the reference translations, the better the translation is judged to be. The idea is elegant in its simplicity. But far more important, IBM showed a strong correlation between these automatically generated scores and human judgments of translation quality. As a result, DARPA commissioned NIST to develop an MT evaluation facility based on the IBM work. This utility is now available from NIST and serves as the primary evaluation measure for TIDES MT research.

Recommendations

N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination
EACL '09: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics

In this paper we compare and contrast two approaches to Machine Translation (MT): the CMU-UKA Syntax Augmented Machine Translation system (SAMT) and UPC-TALP N-gram-based Statistical Machine Translation (SMT). SAMT is a hierarchical syntax-driven ...
Read More
Dependency-based automatic evaluation for machine translation
SSST '07: Proceedings of the NAACL-HLT 2007/AMTA Workshop on Syntax and Structure in Statistical Translation

We present a novel method for evaluating the output of Machine Translation (MT), based on comparing the dependency structures of the translation and reference rather than their surface string forms. Our method uses a treebank-based, widecoverage, ...
Read More
Evaluation of machine translation
ICWET '11: Proceedings of the International Conference & Workshop on Emerging Trends in Technology

Machine Translation (MT) refers to the use of a machine for performing translation task which converts text or speech from one Natural Language (NL) into another Natural Language. Machine Translation is an important technology for localization, and is ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
HLT '02: Proceedings of the second international conference on Human Language Technology Research
March 2002
436 pages
Conference Chair:
Mitchell Marcus
University of Pennsylvania
Sponsors
In-Cooperation
Publisher
Morgan Kaufmann Publishers Inc.
San Francisco, CA, United States
Publication History
- Published: 24 March 2002
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate240of768submissions,31%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 178
  Total Citations
  View Citations
- 3,956
  Total Downloads
- Downloads (Last 12 months)452
- Downloads (Last 6 weeks)77
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Automatic evaluation of machine translation quality using n-gram co-occurrence statistics

HLT '02: Proceedings of the second international conference on Human Language Technology Research

ABSTRACT

Cited By

Recommendations

N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination

Dependency-based automatic evaluation for machine translation

Evaluation of machine translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Automatic evaluation of machine translation quality using n-gram co-occurrence statistics

HLT '02: Proceedings of the second international conference on Human Language Technology Research

ABSTRACT

Cited By

Recommendations

N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination

Dependency-based automatic evaluation for machine translation

Evaluation of machine translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media