Article

Free Access

A phrase-based, joint probability model for statistical machine translation

Authors:
Daniel Marcu

University of Southern California, Marina del Rey, CA

University of Southern California, Marina del Rey, CA
View Profile

,
William Wong

Language Weaver Inc., Santa Monica, CA

Language Weaver Inc., Santa Monica, CA
View Profile

EMNLP '02: Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10July 2002Pages 133–139https://doi.org/10.3115/1118693.1118711

Published:06 July 2002Publication History

EMNLP '02: Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10

Pages 133–139

ABSTRACT

We present a joint probability model for statistical machine translation, which automatically learns word and phrase equivalents from bilingual corpora. Translations produced with parameters estimated using the joint model are more accurate than translations produced using IBM Model 4.

References

Yaser Al-Onaizan, Jan Curin, Michael Jahr, Kevin Knight, John Lafferty, Dan Melamed, Franz-Josef Och, David Purdy, Noah A. Smith, and David Yarowsky. 1999. Statistical machine translation. Final Report, JHU Summer Workshop.Google Scholar
Peter F. Brown, Stephen A. Della Pietra, Vincent J. Della Pietra, and Robert L. Mercer. 1993. The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics, 19(2):263--311. Google ScholarDigital Library
Philip Clarkson and Ronald Rosenfeld. 1997. Statistical language modeling using the CMU-Cambridge toolkit. In Proceedings of Eurospeech, September.Google Scholar
A. P. Dempster, N. M. Laird, and D. B. Rubin. 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, 39(Ser B):1--38.Google Scholar
Ulrich Germann, Mike Jahr, Kevin Knight, Daniel Marcu, and Kenji Yamada. 2001. Fast decoding and optimal decoding for machine translation. In Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics (ACL'01), pages 228--235, Toulouse, France, July 6--11. Decoder available at http://www.isi.edu/natural-language/projects/rewrite/. Google ScholarDigital Library
Daniel Marcu. 2001. Towards a unified approach to memory-and statistical-based machine translation. In Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics (ACL'01), pages 378--385, Toulouse, France, July 6--11. Google ScholarDigital Library
Dan Melamed. 2001. Empirical Methods for Exploiting Parallel Texts. The MIT Press.Google Scholar
Franz Josef Och, Christoph Tillmann, and Herman Ney. 1999. Improved alignment models for statistical machine translation. In Proceedings of the Joint Work-shop on Empirical Methods in NLP and Very Large Corpora, pages 20--28, University of Maryland, Maryland.Google Scholar
Kishore Papineni, Salim Roukos, Todd Ward, John Henderson, and Florence Reeder. 2002. Corpus-based comprehensive and diagnostic MT evaluation: Initial Arabic, Chinese, French, and Spanish results. In Proceedings of the Human Language Technology Conference, pages 124--127, San Diego, CA, March 24--27. Google ScholarDigital Library
Kenji Yamada and Kevin Knight. 2001. A syntax-based statistical translation model. In Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics (ACL'01), Toulouse, France, July 6--11. Google ScholarDigital Library

A phrase-based, joint probability model for statistical machine translation
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
  2. Modeling and simulation
    1. Model development and analysis
2. Hardware
  1. Power and energy
    1. Power estimation and optimization

Recommendations

A tree-to-string phrase-based model for statistical machine translation
CoNLL '08: Proceedings of the Twelfth Conference on Computational Natural Language Learning

Though phrase-based SMT has achieved high translation quality, it still lacks of generalization ability to capture word order differences between languages. In this paper we describe a general method for tree-to-string phrase-based SMT. We study how ...
Read More
Integrating source-language context into phrase-based statistical machine translation

The translation features typically used in Phrase-Based Statistical Machine Translation (PB-SMT) model dependencies between the source and target phrases, but not among the phrases in the source language themselves. A swathe of research has demonstrated ...
Read More
An incremental syntactic language model for statistical phrase-based machine translation
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

EMNLP '02: Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
July 2002
328 pages
Sponsors
In-Cooperation
Publisher
Association for Computational Linguistics
United States
Publication History
- Published: 6 July 2002
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate73of234submissions,31%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 109
  Total Citations
  View Citations
- 1,373
  Total Downloads
- Downloads (Last 12 months)45
- Downloads (Last 6 weeks)10
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

A phrase-based, joint probability model for statistical machine translation

EMNLP '02: Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10

ABSTRACT

References

Cited By

Recommendations

A tree-to-string phrase-based model for statistical machine translation

Integrating source-language context into phrase-based statistical machine translation

An incremental syntactic language model for statistical phrase-based machine translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

A phrase-based, joint probability model for statistical machine translation

EMNLP '02: Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10

ABSTRACT

References

Cited By

Recommendations

A tree-to-string phrase-based model for statistical machine translation

Integrating source-language context into phrase-based statistical machine translation

An incremental syntactic language model for statistical phrase-based machine translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media