Copyright © 2005 Elsevier B.V. All rights reserved.
Integration of speech recognition and machine translation: Speech recognition word lattice translation
Received 30 December 2004;
References and further reading may be available for this article. To view references and further reading you must purchase this article.
Abstract
An important issue in speech translation is to minimize the negative effect of speech recognition errors on machine translation. We propose a novel statistical machine translation decoding algorithm for speech translation to improve speech translation quality. The algorithm can translate the speech recognition word lattice, where more hypotheses are utilized to bypass the misrecognized single-best hypothesis. The decoding involves converting the recognition word lattice to a translation word graph by a graph-based search, followed by a fine rescoring by an A* search. We show that a speech recognition confidence measure implemented by posterior probability is effective to improve speech translation. The proposed techniques were tested in a Japanese-to-English speech translation task, in which we measured the translation results in terms of a number of automatic evaluation metrics. The experimental results demonstrate a consistent and significant improvement in speech translation achieved by the proposed techniques.
Article Outline
- 1. Introduction
- 2. Proposed speech translation structure
- 3. Speech recognition word lattice translation—WLT
- 3.1. First pass—from source word lattice to target word graph
- 3.2. Second pass—by an A* search to find the best outcome through the TWG
- 4. Downsizing the source word lattice
- 5. Selection of hypotheses by confidence measure (CM) filtering
- 6. Experiments
- 7. Discussion and conclusions
- Acknowledgements
- References






E-mail Article
Add to my Quick Links

Cited By in Scopus (0)






