ABSTRACT
It is challenging to translate names and technical terms across languages with different alphabets and sound inventories. These items are commonly transliterated, i.e., replaced with approximate phonetic equivalents. For example, computer in English comes out as (konpyuutaa) in Japanese. Translating such items from Japanese back to English is even more challenging, and of practical interest, as transliterated items make up the bulk of text phrases not found in bilingual dictionaries. We describe and evaluate a method for performing backwards transliterations by machine. This method uses a generative model, incorporating several distinct stages in the transliteration process.
- M. Arbabi, S. M. Fischthal, and V. C. Cheng andd E. Bart. 1994. Algorithms for Arabic name transliteration. IBM J. Res. Develop., 38(2). Google ScholarDigital Library
- L. E. Baum. 1972. An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process. Inequalities, 3.Google Scholar
- E. W. Dijkstra. 1959. A note on two problems in connexion with graphs. Numerische Mathematik, 1.Google Scholar
- David Eppstein. 1994. Finding the k shortest paths. In Proc. 35th Symp. Foundations of Computer Science. IEEE.Google ScholarDigital Library
- E. H. Jorden and H. I. Chaplin. 1976. Reading Japanese. Yale University Press, New Haven.Google Scholar
- F. Pereira and M. Riley. 1996. Speech recognition by composition of weighted finite automata. In preprint, cmp-lg/9603001.Google Scholar
- F. Pereira, M. Riley, and R. Sproat. 1994. Weighted rational transductions and their application to human language processing. In Proc. ARPA Human Language Technology Workshop. Google ScholarDigital Library
- J. Yamron, J. Cant, A. Demedts, T. Dietzel, and Y. Ito. 1994. The automatic component of the LINGSTAT machine-aided translation system. In Proc. ARPA Workshop on Human Language Technology. Google ScholarDigital Library
- Machine transliteration
Recommendations
Machine transliteration survey
Machine transliteration is the process of automatically transforming the script of a word from a source language to a target language, while preserving pronunciation. The development of algorithms specifically for machine transliteration began over a ...
Machine transliteration
It is challenging to translate names and technical terms across languages with different alphabets and sound inventories. These items are commonly trnasliterated, i.e., replaced with approximate phonetic equivalents. For example, "computer" in English ...
Punjabi machine transliteration
ACL-44: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational LinguisticsMachine Transliteration is to transcribe a word written in a script with approximate phonetic equivalence in another language. It is useful for machine translation, cross-lingual information retrieval, multilingual text and speech processing. Punjabi ...
Comments