Abstract
Transliterating foreign entities into Chinese is usually done through direct-style approach. The direct-style approach transliterates each syllable rendered from foreign terms into Chinese directly. Not every syllable can be rendered. An approach utilizing contextual information for term transliteration is proposed in this paper to attack this problem. Traditionally, evaluating transliteration performances always uses character and word error rates. However, many transliteration variants of the same term always found. Validating the generated results using Web corpora is more suitable and is proposed in this paper. Using the proposed evaluation method, experiments on term transliteration were conducted. From the experimental results show that taking contextual information is helpful to term transliteration and validating the generated results using Web corpora can provide more concrete evidences to transliteration evaluation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Gao, W., Wong, K.F., Lam, W.: Phoneme-based Transliteration of Foreign Names for OOV Problem. In: Proceedings of 1st International Joint Chinese Natural Language Processing, Hainan, China, pp. 374–381 (2004)
Jurafsky, D., Martin, J.H.: Speech and Language Processing, pp. 102–120. Prentice-Hall, Englewood Cliffs (2000)
Knight, K., Graehl, J.: Machine Transliteration. Computational Linguistics 24(4), 599–612 (1998)
Kuo, J.S., Yang, Y.K.: Incorporating Pronunciation Variation into Extraction of Transliterated-term Pairs from Web Corpora. In: Proceedings of International Conference on Chinese Computing, Singapore, pp. 131–138 (2005)
Li, H., Zhang, M., Su, J.: A Joint Source-Channel Model for Machine Transliteration. In: Proceedings of 42nd ACL, Barcelona, Spain, pp. 159–166 (2004)
Meng, H., Lo, W.K., Chen, B., Tang, K.: Generating Phonetic Cognates to Handle Named Entities in English-Chinese Cross-Language Spoken Document Retrieval. In: Proceedings of the Automatic Speech Recognition and Understanding Workshop, pp. 311–314 (2001)
Virga, P., Khudanpur, S.: Transliteration of Proper Names in Cross-Lingual Information Retrieval. In: Proceedings of 41st ACL Workshop on Multilingual and Mixed Language Named Entity Recognition, pp. 57–64 (2003)
Wan, S., Verspoor, C.M.: Automatic English-Chinese Name Transliteration for Development of Multilingual Resources. In: Proceedings of 17th COLING and 36th ACL, Montreal Quebec, Canada, pp. 1352–1356 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kuo, JS. (2005). Generating Term Transliterations Using Contextual Information and Validating Generated Results Using Web Corpora. In: Lee, G.G., Yamada, A., Meng, H., Myaeng, S.H. (eds) Information Retrieval Technology. AIRS 2005. Lecture Notes in Computer Science, vol 3689. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11562382_64
Download citation
DOI: https://doi.org/10.1007/11562382_64
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29186-2
Online ISBN: 978-3-540-32001-2
eBook Packages: Computer ScienceComputer Science (R0)