ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Information Processing & Management
Volume 41, Issue 3, May 2005, Pages 433-455
Cross-Language Information Retrieval
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (420 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
Special issue
View Record in Scopus
 
doi:10.1016/j.ipm.2004.06.007    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2004 Elsevier Ltd All rights reserved.

Technical issues of cross-language information retrieval: a review

Kazuaki KishidaCorresponding Author Contact Information, E-mail The Corresponding Author

Faculty of Cultural Information Resources, Surugadai University, 698 Azu, Hanno, Saitama 357-8555, Japan

Received 10 June 2004; 
accepted 14 June 2004. 
Available online 23 August 2004.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

This paper reviews state-of-the-art techniques and methods for enhancing effectiveness of cross-language information retrieval (CLIR). The following research issues are covered: (1) matching strategies and translation techniques, (2) methods for solving the problem of translation ambiguity, (3) formal models for CLIR such as application of the language model, (4) the pivot language approach, (5) methods for searching multilingual document collection, (6) techniques for combining multiple language resources, etc.

Keywords: Cross-language information retrieval; Machine translation; Word sense disambiguation; Language model

Article Outline

1. Introduction
2. Matching strategies and translation
2.1. Matching strategies
2.1.1. Types of matching strategies
2.1.2. Cognate matching
2.1.3. Query translation
2.1.4. Document translation
2.1.5. Interlingual techniques
2.2. Translation techniques
2.2.1. Machine translation techniques
2.2.2. Dictionary-based methods
2.2.3. Parallel corpora-based method
2.2.4. Use of WWW resources
3. Term disambiguation techniques
3.1. Translation ambiguity
3.2. Use of part-of-speech tags
3.3. Parallel corpus-based disambiguation
3.4. Disambiguation based on co-occurrence statistics
3.5. Query expansion for disambiguation
3.6. Structured query model
3.7. Another method for disambiguation
3.8. Phrasal translation techniques
4. Formal model for CLIR
4.1. Retrieval model for estimating document scores
4.2. Language model for CLIR
5. Other research topics in CLIR
5.1. Pivot language approach
5.2. Merging strategy for multilingual information retrieval
5.3. Combination of some language resources
5.4. Language processing issues
5.4.1. Text processing for languages other than English
5.4.2. Tokenization
5.4.3. Stopword list
5.4.4. Stemming
5.5. User interfaces for interactive CLIR
5.6. Evaluation of CLIR
6. Concluding remarks: future directions for research
Acknowledgements
References

Information Processing & Management
Volume 41, Issue 3, May 2005, Pages 433-455
Cross-Language Information Retrieval
 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.