Abstract
In this paper, we analyzed the appropriate category level for a Web directory for cross-language information retrieval (CLIR). Our proposed method for CLIR is based on an estimation of the domains of a query using the hierarchical structures of Web directories. Therefore, correct domain estimation is necessary for detecting the appropriate category level of the Web directory. We conducted retrieval experiments using four-category levels to find the appropriate category levels of a Web directory. We found that the 2nd or 3rd levels the most appropriate for CLIR.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kishida, K., Kando, N., and Chen, K.-H. (2004). Two-stage Refinement of Transitive Query Translation with English Disambiguation for Cross- Language Information Retrieval: A Trial at CLEF 2004, Working Notes for the CLEF 2004 Workshop, pp. 135–142.
Sakai, T., Manabe, T., Kumano, A., Koyama, M., and Kokubu, T. (2005). Toshiba BRIDJE at NTCIR-5 CLIR: Evaluation using Geometric Means, Proceedings of NTCIR-5 Workshop Meeting, pp. 56–63.
Davis, M. (1996). New experiments in cross-language text retrieval at NMSU’s computing research lab. In the fifth text retrieval conference (TREC-5).
Hull, D. A. and Grefenstette, G. (1996). Querying across languages: a dictionary-based approach to multilingual information retrieval. the 19th ACM SIGIR conference (SIGIR '96).
Eichmann, D., Ruiz, M. E., and Srinivasan, P. (1998) .Cross-language information retrieval with the UMLS metathesaurus. The 21st ACM SIGIR conference (SIGIR '98).
Seo, H. C., Kim, S. B., Rim, H. C., and Myaeng, S. H. (2005). Improving Query Translation in English-Korean Cross- Language Information Retrieval. Information Processing and Management, 41(3), pp. 507–522.
Gonzalo, J., Verdejo, F., Peters, C., and Calzolari, N. (1998). Applying EuroWordNet to Cross- Language Text Retrieval, Computers and the Humanities, 32(2.3), pp. 185–207.
Deerwester, S., Dumais, S. T., Furnas, T. K., Landauer, G. W., and Harshman, R. A. (1990). Indexing by latent semantic analysis, journal of the American Society for Information Science, 46(1), pp. 391–407,
Dumais, S. T. (1995). Using LSI for information filtering: TREC-3 experiments. In The Third Text Retrieval Conference (TREC3), pp. 219–230,
Rehder, B., Littman, M.L., Dumais, S., and Landauer, T.K. (1997). Automatic 3-Language Cross- Language Information Retrieval with Latent Semantic Indexing, Proceedings of the Sixth Text Retrieval Conference (TREC-6), pp. 233–239.
Jansen, B. J., Spink, A., and Saracevic, T. (2000). Real life, real user queries on the Web. Information Processing & Management, 36(2), pp. 207–227.
Resnik, P., and Smith, N. A. (2003). The Web as a Parallel Corpus, Computational Linguistics, 29(3), pp. 349–380.
Hull, D. A. (1997). Using structured queries for disambiguation in cross-language information retrieval. Electronic Working Notes of the AAAI Symposium on Cross-Language Text and Speech Retrieval.
Lin, C. J., Lin, W. C., Bian, G. W., and Chen, H. H. (1999). Description of the NTU Japanese-English cross-lingual information retrieval system used for NTCIR workshop. First NTCIR Workshop on Research in Japanese Text Retrieval and Term Recognition, pp. 145–148.
Kimura, F., Maeda, A., Yoshikawa, M., and Uemura, S. (2003). Cross-Language Information Retrieval using Web Directories. Proceedings of IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM '03), pp. 911–914.
Cho, K. and Kim, J. (1997). Automatic Text Categorization on Hierarchical Category Structure by using ICF (Inverted Category Frequency) Weighting, Proc. of KISS Conference, pp. 507–510.
Frakes, W. and Baeza-Yates, R. (1992). Information Retrieval: Data Structures and Algorithms, chapter 7. Prentice-Hall, Englewood Cliffs.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer Science+Business Media B.V
About this chapter
Cite this chapter
Kimura, F., Maeda, A., Hatano, K., Miyazaki, J., Uemura, S. (2009). Utilizing Web Directories for Translation Disambiguation in Cross-Language Information Retrieval. In: Wai, PK., Huang, X., Ao, SI. (eds) Trends in Communication Technologies and Engineering Science. Lecture Notes in Electrical Engineering, vol 33. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-9532-0_8
Download citation
DOI: https://doi.org/10.1007/978-1-4020-9532-0_8
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-9492-7
Online ISBN: 978-1-4020-9532-0
eBook Packages: EngineeringEngineering (R0)