Copyright © 2006 Elsevier Ltd All rights reserved.
Employing web mining and data fusion to improve weak ad hoc retrieval
Received 27 May 2006;
References and further reading may be available for this article. To view references and further reading you must purchase this article.
Abstract
When a user issues a reasonable query to a retrieval system and obtains no relevant documents, he or she is bound to feel frustrated. We call these weak queries and retrievals. Improving their effectiveness is an important issue for ad hoc retrieval and would be most rewarding for these users. We explain why data fusion of sufficiently dissimilar retrieval lists can improve weak query results and confirm this with experiments using short and medium size queries. To realize sufficiently dissimilar retrieval lists, we propose composing alternate queries through web search and mining, employ them for target retrieval, and combine with the original query retrieval list. Methods of forming web probes from longer queries, including salient term selection and query text window rotation, are investigated. When compared with normal ad hoc retrieval, web assistance and data fusion can more than double the original weak query effectiveness. Other queries can also improve along with weak ones.
Keywords: Weak query; Robust retrieval; Salient term selection; Web mining; Alternate queries; Data fusion
Article Outline
- 1. Introduction
- 2. Weak queries
- 3. External resources for enhancing query and retrieval
- 4. Exploiting the web to form alternate queries
- 5. Experiments with short ‘Title’ queries
- 5.1. Retrieval results – ‘Title’ and alternate queries
- 5.2. Data fusion of ‘Title’ and alternate query lists
- 6. Experiments with medium length ‘Description’ queries
- 6.1. Forming web probes by term selection
- 6.2. Forming web probes by window rotation
- 6.3. Data fusion experiments for ‘Description’ queries
- 7. Conclusion
- Acknowledgements
- References







E-mail Article
Add to my Quick Links

Cited By in Scopus (0)







