Query-Free News Search

Henzinger, Monika; Chang, Bay-Wei; Milch, Brian; Brin, Sergey

doi:10.1007/s11280-004-4870-6

Query-Free News Search

Published: June 2005

Volume 8, pages 101–126, (2005)
Cite this article

World Wide Web Aims and scope Submit manuscript

Monika Henzinger¹,
Bay-Wei Chang¹,
Brian Milch¹ &
…
Sergey Brin¹

239 Accesses
30 Citations
3 Altmetric
Explore all metrics

Abstract

Many daily activities present information in the form of a stream of text, and often people can benefit from additional information on the topic discussed. TV broadcast news can be treated as one such stream of text; in this paper we discuss finding news articles on the web that are relevant to news currently being broadcast.

We evaluated a variety of algorithms for this problem, looking at the impact of inverse document frequency, stemming, compounds, history, and query length on the relevance and coverage of news articles returned in real time during a broadcast. We also evaluated several postprocessing techniques for improving the precision, including reranking using additional terms, reranking by document similarity, and filtering on document similarity. For the best algorithm, 84–91% of the articles found were relevant, with at least 64% of the articles being on the exact topic of the broadcast. In addition, a relevant article was found for at least 70% of the topics.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

J. Allan, R. Gupta, and V. Khandelwal, “Temporal summaries of news topics,” in Research and Development in Information Retrieval, 2001, pp. 10–18.
E. Brill, “Transformation-based error-driven learning and natural language processing: A case study in part-of-speech tagging,” Computation Linguistics 21(4), 1995, 543–565.
Google Scholar
S. Brin, R. Motwani, L. Page, and T. Winograd, “What can you do with a web in your pocket?” Data Engineering Bulletin 21(2), 1998, 37–47.
Google Scholar
J. Budzik, K. Hammond, and L. Birnbaum, “Information access in context,” Knowledge Based Systems 14(1–2), 2001, 37–53.
Google Scholar
J. Davis, “Intercast dying of neglect,” CNET News, January 29, 1997.
Electronic Industries Alliance, “Transport of internet uniform resource locator (url) information using text-2 (t-2) service,” Technical Report, EIA-746-A, 1998.
E. Frank, G. W. Paynter, I. H. Witten, C. Gutwin, and C. G. Nevill-Manning, “Domain-specific keyphrase extraction,” in IJCAI, 1999, pp. 668–673.
P. Hart and J. Graham, “Query-free information retrieval,” IEEE Expert 12(5), 1997, 32–37.
Google Scholar
B. Krulwich and C. Burkey, “Learning user information interests through the extraction of semantically significant phrases,” in AAAI 1996 Spring Symposium on Machine Learning in Information Access, 1996.
H. Lieberman, “Letizia: An agent that assists web browsing,” in C. S. Mellish (ed.), Proceedings of the 14th International Joint Conference on Artificial Intelligence ({IJCAI}-95), 1995, pp. 924–929.
K. Livingston, M. Dredze, K. Hammond, and L. Birnbaum, “Beyond broadcast,” in International Conference on Intelligent User Interfaces, 2003.
P. Maglio, R. Barrett, C. Campbell, and T. Selker, “Suitor: An attentive information system,” in International Conference on Intelligent User Interfaces, 2000.
A. Munoz, “Compound key word generation from document databases using a hierarchical clustering art model,” Intelligent Data Analysis 1(1), 1997.
M. N. Price, G. Golovchinsky, and B. N. Schilit, “Linking by inking: Trailblazing in a paper-like hypertext,” in Proceedings of the Hypertext’98, 1998, pp. 30–39.
B. Rhodes and P. Maes, “Just-in-time information retrieval agents,” IBM Systems Journal 39(3–4), 2000.
B. J. Rhodes, “Just-in-time information retrieval,” Ph.D. Thesis, MIT Media Laboratory, Cambridge, MA, May 2000.
S. Robertson, S. Walker, and M. Beaulieu, “Okapi at TREC-7: automatic ad hoc, filtering, VLC and interactive track,” in Proceedings of the 7th International Text Retrieval Conference (TREC), 1999, pp. 253–264.
G. D. Robson, “Closed captions, V-chip, and other VBI data,” Nuts and Volts, 2000.
G. Salton, The SMART System—Experiments in Automatic Document Processing, Prentice Hall, 1971.
A. M. Steier and R. K. Belew, “Exporting phrases: A statistical analysis of topical language,” in Proceedings of the 2nd Symposium on Document Analysis and Information Retrieval, 1993, pp. 179–190.
P. D. Turney, “Learning algorithms for keyphrase extraction,” Information Retrieval 2(4), 2000, 303–336.
Google Scholar

Download references

Author information

Authors and Affiliations

Google Inc., 1600 Amphitheatre Parkway, Mountain View, CA, 94043, USA
Monika Henzinger, Bay-Wei Chang, Brian Milch & Sergey Brin

Authors

Monika Henzinger
View author publications
You can also search for this author in PubMed Google Scholar
Bay-Wei Chang
View author publications
You can also search for this author in PubMed Google Scholar
Brian Milch
View author publications
You can also search for this author in PubMed Google Scholar
Sergey Brin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bay-Wei Chang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Henzinger, M., Chang, BW., Milch, B. et al. Query-Free News Search. World Wide Web 8, 101–126 (2005). https://doi.org/10.1007/s11280-004-4870-6

Download citation

Issue Date: June 2005
DOI: https://doi.org/10.1007/s11280-004-4870-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Query-Free News Search

Abstract

Access this article

Similar content being viewed by others

Pagerank-Like Algorithm for Ranking News Stories and News Portals

Comparing Two Strategies for Query Expansion in a News Monitoring System

Context-Aware News Recommendation System: Incorporating Contextual Information and Collaborative Filtering Techniques

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Abstract

Access this article

Similar content being viewed by others

Pagerank-Like Algorithm for Ranking News Stories and News Portals

Comparing Two Strategies for Query Expansion in a News Monitoring System

Context-Aware News Recommendation System: Incorporating Contextual Information and Collaborative Filtering Techniques

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation