ABSTRACT
We examine the accuracy of first story detection on traditional news collections and on a re-purposed source of academic material. The impact on accuracy of detecting an early rather than the first story is examined, showing that accuracy increases under a broader time window, however, the increases on some collections are small. Even on collections where the increase is large, many new events are still missed and there remains an underlying challenge to detecting new events. An analysis of temporal and vocabulary profiles of topics within their source collections is conducted. Analysis of the results establish the underlying causes of the patterns seen in the experimental results with respect to the different source types and performance. The usefulness of new criteria for new event detection and success across source types is discussed.
- TREC Genomics Track. available at http://ir.ohsu.edu/genomics/.Google Scholar
- C. Aggarwal and K. Subbian. Event Detection in Social Streams. In SIAM International Conference on Data Mining, pages 624--635, 2012.Google Scholar
- A. Ahmed, Q. Ho, J. Eisenstein, E. Xing, A. Smola, and C. Teo. Unified analysis of streaming news. In WWW '11, pages 267--276. ACM, 2011. Google ScholarDigital Library
- S. Ahmed, R. Bhindwale, and H. Davulcu. Tracking terrorism news threads by extracting event signatures. In ISI '09, pages 182--184. IEEE, 2009. Google ScholarDigital Library
- J. Allan, J. Carbonell, G. Doddington, J. Yamron, and Y. Yang. Topic detection and tracking pilot study: final report. In DARPA Broadcast News Transcription and Understanding Workshop, pages 194--218, 1998.Google Scholar
- J. Allan, H. Jin, M. Rajman, C. Wayne, D. Gildea, V. Lavrenko, R. Hoberman, and D. Caputo, editors. Topic-based novelty detection 1999 summer workshop at CLSP final report, 1999. available at http://www.clsp.jhu.edu/ws99/final/Topic-based.pdf.Google Scholar
- J. Allan and G. Kumaran. Text classification and named entities for new event detection. In SIGIR '04, pages 297--304, 2004. Google ScholarDigital Library
- J. Allan, V. Lavrenko, and H. Hin. First story detection in TDT is hard. In CIKM '00, pages 374--318, 2000. Google ScholarDigital Library
- J. Allan, V. Lavrenko, and R.Swan. Explorations within topic tracking and detection. In J. Allan, editor, Topic Detection and Tracking; Event-based Information Organization, pages 197--224. Kluwer Academic Publishers, 2002. Google ScholarDigital Library
- S. Brandt. Statistical and computational methods in data analysis. Amsterdam: North-Holland, 1976, 2nd rev. ed., 5th repr. 1989, 1, 1989.Google Scholar
- T. Brants and F. Chen. A System for new event detection. In SIGIR '03, pages 330--337, 2003. Google ScholarDigital Library
- J. Cao, C. Ngo, Y. Zhang, D. Zhang, and L. Ma. Trajectory-based visualization of web video topics. In MULTIMEDIA '10, pages 1639--1642. ACM, 2010. Google ScholarDigital Library
- P. Clarkson and R. Rosenfeld. Statistical language modeling using the CMU-Cambridge toolkit. In Fifth European Conference on Speech Communication and Technology, pages 2707--2710, 1997.Google ScholarCross Ref
- J. Fiscus and G. Doddington. Topic detection and tracking overview. In J. Allan, editor, Topic Detection and Tracking; Event-based Information Organization, pages 17--30. Kluwer Academic Publishers, 2002. Google ScholarDigital Library
- F. Jelinek, R. L. Mercer, L. R. Bahl, and J. K. Baker. Perplexity - a measure of the difficulty of speech recognition tasks. The Journal of the Acoustical Society of America, 62(1):63, 1977.Google ScholarCross Ref
- H. Jin, R. Schwartz, S. Sista, and F. Walls. Topic tracking for radio, TV broadcast, and newswire. In DARPA Broadcast News Workshop, pages 199--204, 1999.Google ScholarCross Ref
- T. Leek, R. Schwartz, and S. Sista. Probabilistic approaches to Topic Detection and Tracking. In J. Allan, editor, Topic Detection and Tracking; Event-based Information Organization, pages 67--85. Kluwer Academic Publishers, 2002. Google ScholarDigital Library
- J. Lin, R. Snow, and W. Morgan. Smoothing techniques for adaptive online language models: topic tracking in tweet streams. In SIGKDD '11, pages 422--429. ACM, 2011. Google ScholarDigital Library
- I. Ounis, G. Amati, V. Plachouras, B. He, C. Macdonald, and C. Lioma. Terrier: A high performance and scalable information retrieval platform. In OSIR '06, volume 2006, pages 18--25, 2006.Google Scholar
- M. Porter. An algorithm for suffix stripping. Program, 14(3):130--137, 1980.Google ScholarCross Ref
- G. Smith. Video scene detection using closed caption text. Virginia Commonwealth University, 2009. available at https://digarchive.library.vcu.edu/handle/10156/2649.Google Scholar
- N. Stokes and J. Carthy. Combining semantic and syntactic document classifiers to improve first story detection. In SIGIR '01, pages 424--425, 2001. Google ScholarDigital Library
- S. Strassel, J. Kong, and D. Graff. TDT4 Multilingual Text and Annotations. Linguistic Data Consortium, Philadelphia, 2005.Google Scholar
- K. Zhang, J. Zi, and L. Wu. New event detection based on indexing-tree and named entity. In SIGIR '07, pages 215--222, 2007. Google ScholarDigital Library
Recommendations
Video Event Detection: From Subvolume Localization to Spatiotemporal Path Search
Although sliding window-based approaches have been quite successful in detecting objects in images, it is not a trivial problem to extend them to detecting events in videos. We propose to search for spatiotemporal paths for video event detection. This ...
Story link detection and new event detection are asymmetric
NAACL-Short '03: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2Story link detection has been regarded as a core technology for other Topic Detection and Tracking tasks such as new event detection. In this paper we analyze story link detection and new event detection in a retrieval framework and examine the effect ...
Comments