ACM Home Page
Please provide us with feedback. Feedback
An exploration of the principles underlying redundancy-based factoid question answering
Full text pdf formatPdf (765 KB)
Source
ACM Transactions on Information Systems (TOIS) archive
Volume 25 ,  Issue 2  (April 2007) table of contents
Article No. 6  
Year of Publication: 2007
ISSN:1046-8188
Author
Jimmy Lin  University of Maryland, College Park, College Park, MD
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 21,   Downloads (12 Months): 275,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1229179.1229180
What is a DOI?

ABSTRACT

The so-called “redundancy-based” approach to question answering represents a successful strategy for mining answers to factoid questions such as “Who shot Abraham Lincoln?” from the World Wide Web. Through contrastive and ablation experiments with Aranea, a system that has performed well in several TREC QA evaluations, this work examines the underlying assumptions and principles behind redundancy-based techniques. Specifically, we develop two theses: that stable characteristics of data redundancy allow factoid systems to rely on external “black box” components, and that despite embodying a data-driven approach, redundancy-based methods encode a substantial amount of knowledge in the form of heuristics. Overall, this work attempts to address the broader question of “what really matters” and to provide guidance for future researchers.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
 
4
 
5
 
6
7
 
8
 
9
 
10
Brill, E., Lin, J., Banko, M., Dumais, S., and Ng, A. 2001. Data-intensive question answering. In Proceedings of the Tenth Text REtrieval Conference (TREC 2001). 393--400.
 
11
Brill, E. and Mooney, R. J. 1997. An overview of empirical natural language processing. AI Mag. 18, 4, 13--24.
 
12
 
13
Cahn, S. M., Kitcher, P., Sher, G., and Markie, P. J. 1996. Reason at Work: Introductory Readings in Philosophy, 3rd ed. Hardcourt Brace College Publishers, Fort Worth, TX.
 
14
 
15
16
 
17
Clarke, C., Cormack, G., Lynam, T., Li, C., and McLearn, G. 2001b. Web reinforced question answering (MultiText experiments for TREC 2001). In Proceedings of the Tenth Text REtrieval Conference (TREC 2001). 673--679.
18
 
19
Dang, H. 2005. Overview of DUC 2005. In Proceedings of the 2005 Document Understanding Conference (DUC 2005) at NLT/EMNLP 2005.
 
20
Dang, H., Lin, J., and Kelly, D. 2006. Overview of the TREC 2006 question answering track. In Proceedings of the Fifteenth Text REtrieval Conference (TREC 2006).
21
 
22
 
23
 
24
Fukumoto, J., Kato, T., and Masui, F. 2002. Question Answering Challenge (QAC-1): An evaluation of question answering task at NTCIR Workshop 3. In Proceedings of the Third NTCIR Workshop on Research in Information Retrieval, Automatic Text Summarization and Question Answering.
 
25
Harabagiu, S., Moldovan, D., Paşca, M., Mihalcea, R., Surdeanu, M., Bunescu, R., Gîrju, R., Rus, V., and Morărescu, P. 2000a. FALCON: Boosting knowledge for answer engines. In Proceedings of the Ninth Text REtrieval Conference (TREC-9). 497--506.
 
26
 
27
Hildebrandt, W., Katz, B., and Lin, J. 2004. Answering definition questions with multiple knowledge sources. In Proceedings of the 2004 Human Language Technology Conference and the North American Chapter of the Association for Computational Linguistics Annual Meeting (HLT/NAACL 2004). 49--56.
 
28
 
29
Hovy, E., Gerber, L., Hermjakob, U., Junk, M., and Lin, C.-Y. 2000. Question answering in Webclopedia. In Proceedings of the Ninth Text REtrieval Conference (TREC-9). 655--664.
 
30
Ittycheriah, A., Franz, M., Zhu, W.-J., and Ratnaparkhi, A. 2000. IBM's statistical question answering system. In Proceedings of the Ninth Text REtrieval Conference (TREC-9). 258--264.
 
31
Kato, T., Fukumoto, J., Masui, F., and Kando, N. 2004. Handling information access dialogue through QA technologies---a novel challenge for open-domain question answering. In Proceedings of the HLT-NAACL 2004 Workshop on Pragmatics of Question Answering. 70--77.
 
32
Katz, B. 1997. Annotating the World Wide Web using natural language. In Proceedings of the 5th RIAO Conference on Computer Assisted Information Searching on the Internet (RIAO 1997). 136--155.
 
33
34
 
35
36
 
37
 
38
Lin, J., Fernandes, A., Katz, B., Marton, G., and Tellex, S. 2002. Extracting answers from the Web using knowledge annotation and knowledge mining techniques. In Proceedings of the Eleventh Text REtrieval Conference (TREC 2002).
39
 
40
41
 
42
Lowe, J. B. 2000. What's in store for question answering? (Invited talk.) In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP/VLC-2000).
 
43
Magnini, B., Romagnoli, S., Vallin, A., Herrera, J., Peñas, A., Peinado, V., Verdejo, F., and de Rijke, M. 2004. The multiple language question answering track at CLEF 2003. In Comparative Evaluation of Multilingual Information Access Systems: 4th Workshop of the Cross-Language Evaluation Forum, CLEF 2003, Trondheim, Norway, August 21--22, 2003, Revised Selected Papers, C. Peters, J. Gonzalo, M. Braschler, and M. Kluck, Eds. Lecture Notes in Computer Science, vol. 3237. Springer, Berlin, Germany, 471--486.
 
44
 
45
 
46
Moffat, A., Sacks-Davis, R., Wilkinson, R., and Zobel, J. 1993. Retrieval of partial documents. In Proceedings of the Second Text REtrieval Conference (TREC-2). 181--190.
 
47
48
49
 
50
 
51
 
52
Robertson, S. 1977. The probability ranking principle in IR. J. Documentat. 33, 4, 294--304.
 
53
Robertson, S. 2004. Understanding inverse document frequency: On theoretical arguments for IDF. J. Documentat. 60, 5, 503--520.
54
 
55
Srihari, R. and Li, W. 1999. Information extraction supported question answering. In Proceedings of the Eighth Text REtrieval Conference (TREC-8). 185--196.
56
 
57
Voorhees, E. 2001. Overview of the TREC 2001 question answering track. In Proceedings of the Tenth Text REtrieval Conference (TREC 2001). 42--51.
 
58
Voorhees, E. 2002. Overview of the TREC 2002 question answering track. In Proceedings of the Eleventh Text REtrieval Conference (TREC 2002). 57--68.
 
59
Voorhees, E. 2003. Overview of the TREC 2003 question answering track. In Proceedings of the Twelfth Text REtrieval Conference (TREC 2003). 54--68.
 
60
Voorhees, E. 2004. Overview of the TREC 2004 question answering track. In Proceedings of the Thirteenth Text REtrieval Conference (TREC 2004). 52--69.
 
61
Voorhees, E. and Tice, D. 1999. The TREC-8 question answering track evaluation. In Proceedings of the Eighth Text REtrieval Conference (TREC-8). 83--106.
62
 
63