Skip to main content

Argumentative Zoning for Improved Citation Indexing

  • Chapter

Part of the book series: The Information Retrieval Series ((INRE,volume 20))

Abstract

We address the problem of automatically classifying academic citations in scientific articles according to author affect. There are many ways how a citation might fit into the overall argumentation of the article: as part of the solution, as rival approach or as flawed approach that justifies the current research. Our motivation for this work is to improve citation indexing. The method we use for this task is machine learning from indicators of affect (such as “we follow X in assuming that…”, or “in contrast to Y, our system solves this problem”) and of presentation of ownership of ideas (such as “We present a new method for…”, or “They claim that…”). Some of these features are borrowed from Argumentative Zoning (Teufel and Moens, 2002), a technique for determining the rhetorical status of each sentence in a scientific article. These features include the type of subject of the sentence, the citation type, the semantic class of main verb, and a list of indicator phrases. Evaluation will be both intrinsic and extrinsic, involving the measurement of human agreement on the task and a comparison of human and automatic evaluation, as well as a comparison of task-performance with our system versus task performance with a standard citation indexer (CiteSeer, Lawrence et al., 1999).

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

8. Bibliography

  • Computation and Language Archive (1994). http://xxx.lanl.gov/cmp-lg

    Google Scholar 

  • Ge, N. Hale, J., and Charniak, E. (1998). A statistical approach to anaphora resolution. In Proceedings of the Sixth Workshop on Very Large Corpora.

    Google Scholar 

  • Giles, C, Bollacker, K., and Lawrence, S. (1998). Citeseer: An automatic citation indexing system. In Proceedings of the Third ACM Conference on Digital Libraries.

    Google Scholar 

  • Kupiec, J., Pedersen, J., and Chen, F. (1995). A trainable document summarizer. In Proceedings of the 18thAnnual International Conference on Research and Development in Information Retrieval (SIGIR-95).

    Google Scholar 

  • Lawrence, S., Giles, C., and Bollacker, K. (1999). Digital libraries and autonomous citation indexing. IEEE Computer 32(6).

    Google Scholar 

  • Lewis, D. (1991). Evaluating text categorisation. In Speech and Natural Language: Proceedings of the ARPA Workshop of Human Language Technology.

    Google Scholar 

  • Myers, G. (1992). In this paper we report…-speech acts and scientific facts. Journal of Pragmatics 17(4).

    Google Scholar 

  • Nanba and Okumura (1999). Towards multi-paper summarization using reference information. In Proceedings of IJCAI-99.

    Google Scholar 

  • Pang, B., Lee, L., and Vaithyanathan, S. (2002). Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP).

    Google Scholar 

  • Shum, S. (1998). Evolving the web for scientific knowledge: First steps towards an “HCI knowledge web”. Interfaces, British HCI Group Magazine 39.

    Google Scholar 

  • Siegel, S. and Castellan, N. (1988). Nonparametric Statistics for the Behavioral Sciences. Berkeley, CA: McGraw-Hill, 2nd edition.

    Google Scholar 

  • Swales, J. (1990). Genre Analysis: English in Academic and Research Settings. Chapter 7: Research articles in English. Cambridge, UK: Cambridge University Press.

    Google Scholar 

  • Teufel, S. and Moens, M. (2002). Summarising scientific articles — experiments with relevance and rhetorical status. Computational Linguistics 28(4).

    Google Scholar 

  • Teufel, S., Carletta, J., and Moens, M. (1999). An annotation scheme for discourse-level argumentation in research articles. In Proceedings of the Ninth Meeting of the European Chapter of the Association for Computational Linguistics (EACL-99).

    Google Scholar 

  • Teufel, S. (2001). Task-based evaluation of summary quality: Describing relationships between scientific papers. In Proceedings of NAACL-01 Workshop “Automatic Text Summarization”.

    Google Scholar 

  • Weinstock, M. (1971). Citation Indexes. In Encyclopaedia of Library and Information Science, volume 5, New York, NY: Dekker.

    Google Scholar 

  • Wiebe, J. (1994). Tracking point of view in narrative. Computational Linguistics 20(2).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer

About this chapter

Cite this chapter

Teufel, S. (2006). Argumentative Zoning for Improved Citation Indexing. In: Shanahan, J.G., Qu, Y., Wiebe, J. (eds) Computing Attitude and Affect in Text: Theory and Applications. The Information Retrieval Series, vol 20. Springer, Dordrecht. https://doi.org/10.1007/1-4020-4102-0_13

Download citation

  • DOI: https://doi.org/10.1007/1-4020-4102-0_13

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-1-4020-4026-9

  • Online ISBN: 978-1-4020-4102-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics