Skip to main content

Extracting Information from Short Messages

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3513))

Abstract

Much currently transmitted information takes the form of e-mails or SMS text messages and so extracting information from such short messages is increasingly important. The words in a message can be partitioned into the syntactic structure, terms from the domain of discourse and the data being transmitted. This paper describes a light-weight Information Extraction component which uses pattern matching to separate the three aspects: the structure is supplied as a template; domain terms are the metadata of a data source (or their synonyms), and data is extracted as those words matching placeholders in the templates.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Gaizauskas, R., Wilks, Y.: Information Extraction: Beyond Document Retrieval. Journal of Documentation 54(1), 70–105 (1998)

    Article  Google Scholar 

  2. Fisher, D., Soderland, S., McCarthy, J., Feng, F., Lehnert, W.: Umass System, MUC-6 (1995)

    Google Scholar 

  3. Cardie, C.: Empirical Methods in Information Extraction. AI Magazine 18(4), 65–79 (1997)

    Google Scholar 

  4. http://gate.ac.uk/

  5. Kang, I.-S., Na, S.-H., Lee, J.-H., Yang, G.: Lightweight Natural Language Database Interfaces. In: Meziane, F., Métais, E. (eds.) NLDB 2004. LNCS, vol. 3136, pp. 76–88. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  6. Stratica, N., Desai, B.C.: Schema-Based Natural Language Semantic Mapping. In: Meziane, F., Métais, E. (eds.) NLDB 2004. LNCS, vol. 3136, pp. 103–113. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  7. Cooper, R.L., Ali, S.: Extracting Database Information from E-mail Messages. In: James, A., Younas, M., Lings, B. (eds.) BNCOD 2003. LNCS, vol. 2712, pp. 271–279. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  8. Cooper, R.L., Ali, S., Bi, C.L.: A System for Extracting Information from Short Messages, Technical Report, University of Glasgow (in press)

    Google Scholar 

  9. Agichtein, E., Gravano, L.: Snowball: Extracting Relations from Large Plain-Text Collections. In: Proc. 5th ACM International Conference on Digital Libraries, DL (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cooper, R., Ali, S., Bi, C. (2005). Extracting Information from Short Messages. In: Montoyo, A., Muńoz, R., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2005. Lecture Notes in Computer Science, vol 3513. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11428817_44

Download citation

  • DOI: https://doi.org/10.1007/11428817_44

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-26031-8

  • Online ISBN: 978-3-540-32110-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics