Abstract
Much currently transmitted information takes the form of e-mails or SMS text messages and so extracting information from such short messages is increasingly important. The words in a message can be partitioned into the syntactic structure, terms from the domain of discourse and the data being transmitted. This paper describes a light-weight Information Extraction component which uses pattern matching to separate the three aspects: the structure is supplied as a template; domain terms are the metadata of a data source (or their synonyms), and data is extracted as those words matching placeholders in the templates.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Gaizauskas, R., Wilks, Y.: Information Extraction: Beyond Document Retrieval. Journal of Documentation 54(1), 70–105 (1998)
Fisher, D., Soderland, S., McCarthy, J., Feng, F., Lehnert, W.: Umass System, MUC-6 (1995)
Cardie, C.: Empirical Methods in Information Extraction. AI Magazine 18(4), 65–79 (1997)
Kang, I.-S., Na, S.-H., Lee, J.-H., Yang, G.: Lightweight Natural Language Database Interfaces. In: Meziane, F., Métais, E. (eds.) NLDB 2004. LNCS, vol. 3136, pp. 76–88. Springer, Heidelberg (2004)
Stratica, N., Desai, B.C.: Schema-Based Natural Language Semantic Mapping. In: Meziane, F., Métais, E. (eds.) NLDB 2004. LNCS, vol. 3136, pp. 103–113. Springer, Heidelberg (2004)
Cooper, R.L., Ali, S.: Extracting Database Information from E-mail Messages. In: James, A., Younas, M., Lings, B. (eds.) BNCOD 2003. LNCS, vol. 2712, pp. 271–279. Springer, Heidelberg (2003)
Cooper, R.L., Ali, S., Bi, C.L.: A System for Extracting Information from Short Messages, Technical Report, University of Glasgow (in press)
Agichtein, E., Gravano, L.: Snowball: Extracting Relations from Large Plain-Text Collections. In: Proc. 5th ACM International Conference on Digital Libraries, DL (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cooper, R., Ali, S., Bi, C. (2005). Extracting Information from Short Messages. In: Montoyo, A., Muńoz, R., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2005. Lecture Notes in Computer Science, vol 3513. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11428817_44
Download citation
DOI: https://doi.org/10.1007/11428817_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26031-8
Online ISBN: 978-3-540-32110-1
eBook Packages: Computer ScienceComputer Science (R0)