Buy, Sell, or Hold? Information Extraction from Stock Analyst Reports

Lee, Yeong Su; Geierhos, Michaela

doi:10.1007/978-3-642-24279-3_19

Yeong Su Lee²⁴ &
Michaela Geierhos²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6967))

Included in the following conference series:

International and Interdisciplinary Conference on Modeling and Using Context

1002 Accesses
1 Citations

Abstract

This paper presents a novel linguistic information extraction approach exploiting analysts’ stock ratings for statistical decision making. Over a period of one year, we gathered German stock analyst reports in order to determine market trends. Our goal is to provide business statistics over time to illustrate market trends for a user-selected company. We therefore recognize named entities within the very short stock analyst reports such as organization names (e.g. BASF, BMW, Ericsson), analyst houses (e.g. Gartner, Citigroup, Goldman Sachs), ratings (e.g. buy, sell, hold, underperform, recommended list) and price estimations by using lexicalized finite-state graphs, so-called local grammars. Then, company names and their acronyms respectively have to be cross-checked against data the analysts provide. Finally, all extracted values are compared and presented into charts with different views depending on the evaluation criteria (e.g. by time line). Thanks to this approach it will be easier and even more comfortable in the future to pay attention to analysts’ buy/sell signals without reading all their reports.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Barber, B.M., Lehaby, R., McNichols, M., Trueman, B.: Buys, holds, and sells: The distribution of investment banks’ stock ratings and the implications for the profitability of analysts’ recommendations. Journal of Accounting and Economics 41, 87–117 (2006)
Article Google Scholar
Lin, L., Liotta, A., Hippisley, A.: A method for automating the extraction of specialized information from the web. CIS 1, 489–494 (2005)
Google Scholar
Gross, M.: The Construction of Local Grammars. In: Roche, E., Schabés, Y. (eds.) Finite-State Language Processing. Language, Speech, and Communication, pp. 329–354. MIT Press, Cambridge (1997)
Google Scholar
Surdeanu, M., Harabagiu, S., Williams, J., Aarseth, P.: Using Predicate-Argument Structures for Informaiton Extraction. In: Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, pp. 8–15 (2003)
Google Scholar
Baumgartner, R., Frölich, O., Gottlob, G., Harz, P.: Web Data Extraction for Business Intelligence: the Lixto Approach. In: Proc. of BTW 2005 (2005)
Google Scholar
Maynard, D., Saggion, H., Yankova, M., Bontcheva, K., Peters, W.: Natural Language Technology for Information Integration in Business Intelligence. In: 10th International Conference on Business Information Systems, Poland (2007)
Google Scholar
Saggion, H., Funk, A., Maynard, D., Bontcheva, K.: Ontology-Based Information Extraction for Business Intelligence. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 843–856. Springer, Heidelberg (2007)
Chapter Google Scholar
Paradis, F., Nie, J.Y., Tajarobi, A.: Discovery of Business Opportunities on the Internet with Informaiton Extraction. In: IJCAI 2005, Edinburgh, pp. 47–54 (2005)
Google Scholar
Silva, J., Kozareva, Z., Noncheva, V., Lopes, G.: Extracting Named Entities. A Statistical Approach. In: TALN 2004, Fez, Marroco, ATALA, pp. 347–351 (2004)
Google Scholar
Downey, D., Broadhead, M., Etzioni, O.: Locating Complex Named Entities in web Text. In: Proc. of the Twentieth International Joint Conference on Artificial Intelligence (IJCAI 2007), Hyderabad, India (2007)
Google Scholar
McDonald, D.: Internal and external evidence in the identification and semantic categorization of proper names. In: Boguraev, B., Pustejovsky, J. (eds.) Corpus Processing for Lexical Acquisition, pp. 21–39. MIT Press, Cambridge (1996)
Google Scholar
Mikheev, A., Moens, M., Grover, C.: Named entity recognition without gazetteers. In: Proceedings of the Ninth Conference of the European Chapter of the Association for Computational Linguistics, pp. 1–8 (1999)
Google Scholar
Etzioni, O., Cafarella, M., Downey, D., Popescu, A.M., Shaked, T., Soderland, S., Weld, D.S., Yates, A.: Unsupervised Named-Entity Extraction from the web: An Experimental Study. Artificial Intelligence 165(1), 134–191 (2005)
Article Google Scholar
Banko, M., Cafarella, M.J., Soderland, S., Broadhead, M., Etzioni, O.: Open Information Extraction from the Web. In: Proceedings of IJCAI (2007)
Google Scholar
Giuliano, C., Lavelli, A., Romano, L.: Exploiting shallow linguistic information for relation extraction from biomedical literature. In: Proc. EACL 2006 (2006)
Google Scholar
Bsiri, S., Geierhos, M., Ringlstetter, C.: Structuring Job Search via Local Grammars. Advances in Natural Language Processing and Applications. Research in Computing Science (RCS) 33, 201–212 (2008)
Google Scholar
Geierhos, M., Blanc, O.: BiographIE – Biographical Information Extraction from Business News. In: De Gioia, M. (ed.) Actes du 27e Colloque international sur le lexique et la grammaire, L’Aquila, September 10-13 (2008); Seconde partie. Lingue d’Europa e del Mediterraneo: Grammatica comparata. Aracne, Rome, Italy (2010)
Google Scholar
Traboulsi, H.N.: Named Entity Recognition: A Local Grammar-based Approach. PhD thesis, University of Surrey (2006)
Google Scholar
Woods, W.A.: Transition network grammars for natural language analysis. Commun. ACM 13(10), 591–606 (1970)
Article MATH Google Scholar
Paumier, S.: Unitex User Manual 2.1. (2010), http://igm.univ-mlv.fr/~unitex/UnitexManual2.1.pdf
Chi, C.H., Ding, C.: Word Segmentation and Recognition for Web Document Framework. In: Proceedings of Conference on Information and Knowledge Management, CIKM (1999)
Google Scholar
Lee, Y.S.: Website-Klassifikation und Informationsextraktion aus Informationsseiten einer Firmenwebsite. PhD thesis, University of Munich, Germany (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

CIS, University of Munich, Germany
Yeong Su Lee & Michaela Geierhos

Authors

Yeong Su Lee
View author publications
You can also search for this author in PubMed Google Scholar
Michaela Geierhos
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

TecO/Pervasive Computing Systems, Karlsruhe Institute of Technology, Am Fasanengarten 5, 76131, Karlsruhe, Germany
Michael Beigl & Hedda R. Schmidtke &
Department of Communication, Business and Information Technologies, Roskilde University, P.O. box 260, 4000, Roskilde, Denmark
Henning Christiansen
Institut für Informatik, AG Erklärungsfähige Softwaresystems, Universität Hildesheim, Marienburger Platz 22, 31141, Hildesheim, Germany
Thomas R. Roth-Berghofer
Department of Computer and Information Science, Intelligent Systems Group, Norwegian University of Science and Technology (NTNU), Sem Saelandsvei 7-9,, 7491, Trondheim, Norway
Anders Kofod-Petersen
Cognition and Communication Research Centre, Northumbria University, Newcastle upon Tyne, NE1 8ST, UK
Kenny R. Coventry

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lee, Y.S., Geierhos, M. (2011). Buy, Sell, or Hold? Information Extraction from Stock Analyst Reports. In: Beigl, M., Christiansen, H., Roth-Berghofer, T.R., Kofod-Petersen, A., Coventry, K.R., Schmidtke, H.R. (eds) Modeling and Using Context. CONTEXT 2011. Lecture Notes in Computer Science(), vol 6967. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24279-3_19

Download citation

DOI: https://doi.org/10.1007/978-3-642-24279-3_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24278-6
Online ISBN: 978-3-642-24279-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics