ABSTRACT
The analysis of microblogging data related with stock markets can reveal relevant new signals of investor sentiment and attention. It may also provide sentiment and attention indicators in a more rapid and cost-effective manner than other sources. In this study, we created several indicators using Twitter data and investigated their value when modeling relevant stock market variables, namely returns, trading volume and volatility. We collected recent data from nine major technological companies. Several sentiment analysis methods were explored, by comparing 5 popular lexical resources and two novel lexicons (emoticon based and the merge of all 6 lexicons) and sentiment indicators produced using two strategies (based on daily words and individual tweet classifications). Also, we measured posting volume associated with tweets related to the analyzed companies. While a short time period is considered (32 days), we found scarce evidence that sentiment indicators can explain these stock returns. However, interesting results were obtained when measuring the value of using posting volume for fitting trading volume and, in particular, volatility.
- Y. Amihud, H. Mendelson, and L. H. Pedersen. Liquidity and Asset Prices. Foundations and Trends in Finance, 1(4):269--364, Aug. 2007.Google ScholarCross Ref
- S. Baccianella, A. Esuli, and F. Sebastiani. Sentiwordnet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining. In Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10), Valletta, Malta, May. European Language Resources Association (ELRA), 2010.Google Scholar
- J. Bollen, H. Mao, and X. Zeng. Twitter mood predicts the stock market. Journal of Computational Science, 2(1):1--8, 2011.Google ScholarCross Ref
- P. Cortez. Data Mining with Neural Networks and Support Vector Machines using the R/rminer Tool. In P. Perner, editor, Advances in Data Mining -- Applications and Theoretical Aspects, 10th Industrial Conference on Data Mining, pages 572--583, Berlin, Germany, July 2010. LNAI 6171, Springer. Google ScholarDigital Library
- L. Ederington and W. Guan. Is implied volatility an informationally efficient and effective predictor of future volatility? Journal of Risk, 4:29--46, 2002.Google ScholarCross Ref
- I. Feinerer, K. Hornik, and D. Meyer. Text mining infrastructure in r. Journal of Statistical Software, 25(5), March 2008.Google ScholarCross Ref
- T. Hastie, R. Tibshirani, and J. Friedman. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer-Verlag, NY, USA, 2nd edition, 2008.Google Scholar
- M. Hu and B. Liu. Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 168--177. ACM, 2004. Google ScholarDigital Library
- H. Mao, S. Counts, and J. Bollen. Predicting financial markets: Comparing survey, news, twitter and search engine data. arXiv preprint arXiv:1112.1051, 2011.Google Scholar
- S. Mohammad, C. Dunne, and B. Dorr. Generating high-coverage semantic orientation lexicons from overtly marked words and a thesaurus. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2--Volume 2, pages 599--608. Association for Computational Linguistics, 2009. Google ScholarDigital Library
- J. Nofsinger. Social mood and financial economics. The Journal of Behavioral Finance, 6(3):144--160, 2005.Google ScholarCross Ref
- C. Oh and O. Sheng. Investigating predictive power of stock micro blog sentiment in forecasting future stock price directional movement. ICIS 2011 Proceedings, 2011.Google Scholar
- R. Peterson. Affect and financial decision-making: How neuroscience can inform market participants. The Journal of Behavioral Finance, 8(2):70--78, 2007.Google ScholarCross Ref
- R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, 2012. ISBN 3-900051-07-0.Google Scholar
- R. Schumaker and H. Chen. Textual analysis of stock market prediction using breaking financial news: The azfin text system. ACM Transactions on Information Systems (TOIS), 27(2):12, 2009. Google ScholarDigital Library
- T. Sprenger and I. Welpe. Tweets and trades: The information content of stock microblogs. Social Science Research Network Working Paper Series, pages 1--89, 2010.Google ScholarCross Ref
- P. Stone, D. Dunphy, and M. Smith. The General Inquirer: A Computer Approach to Content Analysis. MIT press, 1966.Google Scholar
- A. Timmermann. Elusive return predictability. International Journal of Forecasting, 24(1):1--18, Jan. 2008.Google ScholarCross Ref
- T. Wilson, P. Hoffmann, S. Somasundaran, J. Kessler, J. Wiebe, Y. Choi, C. Cardie, E. Riloff, and S. Patwardhan. Opinionfinder: A system for subjectivity analysis. In Proceedings of HLT/EMNLP on Interactive Demonstrations, pages 34--35. Association for Computational Linguistics, 2005. Google ScholarDigital Library
- T. Wilson, J. Wiebe, and P. Hoffmann. Recognizing contextual polarity: An exploration of features for phrase-level sentiment analysis. Computational linguistics, 35(3):399--433, 2009. Google ScholarDigital Library
- I. Witten and E. Frank. Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco, CA, 2005. Google ScholarDigital Library
Index Terms
- Some experiments on modeling stock market behavior using investor sentiment analysis and posting volume from Twitter
Recommendations
Automatic creation of stock market lexicons for sentiment analysis using StockTwits data
IDEAS '14: Proceedings of the 18th International Database Engineering & Applications SymposiumSentiment analysis has been increasingly applied to the stock market domain. In particular, investor sentiment indicators can be used to model and predict stock market variables. In this context, the quality of the sentiment analysis is highly dependent ...
Analyzing Stock Market Movements Using Twitter Sentiment Analysis
ASONAM '12: Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)In this paper we investigate the complex relationship between tweet board literature (like bullishness, volume, agreement etc) with the financial market instruments (like volatility, trading volume and stock prices). We have analyzed sentiments for more ...
Stock market sentiment lexicon acquisition using microblogging data and statistical measures
Lexicon acquisition is a key issue for sentiment analysis. This paper presents a novel and fast approach for creating stock market lexicons. The approach is based on statistical measures applied over a vast set of labeled messages from StockTwits, which ...
Comments