Abstract
With the emergence of social networks, opinion detection has become an active research area with different applications and several opinionated resources such as product reviews, social media posts and online blogs. Many social actors (e.g., companies, government departments, journalists) seek to understand people’s opinions for various purposes such as analyzing consumer reactions to certain products’ promotion (Marketing). In this regard, the last decade has witnessed a steady growth in opinion mining and sentiment analysis mainly explained by the scientific challenges and it bears such as natural language processing ambiguity, spam opinion detection, sarcasm, and using abbreviations. As a result, an extended survey focusing on the different aspects of those challenges is required. In this work, we present the problem statement and preliminaries, as well as the data sources and acquisition techniques. We then propose a thorough examination of well-cited, classical and recent opinion mining approaches, with an emphasis on the techniques employed in each of the sub-tasks of opinion mining.
Similar content being viewed by others
Notes
References
Adnan K, Akbar R (2019) Limitations of information extraction methods and techniques for heterogeneous unstructured big data. Int J Eng Bus Manag 11:1847979019890771
Akhmedova S, Semenkin E, Stanovov V (2018) Co-operation of biology related algorithms for solving opinion mining problems by using different term weighting schemes. In: Madani K, Peaucelle D, Gusikhin O (eds) Informatics in control, automation and robotics (pp. 73–90). Springer
Alkula R (2001) From plain character strings to meaningful words: producing better full text databases for inflectional and compounding languages with morphological analysis software. Inf Retr 4(3–4):195–208
Balahur A, Hermida JM, Montoyo A (2011) Building and exploiting emotinet, a knowledge base for emotion detection based on the appraisal theory model. IEEE Trans Affect Comput 3(1):88–101
Balazs JA, Velásquez JD (2016) Opinion mining and information fusion: a survey. Inf Fus 27:95–110
Cambria E (2016) Affective computing and sentiment analysis. IEEE Intell Syst 31(2):102–107
Cambria E, Speer R, Havasi C, Hussain A (2010) Senticnet: a publicly available semantic resource for opinion mining. In: AAAI fall symposium: commonsense knowledge, vol 10
Cambria E, Havasi C, Hussain A (2012) Senticnet 2: a semantic and affective resource for opinion mining and sentiment analysis. In: Twenty-fifth international flairs conference
Cambria E, Schuller B, Xia Y, Havasi C (2013) New avenues in opinion mining and sentiment analysis. IEEE Intell Syst 28(2):15–21
Chelaru S, Altingovde IS, Siersdorfer S, Nejdl W (2013) Analyzing, detecting, and exploiting sentiment in web queries. ACM Trans Web (TWEB) 8(1):1–28
Chifu ES, Chifu VR (2019) An unsupervised neural model for aspect based opinion mining. In: 2019 IEEE 15th international conference on intelligent computer communication and processing (ICCP), pp 151–157
Cui A, Zhang M, Liu Y, Ma S (2011) Emotion tokens: bridging the gap among multilingual twitter sentiment analysis. In: Asia information retrieval symposium, pp 238–249
Esuli A, Sebastiani F (2006) Sentiwordnet: a publicly available lexical resource for opinion mining. In: LREC, vol 6, pp 417–422
Fu T, Abbasi A, Zeng D, Chen H (2012) Sentimental spidering: leveraging opinion information in focused crawlers. ACM Trans Inf Syst (TOIS) 30(4):1–30
Gruber TR (1993) A translation approach to portable ontology specifications. Knowl Acquis 5(2):199–220
Guarino N (1995) Formal ontology, conceptual analysis and knowledge representation. Int J Hum Comput Stud 43(5–6):625–640
Guo K, Shi L, Ye W, Li X (2014) A survey of internet public opinion mining. In: 2014 IEEE international conference on progress in informatics and computing
Hangya V, Farkas R (2013) Target-oriented opinion mining from tweets. In: 2013 IEEE 4th international conference on cognitive infocommunications (coginfocom), pp 251–254
Hu M, Liu B (2004) Mining and summarizing customer reviews. In: Proceedings of the tenth ACM SIGKDD international conference on knowledge discovery and data mining, pp 168–177
Hu N, Pavlou PA, Zhang J (2006) Can online reviews reveal a product’s true quality? empirical findings and analytical modeling of online word-of-mouth communication. In: Proceedings of the 7th ACM conference on electronic commerce, pp 324–330
Kamath U, Liu J, Whitaker J (2019) Deep learning for NLP and speech recognition, vol 84. Springer
Kaur A, Gupta V (2013) A survey on sentiment analysis and opinion mining techniques. J Emerg Technol Web Intell 5(4):367–371
Keyvanpour M, Zandian ZK, Heidarypanah M (2020) Omlml: a helpful opinion mining method based on lexicon and machine learning in social networks. Soc Netw Anal Min 10(1):1–17
Korenius T, Laurikkala J, Järvelin K, Juhola M (2004) Stemming and lemmatization in the clustering of finnish text documents. In: Proceedings of the thirteenth ACM international conference on information and knowledge management, pp 625–633
Krishna BV, Pandey AK, Kumar AS (2018) Feature based opinion mining and sentiment analysis using fuzzy logic. In: Gurumoorthy S, Rao BNK, Gao X-Z (eds) Cognitive science and artificial intelligence, pp 79–89. Springer
Liu B (2007) Web data mining: exploring hyperlinks, contents, and usage data. Springer
Liu B (2012) Sentiment analysis and opinion mining. Synth Lect Hum Lang Technol 5(1):1–167
Liu B et al (2010) Sentiment analysis and subjectivity. Handb Nat Lang Process 2(2010):627–666
Meriem AB, Hlaoua L, Romdhane LB (2021) A fuzzy approach for sarcasm detection in social networks. Procedia Comput Sci 192:602–611
Miller GA (1998) Wordnet: an electronic lexical database. MIT Press
Missen MMS, Boughanem M, Cabanac G (2013) Opinion mining: reviewed from word to document level. Social Netw Anal Min 3(1):107–125
Montejo-Ráez A, Martínez-Cámara E, Martin-Valdivia MT, López LAU (2012) Random walk weighting over sentiwordnet for sentiment polarity detection on twitter. In: Proceedings of the 3rd workshop in computational approaches to subjectivity and sentiment analysis, pp 3–10
Montoyo A, MartíNez-Barco P, Balahur A (2012) Subjectivity and sentiment analysis: an overview of the current state of the area and envisaged developments. Elsevier
Ortega-Bueno R, Muniz-Cuza CE, Pagola JEM, Rosso P (2018) UO UPV: deep linguistic humor detection in Spanish social media. In: Proceedings of the third workshop on evaluation of human language technologies for Iberian languages (IberEval 2018) co-located with 34th conference of the Spanish society for natural language processing (SEPLN 2018), pp 204–213
Palmer DD (2000) Tokenisation and sentence segmentation. In: Dale R, Moisel H, Somers H (eds) Handbook of natural language processing, pp 11–35
Pang B, Lee L (2009) Opinion mining and sentiment analysis. Comput Linguist 35(2):311–312
Pang B, Lee L, Vaithyanathan S (2002a) Thumbs up? sentiment classification using machine learning techniques. arXiv preprint cs/0205070
Poecze F, Ebster C, Strauss C (2018) Social media metrics and sentiment analysis to evaluate the effectiveness of social media posts. Procedia Comput Sci 130:660–666
Pooja B, Jaswinder S (2021) A study on classification techniques based on opinions. IOP Conf Ser Mater Sci Eng 1022:012091
Popescu O, Strapparava C (2014) Time corpora: epochs, opinions and changes. Knowl Based Syst 69:3–13
Poria S, Cambria E, Gelbukh A (2016) Aspect extraction for opinion mining with a deep convolutional neural network. Knowl Based Syst 108:42–49
Porter MF et al (1980) An algorithm for suffix stripping. Program 14(3):130–137
Ravi K, Ravi V (2015) A survey on opinion mining and sentiment analysis: tasks, approaches and applications. Knowl Based Syst 89:14–46
Reddy CS, Raju K (2009) An improved fuzzy approach for COCOMO’s effort estimation using gaussian membership function. J Softw 4(5):452–459
Santorini B (1990) Part-of-speech tagging guidelines for the Penn treebank project (3rd revision). Technical Reports (CIS). University of Pennsylvania, School of Engineering and Applied Science
Shelke NM, Deshpande S, Thakre V (2012) Survey of techniques for opinion mining. Int J Comput Appl 57:13
Silva C, Ribeiro B (2003) The importance of stop word removal on recall values in text categorization. In: Proceedings of the international joint conference on neural networks, vol 3, pp 1661–1666
Takagi T, Sugeno M (1985) Fuzzy identification of systems and its applications to modeling and control. IEEE Trans Syst Man Cybern 1:116–132
Tang H, Tan S, Cheng X (2009) A survey on sentiment detection of reviews. Expert Syst Appl 36(7):10760–10773
Toutanova K, Klein D, Manning CD, Singer Y (2003) Feature-rich part-of-speech tagging with a cyclic dependency network. In: Proceedings of the 2003 conference of the North American chapter of the association for computational linguistics on human language technology, vol 1, pp 173–180
Turney PD (2002) Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. arXiv preprint cs/0212032
Vilares D, Alonso MA, Gómez-Rodríguez C (2015) A syntactic approach for opinion mining on Spanish reviews. Nat. Lang. Eng. 21(1):139–163
Vinodhini G, Chandrasekaran R (2012) Sentiment analysis and opinion mining: a survey. Int J 2(6):282–292
Wang J, Li J, Li S, Kang Y, Zhang M, Si L, Zhou G (2018) Aspect sentiment classification with both word-level and clause-level attention networks. In: IJCAI, vol 2018, pp 4439–4445
Webster JJ, Kit C (1992) Tokenization as the initial phase in NLP. In: Coling 1992 volume 4: the 15th international conference on computational linguistics
Wilson T, Wiebe J, Hwa R (2004) Just how mad are you? Finding strong and weak opinion clauses. In: AAAI, vol 4, pp 761–769
Zhou L, Chaovalit P (2008) Ontology-supported polarity mining. J Am Soc Inf Sci Technol 59(1):98–110
Acknowledgements
The authors would like to thank the anonymous reviewers for their careful reading of our manuscript and their many insightful comments and suggestions.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Messaoudi, C., Guessoum, Z. & Ben Romdhane, L. Opinion mining in online social media: a survey. Soc. Netw. Anal. Min. 12, 25 (2022). https://doi.org/10.1007/s13278-021-00855-8
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-021-00855-8