skip to main content
10.1145/3535508.3545594acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article

An Italian lexicon-based sentiment analysis approach for medical applications

Published:07 August 2022Publication History

ABSTRACT

Sentiment analysis aims at extracting opinions and or emotions mainly from written text. The most popular problem in sentiment analysis certainly is polarity detection, which falls into the broader class of Natural Language Processing (NLP) problems of text classification. To date, state-of-the-art approaches to text classification use neural language models built on popular architectures such as Transformers. However, these approaches are difficult to apply in low-resource languages and domains, as for instance the Italian language or small clinical trials. Motivated by this, this paper presents VADER-IT, a lexicon-based algorithm for polarity prediction in written text, that is an adaptation to the Italian language of the popular VADER. Unlike VADER, our system also predicts a polarity class (i.e. positive, negative or neutral). The system was tested on a dataset of 5495 healthcare related reviews from QSalute https://www.qsalute.it/, reaching a micro averaged F1--score = 81% and a micro averaged Jaccard - score = 73%.

References

  1. Apoorv Agarwal, Boyi Xie, Ilia Vovsha, Owen Rambow, and Rebecca J Passonneau. 2011. Sentiment analysis of twitter data. In Proceedings of the workshop on language in social media (LSM 2011). 30--38.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Luca Bacco, Andrea Cimino, Luca Paulon, Mario Merone, and Felice Dell'Orletta. 2020. A Machine Learning approach for Sentiment Analysis for Italian Reviews in Healthcare. Computational Linguistics CLiC-it 2020 630, 699 (2020), 16.Google ScholarGoogle Scholar
  3. Valerio Basile and Malvina Nissim. 2013. Sentiment analysis on Italian tweets. In Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. Association for Computational Linguistics, Atlanta, Georgia, 100--107. https://www.aclweb.org/anthology/W13-1614Google ScholarGoogle Scholar
  4. Margaret M Bradley and Peter J Lang. 1999. Affective norms for English words (ANEW): Instruction manual and affective ratings. Technical Report. Technical report C-1, the center for research in psychophysiology ....Google ScholarGoogle Scholar
  5. Rosario Catelli, Serena Pelosi, and Massimo Esposito. 2022. Lexicon-Based vs. Bert-Based Sentiment Analysis: A Comparative Study in Italian. Electronics 11 (01 2022), 374. Google ScholarGoogle ScholarCross RefCross Ref
  6. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 4171--4186. Google ScholarGoogle ScholarCross RefCross Ref
  7. Shihab Elbagir and Jing Yang. 2019. Twitter sentiment analysis using natural language toolkit and VADER sentiment. In Proceedings of the international multiconference of engineers and computer scientists, Vol. 122. 16.Google ScholarGoogle Scholar
  8. Yu Gu, Robert Tinn, Hao Cheng, Michael Lucas, Naoto Usuyama, Xiaodong Liu, Tristan Naumann, Jianfeng Gao, and Hoifung Poon. 2020. Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing. arXiv:arXiv:2007.15779Google ScholarGoogle Scholar
  9. Kexin Huang, Jaan Altosaar, and Rajesh Ranganath. 2019. Clinicalbert: Modeling clinical notes and predicting hospital readmission. arXiv preprint arXiv:1904.05342 (2019).Google ScholarGoogle Scholar
  10. Clayton Hutto and Eric Gilbert. 2014. Vader: A parsimonious rule-based model for sentiment analysis of social media text. In Proceedings of the international AAAI conference on web and social media, Vol. 8. 216--225.Google ScholarGoogle ScholarCross RefCross Ref
  11. Katikapalli Subramanyam Kalyan, Ajit Rajasekharan, and Sivanesan Sangeetha. 2022. AMMU: A survey of transformer-based biomedical pretrained language models. Journal of Biomedical Informatics 126 (2022), 103982. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Svetlana Kiritchenko, Xiaodan Zhu, and Saif M Mohammad. 2014. Sentiment analysis of short informal texts. Journal of Artificial Intelligence Research 50 (2014), 723--762.Google ScholarGoogle ScholarCross RefCross Ref
  13. Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, and Jaewoo Kang. 2020. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36, 4 (2020), 1234--1240.Google ScholarGoogle ScholarCross RefCross Ref
  14. Shervin Minaee, Nal Kalchbrenner, Erik Cambria, Narjes Nikzad, Meysam Chenaghlu, and Jianfeng Gao. 2021. Deep learning-based text classification: a comprehensive review. ACM Computing Surveys (CSUR) 54, 3 (2021), 1--40.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. James W Pennebaker, Ryan L Boyd, Kayla Jordan, and Kate Blackburn. 2015. The development and psychometric properties of LIWC2015. Technical Report.Google ScholarGoogle Scholar
  16. Emanuele Pianta, Luisa Bentivogli, and Christian Girardi. 2002. MultiWordNet: developing an aligned multilingual database. In First international conference on global WordNet. 293--302.Google ScholarGoogle Scholar
  17. Laila Rasmy, Yang Xiang, Ziqian Xie, Cui Tao, and Degui Zhi. 2021. Med-BERT: pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction. NPJ digital medicine 4, 1 (2021), 1--13.Google ScholarGoogle Scholar
  18. Karsten Tymann, Matthias Lutz, Patrick Palsbröker, and Carsten Gips. 2019. GerVADER-A German Adaptation of the VADER Sentiment Analysis Tool for Social Media Texts.. In LWDA. 178--189.Google ScholarGoogle Scholar
  19. Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems. 5998--6008.Google ScholarGoogle Scholar
  20. Chiara Zucco, Barbara Calabrese, Giuseppe Agapito, Pietro H Guzzi, and Mario Cannataro. 2020. Sentiment analysis for mining texts and social networks data: Methods and tools. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 10, 1 (2020), e1333.Google ScholarGoogle ScholarCross RefCross Ref
  21. Chiara Zucco, Clarissa Paglia, Sonia Graziano, Sergio Bella, and Mario Cannataro. 2020. Sentiment analysis and text mining of questionnaires to support telemonitoring programs. Information 11, 12 (2020), 550.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. An Italian lexicon-based sentiment analysis approach for medical applications

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      BCB '22: Proceedings of the 13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics
      August 2022
      549 pages
      ISBN:9781450393867
      DOI:10.1145/3535508

      Copyright © 2022 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 7 August 2022

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate254of885submissions,29%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader