skip to main content
10.1145/2872427.2883072acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article
Public Access

Hidden Topic Sentiment Model

Published:11 April 2016Publication History

ABSTRACT

Various topic models have been developed for sentiment analysis tasks. But the simple topic-sentiment mixture assumption prohibits them from finding fine-grained dependency between topical aspects and sentiments. In this paper, we build a Hidden Topic Sentiment Model (HTSM) to explicitly capture topic coherence and sentiment consistency in an opinionated text document to accurately extract latent aspects and corresponding sentiment polarities. In HTSM, 1) topic coherence is achieved by enforcing words in the same sentence to share the same topic assignment and modeling topic transition between successive sentences; 2) sentiment consistency is imposed by constraining topic transitions via tracking sentiment changes; and 3) both topic transition and sentiment transition are guided by a parameterized logistic function based on the linguistic signals directly observable in a document. Extensive experiments on four categories of product reviews from both Amazon and NewEgg validate the effectiveness of the proposed model.

References

  1. S. Baccianella, A. Esuli, and F. Sebastiani. Sentiwordnet 3.0: An enhanced lexical resource for sentiment analysis and opinion mining. In in Proc. of LREC, 2010.Google ScholarGoogle Scholar
  2. D. M. Blei and P. J. Moreno. Topic segmentation with an aspect hidden markov model. In Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pages 343--348. ACM, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. the Journal of machine Learning research, 3:993--1022, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. Chang, S. Gerrish, C. Wang, J. L. Boyd-Graber, and D. M. Blei. Reading tea leaves: How humans interpret topic models. In Advances in neural information processing systems, pages 288--296, 2009.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the em algorithm. Journal of the royal statistical society. Series B (methodological), pages 1--38, 1977.Google ScholarGoogle Scholar
  6. Y. Fang, L. Si, N. Somasundaram, and Z. Yu. Mining contrastive opinions on political texts using cross-perspective topic model. In Proceedings of the fifth ACM international conference on Web search and data mining, pages 63--72. ACM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. T. Griffiths, M. Steyvers, D. Blei, and J. Tenenbaum. Integrating topics and syntax. Advances in neural information processing systems, 17:537--544, 2005.Google ScholarGoogle Scholar
  8. A. Gruber, Y. Weiss, and M. Rosen-Zvi. Hidden topic markov models. In International Conference on Artificial Intelligence and Statistics, pages 163--170, 2007.Google ScholarGoogle Scholar
  9. T. Hofmann. Probabilistic latent semantic analysis. In Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence, pages 289--296. Morgan Kaufmann Publishers Inc., 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. E. H. Hovy. Automated discourse generation using discourse structure relations. Artificial intelligence, 63(1):341--385, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. W. Jin, H. H. Ho, and R. K. Srihari. A novel lexicalized hmm-based learning framework for web opinion mining. In Proceedings of the 26th Annual International Conference on Machine Learning, pages 465--472. Citeseer, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Y. Jo and A. H. Oh. Aspect and sentiment unification model for online review analysis. In Proceedings of the fourth ACM international conference on Web search and data mining, pages 815--824. ACM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. H. Kamp. A theory of truth and semantic representation. Formal methods in the study of language, 1:277--322, 1981.Google ScholarGoogle Scholar
  14. D. D. Lewis, Y. Yang, T. G. Rose, and F. Li. Smart stopword list, 2004.Google ScholarGoogle Scholar
  15. C. Lin and Y. He. Joint sentiment/topic model for sentiment analysis. In Proceedings of the 18th ACM conference on Information and knowledge management, pages 375--384. ACM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. J. McAuley and J. Leskovec. Hidden factors and hidden topics: understanding rating dimensions with review text. In Proceedings of the 7th ACM conference on Recommender systems, pages 165--172. ACM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. D. Mcauliffe and D. M. Blei. Supervised topic models. In Advances in neural information processing systems, pages 121--128, 2008.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Q. Mei, X. Ling, M. Wondra, H. Su, and C. Zhai. Topic sentiment mixture: modeling facets and opinions in weblogs. In Proceedings of the 16th international conference on World Wide Web, pages 171--180. ACM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. D. Mimno and A. McCallum. Topic models conditioned on arbitrary features with dirichlet-multinomial regression. The 24th Conference on Uncertainty in Artificial Intelligence, pages 411--418, 2008.Google ScholarGoogle Scholar
  20. K. Nigam, A. K. McCallum, S. Thrun, and T. Mitchell. Text classification from labeled and unlabeled documents using em. Machine learning, 39(2--3):103--134, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. B. Pang and L. Lee. Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pages 115--124. Association for Computational Linguistics, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. B. Pang and L. Lee. Opinion mining and sentiment analysis. Foundations and trends in information retrieval, 2(1--2):1--135, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. L. R. Rabiner. A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(2):257--286, 1989. Google ScholarGoogle ScholarCross RefCross Ref
  24. M. Steyvers and T. Griffiths. Probabilistic topic models. Handbook of latent semantic analysis, 427(7):424--440.Google ScholarGoogle Scholar
  25. I. Titov and R. T. McDonald. A joint model of text and aspect ratings for sentiment summarization. In ACL, volume 8, pages 308--316. Citeseer, 2008.Google ScholarGoogle Scholar
  26. P. D. Turney and M. L. Littman. Measuring praise and criticism: Inference of semantic orientation from association. ACM Trans. Inf. Syst., 21(4):315--346, Oct. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. A. J. Viera, J. M. Garrett, et al. Understanding interobserver agreement: the kappa statistic. Fam Med, 37(5):360--363, 2005.Google ScholarGoogle Scholar
  28. H. Wang, Y. Lu, and C. Zhai. Latent aspect rating analysis without aspect keyword supervision. In Proceedings of the 17th ACM SIGKDD Conference, pages 618--626. ACM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. H. Wang, D. Zhang, and C. Zhai. Structural topic model for latent topical structure analysis. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1, pages 1526--1535. Association for Computational Linguistics, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. P. Willett. The porter stemming algorithm: then and now. Program, 40(3):219--223, 2006. Google ScholarGoogle Scholar
  31. W. X. Zhao, J. Jiang, H. Yan, and X. Li. Jointly modeling aspects and opinions with a maxent-lda hybrid. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pages 56--65. Association for Computational Linguistics, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Hidden Topic Sentiment Model

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        WWW '16: Proceedings of the 25th International Conference on World Wide Web
        April 2016
        1482 pages
        ISBN:9781450341431

        Copyright © 2016 Copyright is held by the International World Wide Web Conference Committee (IW3C2)

        Publisher

        International World Wide Web Conferences Steering Committee

        Republic and Canton of Geneva, Switzerland

        Publication History

        • Published: 11 April 2016

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        WWW '16 Paper Acceptance Rate115of727submissions,16%Overall Acceptance Rate1,899of8,196submissions,23%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader