ABSTRACT
Within the area of general-purpose fine-grained subjectivity analysis, opinion topic identification has, to date, received little attention due to both the difficulty of the task and the lack of appropriately annotated resources. In this paper, we provide an operational definition of opinion topic and present an algorithm for opinion topic identification that, following our new definition, treats the task as a problem in topic coreference resolution. We develop a methodology for the manual annotation of opinion topics and use it to annotate topic information for a portion of an existing general-purpose opinion corpus. In experiments using the corpus, our topic identification approach statistically significantly outperforms several non-trivial baselines according to three evaluation measures.
- ACE. 2005. The NIST ACE evaluation website. http://www.nist.gov/speech/tests/ace/.Google Scholar
- Bagga, A. and B. Baldwin. 1998. Algorithms for scoring coreference chains. In In Proceedings of MUC7.Google Scholar
- Bethard, S., H. Yu, A. Thornton, V. Hativassiloglou, and D. Jurafsky. 2004. Automatic extraction of opinion propositions and their holders. In 2004 AAAI Spring Symposium on Exploring Attitude and Affect in Text.Google Scholar
- Breck, E., Y. Choi, and C. Cardie. 2007. Identifying expressions of opinion in context. In Proceedings of IJCAI. Google ScholarDigital Library
- Choi, Y., C. Cardie, E. Riloff, and S. Patwardhan. 2005. Identifying sources of opinions with conditional random fields and extraction patterns. In Proceedings of EMNLP. Google ScholarDigital Library
- Choi, Y., E. Breck, and C. Cardie. 2006. Joint extraction of entities and relations for opinion recognition. In Proceedings of EMNLP. Google ScholarDigital Library
- Choi, F. 2000. Advances in domain independent linear text segmentation. Proceedings of NAACL. Google ScholarDigital Library
- Cohen, W. 1995. Fast effective rule induction. In Proceedings of ICML.Google ScholarCross Ref
- Freund, Y. and R. Schapire. 1998. Large margin classification using the perceptron algorithm. In Proceedings of Computational Learing Theory. Google ScholarDigital Library
- Hasegawa, T., S. Sekine, and R. Grishman. 2004. Discovering relations among named entities from large corpora. In Proceedings of ACL. Google ScholarDigital Library
- Hu, M. and B. Liu. 2004. Mining opinion features in customer reviews. In AAAI. Google ScholarDigital Library
- Joachims, T. 1998. Making large-scale support vector machine learning practical. In B. Schölkopf, C. Burges, A. Smola, editor, Advances in Kernel Methods: Support Vector Machines. MIT Press, Cambridge, MA. Google ScholarDigital Library
- Kim, S. and E. Hovy. 2006. Extracting opinions, opinion holders, and topics expressed in online news media text. In Proceedings of ACL/COLING Workshop on Sentiment and Subjectivity in Text. Google ScholarDigital Library
- Kobayashi, N., K. Inui, Y. Matsumoto, K. Tateishi, and T. Fukushima. 2004. Collecting evaluative expressions for opinion extraction. In Proceedings of IJCNLP.Google Scholar
- Krippendorff, K. 1980. Content Analysis: An Introduction to Its Methodology. Sage Publications, Beverly Hills, CA.Google Scholar
- Luo, X. 2005. On coreference resolution performance metrics. In Proceedings of EMNLP. Google ScholarDigital Library
- Malioutov, I. and R. Barzilay. 2006. Minimum cut model for spoken lecture segmentation. In Proceedings of ACL/COLING. Google ScholarDigital Library
- Ng, V. and C. Cardie. 2002. Improving machine learning approaches to coreference resolution. In In Proceedings of ACL. Google ScholarDigital Library
- Pang, B., L. Lee, and S. Vaithyanathan. 2002. Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of EMNLP. Google ScholarDigital Library
- Passonneau, R. 2004. Computing reliability for coreference annotation. In Proceedings of LREC.Google Scholar
- Popescu, A. and O. Etzioni. 2005. Extracting product features and opinions from reviews. In Proceedings of HLT/EMNLP. Google ScholarDigital Library
- Rosenfeld, B. and R. Feldman. 2007. Clustering for unsupervised relation identification. In Proceedings of CIKM. Google ScholarDigital Library
- Soon, W., H. Ng, and D. Lim. 2001. A machine learning approach to coreference resolution of noun phrases. Computational Linguistics, 27(4). Google ScholarDigital Library
- Stoyanov, V. and C. Cardie. 2008. Annotating topics of opinions. In Proceedings of LREC.Google Scholar
- Turney, P. 2002. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In Proceedings of ACL. Google ScholarDigital Library
- Vilain, M., J. Burger, J. Aberdeen, D. Connolly, and L. Hirschman. 1995. A model-theoretic coreference scoring scheme. In Proceedings of the MUC6. Google ScholarDigital Library
- Voorhees, E. and L. Buckland. 2003. Overview of the TREC 2003 Question Answering Track. In Proceedings of TREC 12.Google Scholar
- Wiebe, J., T. Wilson, and C. Cardie. 2005. Annotating expressions of opinions and emotions in language. Language Resources and Evaluation, 1(2).Google Scholar
- Wiebe, J. 2005. Personal communication.Google Scholar
- Wilson, T., J. Wiebe, and P. Hoffmann. 2005. Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of HLT/EMNLP. Google ScholarDigital Library
- Wilson, T. 2005. Personal communication.Google Scholar
- Yi, J., T. Nasukawa, R. Bunescu, and W. Niblack. 2003. Sentiment analyzer: Extracting sentiments about a given topic using natural language processing techniques. In Proceedings of ICDM. Google ScholarDigital Library
Index Terms
- Topic identification for fine-grained opinion analysis
Recommendations
Jointly Discovering Fine-grained and Coarse-grained Sentiments via Topic Modeling
MM '14: Proceedings of the 22nd ACM international conference on MultimediaThe ever-increasing user-generated contents in social media and other web services make it highly desirable to discover opinions of users on all kinds of topics. Motivated by the assumption that individual word and paragraph in documents will deliver ...
Twitter Opinion Topic Model: Extracting Product Opinions from Tweets by Leveraging Hashtags and Sentiment Lexicon
CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge ManagementAspect-based opinion mining is widely applied to review data to aggregate or summarize opinions of a product, and the current state-of-the-art is achieved with Latent Dirichlet Allocation (LDA)-based model. Although social media data like tweets are ...
Comments