Paper The following article is Open access

Supervised Ensemble Machine Learning Aided Performance Evaluation of Sentiment Classification

, , , , and

Published under licence by IOP Publishing Ltd
, , Citation Sheikh Shah Mohammad Motiur Rahman et al 2018 J. Phys.: Conf. Ser. 1060 012036 DOI 10.1088/1742-6596/1060/1/012036

1742-6596/1060/1/012036

Abstract

Text vectorization, features extraction and machine learning algorithms play a vital role to the field of sentiment classification. Accuracy of sentiment classification varies depending on various machine learning approaches, vectorization models and features extraction methods. This paper represents multiple ways of evaluations with the necessary steps needed to achieve highest accuracy for classifying the sentiment of reviews. We apply two n-gram vectorization models - Unigram and Bigram individually. Later on, we also apply features extraction method TF-IDF with Unigram and Bigram respectively. Five ensemble machine learning algorithms namely Random Forest (RF), Extra Tree (ET), Bagging Classifier (BC), Ada Boost (ADA) and Gradient Boost (GB) are used here. The key findings in this study is to determine which combination of vectorization models (Bigram, Unigram) along with feature extraction method (TF-IDF) and ensemble classifier gives the better performance of sentiment classification.

Export citation and abstract BibTeX RIS

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Please wait… references are loading.
10.1088/1742-6596/1060/1/012036