Mechanical and Vehicle Engineering

LSI with Support Vector Machine for Text Categorization – a practical example with Python

Published:
November 19, 2021
Authors
View
Keywords
License

Copyright (c) 2021 Joao Henrique Gomes Da Costa Cavalcanti, József Menyhárt

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

How To Cite
Selected Style: APA
Gomes Da Costa Cavalcanti, J. H., & Menyhárt, J. (2021). LSI with Support Vector Machine for Text Categorization – a practical example with Python . International Journal of Engineering and Management Sciences, 6(3), 18-29. https://doi.org/10.21791/IJEMS.2021.3.2.
Abstract

Artificial intelligence is becoming a powerful tool of modernity science, there is even a science consensus about how our society is turning to a data-driven society. Machine learning is a branch of Artificial intelligence that has the ability to learn from data and understand its behavers. Python programming language aiming the challenges of this new era is becoming one of the most popular languages for general programming and scientific computing. Keeping all this new era circumstances in mind, this article has as a goal to show one example of how to use one supervised machine learning method, Support Vector Machine, and to predict movie’s genre according to its description using the programming language of the moment, python. Firstly, Omdb official API was used to gather data about movies, then tuned Support Vector Machine model for Latent semantic indexing capable of predicting movies genres according to its plot was coded. The performance of the model occurred to be satisfactory considering the small dataset used and the occurrence of movies with hybrid genres. Testing the model with larger dataset and using multi-label classification models were purposed to improve the model.

Database Logos