Comparison of Naïve Bayes, Random Forest, and Logistic Regression Algorithms for Sentiment Analysis Online Gambling

Authors

  • Dwi Nanda Agustia Universitas Teknokrat Indonesia Author
  • Ryan Randy Suryono Universitas Teknokrat Indonesia Author

DOI:

https://doi.org/10.35314/prk93630

Keywords:

Sentiment Analysis, Online Gambling, Naïve Bayes, Random Forest, Logistic Regression, SMOTE

Abstract

This study aims to compare the performance of Naïve Bayes, Random Forest, and Logistic Regression algorithms for sentiment analysis on the topic of online gambling. The dataset consisted of 4592 entries after preprocessing and applying the SMOTE technique to address class imbalance. The evaluation results show that Random Forest achieved the best performance with an accuracy of 78%, followed by Naïve Bayes and Logistic Regression, both achieving 77%. Random Forest excelled in classifying positive and negative sentiments, while Naïve Bayes demonstrated a significant improvement in recall for neutral sentiment, increasing from 0.45 to 0.82 after the SMOTE application. Logistic Regression showed less optimal performance, particularly for neutral sentiment. This study provides essential guidance for selecting the best algorithms for sentiment analysis in specific domains such as online gambling and highlights the importance of SMOTE in handling imbalanced datasets. The findings of this study can be used by practitioners and policymakers to make more informed decisions in regulating online gambling.

Downloads

Download data is not yet available.

Downloads

Published

19-01-2025

Issue

Section

Articles

How to Cite

Comparison of Naïve Bayes, Random Forest, and Logistic Regression Algorithms for Sentiment Analysis Online Gambling. (2025). INOVTEK Polbeng - Seri Informatika, 10(1), 284-295. https://doi.org/10.35314/prk93630