Comparison of SVM and Naive Bayes Algorithms in Sentiment Analysis of User Reviews on Bukalapak

Authors

  • M Yasir Alghifari Universitas Sriwijaya Author
  • M. Rudi Sanjaya Universitas Sriwijaya Author
  • Dwi Rosa Indah Universitas Sriwijaya Author
  • Endang Lestari Ruskan Author

DOI:

https://doi.org/10.35314/dqhpkb12

Keywords:

Sentiment Analysis, SVM, Naïve Bayes, E-Commerce, TF-IDF

Abstract

Indonesia’s rapid e-commerce growth has produced a vast volume of user reviews, yet their use for insight extraction remains limited—particularly for the Bukalapak platform. This study compares the performance of Naïve Bayes and Support Vector Machine for sentiment classification on 10,000 Bukalapak reviews. The workflow includes text preprocessing (cleaning, case folding, tokenization, stopword removal, and stemming) and feature extraction using Term Frequency–Inverse Document Frequency (TF-IDF; max_features = 10,000). Evaluation employs 10-fold cross-validation with accuracy, precision, recall, and F1-score, complemented by a paired t-test for significance. Results show SVM outperforming NB (accuracy 84.48% vs. 83.96%; F1 0.8253 vs. 0.8205) with better consistency (standard deviation ±1.08% vs. ±1.24%). The t-test confirms a significant difference (p = 0.019), with SVM’s advantage most evident for the negative class (precision 0.80 vs. 0.78). Both models underperform on the neutral class due to severe class imbalance. These findings provide empirical evidence for algorithm selection in Indonesian e-commerce sentiment analysis and open avenues for future research using deep learning and class-imbalance handling techniques.

Downloads

Download data is not yet available.

Downloads

Published

15-11-2025

Issue

Section

Articles

How to Cite

Comparison of SVM and Naive Bayes Algorithms in Sentiment Analysis of User Reviews on Bukalapak. (2025). INOVTEK Polbeng - Seri Informatika, 10(3), 1623-1633. https://doi.org/10.35314/dqhpkb12