Computational Analysis of Student Stress on Social Media using Support Vector Machine and Latent Dirichlet Allocation
DOI:
https://doi.org/10.35314/8jcvxk45Keywords:
latent dirichlet allocation, sentiment analysis , social media , stress detection, support vector machineAbstract
This study develops a two-stage machine-learning framework to identify academic stressors among Indonesian university students using Twitter data. A Support Vector Machine (SVM) classifier was trained on manually annotated tweets and benchmarked against Naïve Bayes, logistic regression, and random forest, achieving an accuracy of 0.91 and a macro F1-score of 0.914, outperforming all baselines. Tweets classified as stress-related with ≥75% confidence were subsequently analyzed using Latent Dirichlet Allocation (LDA), which generated six coherent stressor categories. The framework reveals both structural academic pressures and culturally specific patterns, including references to “dosen killer” and emerging mental-health vocabulary. Contributions include the first Indonesia-focused stressor map derived from large-scale social media discourse and the integration of confidence filtering to enhance topic quality. While results demonstrate the feasibility of social-media–based stress detection, limitations remain regarding temporal drift, annotation bias, and demographic representativeness. Future research should incorporate real-time streaming pipelines, multimodal annotation, and longitudinal evaluation to enhance robustness and early-warning potential.
Downloads
Downloads
Published
Issue
Section
License
Copyright (c) 2025 INOVTEK Polbeng - Seri Informatika

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
