Application of KNN Voting Classification and Naive Bayes for Classification of Type II Diabetes Mellitus

Authors

  • I Gusti Agung Made Suparta Yasa Agung Universitas Primakara Author
  • Eddy Muntina Dharma Universitas Primakara Author
  • Nengah Widya Utami Universitas Primakara Author

DOI:

https://doi.org/10.35314/tynbfz55

Keywords:

Type II Diabetes Mellitus, Voting Classifier, K-Nearest Neighbor, Naive Bayes, Classification

Abstract

Type II diabetes mellitus (Type II DM) is a public health burden that requires fast and accurate early detection, particularly in primary care settings. Single machine-learning classifiers such as K-Nearest Neighbor (KNN) and Naive Bayes (NB) are widely used but have limitations, including the computational cost of KNN and the strong feature-independence assumption of NB. This study applies an ensemble Voting Classifier (VC) that combines KNN and NB to classify Type II DM using clinical data from 2,390 patients at Mengwi 1 Health Center. Following the CRISP-DM process, we evaluate the models under 80:20 and 70:30 train–test splits using accuracy, precision, recall, F1-score, and ROC/AUC. Compared with the KNN baseline, soft voting consistently improves performance: on the 80:20 split, accuracy increases from 80.33% to 81.59% (+1.26 percentage points) and F1-score from 79.52% to 80.91% (+1.39%); on the 70:30 split, accuracy increases from 80.47% to 82.01% (+1.54%) and F1-score from 79.65% to 81.24% (+1.59%). The soft-voting ensemble also yields higher AUCs, reaching 0.8138 (80:20) and 0.8213 (70:30), and outperforms both single models and hard voting. The novelty of this work lies in demonstrating that a lightweight KNN–NB soft-voting ensemble, designed for the computational constraints of a primary health center and evaluated with repeated cross-validation, can provide small but consistent gains over single classifiers on real DM data. These findings indicate that such an ensemble is a promising building block for clinical decision support in resource-limited primary care, although further calibration, external validation, and prospective testing are still required.

Downloads

Download data is not yet available.

Downloads

Published

21-11-2025

Issue

Section

Articles

How to Cite

Application of KNN Voting Classification and Naive Bayes for Classification of Type II Diabetes Mellitus. (2025). Journal of Innovation and Technology Polbeng Series on Informatics (INOVTEK Polbeng - Seri Informatika), 10(3), 1867-1876. https://doi.org/10.35314/tynbfz55