Comparison of Classification Algorithm Performance for Diabetes Prediction Using Orange Data Mining
DOI:
https://doi.org/10.56705/ijodas.v4i3.103Keywords:
Data mining, Diabetes, KNN, Naive Bayes, Random Forest, ClassificationAbstract
Diabetes is a disease that contributes to a relatively high mortality rate. The human death rate due to diabetes is a widespread issue globally. The primary goal of this research is to predict individuals suffering from diabetes using a publicly available dataset from the UCI Repository with the Diabetes Disease dataset. To obtain the best classification algorithm, a comparison is made among three algorithms: KNN, Naive Bayes, and Random Forest, commonly used for predicting diabetes. The comparison results indicate that the Random Forest algorithm is the appropriate and accurate algorithm for predicting individuals with diabetes, with an accuracy rate of 97%.
Downloads
References
Longmore, D. K., Barr, E. L., Wilson, A. N., Barzi, F., Kirkwood, M., Simmonds, A., ... & Maple-Brown, L. J. (2020). "Associations of gestational diabetes and type 2 diabetes during pregnancy with breastfeeding at hospital discharge and up to 6 months: the PANDORA study." Diabetologia, 63, 2571-2581.
A. R. P. Abimanyu et al., "Pengaruh Terapi Pada Penderita Diabetes Mellitus Sebagai Penurunan Ka dar Gula Darah: Review Artikel," Innovative: Journal Of Social Science Research, vol. 3, no. 2, pp. 8931-8949, 2023.
Maryati, Y., Alifiar, I., Nurfatwa, M., Nofianti, T., & Rahayuningsih, N. (2019, July). "Antlion (Myrmeleon sp.) Infusion as Antidiabetic in Dexamethasone Induced Mice." In Journal of Physics: Conference Series, vol. 1179, No. 1, p. 012177. IOP Publishing.
M. Ridwan, H. Suyono, dan M. Sarosa, "Penerapan Data Mining Untuk Evaluasi Kinerja Akademik Mahasiswa Menggunakan Algoritma Naive Bayes Classifier," Jurnal EECCIS (Electrics, Electronics, Communications, Controls, Informatics, Systems), vol. 7, no. 1, pp. 59-64, 2013.
D. Cahyanti, A. Rahmayani, dan S. A. Husniar, "Analisis performa metode Knn pada Dataset pasien pengidap Kanker Payudara," Indonesian Journal of Data and Science, vol. 1, no. 2, pp. 39-43, 2020.
M. Fithratullah, "Representation of Korean Values Sustainability in American Remake Movies," Teknosastik, vol. 19, no. 1, p. 60, 2021. [Online]. Available: [https://doi.org/10.33365/ts.v19i1.874]
A. Wantoro, A. Syarif, K. N. Berawi, K. Muludi, S. R. Sulistiyanti, U. Lampung, I. Komputer, U. Lampung, K. Masyarakat, F. Kedokteran, U. Lampung, T. Elektro, F. Teknik, U. Lampung, U. Lampung, G. Meneng, dan B. Lampung, "Metode Profile Matching Pada Sistem Pakar Medis Untuk," vol. 15, no. 2, pp. 134–145, 2021.
F. Dharma, A. Noviana, M. Tahir, dan N. Hendrastuty, "Prediction of Indonesian Inflation Rate Using Regression Model Based on Genetic Algorithms," J. Informatics Optim. Nanotechnol. Mater., vol. 5, no. 1, pp. 45–52, 2020. [Online]. Available:[https://doi.org/10.15575/join]
E. D. Listiono, A. Surahman, dan S. Sintaro, "Ensiklopedia Istilah Geografi Menggunakan Metode Sequential Search Berbasis Android Studi Kasus: Sma Teladan Way Jepara Lampung Timur," Jurnal Teknologi Dan Sistem Informasi, vol. 2, no. 1, pp. 35–42, 2021
H. Azis, F. Fattah, and P. Putri, “Performa Klasifikasi K-NN dan Cross-validation pada Data Pasien Pengidap Penyakit Jantung,” ILKOM Jurnal Ilmiah, vol. 12, no. 2, pp. 81–86, 2020, [Online]. Available: file:///Users/kbh/Downloads/507-2012-5-PB.pdf
D. Mahapatra, “Handwritten Character Recognition Using KNN and SVM Based Classifier over Feature Vector from Autoencoder,” Communications in Computer and Information Science, vol. 1240, pp. 304–317, 2020, doi: 10.1007/978-981-15-6315-7_25
H. Azis, F. T. Admojo, and E. Susanti, “Analisis Perbandingan Performa Metode Klasifikasi pada Dataset Multiclass Citra Busur Panah,” Techno.Com, vol. 19, no. 3, 2020, [Online]. Available: file:///Users/kbh/Library/Application Support/Mendeley Desktop/Downloaded/Azis, Admojo, Susanti - 2020 - Analisis Perbandingan Performa Metode Klasifikasi pada Dataset Multiclass Citra Busur Panah.pdf
Y. Jusman, “Machine Learnings of Dental Caries Images based on Hu Moment Invariants Features,” Proceedings - 2021 International Seminar on Application for Technology of Information and Communication: IT Opportunities and Creativities for Digital Innovation and Communication within Global Pandemic, iSemantic 2021, pp. 296–299, 2021, doi: 10.1109/iSemantic52711.2021.9573208
Y. Jusman, “Classification System for Leukemia Cell Images based on Hu Moment Invariants and Support Vector Machines,” Proceedings - 2021 11th IEEE International Conference on Control System, Computing and Engineering, ICCSCE 2021, pp. 137–141, 2021, doi: 10.1109/ICCSCE52189.2021.9530974
X. Ye, “Prediction of Breast Cancer of Women Based on Support Vector Machines,” ACM International Conference Proceeding Series, pp. 780–784, 2020, doi: 10.1145/3443467.3443853
Downloads
Published
Issue
Section
License
Authors retain copyright and full publishing rights to their articles. Upon acceptance, authors grant Indonesian Journal of Data and Science a non-exclusive license to publish the work and to identify itself as the original publisher.
Self-archiving. Authors may deposit the submitted version, accepted manuscript, and version of record in institutional or subject repositories, with citation to the published article and a link to the version of record on the journal website.
Commercial permissions. Uses intended for commercial advantage or monetary compensation are not permitted under CC BY-NC 4.0. For permissions, contact the editorial office at ijodas.journal@gmail.com.
Legacy notice. Some earlier PDFs may display “Copyright © [Journal Name]” or only a CC BY-NC logo without the full license text. To ensure clarity, the authors maintain copyright, and all articles are distributed under CC BY-NC 4.0. Where any discrepancy exists, this policy and the article landing-page license statement prevail.










