Detection and Classification of Bacterial Skin Infections Using K-Nearest Neighbors Algorithm

Authors

  • Hayatou Oumarou University of Maroua

DOI:

https://doi.org/10.56705/ijaimi.v1i2.149

Keywords:

Bacterial Skin Infections, Image Processing, K-Nearest Neighbors, Hu Moments, Machine Learning

Abstract

Bacterial skin infections, including cellulitis and impetigo, pose significant health challenges requiring timely and accurate diagnosis for effective treatment. This research aims to develop an automated classification system for these infections using image processing and machine learning techniques. The study utilizes the Sobel method for image segmentation and Hu Moments for feature extraction. The classification is performed using the K-Nearest Neighbors (K-NN) algorithm with . The dataset, sourced from Kaggle, consists of imbalanced images of the two infection types. After pre-processing and feature extraction, the dataset is scaled to zero mean and unit variance. The model's performance is evaluated using cross-validation, yielding mean accuracy, precision, recall, F1-score, and ROC-AUC values of 65.95%, 65.18%, 65.95%, 63.06%, and 64.13%, respectively. Visualizations, including scatter plots, boxplots, histograms, correlation heatmaps, PCA, t-SNE, and UMAP, provide insights into the feature distributions and separability of classes. The results indicate that the combination of Sobel segmentation, Hu Moments, and K-NN can effectively classify bacterial skin infections. The study's contributions include demonstrating the applicability of these techniques to dermatological diagnostics and highlighting the potential for improved diagnostic accuracy and efficiency. However, the study acknowledges limitations such as data imbalance and variability in performance, suggesting the need for further research using advanced models like convolutional neural networks (CNNs) and enhanced data pre-processing techniques. These findings underscore the importance of machine learning in developing practical tools for clinical use, ultimately improving patient outcomes through early and accurate diagnosis.

References

K. V Swamy, “Skin Disease Classification using Image Preprocessing and Machine Learning,” 2024 IEEE International Conference on Interdisciplinary Approaches in Technology and Management for Social Innovation, IATMSI 2024. 2024, doi: 10.1109/IATMSI60426.2024.10502445.

C. R. Dhivyaa, “Skin lesion classification using decision trees and random forest algorithms,” J. Ambient Intell. Humaniz. Comput., 2020, doi: 10.1007/s12652-020-02675-8.

B. P. Sari, “Classification System for Cervical Cell Images based on Hu Moment Invariants Methods and Support Vector Machine,” 2021 Int. Conf. Intell. Technol. CONIT 2021, 2021, doi: 10.1109/CONIT51480.2021.9498353.

A. Sharma, “Prediction of the Fracture Toughness of Silicafilled Epoxy Composites using K-Nearest Neighbor (KNN) Method,” 2020 International Conference on Computational Performance Evaluation, ComPE 2020. pp. 194–198, 2020, doi: 10.1109/ComPE49325.2020.9200093.

M. Khushi, “A Comparative Performance Analysis of Data Resampling Methods on Imbalance Medical Data,” IEEE Access, vol. 9, pp. 109960–109975, 2021, doi: 10.1109/ACCESS.2021.3102399.

R. Tian, “Sobel edge detection based on weighted nuclear norm minimization image denoising,” Electron., vol. 10, no. 6, pp. 1–15, 2021, doi: 10.3390/electronics10060655.

Y. Jusman, “Classification System of Malaria Disease with Hu Moment Invariant and Support Vector Machines,” Proc. - 2022 2nd Int. Conf. Electron. Electr. Eng. Intell. Syst. ICE3IS 2022, pp. 365–368, 2022, doi: 10.1109/ICE3IS56585.2022.10010304.

S. T. Ahmed, “Enhancement of student performance prediction using modified K-nearest neighbor,” Telkomnika (Telecommunication Comput. Electron. Control., vol. 18, no. 4, pp. 1777–1783, 2020, doi: 10.12928/TELKOMNIKA.V18I4.13849.

Y. Boer, “Classification of Heart Disease: Comparative Analysis using KNN, Random Forest, Gaussian Naive Bayes, XGBoost, SVM, Decision Tree, and Logistic Regression,” 2023 5th International Conference on Cybernetics and Intelligent Systems, ICORIS 2023. 2023, doi: 10.1109/ICORIS60118.2023.10352195.

W. Kong, “Sobel Edge Detection Algorithm with Adaptive Threshold based on Improved Genetic Algorithm for Image Processing,” Int. J. Adv. Comput. Sci. Appl., vol. 14, no. 2, pp. 557–562, 2023, doi: 10.14569/IJACSA.2023.0140266.

D. R. D. Varma, “Performance Monitoring of Novel Iris Detection System using Sobel Algorithm in Comparison with Canny Algorithm by Minimizing the Mean Square Error,” Proc. 3rd Int. Conf. Intell. Eng. Manag. ICIEM 2022, pp. 509–512, 2022, doi: 10.1109/ICIEM54221.2022.9853127.

T. Wu, “Image Edge Detection Based on Sobel with Morphology,” IEEE Inf. Technol. Networking, Electron. Autom. Control Conf. ITNEC 2021, pp. 1216–1220, 2021, doi: 10.1109/ITNEC52019.2021.9586895.

J. N. Archana, “Enhancement of digital chest images using a modified Sobel edge detection algorithm,” Indones. J. Electr. Eng. Comput. Sci., vol. 24, no. 3, pp. 1718–1726, 2021, doi: 10.11591/ijeecs.v24.i3.pp1718-1726.

S. AbuRass, “Enhancing Convolutional Neural Network using Hu’s Moments,” Int. J. Adv. Comput. Sci. Appl., vol. 11, no. 12, pp. 130–137, 2020, doi: 10.14569/IJACSA.2020.0111216.

J. Sun, “Class-imbalanced dynamic financial distress prediction based on Adaboost-SVM ensemble combined with SMOTE and time weighting,” Inf. Fusion, vol. 54, pp. 128–144, 2020, doi: 10.1016/j.inffus.2019.07.006.

A. Çınar, “Classification of normal sinus rhythm, abnormal arrhythmia and congestive heart failure ECG signals using LSTM and hybrid CNN-SVM deep neural networks,” Comput. Methods Biomech. Biomed. Engin., vol. 24, no. 2, pp. 203–214, 2021, doi: 10.1080/10255842.2020.1821192.

Y. Jusman, “Classification System for Leukemia Cell Images based on Hu Moment Invariants and Support Vector Machines,” Proc. - 2021 11th IEEE Int. Conf. Control Syst. Comput. Eng. ICCSCE 2021, pp. 137–141, 2021, doi: 10.1109/ICCSCE52189.2021.9530974.

A. E. Minarno, “Classification of batik patterns using K-nearest neighbor and support vector machine,” Bull. Electr. Eng. Informatics, vol. 9, no. 3, pp. 1260–1267, 2020, doi: 10.11591/eei.v9i3.1971.

S. P. Thangavel, “K-nearest neighbour technique for the effective prediction of refrigeration parameter compatible for automobile,” Therm. Sci., vol. 24, no. 1, pp. 565–569, 2020, doi: 10.2298/tsci190623436p.

P. Sharma, “Performance analysis of deep learning CNN models for disease detection in plants using image segmentation,” Inf. Process. Agric., vol. 7, no. 4, pp. 566–574, 2020, doi: 10.1016/j.inpa.2019.11.001.

Downloads

Published

2023-11-30