Predictive Analysis of Online Course Completion: Key Insights and Practical Implications

  • Riska Riska Politeknik Negeri Ujung Pandang
  • Rahmat Fuadi Syam Universitas Pancasakti Makassar

Keywords: Online Education, Course Completion, Student Engagement, Decision Tree, Predictive Modelling

Abstract

The rapid expansion of online education has brought significant attention to understanding factors that influence student engagement and course completion. This study aims to predict online course engagement using a dataset from Kaggle, encompassing user demographics, course-specific data, and engagement metrics. Employing a Decision Tree model with 5-fold cross-validation, the research identifies key predictors of course completion, including time spent on the course, the number of videos watched, and quiz scores. The model demonstrates robust performance with accuracy, precision, recall, and F1-scores consistently above 92%, indicating its effectiveness in predicting student outcomes. This predictive capability allows educators and online course providers to identify at-risk students early and implement timely interventions to enhance engagement and completion rates. The study's contributions lie in pinpointing critical engagement metrics and validating the use of Decision Trees in educational data mining. The findings align with existing educational theories that emphasize the importance of active engagement for academic success. Practical implications suggest that online platforms should focus on strategies to increase interaction with course content and provide timely feedback. Future research should explore additional datasets and machine learning models to further refine predictive accuracy and broaden the understanding of factors influencing online learning success. This research provides a foundation for developing more effective online education strategies, ultimately aiming to improve student retention and outcomes

Downloads

Download data is not yet available.

References

U. Zaky, A. Naswin, S. Sumiyatun, and ..., “Performance Analysis of the Decision Tree Classification Algorithm on the Water Quality and Potability Dataset,” Indones. J. …, 2023, [Online]. Available: https://jurnal.yoctobrain.org/index.php/ijodas/article/view/113.

D. Widyawati, A. Faradibah, and ..., “Comparison Analysis of Classification Model Performance in Lung Cancer Prediction Using Decision Tree, Naive Bayes, and Support Vector Machine,” Indones. J. …, 2023, [Online]. Available: https://jurnal.yoctobrain.org/index.php/ijodas/article/view/76.

S. Hidayat, H. M. T. Ramadhan, and ..., “Comparison of K-Nearest Neighbor and Decision Tree Methods using Principal Component Analysis Technique in Heart Disease Classification,” Indones. J. …, 2023, [Online]. Available: https://www.jurnal.yoctobrain.org/index.php/ijodas/article/view/70.

H. Azis, L. Syafie, F. Fattah, and ..., “Unveiling Algorithm Classification Excellence: Exploring Calendula and Coreopsis Flower Datasets with Varied Segmentation Techniques,” 2024 18th Int. …, 2024, [Online]. Available: https://ieeexplore.ieee.org/abstract/document/10418246/.

H. Azis and S. R. Jabir, “Chemical Composition and Aroma Profiling: Decision Tree Modeling of Formalin Tofu,” J. Embed. Syst. Secur. …, 2023, [Online]. Available: https://journal.unm.ac.id/index.php/JESSI/article/view/1162.

A. D. Purwanto, “Decision Tree and Random Forest Classification Algorithms for Mangrove Forest Mapping in Sembilang National Park, Indonesia,” Remote Sens., vol. 15, no. 1, 2023, doi: 10.3390/rs15010016.

C. R. Dhivyaa, “Skin lesion classification using decision trees and random forest algorithms,” J. Ambient Intell. Humaniz. Comput., 2020, doi: 10.1007/s12652-020-02675-8.

D. Jalal, “Decision Tree and Support Vector Machine for Anomaly Detection in Water Distribution Networks,” 2020 International Wireless Communications and Mobile Computing, IWCMC 2020. pp. 1320–1323, 2020, doi: 10.1109/IWCMC48107.2020.9148431.

Y. Mao, “Disease Classification Based on Eye Movement Features With Decision Tree and Random Forest,” Front. Neurosci., vol. 14, 2020, doi: 10.3389/fnins.2020.00798.

F. Manzella, “The voice of COVID-19: Breath and cough recording classification with temporal decision trees and random forests,” Artif. Intell. Med., vol. 137, 2023, doi: 10.1016/j.artmed.2022.102486.

C. S. Yu, “Predicting metabolic syndrome with machine learning models using a decision tree algorithm: Retrospective cohort study,” JMIR Med. Informatics, vol. 8, no. 3, 2020, doi: 10.2196/17110.

O. J. Alajas, “Prediction of Grape Leaf Black Rot Damaged Surface Percentage Using Hybrid Linear Discriminant Analysis and Decision Tree,” 2021 International Conference on Intelligent Technologies, CONIT 2021. 2021, doi: 10.1109/CONIT51480.2021.9498518.

I. P. A. Pratama, E. S. J. Atmadji, and ..., “Evaluating the Performance of Voting Classifier in Multiclass Classification of Dry Bean Varieties,” Indones. J. …, 2024, [Online]. Available: https://www.jurnal.yoctobrain.org/index.php/ijodas/article/view/124.

R. F. Syam, “Performance Comparison Analysis of Classifiers on Binary Classification Dataset,” Indones. J. Data Sci., 2023, [Online]. Available: https://jurnal.yoctobrain.org/index.php/ijodas/article/view/77.

A. Faradibah, D. Widyawati, A. U. T. Syahar, and ..., “Comparison Analysis of Random Forest Classifier, Support Vector Machine, and Artificial Neural Network Performance in Multiclass Brain Tumor Classification,” Indones. J. …, 2023, [Online]. Available: https://www.jurnal.yoctobrain.org/index.php/ijodas/article/view/73.

A. Naswin and A. P. Wibowo, “Performance Analysis of the Decision Tree Classification Algorithm on the Pneumonia Dataset,” … Artif. Intell. Med. …, 2023, [Online]. Available: https://jurnal.yoctobrain.org/index.php/ijaimi/article/view/83.

Y. Boer, “Classification of Heart Disease: Comparative Analysis using KNN, Random Forest, Gaussian Naive Bayes, XGBoost, SVM, Decision Tree, and Logistic Regression,” 2023 5th International Conference on Cybernetics and Intelligent Systems, ICORIS 2023. 2023, doi: 10.1109/ICORIS60118.2023.10352195.

H. Tabrizchi, “Breast cancer diagnosis using a multi-verse optimizer-based gradient boosting decision tree,” SN Appl. Sci., vol. 2, no. 4, 2020, doi: 10.1007/s42452-020-2575-9.

S. H. Asman, “Decision tree method for fault causes classification based on rms-dwt analysis in 275 kv transmission lines network,” Appl. Sci., vol. 11, no. 9, 2021, doi: 10.3390/app11094031.

T. E. Tarigan, E. Susanti, M. I. Siami, I. Arfiani, and ..., “Performance Metrics of AdaBoost and Random Forest in Multi-Class Eye Disease Identification: An Imbalanced Dataset Approach,” … Artif. Intell. …, 2023, [Online]. Available: https://jurnal.yoctobrain.org/index.php/ijaimi/article/view/98.

A. Sinra and H. Angriani, “Automated Classification of COVID-19 Chest X-ray Images Using Ensemble Machine Learning Methods,” Indones. J. Data Sci., 2024, [Online]. Available: https://jurnal.yoctobrain.org/index.php/ijodas/article/view/127.

S. Rahmah, H. Azis, D. Widyawati, and A. U. Tenripada, “Prediksi potensi donatur menggunakan model Logistic Regression,” Indones. J. Data Sci., vol. 4, no. 1, pp. 31–37, 2023.

R. Setiawan, H. Zein, R. A. Azdy, and ..., “Rice Leaf Disease Classification with Machine Learning: An Approach Using Nu-SVM,” Indones. J. …, 2023, [Online]. Available: https://www.jurnal.yoctobrain.org/index.php/ijodas/article/view/114.

Published
2024-07-31
How to Cite
Riska, R., & Syam, R. F. (2024). Predictive Analysis of Online Course Completion: Key Insights and Practical Implications. Indonesian Journal of Data and Science, 5(2), 144-154. https://doi.org/10.56705/ijodas.v5i2.168