Predictive Analysis of Online Course Completion: Key Insights and Practical Implications
Abstract
The rapid expansion of online education has brought significant attention to understanding factors that influence student engagement and course completion. This study aims to predict online course engagement using a dataset from Kaggle, encompassing user demographics, course-specific data, and engagement metrics. Employing a Decision Tree model with 5-fold cross-validation, the research identifies key predictors of course completion, including time spent on the course, the number of videos watched, and quiz scores. The model demonstrates robust performance with accuracy, precision, recall, and F1-scores consistently above 92%, indicating its effectiveness in predicting student outcomes. This predictive capability allows educators and online course providers to identify at-risk students early and implement timely interventions to enhance engagement and completion rates. The study's contributions lie in pinpointing critical engagement metrics and validating the use of Decision Trees in educational data mining. The findings align with existing educational theories that emphasize the importance of active engagement for academic success. Practical implications suggest that online platforms should focus on strategies to increase interaction with course content and provide timely feedback. Future research should explore additional datasets and machine learning models to further refine predictive accuracy and broaden the understanding of factors influencing online learning success. This research provides a foundation for developing more effective online education strategies, ultimately aiming to improve student retention and outcomes
Downloads
References
U. Zaky, A. Naswin, S. Sumiyatun, and ..., “Performance Analysis of the Decision Tree Classification Algorithm on the Water Quality and Potability Dataset,” Indones. J. …, 2023, [Online]. Available: https://jurnal.yoctobrain.org/index.php/ijodas/article/view/113.
D. Widyawati, A. Faradibah, and ..., “Comparison Analysis of Classification Model Performance in Lung Cancer Prediction Using Decision Tree, Naive Bayes, and Support Vector Machine,” Indones. J. …, 2023, [Online]. Available: https://jurnal.yoctobrain.org/index.php/ijodas/article/view/76.
S. Hidayat, H. M. T. Ramadhan, and ..., “Comparison of K-Nearest Neighbor and Decision Tree Methods using Principal Component Analysis Technique in Heart Disease Classification,” Indones. J. …, 2023, [Online]. Available: https://www.jurnal.yoctobrain.org/index.php/ijodas/article/view/70.
H. Azis, L. Syafie, F. Fattah, and ..., “Unveiling Algorithm Classification Excellence: Exploring Calendula and Coreopsis Flower Datasets with Varied Segmentation Techniques,” 2024 18th Int. …, 2024, [Online]. Available: https://ieeexplore.ieee.org/abstract/document/10418246/.
H. Azis and S. R. Jabir, “Chemical Composition and Aroma Profiling: Decision Tree Modeling of Formalin Tofu,” J. Embed. Syst. Secur. …, 2023, [Online]. Available: https://journal.unm.ac.id/index.php/JESSI/article/view/1162.
A. D. Purwanto, “Decision Tree and Random Forest Classification Algorithms for Mangrove Forest Mapping in Sembilang National Park, Indonesia,” Remote Sens., vol. 15, no. 1, 2023, doi: 10.3390/rs15010016.
C. R. Dhivyaa, “Skin lesion classification using decision trees and random forest algorithms,” J. Ambient Intell. Humaniz. Comput., 2020, doi: 10.1007/s12652-020-02675-8.
D. Jalal, “Decision Tree and Support Vector Machine for Anomaly Detection in Water Distribution Networks,” 2020 International Wireless Communications and Mobile Computing, IWCMC 2020. pp. 1320–1323, 2020, doi: 10.1109/IWCMC48107.2020.9148431.
Y. Mao, “Disease Classification Based on Eye Movement Features With Decision Tree and Random Forest,” Front. Neurosci., vol. 14, 2020, doi: 10.3389/fnins.2020.00798.
F. Manzella, “The voice of COVID-19: Breath and cough recording classification with temporal decision trees and random forests,” Artif. Intell. Med., vol. 137, 2023, doi: 10.1016/j.artmed.2022.102486.
C. S. Yu, “Predicting metabolic syndrome with machine learning models using a decision tree algorithm: Retrospective cohort study,” JMIR Med. Informatics, vol. 8, no. 3, 2020, doi: 10.2196/17110.
O. J. Alajas, “Prediction of Grape Leaf Black Rot Damaged Surface Percentage Using Hybrid Linear Discriminant Analysis and Decision Tree,” 2021 International Conference on Intelligent Technologies, CONIT 2021. 2021, doi: 10.1109/CONIT51480.2021.9498518.
I. P. A. Pratama, E. S. J. Atmadji, and ..., “Evaluating the Performance of Voting Classifier in Multiclass Classification of Dry Bean Varieties,” Indones. J. …, 2024, [Online]. Available: https://www.jurnal.yoctobrain.org/index.php/ijodas/article/view/124.
R. F. Syam, “Performance Comparison Analysis of Classifiers on Binary Classification Dataset,” Indones. J. Data Sci., 2023, [Online]. Available: https://jurnal.yoctobrain.org/index.php/ijodas/article/view/77.
A. Faradibah, D. Widyawati, A. U. T. Syahar, and ..., “Comparison Analysis of Random Forest Classifier, Support Vector Machine, and Artificial Neural Network Performance in Multiclass Brain Tumor Classification,” Indones. J. …, 2023, [Online]. Available: https://www.jurnal.yoctobrain.org/index.php/ijodas/article/view/73.
A. Naswin and A. P. Wibowo, “Performance Analysis of the Decision Tree Classification Algorithm on the Pneumonia Dataset,” … Artif. Intell. Med. …, 2023, [Online]. Available: https://jurnal.yoctobrain.org/index.php/ijaimi/article/view/83.
Y. Boer, “Classification of Heart Disease: Comparative Analysis using KNN, Random Forest, Gaussian Naive Bayes, XGBoost, SVM, Decision Tree, and Logistic Regression,” 2023 5th International Conference on Cybernetics and Intelligent Systems, ICORIS 2023. 2023, doi: 10.1109/ICORIS60118.2023.10352195.
H. Tabrizchi, “Breast cancer diagnosis using a multi-verse optimizer-based gradient boosting decision tree,” SN Appl. Sci., vol. 2, no. 4, 2020, doi: 10.1007/s42452-020-2575-9.
S. H. Asman, “Decision tree method for fault causes classification based on rms-dwt analysis in 275 kv transmission lines network,” Appl. Sci., vol. 11, no. 9, 2021, doi: 10.3390/app11094031.
T. E. Tarigan, E. Susanti, M. I. Siami, I. Arfiani, and ..., “Performance Metrics of AdaBoost and Random Forest in Multi-Class Eye Disease Identification: An Imbalanced Dataset Approach,” … Artif. Intell. …, 2023, [Online]. Available: https://jurnal.yoctobrain.org/index.php/ijaimi/article/view/98.
A. Sinra and H. Angriani, “Automated Classification of COVID-19 Chest X-ray Images Using Ensemble Machine Learning Methods,” Indones. J. Data Sci., 2024, [Online]. Available: https://jurnal.yoctobrain.org/index.php/ijodas/article/view/127.
S. Rahmah, H. Azis, D. Widyawati, and A. U. Tenripada, “Prediksi potensi donatur menggunakan model Logistic Regression,” Indones. J. Data Sci., vol. 4, no. 1, pp. 31–37, 2023.
R. Setiawan, H. Zein, R. A. Azdy, and ..., “Rice Leaf Disease Classification with Machine Learning: An Approach Using Nu-SVM,” Indones. J. …, 2023, [Online]. Available: https://www.jurnal.yoctobrain.org/index.php/ijodas/article/view/114.
Copyright (c) 2024 Indonesian Journal of Data and Science
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
License and Copyright Agreement
In submitting the manuscript to the journal, the authors certify that:
- They are authorized by their co-authors to enter into these arrangements.
- The work described has not been formally published before, except in the form of an abstract or as part of a published lecture, review, thesis, or overlay journal.
- The work is not under consideration for publication elsewhere.
- The work has been approved by all the author(s) and by the responsible authorities – tacitly or explicitly – of the institutes where the work has been carried out.
- They secure the right to reproduce any material that has already been published or copyrighted elsewhere.
- They agree to the following license and copyright agreement.
Copyright
Authors who publish with Indonesian Journal of Data and Science agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. (CC BY-NC 4.0) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.