Abstract
In order to analyze the non-linear and uncertain relationships among the student-related features, curriculum-related features as well as the environment-related features, and then quantify the corresponding impacts on students’ final MOOC performance in a valid way, we first construct a Students’ performance Prediction Bayesian Network (SPBN) via the hill-climbing and maximum likelihood estimation (MLE) method. With SPBN, we can predict the students’ MOOC performance and then quantify the uncertain dependencies of all the relevant features. Furthermore, with the prediction results of SPBN, we further apply the genetic algorithm (GA) to offer the personal performance improvement suggestions for those who are about to fail the courses according to different students’ background and their current engagement behaviors, so as to avoid the MOOC exam failures in advance. The experiments conducted on the Open University Learning Analytics Dataset (OULAD) have shown that the SPBN can predict students’ performance in MOOC accurately, and the GA-based method can offer the reasonable performance improvement suggestions effectively.
Similar content being viewed by others
Notes
The OULAD dataset. https://analyse.kmi.open.ac.uk/open_dataset. 2021/8/10.
References
Albelbisi, N. A., Al-Adwan, A. S., & Habibi, A. (2021). Self-regulated learning and satisfaction: A key determinants of MOOC success. Education and Information Technologies, 26(3), 3459–3481. https://doi.org/10.1007/s10639-020-10404-z.
Borrella, I., Caballero-Caballero, S., & Ponce-Cueto, E. (2019). Predict and Intervene: Addressing the Dropout Problem in a MOOC-based Program. ACM Conference on Learing @ Scale, 24, 1–9.
Bote-Lorenzo, M. L., & Gómez-Sánchez, E. (2017). Predicting the decrease of engagement indicators in a MOOC. International Learning Analytics & Knowledge Conference, 143–7.
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324.
Burd, H., Bell, A., Hemberg, E., & O’Reilly, U.-M. (2020). Analyzing pre-existing knowledge and performance in a programming MOOC. ACM Conference on Learning @ Scale, 281–4.
Chen, T., & Guestrin, C. (2016). XGBoost: A Scalable tree boosting system. International Conference on Knowledge Discovery and Data Mining (KDD), 785–94.
Fang, J., Tang, L., Yang, J., & Peng, M. (2019). Social interaction in MOOCs: The mediating effects of immersive experience and psychological needs satisfaction. Telematics Informatics, 39, 75–91. https://doi.org/10.1016/j.tele.2019.01.006.
Fei, M., & Yeung, D.-Y. (2015). Temporal Models for Predicting Student Dropout in Massive Open Online Courses. IEEE International Conference on Data Mining Workshop, 256–63.
Guo, P. J., Kim, J., & Rubin, R. (2014). How video production affects student engagement: an empirical study of MOOC videos. ACM conference on Learnin @ Scale, 41–50.
Hao, J., Yue, K., Duan, L., Zhang, B., & Xiaodong, Fu. (2021). Predicting QoS of virtual machines via Bayesian network with XGboost-induced classes. Cluster Computing, 24(2), 1165–1184. https://doi.org/10.1007/s10586-020-03183-2.
He, J., Bailey, J., Rubinstein, B. I. P., & Zhang, R. (2015). Identifying at-risk students in massive open online courses. AAAI Conference on Artificial Intelligence, 1749–55.
Jeon, B., Park, N., & Bang, S. (2020). Dropout prediction over weeks in MOOCs via interpretable multi-layer representation learning. CoRR abs/2002.01598.
Jin, C. (2021). Dropout prediction model in MOOC based on clickstream data and student sample weight. Soft Computing, 25(14), 8971–8988. https://doi.org/10.1007/s00500-021-05795-1.
Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., Ye, Q., & Liu, T.-Y. (2017). LightGBM: A highly efficient gradient boosting decision tree. Neural Information Processing Systems (NIPS), 3146–54.
Kizilcec, R. F., & Halawa, S. A. (2015). Attrition and achievement gaps in online learning. ACM Conference on Learning @ Scale, 57–66.
Kotsiantis, S. B., Pierrakeas, C. J., & Pintelas, P. E. (2003). Preventing student dropout in distance learning using machine learning techniques. International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, 267–74.
Kuzilek, J., Hlosta, M., & Zdrahal, Z. (2017). Open university learning analytics dataset. Scientific Data, 4(170171)
Lee, Y., & Choi, J. (2011). A review of online course dropout research: Implications for practice and future research. Educational Technology Research and Development, 59(5), 593–618.
Maghawry, A., Hodhod, R., Omar, Y. M. K., & Kholief, M. (2021). An approach for optimizing multi-objective problems using hybrid genetic algorithms. Soft Computing, 25(1), 389–405. https://doi.org/10.1007/s00500-020-05149-3.
Moore, R. L., Yen, C.-J., & Eamonn Powers, F. (2021). Exploring the relationship between clout and cognitive processing in MOOC discussion forums. British Journal of Educational Technology, 52(1), 482–497. https://doi.org/10.1111/bjet.13033.
Moreno-Marcos, P. M., Alario-Hoyos, C., Muñoz-Merino, P. J., & Kloos, C. D. (2019). Prediction in MOOCs: A review and future research directions. IEEE Transactions on Learning Technologies, 12(3), 384–401. https://doi.org/10.1109/TLT.2018.2856808.
Mubarak, A. A., Cao, H., Zhang, W., & Zhang, W. (2021). Visual analytics of video-clickstream data and prediction of learners’ performance using deep learning models in MOOCs’ courses. Computer Applications in Engineering Education, 29(4), 710–732. https://doi.org/10.1002/cae.22328.
NeCamp, T., Gardner, J., & Brooks, C. (2019). Beyond A/B testing: Sequential randomization for developing interventions in scaled digital learning environments. International Conference on Learning Analytics & Knowledge, 539–48.
Onan, A., & Toçoglu, M. A. (2021). Weighted word embeddings and clustering-based identification of question topics in MOOC discussion forum posts. Computer Applications in Engineering Education, 29(4), 675–689. https://doi.org/10.1002/cae.22252.
Pozón-López, I., Higueras-Castillo, E., Muñoz-Leiva, F., & Liébana-Cabanillas, F. J. (2021). Perceived user satisfaction and intention to use massive open online courses (MOOCs). Journal of Computing in Higher Education, 33(1), 85–120. https://doi.org/10.1007/s12528-020-09257-9.
Prince, M. (2004). Does active learning work? A review of the research. Journal of Engineering Education, 93(3), 223–231. https://doi.org/10.1002/j.2168-9830.2004.tb00809.x.
Reparaz, C., Aznárez-Sanado, M., & Mendoza, G. (2020). Self-regulation of learning and MOOC retention. Computers in Human Behavior, 111, 106423. https://doi.org/10.1016/j.chb.2020.106423.
Sands, P., & Yadav, A. (2020). Self-regulation for high school learners in a MOOC computer science course. ACM Technical Symposium on Computer Science Education, 845–51.
Schudde, L. (2019). Short- and long-term impacts of engagement experiences with faculty and peers at community colleges. Higher Education, 42(2), 385–426. https://doi.org/10.1353/rhe.2019.0001.
Tanner, T., & Toivonen, H. (2010). Predicting and preventing student failure – using the k-nearest neighbour method to predict student performance in an online course environment. International Journal of Learning Technology, 5(4), 356–377.
Tomasevic, N., Gvozdenovic, N., & Vranes, S. (2020). An overview and comparison of supervised data mining techniques for student exam performance prediction. Computers & Education, 143,. https://doi.org/10.1016/j.compedu.2019.103676.
Wu, B. (2021). Influence of MOOC learners discussion forum social interactions on online reviews of MOOC. Education and Information Technologies, 26(3), 3483–3496. https://doi.org/10.1007/s10639-020-10412-z.
Yue, K., Fang, Q., Wang, X., Li, J., & Liu, W. (2015). A parallel and incremental approach for data-intensive learning of Bayesian networks. IEEE Transaction on Cybernetics, 45(12), 2890–2904. https://doi.org/10.1109/TCYB.2015.2388791.
Zhang, J., Hao, B., Chen, B., Li, C., Chen, H., & Sun, J. (2019). Hierarchical reinforcement learning for course recommendation in MOOCs. AAAI Conference on Artificial Intelligence, 435–42.
Acknowledgements
This paper was supported by National Nature Science Foundation of China (No. 61862068 and No.62166050), Program for innovative research team (in Science and Technology) in University of Yunnan Province, and Yunnan Expert Workstation of Xiaochun Cao.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
All of the authors declare no conflicts of interest regarding to publish this paper.
We declare that we do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Hao, J., Gan, J. & Zhu, L. MOOC performance prediction and personal performance improvement via Bayesian network. Educ Inf Technol 27, 7303–7326 (2022). https://doi.org/10.1007/s10639-022-10926-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10639-022-10926-8