Skip to main content

Advertisement

Log in

MOOC performance prediction and personal performance improvement via Bayesian network

  • Published:
Education and Information Technologies Aims and scope Submit manuscript

Abstract

In order to analyze the non-linear and uncertain relationships among the student-related features, curriculum-related features as well as the environment-related features, and then quantify the corresponding impacts on students’ final MOOC performance in a valid way, we first construct a Students’ performance Prediction Bayesian Network (SPBN) via the hill-climbing and maximum likelihood estimation (MLE) method. With SPBN, we can predict the students’ MOOC performance and then quantify the uncertain dependencies of all the relevant features. Furthermore, with the prediction results of SPBN, we further apply the genetic algorithm (GA) to offer the personal performance improvement suggestions for those who are about to fail the courses according to different students’ background and their current engagement behaviors, so as to avoid the MOOC exam failures in advance. The experiments conducted on the Open University Learning Analytics Dataset (OULAD) have shown that the SPBN can predict students’ performance in MOOC accurately, and the GA-based method can offer the reasonable performance improvement suggestions effectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. The OULAD dataset. https://analyse.kmi.open.ac.uk/open_dataset. 2021/8/10.

References

  • Albelbisi, N. A., Al-Adwan, A. S., & Habibi, A. (2021). Self-regulated learning and satisfaction: A key determinants of MOOC success. Education and Information Technologies, 26(3), 3459–3481. https://doi.org/10.1007/s10639-020-10404-z.

    Article  Google Scholar 

  • Borrella, I., Caballero-Caballero, S., & Ponce-Cueto, E. (2019). Predict and Intervene: Addressing the Dropout Problem in a MOOC-based Program. ACM Conference on Learing @ Scale, 24, 1–9.

    Google Scholar 

  • Bote-Lorenzo, M. L., & Gómez-Sánchez, E. (2017). Predicting the decrease of engagement indicators in a MOOC. International Learning Analytics & Knowledge Conference, 143–7.

  • Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324.

    Article  MATH  Google Scholar 

  • Burd, H., Bell, A., Hemberg, E., & O’Reilly, U.-M. (2020). Analyzing pre-existing knowledge and performance in a programming MOOC. ACM Conference on Learning @ Scale, 281–4.

  • Chen, T., & Guestrin, C. (2016). XGBoost: A Scalable tree boosting system. International Conference on Knowledge Discovery and Data Mining (KDD), 785–94.

  • Fang, J., Tang, L., Yang, J., & Peng, M. (2019). Social interaction in MOOCs: The mediating effects of immersive experience and psychological needs satisfaction. Telematics Informatics, 39, 75–91. https://doi.org/10.1016/j.tele.2019.01.006.

    Article  Google Scholar 

  • Fei, M., & Yeung, D.-Y. (2015). Temporal Models for Predicting Student Dropout in Massive Open Online Courses. IEEE International Conference on Data Mining Workshop, 256–63.

  • Guo, P. J., Kim, J., & Rubin, R. (2014). How video production affects student engagement: an empirical study of MOOC videos. ACM conference on Learnin @ Scale, 41–50.

  • Hao, J., Yue, K., Duan, L., Zhang, B., & Xiaodong, Fu. (2021). Predicting QoS of virtual machines via Bayesian network with XGboost-induced classes. Cluster Computing, 24(2), 1165–1184. https://doi.org/10.1007/s10586-020-03183-2.

    Article  Google Scholar 

  • He, J., Bailey, J., Rubinstein, B. I. P., & Zhang, R. (2015). Identifying at-risk students in massive open online courses. AAAI Conference on Artificial Intelligence, 1749–55.

  • Jeon, B., Park, N., & Bang, S. (2020). Dropout prediction over weeks in MOOCs via interpretable multi-layer representation learning. CoRR abs/2002.01598.

  • Jin, C. (2021). Dropout prediction model in MOOC based on clickstream data and student sample weight. Soft Computing, 25(14), 8971–8988. https://doi.org/10.1007/s00500-021-05795-1.

    Article  Google Scholar 

  • Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., Ye, Q., & Liu, T.-Y. (2017). LightGBM: A highly efficient gradient boosting decision tree. Neural Information Processing Systems (NIPS), 3146–54.

  • Kizilcec, R. F., & Halawa, S. A. (2015). Attrition and achievement gaps in online learning. ACM Conference on Learning @ Scale, 57–66.

  • Kotsiantis, S. B., Pierrakeas, C. J., & Pintelas, P. E. (2003). Preventing student dropout in distance learning using machine learning techniques. International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, 267–74.

  • Kuzilek, J., Hlosta, M., & Zdrahal, Z. (2017). Open university learning analytics dataset. Scientific Data, 4(170171)

  • Lee, Y., & Choi, J. (2011). A review of online course dropout research: Implications for practice and future research. Educational Technology Research and Development, 59(5), 593–618.

    Article  Google Scholar 

  • Maghawry, A., Hodhod, R., Omar, Y. M. K., & Kholief, M. (2021). An approach for optimizing multi-objective problems using hybrid genetic algorithms. Soft Computing, 25(1), 389–405. https://doi.org/10.1007/s00500-020-05149-3.

    Article  Google Scholar 

  • Moore, R. L., Yen, C.-J., & Eamonn Powers, F. (2021). Exploring the relationship between clout and cognitive processing in MOOC discussion forums. British Journal of Educational Technology, 52(1), 482–497. https://doi.org/10.1111/bjet.13033.

    Article  Google Scholar 

  • Moreno-Marcos, P. M., Alario-Hoyos, C., Muñoz-Merino, P. J., & Kloos, C. D. (2019). Prediction in MOOCs: A review and future research directions. IEEE Transactions on Learning Technologies, 12(3), 384–401. https://doi.org/10.1109/TLT.2018.2856808.

    Article  Google Scholar 

  • Mubarak, A. A., Cao, H., Zhang, W., & Zhang, W. (2021). Visual analytics of video-clickstream data and prediction of learners’ performance using deep learning models in MOOCs’ courses. Computer Applications in Engineering Education, 29(4), 710–732. https://doi.org/10.1002/cae.22328.

    Article  Google Scholar 

  • NeCamp, T., Gardner, J., & Brooks, C. (2019). Beyond A/B testing: Sequential randomization for developing interventions in scaled digital learning environments. International Conference on Learning Analytics & Knowledge, 539–48.

  • Onan, A., & Toçoglu, M. A. (2021). Weighted word embeddings and clustering-based identification of question topics in MOOC discussion forum posts. Computer Applications in Engineering Education, 29(4), 675–689. https://doi.org/10.1002/cae.22252.

    Article  Google Scholar 

  • Pozón-López, I., Higueras-Castillo, E., Muñoz-Leiva, F., & Liébana-Cabanillas, F. J. (2021). Perceived user satisfaction and intention to use massive open online courses (MOOCs). Journal of Computing in Higher Education, 33(1), 85–120. https://doi.org/10.1007/s12528-020-09257-9.

    Article  Google Scholar 

  • Prince, M. (2004). Does active learning work? A review of the research. Journal of Engineering Education, 93(3), 223–231. https://doi.org/10.1002/j.2168-9830.2004.tb00809.x.

    Article  Google Scholar 

  • Reparaz, C., Aznárez-Sanado, M., & Mendoza, G. (2020). Self-regulation of learning and MOOC retention. Computers in Human Behavior, 111, 106423. https://doi.org/10.1016/j.chb.2020.106423.

  • Sands, P., & Yadav, A. (2020). Self-regulation for high school learners in a MOOC computer science course. ACM Technical Symposium on Computer Science Education, 845–51.

  • Schudde, L. (2019). Short- and long-term impacts of engagement experiences with faculty and peers at community colleges. Higher Education, 42(2), 385–426. https://doi.org/10.1353/rhe.2019.0001.

    Article  Google Scholar 

  • Tanner, T., & Toivonen, H. (2010). Predicting and preventing student failure – using the k-nearest neighbour method to predict student performance in an online course environment. International Journal of Learning Technology, 5(4), 356–377.

    Article  Google Scholar 

  • Tomasevic, N., Gvozdenovic, N., & Vranes, S. (2020). An overview and comparison of supervised data mining techniques for student exam performance prediction. Computers & Education, 143,. https://doi.org/10.1016/j.compedu.2019.103676.

  • Wu, B. (2021). Influence of MOOC learners discussion forum social interactions on online reviews of MOOC. Education and Information Technologies, 26(3), 3483–3496. https://doi.org/10.1007/s10639-020-10412-z.

    Article  Google Scholar 

  • Yue, K., Fang, Q., Wang, X., Li, J., & Liu, W. (2015). A parallel and incremental approach for data-intensive learning of Bayesian networks. IEEE Transaction on Cybernetics, 45(12), 2890–2904. https://doi.org/10.1109/TCYB.2015.2388791.

    Article  Google Scholar 

  • Zhang, J., Hao, B., Chen, B., Li, C., Chen, H., & Sun, J. (2019). Hierarchical reinforcement learning for course recommendation in MOOCs. AAAI Conference on Artificial Intelligence, 435–42.

Download references

Acknowledgements

This paper was supported by National Nature Science Foundation of China (No. 61862068 and No.62166050), Program for innovative research team (in Science and Technology) in University of Yunnan Province, and Yunnan Expert Workstation of Xiaochun Cao.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jianhou Gan.

Ethics declarations

Conflict of interest

All of the authors declare no conflicts of interest regarding to publish this paper.

We declare that we do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hao, J., Gan, J. & Zhu, L. MOOC performance prediction and personal performance improvement via Bayesian network. Educ Inf Technol 27, 7303–7326 (2022). https://doi.org/10.1007/s10639-022-10926-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10639-022-10926-8

Keywords

Navigation