Abstract
With the rapid development of information technology, Smart Campus Card System (SCCS) has become an important part of digital campus construction. Although the characteristics of students' abnormal activities can be reflected in smart campus card records, studies now focus more on analyzing the relationship between normal student activities and smart campus card data. Therefore, we analyzed some features of students’ amounts of consumption on campus and their statistical characteristics, and established a dataset based on smart campus card records for the purpose of detecting students’ abnormal activities. However, extant anomaly detection methods are prone to the two issues listed below. First, the vast majority of existing unsupervised anomaly detection algorithms are trained by fitting a central piece of the training data while disregarding the anomalous data. Those approaches do not completely eliminate the impact of anomaly data. Secondly, those algorithms have poor time performance on large-scale datasets. This paper proposed a growing model-based one-class support vector machine (GMB-OCSVM) to solve the above problems. Our model outperforms other frontier models in terms of detection accuracy and execution efficiency after extensive testing. In particular, our method processes 61,000 more units of data per second than the OCSVM method in terms of execution efficiency and improves detection accuracy to 92.39%. It demonstrates that our method can effectively handle the challenges of abnormal data interference in the training process and inefficient execution, and can detect abnormal student behavior in the campus daily consumption dataset in an efficient and accurate manner, indicating that our method has some practical utility.
Similar content being viewed by others
Data Availability
The data presented in this study are available on request from the corresponding author. The data are not publicly available due to privacy or ethical restrictions.
References
Fan, S., Li, P., Liu, T., Chen, Y.: Population Behavior Analysis of Chinese University Students via Digital Campus Cards. In: 2015 IEEE International Conference on Data Mining Workshop (ICDMW), (2015). https://doi.org/10.1109/ICDMW.2015.45
Jiang, T., Cao, J., Su, D., Yang, X.: Analysis and Data Mining of Students' Consumption Behavior Based on a Campus Card System. In: 2017 International Conference on Smart City and Systems Engineering (ICSCSE), (2017). https://doi.org/10.1109/ICSCSE.2017.22
Liu, Y., Hu, M., Lu, X.: Social Frequency Analysis of University Students via Digital Campus Cards.In: 2016 8th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC), (2016). https://doi.org/10.1109/IHMSC.2016.206
Lu, S., Zhao, J., Wang, H.: MD-MBPLS: A novel explanatory model in computational social science. Knowledge-Based Syst. (2021). https://doi.org/10.1016/j.knosys.2021.107023
Lim, H., Kim, S., Chung, K.M., Lee, K., Kim, T., Heo, J.: Is college students’ trajectory associated with academic performance? Comput. Edu. (2022). https://doi.org/10.1016/j.compedu.2021.104397
Asif, R., Merceron, A., Ali, S.A., Haider, N.G.: Analyzing undergraduate students’ performance using educational data mining. Comput. Educ. 113, 177–194 (2017). https://doi.org/10.1016/j.compedu.2017.05.007
Yao, H., Lian, D., Cao, Y., Wu, Y., Zhou, T.: Predicting academic performance for college students: A campus behavior perspective. ACM Trans. Intell. Syst. Technol (2019). https://doi.org/10.1145/3299087
Hsieh, K.-Y., Hsiao, R.C., Yang, Y.-H., Lee, K.-H., Yen, C.-F.: Relationship between self-identity confusion and internet addiction among college students: The mediating effects of psychological inflexibility and experiential avoidance. Int. J. Environ. Res. Public Health (2019). https://doi.org/10.3390/ijerph16173225
Ding, Y., Chen, X., Fu, Q., Zhong, S.: A depression recognition method for college students using deep integrated support vector algorithm. IEEE Access. 8, 75616–75629 (2020). https://doi.org/10.1109/access.2020.2987523
Akram, A., Fu, C., Li, Y., Javed, M.Y., Lin, R., Jiang, Y., Tang, Y.: Predicting students’ academic procrastination in blended learning course using homework submission data. IEEE Access. 7, 102487–102498 (2019). https://doi.org/10.1109/access.2019.2930867
Yang, Z., Su, Z., Liu, S., Liu, Z., Ke, W., Zhao, L.: Evolution features and behavior characters of friendship networks on campus life. Expert Syst. Appl (2020). https://doi.org/10.1016/j.eswa.2020.113519
Wang, Y., Wang, Q.-W., Tao, Y.-Y., Xie, W.-W.: Empirical study of consumption behavior of college students under the influence of internet-based financing services. Procedia Comput. Sci. 187, 152–157 (2021). https://doi.org/10.1016/j.procs.2021.04.046
Wu, F., Zheng, Q., Tian, F., Suo, Z., Zhou, Y., Chao, K.-M., Xu, M., Shah, N., Liu, J., Li, F.: Supporting poverty-stricken college students in smart campus. Future Gener. Comput. Syst. 111, 599–616 (2020). https://doi.org/10.1016/j.future.2019.09.017
Ma, Y., Zhang, X., Di, X., Ren, T., Yang, H., Cai, B.: Analysis and identification of students with financial difficulties: A behavioural feature perspective. Discrete Dyn. Nat. Soc (2020). https://doi.org/10.1155/2020/2071025
Govindasamy, K., Thambusamy, V.: A study on classification and clustering data mining algorithms based on students academic performance prediction. Int. J. Control Theory Appl. 10(23), 147–160 (2017)
Li, Y., Zhang, H., Liu, S.: Applying data mining techniques with data of campus card system. IOP Conf.: Ser Mater. Sci. Eng. 715, 12–21 (2020). https://doi.org/10.1088/1757-899X/715/1/012021
Ding, D., Li, J., Wang, H., Liang, Z.: Student behavior clustering method based on campus big data.In: 2017 13th International Conference on Computational Intelligence and Security (CIS), 500–503 (2017).
Campos, G.O., Zimek, A., Sander, J., Campello, R.J.G.B., Micenková, B., Schubert, E., Assent, I., Houle, M.E.: On the evaluation of unsupervised outlier detection: measures, datasets, and an empirical study. Data Min. Knowl. Disc. 30(4), 891–927 (2016). https://doi.org/10.1007/s10618-015-0444-8
Hardin, J., Rocke, D.M.: Outlier detection in the multiple cluster setting using the minimum covariance determinant estimator. Comput. Stat. Data Anal. 44(4), 625–638 (2004)
Angiulli, F., Pizzuti, C.: Fast outlier detection in high dimensional spaces. In: European conference on principles of data mining and knowledge discovery, 15–27 (2002).
Breunig, M. M., Kriegel, H.-P., Ng, R. T., Sander, J.: LOF: identifying density-based local outliers.In: Proceedings of the 2000 ACM SIGMOD international conference on Management of data, 93–104 (2000).
Schölkopf, B., Platt, J.C., Shawe-Taylor, J., Smola, A.J., Williamson, R.C.: Estimating the support of a high-dimensional distribution. Neural Comput. 13(7), 1443–1471 (2001)
Liu, F. T., Ting, K. M., Zhou, Z.: Isolation Forest. In: 2008 Eighth IEEE International Conference on Data Mining, 413–422 (2008). https://doi.org/10.1109/ICDM.2008.17
Ruff, L., Kauffmann, J.R., Vandermeulen, R.A., Montavon, G., Samek, W., Kloft, M., Dietterich, T.G., Müller, K.-R.: A unifying review of deep and shallow anomaly detection. Proc. IEEE. (2021). https://doi.org/10.1109/JPROC.2021.3052449
Yang, P., Wang, D., Wei, Z., Du, X., Li, T.: An outlier detection approach based on improved self-organizing feature map clustering algorithm. IEEE Access. 7, 115914–115925 (2019). https://doi.org/10.1109/ACCESS.2019.2922004
Yi, Y., Shi, Y., Wang, W., Lei, G., Dai, J., Zheng, H.: Combining boundary detector and SND-SVM for fast learning. Int. J. Mach. Learn. Cybern. 12(3), 689–698 (2021). https://doi.org/10.1007/s13042-020-01196-2
Li, S., Lai, S., Jiang, Y., Wang, W., Yi, Y.: Graph regularized deep sparse representation for unsupervised anomaly detection. Comput. Intell. Neurosci (2021). https://doi.org/10.1155/2021/4026132
Salehi, M., Mirzaei, H., Hendrycks, D., Li, Y., Rohban, M. H., Sabokrou, M.: A Unified Survey on Anomaly, Novelty, Open-Set, and Out-of-Distribution Detection: Solutions and Future Challenges. arXiv preprint arXiv:2110.14051. (2021)
Fritzke, B.A.: A growing neural gas network learns topologies. Adv Neural Inf Process Syst 7, 625–632 (1995)
Ghesmoune, M., Lebbah, M., Azzag, H.: A new growing neural gas for clustering data streams. Neural Netw. 78, 36–50 (2016). https://doi.org/10.1016/j.neunet.2016.02.003
Sun, Q., Liu, H., Harada, T.: Online growing neural gas for anomaly detection in changing surveillance scenes. Pattern Recognit. 64, 187–201 (2017). https://doi.org/10.1016/j.patcog.2016.09.016
Wiwatcharakoses, C., Berrar, D.: SOINN+, a self-organizing incremental neural network for unsupervised learning from noisy data streams. Expert Syst. Appl. 143, 113069 (2020). https://doi.org/10.1016/j.eswa.2019.113069
Tax, D.M., Duin, R.P.: Support vector domain description. Pattern Recognit. Lett. 20(11–13), 1191–1199 (1999)
Liu, Q., Jin, Y., Heiderich, M., Rodemann, T., Yu, G.: An adaptive reference vector-guided evolutionary algorithm using growing neural gas for many-objective optimization of irregular problems. IEEE Trans. Cybern (2020). https://doi.org/10.1109/TCYB.2020.3020630
Powers, D.M.W.: Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. Mach. Learn. Technol 2(1), 37–63 (2011). https://doi.org/10.48550/arXiv.2010.16061
Zhao, Y., Nasrullah, Z., Li, Z.: PyOD: A python toolbox for scalable outlier detection. J Mach Learn Res 20(96), 1–7 (2019)
Funding
This research was funded by the National Key R&D Program of China, grant number 2019YFC0605203, in part by Chongqing Basic Research and Frontier Exploration Project (cstc2020jcyj-msxmX0553), Scientific and Technological Research Program of Chongqing Municipal Education Commission (KJQN201904007).
Author information
Authors and Affiliations
Contributions
Conceptualization, XY, KP and PF; Data curation, XY, KP and LA; Funding acquisition, PF, PH and BW; Investigation, XY, LA and PH; Methodology, XY, KP and PH; Project administration, BW; Resources, KP, PF and PH; Software, XY and LA; Supervision, PF and PH; Visualization, XY and PH; Writing–original draft, XY; Writing–review and editing, KP, PF and BW. All authors have read and agreed to the published version of the manuscript.
Corresponding authors
Ethics declarations
Conflict of Interest
The authors declare no conflict of interest.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Yang, X., Huang, P., An, L. et al. A Growing Model-Based OCSVM for Abnormal Student Activity Detection from Daily Campus Consumption. New Gener. Comput. 40, 915–933 (2022). https://doi.org/10.1007/s00354-022-00193-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00354-022-00193-z