ISSN 1000-1239 CN 11-1777/TP

Journal of Computer Research and Development ›› 2020, Vol. 57 ›› Issue (11): 2442-2455.doi: 10.7544/issn1000-1239.2020.20190649

Previous Articles     Next Articles

Search of Students with Similar Lifestyle Based on Campus Behavior Information Network

Wang Xin’ao1, Duan Lei1, Cui Dingshan1, Lu Li1, Dun Yijie2, Qin Ruiqi1   

  1. 1(School of Computer Science, Sichuan University, Chengdu 610065);2(School of Mathematics and Computer Science, Northwest Minzu University, Lanzhou 730030)
  • Online:2020-11-01
  • Supported by: 
    This work was supported by the National Natural Science Foundation of China (61972268, 61572332), the National Key Research and Development Program of China (2018YFB0704301), the Key Rsearch and Dvelopment Program of Sichuan Province (2019YFG0491), and Sichuan Higher Education Talent Training Quality and Teaching Reform Project (JG2018-92).

Abstract: It is important to keep track of both the psychological and academic status of students in campus. Generally, student data covers a wide range of kinds such as students’ interests, hobbies, and lifestyles, and these data can be collected via smart devices such as student e-cards by many campuses. With the rapid development of new generation of information technology, in recent years, researchers have explored novel ways to improve the quality of talent cultivation by utilizing the student data, such as applying big data analysis on the data to discover subtle but meaningful information as the guidance for better student management. Among such research, search of students with similar lifestyles can exert positive effect on the improvement of student management, as potential and insightful information can be found and may further provide some warnings for students at an early stage if anything unusual is found. Existing algorithms for searching students with similar lifestyle have two deficiencies. Firstly, they cannot explain the similarities between students because related semantic information is lost in the searching process. Secondly, they fail to integrate multiple data sources, while the student behavioral data is growing dynamically and only using one dataset may lead to biased results. To break these limitations, we first propose the concept of campus behavior information network to represent student behaviors in campus. Next, based on the constructed campus behavior information network, an algorithm named SCALE is proposed for similar campus lifestyle mining. SCALE calculates the student similarity by specific meta-paths with constraints. SCALE is strong and unique, not only in keeping the similarity semantics of the original data, but also in extensively integrating multiple data sources in a scalable way while retaining the original results of calculation. Due to the large scale of datasets, parallel strategy is further designed and applied to SCALE for the sake of efficiency. Through extensive experiments on real campus behavior datasets, the effectiveness and execution efficiency of the SCALE are verified.

Key words: campus behavior information network, heterogeneous information network, student behavior analysis, meta path, similar student search

CLC Number: