Ren Jiadong, Zhang Yafei, Zhang Bing, Li Shangyang. Classification Method of Industrial Internet Intrusion Detection Based on Feature Selection[J]. Journal of Computer Research and Development, 2022, 59(5): 1148-1159. DOI: 10.7544/issn1000-1239.20211152
Citation:
Ren Jiadong, Zhang Yafei, Zhang Bing, Li Shangyang. Classification Method of Industrial Internet Intrusion Detection Based on Feature Selection[J]. Journal of Computer Research and Development, 2022, 59(5): 1148-1159. DOI: 10.7544/issn1000-1239.20211152
Ren Jiadong, Zhang Yafei, Zhang Bing, Li Shangyang. Classification Method of Industrial Internet Intrusion Detection Based on Feature Selection[J]. Journal of Computer Research and Development, 2022, 59(5): 1148-1159. DOI: 10.7544/issn1000-1239.20211152
Citation:
Ren Jiadong, Zhang Yafei, Zhang Bing, Li Shangyang. Classification Method of Industrial Internet Intrusion Detection Based on Feature Selection[J]. Journal of Computer Research and Development, 2022, 59(5): 1148-1159. DOI: 10.7544/issn1000-1239.20211152
(School of Information Science and Engineering, Yanshan University, Qinhuangdao, Hebei 066004) (Key Laboratory of Software Engineering of Hebei Province(Yanshan University), Qinhuangdao, Hebei 066004)
Funds: This work was supported by the Science and Technology Project of Hebei Education Department (BJK2022029).
Due to the diversity and differences of industrial Internet access equipment, it is difficult to maintain and vulnerable to attacks. For this security problem, it is necessary to introduce relevant defense systems to identify various intrusion attacks. The traditional intrusion detection system can detect fewer types of attacks, and the network traffic data has poor classification performance due to the redundancy of irrelevant features. Therefore, we propose a classification method for industrial Internet intrusion detection based on feature selection. At first, this method preprocesses the dataset, and determines the strength of the feature by calculating the Pearson correlation coefficient of the feature, and determines the optimal threshold for feature extraction; then, from the perspective of machine learning and deep learning, logistic regression is used. Eight models including logistic regression, support vector machine, K-nearest neighbor, decision tree, random forest, multi-layer perceptron, convolutional neural network, and spatial-temporal network are respectively subjected to binary and multi-classification experiments and evaluated. The experimental results show that the binary classification effect of random forest is the best, and the multi-classification effect of decision tree is the best. Finally, the effectiveness of this method is verified in the real industrial Internet practice.