Gu Yonghao, Huang Boqi, Wang Jigang, Tian Tian, Liu Yan, Wu Yuesheng. Trojan Traffic Detection Method Based on Semi-Supervised Deep Learning[J]. Journal of Computer Research and Development, 2022, 59(6): 1329-1342. DOI: 10.7544/issn1000-1239.20201014
Citation:
Gu Yonghao, Huang Boqi, Wang Jigang, Tian Tian, Liu Yan, Wu Yuesheng. Trojan Traffic Detection Method Based on Semi-Supervised Deep Learning[J]. Journal of Computer Research and Development, 2022, 59(6): 1329-1342. DOI: 10.7544/issn1000-1239.20201014
Gu Yonghao, Huang Boqi, Wang Jigang, Tian Tian, Liu Yan, Wu Yuesheng. Trojan Traffic Detection Method Based on Semi-Supervised Deep Learning[J]. Journal of Computer Research and Development, 2022, 59(6): 1329-1342. DOI: 10.7544/issn1000-1239.20201014
Citation:
Gu Yonghao, Huang Boqi, Wang Jigang, Tian Tian, Liu Yan, Wu Yuesheng. Trojan Traffic Detection Method Based on Semi-Supervised Deep Learning[J]. Journal of Computer Research and Development, 2022, 59(6): 1329-1342. DOI: 10.7544/issn1000-1239.20201014
1(School of Computer Science, Beijing University of Posts and Telecommunications, Beijing 100876)
2(Beijing Key Laboratory of Intelligent Telecommunications Software and Multimedia (Beijing University of Posts and Telecommunications), Beijing 100876)
3(Guangdong Provincial Key Laboratory of Information Security Technology (Sun Yat-sen University), Guangzhou 510275)
Funds: This work was supported by the Fundamental Research Funds for the Central Universities (Beijing University of Posts and Telecommunications) for Action Plan (2021XD-A11-1), the ZTE Industry-Academia-Research Cooperation Funds (HC-CN-20200807013), the Opening Project of Guangdong Provincial Key Laboratory of Information Security Technology (2020B1212060078), and the National Natural Science Foundation of China (U1836108, U1936216).
The existing Trojan traffic detection technology has problems, such as the inaccuracy of manual feature extraction, the difficulty of obtaining a large number of labeled samples, the insufficient utilization of unlabeled samples, and the low detection rate of unknown samples. A semi-supervised deep learning method is proposed to detect Trojan traffic by using unlabeled network traffic for model training. Firstly, the detection method based on the mean teacher model is used to improve the detection accuracy. Then, in order to solve the problem that the model generalization ability is not enough due to the random noise in the mean teacher model, a detection method based on the virtual adversarial mean teacher is proposed. At last, the experimental results show that the proposed semi-supervised deep learning detection method has higher accuracy in the task of two classifications, multi-classification and unknown sample detection under the condition of less labeled samples. Besides, the detection method based on virtual adversarial mean teacher model has stronger generalization performance than the original mean teacher model in the task of multi-classification.