应用于高速网络的基于报文采样和应用签名的BitTorrent流量识别算法

郭振滨      裘正定

应用于高速网络的基于报文采样和应用签名的BitTorrent流量识别算法

郭振滨裘正定

Identification of BitTorrent Traffic for High Speed Network Using Packet Sampling and Application Signatures

Guo Zhenbin and Qiu Zhengding

摘要

摘要: 在高速网络上进行P2P流量识别具有极大的困难，因为基于端口号的方法已经不再准确，而基于应用签名的方法没有足够高的处理效率.提出了应用于高速网络的基于报文采样和应用签名的BitTorrent流量识别算法.建立了误检率和漏检率模型来分析报文采样率和签名率对识别准确度的作用，并指导应用签名和采样率的选择.通过开发流状态判别预处理器，在Snort平台上实现了该流量识别算法.实验结果表明该流量识别算法处理效率和准确度都是令人满意的，能应用于高速网络环境.在普通个人计算机上，对采样报文的处理效率在800Mbps以上.将该方法应用于报文处理，当采样率为0.5时漏检率为0.6%，当采样率为0.1时漏检率为5.9%，当采样率为0.05时漏检率为10.5%. 将该方法应用于流数据分析，当采样率为0.5时漏检率为0.06%，当采样率为0.1时漏检率为0.33%，当采样率为0.05时漏检率为1.1%. 该方法展现了优秀的误检性能，没有任何报文被误检.实验结果也表明误检率和漏检率模型是非常准确的.

Abstract: It is very difficult to identify peer-to-peer (P2P) traffic in high speed network environment because well-known port numbers are no longer reliable and application signatures are not efficient enough. In this paper, a BitTorrent traffic identification method for high speed network using packet sampling and application signatures is presented. Models of false negatives and false positives are developed to analyze the effects of packet sampling probability and application signatures probability on accuracy. The method is implemented with Snort by developing a flow state differentiating preprocessor. The experiment results show that the efficiency and accuracy of the method are exciting and the method can be applied to high speed network. The low limit of processing efficiency is over 800 Mbps on a personal computer hardware platform. Assuming that the method is applied to processing packets, the false negatives rate is about 0.6% with 0.5 sampling probability, about 5.9% with 0.1 sampling probability, and about 10.5% with 0.05 sampling probability. Assuming that the method is applied to analyzing flows, the false negatives rate is about 0.06% with 0.5 sampling probability, about 0.33% with 0.1 sampling probability, and about 1.1% with 0.05 sampling probability. The method shows excellent false positives with no packet falsely identified. The experiment results also show that the false negatives and false positives models are very accurate.

HTML全文

参考文献(0)

施引文献

资源附件(0)