一种基于域名请求伴随关系的恶意域名检测方法

彭成维; 云晓春; 张永铮; 李书豪

doi:10.7544/issn1000-1239.2019.20180481

一种基于域名请求伴随关系的恶意域名检测方法

Detecting Malicious Domains Using Co-Occurrence Relation Between DNS Query

摘要

摘要: 恶意域名在网络非法攻击活动中承担重要的角色.恶意域名检测能够有效地减少攻击活动所带来的经济损失.提出CoDetector恶意域名检测模型，通过挖掘域名请求之间潜在的时空伴随关系进行恶意域名检测.研究发现域名请求之间存在彼此伴随关系，而并非相互独立.因此，彼此伴随的域名之间存在紧密关联，偏向于同时是正常域名或恶意域名.1)利用域名请求的先后时间顺序对域名数据进行粗粒度的聚类操作，将彼此伴随出现的域名划分到同一簇中；2)采用嵌入学习构建映射函数，在保留域名伴随关系的同时将每一个域名投影成低维空间的特性向量；3)结合有标记的数据，训练恶意域名检测分类器，用于检测更多未知恶意域名.实验结果表明，CoDetector能够有效地检测恶意域名，具有91.64%检测精度和96.04%召回率.

Abstract: Malicious domains play a vital role in illicit online activities. Effectively detecting the malicious domains can significantly decrease the damage of evil attacks. In this paper, we propose CoDetector, a novel technique to detect malicious domains based on the co-occurrence relationships of domains in DNS (domain name system) queries. We observe that DNS queries are not isolated, whereas co-occur with each other. We base it design on the intuition that domains that tend to co-occur in DNS traffic are strongly associated and are likely to be in the same property (i.e., malicious or benign). Therefore, we first perform coarse-grained clustering of DNS traffic based on the chronological order of DNS queries. The domains co-occurring with each other will be clustered. Then, we design a mapping function that automatically projects every domain into a low-dimensional feature vector while maintaining their co-occurrence relationships. Domains that co-occur with each others are mapped to similar vectors while domains that not co-occur are mapped to distant vectors. Finally, based on the learned feature representations, we train a classifier over a labeled dataset and further apply it to detect unknown malicious domains. We evaluate CoDetector using real-world DNS traffic collected from an enterprise network over two months. The experimental results show that CoDetector can effectively detect malicious domains (91.64% precision and 96.04% recall).

HTML全文

参考文献(0)

施引文献

资源附件(0)