Granular Vectors and K Nearest Neighbor Granular Classifiers
-
摘要: K近邻(K nearest neighbor, KNN)分类器是一种经典的分类器,它简单而又有效,已经在人工智能与机器学习领域得到了广泛的应用.针对传统分类器难以处理不确定性数据的问题,研究样本单特征邻域粒化技术,构造粒的向量形式,提出一种基于粒向量的K近邻分类方法.该方法引入邻域粗糙集模型,对分类系统中的样本进行单特征邻域粒化,形成特征邻域粒子.并由多个特征邻域粒子构成一个粒向量,定义了多种粒向量运算算子,提出了2种粒向量距离:相对粒距离与绝对粒距离,证明了粒向量距离的单调性原理.进一步,基于粒向量距离定义了K近邻粒向量概念,提出了K近邻粒分类器.最后,结合UCI数据集,采用K近邻粒分类器与经典K近邻分类器进行比较测试.理论分析和实验表明:针对合适的粒化参数与k值,K近邻粒分类器具有较好的分类性能.Abstract: K nearest neighbor (KNN) classifier is a classical, simple and effective classifier. It has been widely employed in the fields of artificial intelligence and machine learning. Aiming at the problem that traditional classifiers are difficult to deal with uncertain data, we study a technique of neighborhood granulation of samples on each atom feature, construct some granular vectors, and propose a K nearest neighbor classification method based on these granular vectors in this paper. The method introduces a neighborhood rough set model to granulate samples in a classification system, and the raw data can be converted into some feature neighborhood granules. Then, a granular vector is induced by a set of neighborhood granules, and several operators of granular vectors are defined. We present two metrics of granular vectors which are relative granular distance and absolute granular distance, respectively. The monotonicity of distance of granular vectors is proved. Furthermore, the concept of K nearest neighbor granular vector is defined based on the distance of granular vectors, and K nearest neighbor granular classifier is designed. Finally, the K nearest neighbor granular classifier is compared with the classical K nearest neighbor classifier using several UCI datasets. Theoretical analysis and experimental results show that the K nearest neighbor granular classifier has better classification performance under suitable granulation parameters and k values.
-
-
期刊类型引用(9)
1. 张恩,王梦涛,郑东,禹勇,黄昱晨. 低带宽场景下防合谋多方隐私集合并集协议. 通信学报. 2025(01): 210-221 . 百度学术
2. 李子贤,刘文杰. 求解最小公倍数问题的量子安全多方计算协议. 计算机学报. 2024(06): 1393-1412 . 百度学术
3. 逯绍锋,胡玉龙,逯跃锋. 保护隐私的集合相似性度量协同计算协议. 计算机技术与发展. 2023(01): 137-143 . 百度学术
4. 魏立斐,王勤,张蕾,陈聪聪,陈玉娇,宁建廷. 半可信云服务器辅助的高效隐私交集计算协议. 软件学报. 2023(02): 932-944 . 百度学术
5. 家珠亮,赵雪玲,李顺东. 分布式多重集众数及重数的保密计算. 密码学报. 2023(01): 102-117 . 百度学术
6. 魏立斐,刘纪海,张蕾,宁建廷. 双云辅助的超阈值多方隐私集合交集计算协议. 软件学报. 2023(11): 5442-5456 . 百度学术
7. 赵宗渠,王书静,汤永利,霍亚超,杨丽. 基于理想格的两方隐私集合交集协议. 计算机应用研究. 2023(12): 3795-3799 . 百度学术
8. 徐琳,魏晓超,蔡国鹏,王皓,郑志华. 一个高效的安全两方近似模式匹配协议. 计算机研究与发展. 2022(08): 1819-1830 . 本站查看
9. 魏立斐,刘纪海,张蕾,王勤,贺崇德. 面向隐私保护的集合交集计算综述. 计算机研究与发展. 2022(08): 1782-1799 . 本站查看
其他类型引用(16)
计量
- 文章访问数: 925
- HTML全文浏览量: 3
- PDF下载量: 360
- 被引次数: 25