A Quantum Principal Component Analysis Algorithm for Clustering Problems
-
摘要: 聚类问题中的离群点容易影响簇中心的选择,且样本数据量规模的扩大会造成样本点间的距离计算需要消耗大量计算资源.为了解决上述问题,从簇中心选取和最短距离搜索2个方面出发,提出了一种针对聚类问题的新型量子主成分分析算法.利用阈值更新奇异值并得到主成分,再通过势函数得到簇中心,从而减少异常值对簇中心选取的影响.此外,采用量子最小值搜索算法寻找距离样本点最近的簇中心,减少聚类所需迭代次数.以小规模数据集为例,采用Cirq量子编程框架对算法进行电路设计和仿真实验.实验结果表明,该算法与已有的量子聚类算法相比,在聚类准确度上有所提升.性能分析表明,与现有经典和量子算法比较,该算法在簇中心选取和最短距离搜索时间复杂度上有不同程度的改进,消耗资源有所降低.Abstract: The outliers in the clustering problem can easily affect the selection of cluster centers, and the expansion of the clustering scale will cause more computing resources to be consumed in the calculation of the distance between sample points. To address the above issues, a new quantum principal component analysis algorithm for clustering problems (QC-PCA) is proposed, improving the selection of the cluster center and the shortest distance search. In this paper, the principal components are marked by adding and subtracting thresholds to singular values and the cluster center is selected according to the potential function of the subset, thereby reduce the influence of abnormal points on the selection of the cluster center. In addition, a quantum minimum search algorithm is used to find the cluster center closest to the sample point, reducing the number of iterations required for clustering. Taking a small-scale data set as an example, the Cirq quantum programming framework is used to circuit design and simulation experiments. The experimental results show that compared with the existing quantum algorithms, the proposed QC-PCA algorithm improves the clustering accuracy. Performance analysis shows that compared with the existing classical and quantum algorithms, our algorithm has different degrees of improvement in the time complexity of the cluster center selection and the shortest distance search. And the resource consumption of the proposed QC-PCA algorithm is also lower than that of them.
-
-
期刊类型引用(5)
1. 夏心锋,陈泽,周于超,刘莉莉. 面向区块链的电力营销大数据安全共享研究. 电气技术与经济. 2025(02): 245-248 . 百度学术
2. 吴昊,李贝,贺小伟,王宾,李思远. 基于侧链和信任管理模型的数据共享方案. 计算机工程与设计. 2024(01): 24-31 . 百度学术
3. 李鸣,宋文鹏,宗燕,刘冕宸. 基于区块链的元宇宙生态体系架构. 计算机研究与发展. 2024(09): 2364-2383 . 本站查看
4. 陈月静,朴春慧,卢晓天,霍仁崇,白英杰. 工业互联网安全评测数据隐私保护与共享方法. 情报杂志. 2023(06): 180-186 . 百度学术
5. 于欣海,黄欣哲,梁海,丁勇. 基于区块链的高速联网收费数据传输应用研究. 计算机技术与发展. 2023(10): 51-58 . 百度学术
其他类型引用(6)
计量
- 文章访问数: 176
- HTML全文浏览量: 15
- PDF下载量: 90
- 被引次数: 11