高级检索

    一种基于聚类分析的R\+*树结点重叠判定算法

    A Decision Algorithm on Judging the Overlap of Nodes for R*Tree Based on Clustering Analysis

    • 摘要: 聚类分析可以对大量空间对象进行聚类划分,优化R\+*树的结点.根据R\+*树的强制重插原则,在聚类分析基础上提出一种扩展MBR的对角线段对相交算法以判定类结点的重叠.从根本上改变以往在解决R\+*树结点重叠时仅将MBR形状改变或单纯紧致正交MBR所存在的问题,以此为判定条件可以控制聚类算法迭代次数,减少噪声点对聚类的影响.其中判定算法时间复杂性为O(nlogn)级.实验结果表明在范围查询中引入基于聚类分析的对角线段对相交判定算法的查询效率优于基于R\+*树的Gain/Loss度量的贪婪算法和基于SR树的算法的查询效率.

       

      Abstract: Clustering analysis can implement the clustering partitions for a large amount of spatial objects and optimize the nodes of the R\+*tree. The superiority of R\+*tree holds for different types of queries and operations for both rectangles and multidimensional points with effect and efficiency. Since the query efficiency of R\+*tree is mainly impacted by the factor of overlap and overlay, according to the forced reinsertion principle of the R\+*tree and based on the clustering analysis, an algorithm which applies the diagonal line segment pairs intersection is presented to judge the overlaps among the clustering nodes. The novel algorithm overcomes the existing problems of only changing the bounding form or shrinking the orthogonal bounding rectangle in solving the overlaps among nodes in most index research. With the decision algorithm, the number of iterations in clustering algorithms, can be controled, and the influence of the noise point on the clustering can be decreased. The theoretical analysis concludes that the time complexity of the algorithm is the magnitude of O(nlogn),where n is the number of the diagonal line segment pairs. Experimental results indicate that the query efficiency based on the R\+*tree, which uses the decision algorithm, outperforms that of the gain/loss-metrics-applied greedy algorithm based on the R\+*tree and the original algorithm based on SR-tree in spatial range query.

       

    /

    返回文章
    返回