高级检索

    一种最优的静态路径编码存储策略

    An Optimal Storage Strategy for Static Path Labeling Scheme

    • 摘要: 路径编码方案通过记录从XML文档根结点到当前结点的路径信息,可以快速判断结点间的各种位置关系.高效的编码存储策略可以在提高存储空间利用率的同时,减少系统的IO开销,从而进一步提升系统的整体性能.提出一种最优的静态路径编码存储策略,其基本思想是在存储编码中的数字时,每个编码中数字对应的前缀并非提前给定,而是根据其所在数字区间中数字的使用频率之和给定相应的前缀,因此可以充分利用每个不同数字的频率信息来降低所需的存储空间.最后通过实验结果验证了该方法的可行性及有效性.

       

      Abstract: By maintaining the information from root node to current node, the position relationship between nodes of an XML document can be determined efficiently by comparing their path labels, such that the overall performance of XML query processing can be improved significantly. Moreover, a good storage strategy for path labels can not only improve the utility ratio of disk space, but also reduce the costly IO operation. In this paper, an optimal storage strategy for static path labeling scheme is proposed to tackle this problem. The basic idea is that when storing the components of path labels, they are assigned with different prefixes according to the sum of frequencies of the region they belong to, thus can reduce the storage space efficiently. Compared with existing methods, the prefixes for components of path labels are not determined according to pre-specified prefixes, which are too inflexible to utilize the frequency information of different components to reduce the storage space. The experimental results verify the feasibility and effectiveness of the proposed storage strategy.

       

    /

    返回文章
    返回