• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Wang Bo and Guo Bo. Study of Aggregation Process Model and Algorithms of Autonomy Heterogeneous Data Sources[J]. Journal of Computer Research and Development, 2008, 45(9): 1546-1553.
Citation: Wang Bo and Guo Bo. Study of Aggregation Process Model and Algorithms of Autonomy Heterogeneous Data Sources[J]. Journal of Computer Research and Development, 2008, 45(9): 1546-1553.

Study of Aggregation Process Model and Algorithms of Autonomy Heterogeneous Data Sources

More Information
  • Published Date: September 14, 2008
  • Data sharing is a pervasive challenge faced in applications that need to query across multiple autonomous data sources. The task of integration becomes more complicated when data sources are distributed, heterogeneous, and high in number. One solution to the issues of distribution and scale is to perform data integration using P2P networks, but current P2P architectures are mostly flat, only specifying mappings directly between peers, and with no schemas abstraction provided. In this paper, a data sharing architecture similar to iXPeer is proposed to deal with integration on several levels of schema abstraction. Peers are grouped into local clusters according to their similarities. Peers with high similarities are clustered into one group, which can improve the query efficiency, reducing the computing cost. An aggregation model based on elements matching for autonomous data sources is proposed to construct clusters. TA, originally proposed in the context of database middleware, is applied to generate a list of best-ranked data source nodes, since TA may require time exponential in the size of the scale of cluster organization. TA is improved by adding labeling nodes, resulting in TAL, to generate the top-K cluster nodes. Experiments show that TA and TAL have good performances on top-K searching, especially for TAL, when the scale of clustered nodes is large.
  • Related Articles

    [1]Han Xixian, Li Jianzhong, and Gao Hong. PAA: An Efficient Approximate Aggregation Algorithm on Massive Data[J]. Journal of Computer Research and Development, 2014, 51(1): 41-53.
    [2]Fan Wenbin, Guo Longjiang, Li Jinbao, and Ren Meirui. MPMC: An Algorithm for Data Aggregation Scheduling in Multi-Channel and Multi-Power Wireless Sensor Networks[J]. Journal of Computer Research and Development, 2012, 49(7): 1568-1578.
    [3]Yin Dan, Gao Hong, and Zou Zhaonian. A Novel Efficient Graph Aggregation Algorithm[J]. Journal of Computer Research and Development, 2011, 48(10): 1831-1841.
    [4]An Mingyuan, Sun Xiuming, Sun Ninghui. Dynamic Data-Partitioned Online Aggregation[J]. Journal of Computer Research and Development, 2010, 47(11): 1928-1935.
    [5]Zhou Xun, Li Jianzhong, and Shi Shengfei. Distributed Aggregations for Two Queries over Uncertain Data[J]. Journal of Computer Research and Development, 2010, 47(5): 762-771.
    [6]Wang Yongli, Xu Hongbing, Dong Yisheng, Qian Jiangbo, Liu Xuejun. Algorithms for Incremental Aggregation over Distributed Data Stream[J]. Journal of Computer Research and Development, 2006, 43(3): 509-515.
    [7]Chen Xiqian, Wang Zhanchang, Cao Xiukun, Chi Zhongxian. An Efficient Indexing Scheme for Range Aggregate Queries in Spatial Data Warehouse[J]. Journal of Computer Research and Development, 2006, 43(1): 75-80.
    [8]Liang Zuopeng, Hu Kongfa, Dong Yisheng, Chen Ling. An Improved Dimension Hierarchy Aggregate Cube Storage Structure for Data Warehouses[J]. Journal of Computer Research and Development, 2005, 42(8): 1362-1368.
    [9]Song Yuqing, Zhu Yuquan, Sun Zhihui, Yang Hebiao. An Algorithm and Its Updating Algorithm Based on Frequent Pattern Tree for Mining Constrained Maximum Frequent Itemsets[J]. Journal of Computer Research and Development, 2005, 42(5): 777-783.
    [10]Cao Qiang and Xie Changsheng. Applying Aggregate I/O to Improve Performance of Network Storage[J]. Journal of Computer Research and Development, 2005, 42(4): 544-550.

Catalog

    Article views (698) PDF downloads (601) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return