• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Liu Xianmin, Li Jianzhong. Discovering Extended Conditional Functional Dependencies[J]. Journal of Computer Research and Development, 2015, 52(1): 130-140. DOI: 10.7544/issn1000-1239.2015.20130691
Citation: Liu Xianmin, Li Jianzhong. Discovering Extended Conditional Functional Dependencies[J]. Journal of Computer Research and Development, 2015, 52(1): 130-140. DOI: 10.7544/issn1000-1239.2015.20130691

Discovering Extended Conditional Functional Dependencies

More Information
  • Published Date: December 31, 2014
  • eCFD (extended conditional functional dependency) is proposed as the extension of CFD (conditional functional dependency) for data cleaning. Compared with CFD, eCFD can take more patterns of values and catch more semantic information. However, there are only few works about eCFD. This paper focuses on the problem of eCFD discovering, whose counterpart of CFD has been studied very much. As we know, this paper is the first work about eCFD discovering. To avoid inconsistencies and remove redundancies, based on the definitions of strongly validated and weakly non-redundant eCFDs, formal definition of eCFD discovering problem is given and MeCFD method is proposed to solve this problem. MeCFD first generates all basic eCFDs which are weakly non-redundant and semantically equivalent to all strongly validated eCFDs, then constructs compound eCFDs through merging basic eCFDs. Searching candidate space in depth-first order makes MeCFD use only constant memory space to maintain data partitions. Efficient pruning strategies are proposed to improve the performance of MeCFD. Theoretical analysis shows the correctness of MeCFD. Experiments over real data sets show the good scalability of MeCFD and the effectiveness of pruning strategies and optimizing methods.
  • Related Articles

    [1]Zheng Fang, Shen Li, Li Hongliang, Xie Xianghui. Lightweight Error Recovery Techniques of Many-Core Processor in High Performance Computing[J]. Journal of Computer Research and Development, 2015, 52(6): 1316-1328. DOI: 10.7544/issn1000-1239.2015.20150119
    [2]Xiong Huanliang, Zeng Guosun, Wu Canghai. A Novel Scalability Metric for Parallel Computing[J]. Journal of Computer Research and Development, 2014, 51(11): 2547-2558. DOI: 10.7544/issn1000-1239.2014.20130750
    [3]Zhang Aiqing, Mo Zeyao, Yang Zhang. Three-Level Hierarchical Software Architecture for Data-Driven Parallel Computing with Applications[J]. Journal of Computer Research and Development, 2014, 51(11): 2538-2546. DOI: 10.7544/issn1000-1239.2014.20131241
    [4]Chen Qi, Chen Zuoning, Jiang Jinhu. MDDS: A Method to Improve the Metadata Performance of Parallel File System for HPC[J]. Journal of Computer Research and Development, 2014, 51(8): 1663-1670. DOI: 10.7544/issn1000-1239.2014.20121094
    [5]Cai Yong, Li Guangyao, and Wang Hu. Parallel Computing of Central Difference Explicit Finite Element Based on GPU General Computing Platform[J]. Journal of Computer Research and Development, 2013, 50(2): 412-419.
    [6]Zhang Shihui, Kong Lingfu, and Feng Liang. An Improved Hestenes SVD Method and Its Parallel Computing and Application in Parallel Robot[J]. Journal of Computer Research and Development, 2008, 45(4): 716-724.
    [7]Tu Bibo, Hong Xuehai, Zhan Jianfeng, Fan Jianping. Workflow-Based User Environment for High Performance Computing[J]. Journal of Computer Research and Development, 2007, 44(10): 1717-1723.
    [8]Wu Xiangjun, Jin Zhiyan, Chen Dehui, Song Junqiang, Yang Xuesheng. A Parallel Computing Algorithm and Its Application in New Generation of Numerical Weather Prediction System (GRAPES)[J]. Journal of Computer Research and Development, 2007, 44(3).
    [9]Liu Jie, Chi Lihua, Hu Qingfeng, Li Xiaomei. An Improved TFQMR Algorithm for Large Linear Systems Suited to Parallel Computing[J]. Journal of Computer Research and Development, 2005, 42(7): 1235-1240.
    [10]Feng Shengzhong, Tan Guangming, Xu Lin, Sun Ninghui, Xu Zhiwei. Research on the High Performance Algorithms of Dawning 4000H Bioinformatics Specific Machine[J]. Journal of Computer Research and Development, 2005, 42(6): 1053-1058.

Catalog

    Article views (1302) PDF downloads (780) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return