• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Ma Ruxia, Meng Xiaofeng, Wang Lu, Shi Yingjie. MTruths:An Approach of Multiple Truths Finding from Web Information[J]. Journal of Computer Research and Development, 2016, 53(12): 2858-2866. DOI: 10.7544/issn1000-1239.2016.20150614
Citation: Ma Ruxia, Meng Xiaofeng, Wang Lu, Shi Yingjie. MTruths:An Approach of Multiple Truths Finding from Web Information[J]. Journal of Computer Research and Development, 2016, 53(12): 2858-2866. DOI: 10.7544/issn1000-1239.2016.20150614

MTruths:An Approach of Multiple Truths Finding from Web Information

More Information
  • Published Date: November 30, 2016
  • Web has been a massive information repository on which information is scattered in different data sources. It is common that different data sources provide conflicting information for the same entity. It is called the truth finding problem that how to find the truths from conflicting information. According to the number of attribute values, object attributes can be divided into two categories: single-valued attributes and multiple-valued attributes. Most of existing truth finding work is designed for truth finding on single-valued attributes. In this paper, a method called MTruths is proposed to resolve truth finding problem for multiple-valued attributes. We model the problem using an optimization problem. The objective is to maximize the total weight similarity between the truths and observations provided by data sources. In truth finding process, two methods are proposed to find the optimal solution: an enumeration algorithm and a greedy algorithm. Experiments on two real data sets show that the correctness of our approache and the efficiency of the greedy algorithm outperform the existing state-of-the-art techniques.
  • Related Articles

    [1]Hong Jinxin, Wu Yingjie, Cai Jianping, Sun Lan. Differentially Private High-Dimensional Binary Data Publication via Attribute Segmentation[J]. Journal of Computer Research and Development, 2022, 59(1): 182-196. DOI: 10.7544/issn1000-1239.20200701
    [2]Wu Bin, Lou Zhengzheng, Ye Yangdong. A Collaborative Filtering Recommendation Algorithm for Multi-Source Heterogeneous Data[J]. Journal of Computer Research and Development, 2019, 56(5): 1034-1047. DOI: 10.7544/issn1000-1239.2019.20180461
    [3]Zhang Xiaoran, Yuan Man. General Data Quality Assessment Model and Ontological Implementation[J]. Journal of Computer Research and Development, 2018, 55(6): 1333-1344. DOI: 10.7544/issn1000-1239.2018.20160764
    [4]Zhou Ningnan, Sheng Wanxing, Liu Ke-yan, Zhang Xiao, Wang Shan. WR Approach: Determining Accurate Attribute Values in Big Data Integration[J]. Journal of Computer Research and Development, 2016, 53(2): 449-458. DOI: 10.7544/issn1000-1239.2016.20148275
    [5]Ma Ruxia, Meng Xiaofeng. Truth Discovery Based Credibility of Data Categories on Data Sources[J]. Journal of Computer Research and Development, 2015, 52(9): 1931-1940. DOI: 10.7544/issn1000-1239.2015.20140684
    [6]Huang Junjie, Chen Xiaojiang, Liu Chen, Fang Dingyi, Wang Wei, Yin Xiaoyan, Wu Yueshan. A Source Data Congestion Control Based on Sleep Schedule[J]. Journal of Computer Research and Development, 2015, 52(8): 1852-1861. DOI: 10.7544/issn1000-1239.2015.20140668
    [7]Yu Wei, Li Shijun, Yang Sha, Hu Yahui, Liu Jing, Ding Yonggang, Wang Qian. Automatically Discovering of Inconsistency Among Cross-Source Data Based on Web Big Data[J]. Journal of Computer Research and Development, 2015, 52(2): 295-308. DOI: 10.7544/issn1000-1239.2015.20140224
    [8]Wan Changxuan, Deng Song, Liu Dexi, Jiang Tengjiao, and Liu Xiping. Non-Cooperative Structured Deep Web Selection Based on Hybrid Type Keyword Retrieval[J]. Journal of Computer Research and Development, 2014, 51(4): 905-917.
    [9]Wang Bo and Guo Bo. Study of Aggregation Process Model and Algorithms of Autonomy Heterogeneous Data Sources[J]. Journal of Computer Research and Development, 2008, 45(9): 1546-1553.
    [10]Deng Xubin, Zhu Yangyong. ReDE: A Regular Expression-Based Method for Extracting Biological Data[J]. Journal of Computer Research and Development, 2005, 42(12): 2184-2191.

Catalog

    Article views (1151) PDF downloads (542) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return