ISSN 1000-1239 CN 11-1777/TP

Journal of Computer Research and Development ›› 2015, Vol. 52 ›› Issue (9): 1931-1940.doi: 10.7544/issn1000-1239.2015.20140684

    Next Articles

Truth Discovery Based Credibility of Data Categories on Data Sources

Ma Ruxia1,2, Meng Xiaofeng1   

  1. 1(Department of Information, Renmin University of China, Beijing 100872); 2(Department of Education Technology, Capital Normal University, Beijing 100048)
  • Online:2015-09-01

Abstract: The popularization of the network and the development of e-commerce have changed the way people access information and consume. For most of people, Web has been the important source of information. Meanwhile, information quality issue is becoming increasingly prominent. There is a lot of information which is outdated, incorrect, false and bias. Particularly, the problem of conflicting information provided by different websites is obvious. It has to be solved that how to find the truth from conflicting information. As we know, there is not a method which considers the credibility of data categories on data sources during discovering truth. So, we propose a problem which is truth discovery based credibility of data categories on data sources. In this paper, two methods are proposed to detect the credibility differences of data categories on sources, and a Bayesian method is used to iteratively compute the data sources quality and data accuracy. Additional, data coverage and the difficulty of each object is considered to improve the accuracy of truth finding. The experiments on a real data set show that our algorithms can significantly improve the accuracy of truth discovery.

Key words: truth discovery, data conflicting, credibility of data categories on data sources, quality of information, data fusion

CLC Number: