ISSN 1000-1239 CN 11-1777/TP

计算机研究与发展 ›› 2013, Vol. 50 ›› Issue (6): 1147-1162.

所属专题: 2013物联网基础理论与新技术方向

• 综述 • 上一篇    下一篇

大数据的一个重要方面:数据可用性

李建中 刘显敏   

  1. (哈尔滨工业大学计算机科学与技术学院 哈尔滨 150001) (lijzh@hit.edu.cn)
  • 出版日期: 2013-06-15

An Important Aspect of Big Data: Data Usability

Li Jianzhong and Liu Xianmin   

  1. (School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001)
  • Online: 2013-06-15

摘要: 随着信息技术的发展,特别是物理信息系统、互联网、云计算和社交网络等技术的突飞猛进,大数据普遍存在,正在成为信息社会的重要财富,同时也带来了巨大的挑战.数据可用性问题就是大数据的重要挑战之一.随着数据的爆炸性增长,劣质数据也随之而来,数据可用性受到严重影响,对信息社会形成严重威胁,引起了学术界和工业界的共同关注.近年来,学术界和工业界开始研究数据可用性问题,取得了一些的研究成果,但是针对大数据可用性问题的研究工作还很少.介绍了大数据可用性的基本概念,讨论大数据可用性的挑战,探讨大数据可用性方面的研究问题,并综述数据可用性方面的研究成果.

关键词: 大数据, 数据可用性, 数据一致性, 数据完整性, 数据精确性, 数据时效性, 实体同一性

Abstract: With the rapid development of information technology, especially the great progresses of Internet, cyber physical system, Internet of things, cloud computing and social network, big data becomes ubiquitous. Big data brings not only great benefits but also crucial challenges. Improving the data usability is one of the most significant challenges. Dirty data accompanies the tremendous increase of data volume, degrades the data quality and data usability, and brings serious harm to the information societies. Fortunately, there has been widespread concern about the data usability in both industrial and academic communities, and the recent research efforts on data usability have yielded some impressive results. However, there are only few works focusing on the usability of big data. In this paper, the concepts of big data usability are introduced first, and then the challenges and research problems of the big data usability are discussed. Finally, the works related to the data usability are surveyed.

Key words: big data, data usability, data consistency, data completeness, data accuracy, data currency, entity identity