Big Data Analysis and Data Velocity

Chen Shimin

doi:10.7544/issn1000-1239.2015.20140302

Chen Shimin. Big Data Analysis and Data VelocityJ. Journal of Computer Research and Development, 2015, 52(2): 333-342. DOI: 10.7544/issn1000-1239.2015.20140302

Citation:

Chen Shimin. Big Data Analysis and Data VelocityJ. Journal of Computer Research and Development, 2015, 52(2): 333-342. DOI: 10.7544/issn1000-1239.2015.20140302

Citation:

Chen Shimin. Big Data Analysis and Data VelocityJ. Journal of Computer Research and Development, 2015, 52(2): 333-342. DOI: 10.7544/issn1000-1239.2015.20140302

Big Data Analysis and Data Velocity

Chen Shimin

Graphical Abstract

Abstract

Abstract

Big data poses three main challenges to the underlying data management systems: volume (a huge amount of data), velocity (high speed of data generation, data acquisition, and data updates), and variety (a large number of data types and data formats). In this paper, we focus on understanding the significance of velocity and discussing how to face the challenge of velocity in the context of big data analysis systems. We compare the requirements of velocity in transaction processing, data stream, and data analysis systems. Then we describe two of our recent research studies with an emphasis on the role of data velocity in big data analysis systems: 1) MaSM, supporting online data updates in data warehouse systems; 2) LogKV, supporting high-throughput data ingestion and efficient time-window based joins in an event log processing system. Comparing the two studies, we find that storing incoming data updates is only the minimum requirement. We should consider velocity as an integral part of the data acquisition and analysis life cycle. It is important to analyze the characteristics of the desired big data analysis operations, and then to optimize data organization and data distribution schemes for incoming data updates so as to maintain or even improve the efficiency of big data analysis.

FullText(HTML)

References (0)

Cited By

Turn off MathJax

Article Contents

Big Data Analysis and Data Velocity

Abstract

Catalog

Export File

Citation

Format

Content