• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Chen Shimin. Big Data Analysis and Data Velocity[J]. Journal of Computer Research and Development, 2015, 52(2): 333-342. DOI: 10.7544/issn1000-1239.2015.20140302
Citation: Chen Shimin. Big Data Analysis and Data Velocity[J]. Journal of Computer Research and Development, 2015, 52(2): 333-342. DOI: 10.7544/issn1000-1239.2015.20140302

Big Data Analysis and Data Velocity

More Information
  • Published Date: January 31, 2015
  • Big data poses three main challenges to the underlying data management systems: volume (a huge amount of data), velocity (high speed of data generation, data acquisition, and data updates), and variety (a large number of data types and data formats). In this paper, we focus on understanding the significance of velocity and discussing how to face the challenge of velocity in the context of big data analysis systems. We compare the requirements of velocity in transaction processing, data stream, and data analysis systems. Then we describe two of our recent research studies with an emphasis on the role of data velocity in big data analysis systems: 1) MaSM, supporting online data updates in data warehouse systems; 2) LogKV, supporting high-throughput data ingestion and efficient time-window based joins in an event log processing system. Comparing the two studies, we find that storing incoming data updates is only the minimum requirement. We should consider velocity as an integral part of the data acquisition and analysis life cycle. It is important to analyze the characteristics of the desired big data analysis operations, and then to optimize data organization and data distribution schemes for incoming data updates so as to maintain or even improve the efficiency of big data analysis.
  • Related Articles

    [1]Wang Duo, Liu Jinglei, Yan Mingyu, Teng Yihan, Han Dengke, Ye Xiaochun, Fan Dongrui. Acceleration Methods for Processor Microarchitecture Design Space Exploration: A Survey[J]. Journal of Computer Research and Development, 2025, 62(1): 22-57. DOI: 10.7544/issn1000-1239.202330348
    [2]Wang Kaifan, Xu Yinan, Yu Zihao, Tang Dan, Chen Guokai, Chen Xi, Gou Lingrui, Hu Xuan, Jin Yue, Li Qianruo, Li Xin, Lin Jiawei, Liu Tong, Liu Zhigang, Wang Huaqiang, Wang Huizhe, Zhang Chuanqi, Zhang Fawang, Zhang Linjuan, Zhang Zifei, Zhang Ziyue, Zhao Yangyang, Zhou Yaoyang, Zou Jiangrui, Cai Ye, Huan Dandan, Li Zusong, Zhao Jiye, He Wei, Sun Ninghui, Bao Yungang. XiangShan Open-Source High Performance RISC-V Processor Design and Implementation[J]. Journal of Computer Research and Development, 2023, 60(3): 476-493. DOI: 10.7544/issn1000-1239.202221036
    [3]Zhang Qianlong, Hou Rui, Yang Sibo, Zhao Boyan, Zhang Lixin. The Role of Architecture Simulators in the Process of CPU Design[J]. Journal of Computer Research and Development, 2019, 56(12): 2702-2719. DOI: 10.7544/issn1000-1239.2019.20190044
    [4]Yu Zihao, Liu Zhigang, Li Yiwei, Huang Bowen, Wang Sa, Sun Ninghui, Bao Yungang. Practice of Chip Agile Development: Labeled RISC-V[J]. Journal of Computer Research and Development, 2019, 56(1): 35-48. DOI: 10.7544/issn1000-1239.2019.20180771
    [5]Zhuo Xinxin, Bai Xiaoying, Xu Jing, Li Enpeng, Liu Yu, Kang Jiehui, Song Wenli. A Tool for Automatic Service Interface Testing[J]. Journal of Computer Research and Development, 2018, 55(2): 358-376. DOI: 10.7544/issn1000-1239.2018.20160721
    [6]Li Gongli, Dai Zibin, Xu Jinhui, Wang Shoucheng, Zhu Yufei, Feng Xiao. Design of Block Cipher Processor Based on Stream Architecture[J]. Journal of Computer Research and Development, 2017, 54(12): 2833-2842. DOI: 10.7544/issn1000-1239.2017.20160670
    [7]Chen Long, Ye Wei, Zhang Shikun. Onboard: A Data-Driven Agile Software Development Collaboration Tool[J]. Journal of Computer Research and Development, 2016, 53(12): 2753-2767. DOI: 10.7544/issn1000-1239.2016.20160625
    [8]Wang Yourui, Shi Wei, Wang Zhiying, Lu Hongyi, and Su Bo. A Novel Flow for Asynchronous Circuit Design Using Synchronous EDA Tools[J]. Journal of Computer Research and Development, 2012, 49(9): 2027-2035.
    [9]Li Tao, Zhang Xiaoming, and Sun Zhigang. Coarse-Grained Dataflow Network Processor:Architecture and Prototype Design[J]. Journal of Computer Research and Development, 2009, 46(8): 1278-1284.
    [10]Hu Yu, Han Yinhe, Li Xiaowei. Design-for-Testability and Test Technologies for System-on-a-Chip[J]. Journal of Computer Research and Development, 2005, 42(1): 153-162.
  • Cited by

    Periodical cited type(9)

    1. 王铎,刘景磊,严明玉,滕亦涵,韩登科,叶笑春,范东睿. 面向处理器微架构设计空间探索的加速方法综述. 计算机研究与发展. 2025(01): 22-57 . 本站查看
    2. 罗宇哲,李玲,侯朋朋,于佳耕,程丽敏,张常有,武延军,赵琛. 面向AIoT的协同智能综述. 计算机研究与发展. 2025(01): 179-206 . 本站查看
    3. 陈云霁,蔡一茂,汪玉,唐华,何杰,刘克,郝跃. 集成电路未来发展与关键问题——第347期“双清论坛(青年)”学术综述. 中国科学:信息科学. 2024(01): 1-15 .
    4. 翟建旺,凌梓超,白晨,赵康,余备. 机器学习辅助微架构功耗建模和设计空间探索综述. 计算机研究与发展. 2024(06): 1351-1369 . 本站查看
    5. 陈三伟,李进财,张婷,邱丹,江林. 面向芯片设计的Python系统级自动化工具开发. 电子技术应用. 2024(10): 14-17 .
    6. 乔寓然,王鹏,谢家志. 人工智能在运算放大器设计中的应用研究. 微电子学. 2024(04): 653-658 .
    7. 罗莉,石伟,何鸿君,潘国腾,王蕾,龚锐. 一种面向IO Die的敏捷验证方法. 计算机工程与科学. 2023(04): 571-576 .
    8. 王梦迪,王颖,刘成,常开颜,高成思,韩银和,李华伟,张磊. Puzzle:面向深度学习集成芯片的可扩展框架. 计算机研究与发展. 2023(06): 1216-1231 . 本站查看
    9. 贺旭,王耀,傅智勇,李暾,屈婉霞,万海,张吉良. 敏捷设计中基于机器学习的静态时序分析方法综述. 计算机辅助设计与图形学学报. 2023(04): 640-652 .

    Other cited types(8)

Catalog

    Article views (2331) PDF downloads (1571) Cited by(17)

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return