• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Su Huayou, Wen Wen, Li Dongsheng. Optimization and Parallelization Single Particle Cryo-EM Software RELION with GPU[J]. Journal of Computer Research and Development, 2018, 55(2): 409-417. DOI: 10.7544/issn1000-1239.2018.20160873
Citation: Su Huayou, Wen Wen, Li Dongsheng. Optimization and Parallelization Single Particle Cryo-EM Software RELION with GPU[J]. Journal of Computer Research and Development, 2018, 55(2): 409-417. DOI: 10.7544/issn1000-1239.2018.20160873

Optimization and Parallelization Single Particle Cryo-EM Software RELION with GPU

More Information
  • Published Date: January 31, 2018
  • Single particle cryo-electron microscopy (cryo-EM) is one of the most important methods of macromolecular structure. RELION (regularized likelihood optimization) is an open-source computer program for the refinement of macromolecular structures by single-particle analysis of cryo-EM data. Due to its easy usage and high quality results, RELION has attracted a lot of attentions from researchers. However, the computation requirement of this program is too huge to solve some large molecular structures with CPU, which harpers the popularization of RELION. In this paper, we characterize the algorithm of RELION and parallelize it with GPU. Firstly, the mathematical theory, computer patterns and performance bottlenecks of RELION are analyzed comprehensively. Then, we optimize the program targeting at fine-grained many-core architecture processor, such as GPU. We propose an efficient multi-level parallel model to utilize the powerful computation capacity of many-core processor. In order to achieve high performance, we reconstruct the data structure for GPU continues memory access. To avoid the limitation of GPU memory size, we implement an adaptive framework. The experimental results show that the proposed GPU based algorithm can achieve good performance. When compared with the CPU implementation, the speedup ratio of the application is more than 36 times, while the speedup ratio of compute-intensive algorithm is about 75X. Moreover, the testing results on multi GPUs show that the GPU based implementation has good scalability.
  • Related Articles

    [1]Zheng Fang, Shen Li, Li Hongliang, Xie Xianghui. Lightweight Error Recovery Techniques of Many-Core Processor in High Performance Computing[J]. Journal of Computer Research and Development, 2015, 52(6): 1316-1328. DOI: 10.7544/issn1000-1239.2015.20150119
    [2]Xiong Huanliang, Zeng Guosun, Wu Canghai. A Novel Scalability Metric for Parallel Computing[J]. Journal of Computer Research and Development, 2014, 51(11): 2547-2558. DOI: 10.7544/issn1000-1239.2014.20130750
    [3]Zhang Aiqing, Mo Zeyao, Yang Zhang. Three-Level Hierarchical Software Architecture for Data-Driven Parallel Computing with Applications[J]. Journal of Computer Research and Development, 2014, 51(11): 2538-2546. DOI: 10.7544/issn1000-1239.2014.20131241
    [4]Chen Qi, Chen Zuoning, Jiang Jinhu. MDDS: A Method to Improve the Metadata Performance of Parallel File System for HPC[J]. Journal of Computer Research and Development, 2014, 51(8): 1663-1670. DOI: 10.7544/issn1000-1239.2014.20121094
    [5]Cai Yong, Li Guangyao, and Wang Hu. Parallel Computing of Central Difference Explicit Finite Element Based on GPU General Computing Platform[J]. Journal of Computer Research and Development, 2013, 50(2): 412-419.
    [6]Zhang Shihui, Kong Lingfu, and Feng Liang. An Improved Hestenes SVD Method and Its Parallel Computing and Application in Parallel Robot[J]. Journal of Computer Research and Development, 2008, 45(4): 716-724.
    [7]Tu Bibo, Hong Xuehai, Zhan Jianfeng, Fan Jianping. Workflow-Based User Environment for High Performance Computing[J]. Journal of Computer Research and Development, 2007, 44(10): 1717-1723.
    [8]Wu Xiangjun, Jin Zhiyan, Chen Dehui, Song Junqiang, Yang Xuesheng. A Parallel Computing Algorithm and Its Application in New Generation of Numerical Weather Prediction System (GRAPES)[J]. Journal of Computer Research and Development, 2007, 44(3).
    [9]Liu Jie, Chi Lihua, Hu Qingfeng, Li Xiaomei. An Improved TFQMR Algorithm for Large Linear Systems Suited to Parallel Computing[J]. Journal of Computer Research and Development, 2005, 42(7): 1235-1240.
    [10]Feng Shengzhong, Tan Guangming, Xu Lin, Sun Ninghui, Xu Zhiwei. Research on the High Performance Algorithms of Dawning 4000H Bioinformatics Specific Machine[J]. Journal of Computer Research and Development, 2005, 42(6): 1053-1058.

Catalog

    Article views (3477) PDF downloads (896) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return