• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Wang Yongxian, Zhang Lilun, Che Yonggang, Xu Chuanfu, Liu Wei, Cheng Xinghua. Heterogeneous Computing and Optimization on Tianhe-2,Supercomputer System for High-Order Accurate CFD Applications[J]. Journal of Computer Research and Development, 2015, 52(4): 833-842. DOI: 10.7544/issn1000-1239.2015.20131922
Citation: Wang Yongxian, Zhang Lilun, Che Yonggang, Xu Chuanfu, Liu Wei, Cheng Xinghua. Heterogeneous Computing and Optimization on Tianhe-2,Supercomputer System for High-Order Accurate CFD Applications[J]. Journal of Computer Research and Development, 2015, 52(4): 833-842. DOI: 10.7544/issn1000-1239.2015.20131922

Heterogeneous Computing and Optimization on Tianhe-2,Supercomputer System for High-Order Accurate CFD Applications

More Information
  • Published Date: March 31, 2015
  • There still exist great challenges when simulating the large-scale computational fluid dynamics (CFD) applications on the contemporary supercomputer systems with many-core heterogeneous architecture like Tianhe-2, which is also one of the research hotspots in this field. In this paper, we focus on exploring the techniques of efficient parallel simulations on the heterogeneous high-performance computing (HPC) platform for large-scale CFD applications with high-order accurate scheme. Some approaches and strategies of performance optimization matched with both the characteristic of CFD application and the architectures of heterogeneous HPC platform are proposed from the perspective of task decomposition, exploration of parallelism, optimization for multi-threaded running, vectorization by employing single-instruction multiple-data (SIMD), optimization for the cooperation of both CPUs and co-processors, and so on. To evaluate the performance of these techniques, some numerical experiments are performed on Tianhe-2,supercomputer system with the maximum number of grid points achieving 1.228×1011, and the total amount of processors and/or co-processors being 590000. Such a large-scale CFD simulation with high-order accurate scheme has to our best knowledge never been attempted before. It shows that the optimized code can get the speedup of 2.6X on CPU and co-processor hybrid platform than that on the CPU platform only, and perfect scalability is also observed from the test results. The present work redefines the frontier of high performance computing for fluid dynamics simulations on heterogeneous platform.
  • Related Articles

    [1]Liu Zhengyi, Song Tian. Covert Sequence Channel Based on HTTP/2 Protocol[J]. Journal of Computer Research and Development, 2018, 55(6): 1157-1166. DOI: 10.7544/issn1000-1239.2018.20170451
    [2]Liu Xu, Yang Zhang, Yang Yang. A Nested Partitioning Load Balancing Algorithm for Tianhe-2[J]. Journal of Computer Research and Development, 2018, 55(2): 418-425. DOI: 10.7544/issn1000-1239.2018.20160877
    [3]Dong Rongsheng, Zhang Xinkai, Liu Huadong, Gu Tianlong. Representation and Operations Research of k\+2-MDD in Large-Scale Graph Data[J]. Journal of Computer Research and Development, 2016, 53(12): 2783-2792. DOI: 10.7544/issn1000-1239.2016.20160589
    [4]Zhang Zhiyuan, Zhou Yufeng, Liu Li, Yang Guangwen. Performance Characterization and Efficient Parallelization of MASNUM Wave Model[J]. Journal of Computer Research and Development, 2015, 52(4): 851-860. DOI: 10.7544/issn1000-1239.2015.20131415
    [5]Xiong Huanliang, Zeng Guosun, Wu Canghai. A Novel Scalability Metric for Parallel Computing[J]. Journal of Computer Research and Development, 2014, 51(11): 2547-2558. DOI: 10.7544/issn1000-1239.2014.20130750
    [6]Gu Rong, Yan Jinshuang, Yang Xiaoliang, Yuan Chunfeng, and Huang Yihua. Performance Optimization for Short Job Execution in Hadoop MapReduce[J]. Journal of Computer Research and Development, 2014, 51(6): 1270-1280.
    [7]Huang Weixian and Wang Guojin. The L\-2 Distances for Rational Surfaces Based on Matrix Representation of Degree Elevation[J]. Journal of Computer Research and Development, 2010, 47(8): 1338-1345.
    [8]Wu Ming, Zhang Fuxin, Lin Wei, Xu Xianchao, Yuan Nan, and Wang Jian. Critical Techniques of System Optimization for Godson-2 Processor[J]. Journal of Computer Research and Development, 2006, 43(6): 980-986.
    [9]Zhang Ge, Qi Zichu, and Hu Weiwu. Functional Units Design in Godson-2 Processor[J]. Journal of Computer Research and Development, 2006, 43(6): 967-973.
    [10]Chi Lihua, Liu jie, and Hu Qingfeng. Evaluation and Test for Scalability of Numerical Parallel Computation[J]. Journal of Computer Research and Development, 2005, 42(6): 1073-1078.

Catalog

    Article views (1507) PDF downloads (619) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return