• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
高级检索

千万亿次可扩展可容错自由网格数值模拟系统

黎雷生, 王朝尉, 马志涛, 霍志刚, 田荣

黎雷生, 王朝尉, 马志涛, 霍志刚, 田荣. 千万亿次可扩展可容错自由网格数值模拟系统[J]. 计算机研究与发展, 2015, 52(4): 823-832. DOI: 10.7544/issn1000-1239.2015.20131332
引用本文: 黎雷生, 王朝尉, 马志涛, 霍志刚, 田荣. 千万亿次可扩展可容错自由网格数值模拟系统[J]. 计算机研究与发展, 2015, 52(4): 823-832. DOI: 10.7544/issn1000-1239.2015.20131332
Li Leisheng, Wang Chaowei, Ma Zhitao, Huo Zhigang, Tian Rong. petaPar: A Scalable and Fault Tolerant Petascale Free Mesh Simulation System[J]. Journal of Computer Research and Development, 2015, 52(4): 823-832. DOI: 10.7544/issn1000-1239.2015.20131332
Citation: Li Leisheng, Wang Chaowei, Ma Zhitao, Huo Zhigang, Tian Rong. petaPar: A Scalable and Fault Tolerant Petascale Free Mesh Simulation System[J]. Journal of Computer Research and Development, 2015, 52(4): 823-832. DOI: 10.7544/issn1000-1239.2015.20131332

千万亿次可扩展可容错自由网格数值模拟系统

基金项目: 国家自然科学基金项目(11072241,11111140020,91130026);橡树岭国家实验室/美国国家计算科学中心主任基金项目(MAT028)
详细信息
  • 中图分类号: TP301.6; TP338.6

petaPar: A Scalable and Fault Tolerant Petascale Free Mesh Simulation System

  • 摘要: 在千万亿次计算能力的驱动下,数值软件的发展进入了一个以海量并行为基本特征的历史转折期,可扩展和可容错成为大规模数值模拟的两大关键技术.petaPar模拟程序是以对传统数值技术形成优势互补的无网格类方法为切入点,面向千万亿次级计算而开发的下一代新兴通用数值模拟程序.petaPar在统一架构下实现了光滑粒子动力学(smoothed particle hydrodynamics, SPH)和物质点法(material point method, MPM)两种最为成熟和有效的无网格/粒子算法,支持多种强度、失效模型和状态方程;其中MPM支持改进的接触算法,可以处理上百万离散物体的非连续变形和相互作用计算.系统具有以下特点:1)高可扩展.实现单核单Patch极端情形下计算和通信的完全重叠,支持动态负载均衡;2)可容错.支持无人值守变进程重启动,在系统硬件出现局部热故障时可以不中止计算;3)适应硬件体系结构异构架构的变化趋势,同时支持flat MPI和MPI+Pthreads并行模型.程序在Titan千万亿次超级计算机上进行了全系统规模的可扩展性测试,结果表明该代码可线性扩展到26万个CPU核,SPH和MPM的并行效率分别为100%和96%.
    Abstract: With the emergence of petaflops (10\+15 FLOPS) systems, numerical simulation has entered a new era—a times opening a possibility of using 10\+4 to 10\+6 processor cores in one single run of parallel computing. In order to take full advantages of the powerfulness of the petaflops and post-petaflops supercomputing infrastructures, two aspects of grand challenges including the scalability and the fault tolerance must be addressed in a domain application. petaPar is a highly scalable and fault tolerant meshfree/particle simulation code dedicated to petascale computing. Two popular particle methods, smoothed particle hydrodynamics (SPH) and material point method (MPM), are implemented in a unified object-oriented framework. The parallelization of both SPH and MPM consistently starts from the domain decomposition of a regular background grid. The scalability of the code is assured by fully overlapping the inter-MPI process communication with computation and a dynamic load balance strategy. petaPar supports both flat MPI and MPI+Pthreads hierarchial parallelization. Application-specific lightweight checkpointing is used in petaPar to deal with the issue of fault tolerance. petaPar is designed to be able to automatically self-restart from any number of MPI processes, allow a dynamic change of computing resources arisen in a scenario of, for example, nodal failure and connection timeout etc. Experiments are performed on the Titan petaflops supercomputer. It is shown that petaPar linearly scales up to 2.6×10\+5 CPU cores with the excellent parallel efficiency of 100% and 96% for the multithreaded SPH and the multithreaded MPM, respectively, and the performance of the multithreaded SPH is improved by up to 30% compared with the flat MPI implementation.
  • 期刊类型引用(8)

    1. 杨小东,周航,任宁宁,袁森,王彩芬. 支持多密文等值测试的无线体域网聚合签密方案. 计算机研究与发展. 2023(02): 341-350 . 本站查看
    2. 杨蒙蒙,江昆,温拓朴,陈会仙,黄晋,张浩,黄健强,唐雪薇,杨殿阁. 自动驾驶高精度地图众源更新技术现状与挑战. 中国公路学报. 2023(05): 244-259 . 百度学术
    3. 王妍,白洪亮,蒋方正,张英伟. 露天矿无人驾驶运输关键技术研究. 现代矿业. 2023(10): 178-181 . 百度学术
    4. 丁晓晖,曹素珍,窦凤鸽,马佳佳,王彩芬. 基于无证书聚合签名的导航信息更新方案. 计算机技术与发展. 2022(06): 112-119 . 百度学术
    5. 杜田,李欣,赖成喆,郑东. 面向无人驾驶地图更新的安全信任管理方案. 计算机工程. 2022(06): 154-166 . 百度学术
    6. 李月华. 基于自动驾驶众包地图更新技术方法. 北京测绘. 2022(05): 629-635 . 百度学术
    7. 陈虹,侯宇婷,郭鹏飞,周沫,赵菊芳,肖成龙. 可公开验证的高效无证书聚合签密方案. 计算机工程. 2022(10): 146-157 . 百度学术
    8. 陶永才,李哲,石磊,卫琳,杨淑博. 一种可信的车联网区块链数据共享模型. 小型微型计算机系统. 2021(10): 2131-2139 . 百度学术

    其他类型引用(5)

计量
  • 文章访问数:  1489
  • HTML全文浏览量:  0
  • PDF下载量:  616
  • 被引次数: 13
出版历程
  • 发布日期:  2015-03-31

目录

    /

    返回文章
    返回