黄鹏,张雄,马上,王汉奎.基于OpenMP的三维显式物质点法并行化研究[J].计算力学学报,2010,27(1):21~27 |
| 码上扫一扫! |
基于OpenMP的三维显式物质点法并行化研究 |
Parallelization of 3D explicit material point method using OpenMP |
投稿时间:2008-03-28 |
DOI:10.7511/jslx20101004 |
中文关键词: OpenMP 显式物质点法 区域分解法 循环分解方法 加速比 |
英文关键词:OpenMP Explicit MPM domain decomposition method loop splitting method speedup |
基金项目:国家自然科学基金(10872107);国家重点基础研究发展计划(2010CB32101)资助项目. |
|
摘要点击次数: 2125 |
全文下载次数: 1402 |
中文摘要: |
基于OpenMP技术开发了三维显式物质点并行程序MPM3DMP。为了避免节点更新阶段的数据竞争,采用区域分解法将背景网格分解为均匀的子域,每个线程负责一个子域的节点变量更新,然后将更新后的节点变量装配到整体。在质点更新阶段采用了循环分解方法进行并行。针对Taylor杆碰撞的三种计算模型,在双Intel Woodcrest 4核CPU服务器下进行了测试:粗模型在4核下加速比为3.82,在8核下为6.23,中模型在4核下加速比为3.79,在8核下加速比为6.23;细模型在4核下加速比为3.75,8核下加速比为6.26。因此,本文的并行程序具有较好的并行效率和可扩展性。 |
英文摘要: |
Based on the OpenMP technique, a parallel 3D explicit material point method (MPM) code, MPM3DMP, is developed in this paper. The domain decomposition method is presented for avoiding data races in updating nodal variables. The background grid is decomposed into some uniform patches, and each thread deal with a patch in domain decomposition method. After updating nodes in all patches, their nodal variables are assembled into the global one. The code for updating particle variables can be parallelized using the loop splitting method directly. To test the performance of the developed code, the Taylor bar impact is simulated by using three models with different number of particles on a computer with dual quad-core Intel Woodcrest processors. A speedup of 3.82 is achieved for the coarse model in 4 cores, and 6.23 in 8 cores, while a speedup of 3.79 is achieved for the medium-sized model in 4 cores, and 6.23 in 8 cores. A speedup of 3.75 is achieved for the fine model in 4 cores, and 6.26 in 8 cores. Thus, MPM3DMP has good parallel efficiency and extension ability. |
查看全文 查看/发表评论 下载PDF阅读器 |
|
|
|
|