龚曙光,刘奇良,卢海山,周志勇,张佳.无网格Galerkin法GPU加速并行计算及其应用[J].计算力学学报,2015,32(6):745~751 |
| 码上扫一扫! |
无网格Galerkin法GPU加速并行计算及其应用 |
Parallel computing and application of Element-Free Galerkin method for GPU acceleration |
投稿时间:2014-09-03 修订日期:2014-11-08 |
DOI:10.7511/jslx201506006 |
中文关键词: 无网格Galerkin法 GPU加速 并行计算 CUDA |
英文关键词:Element-Free Galerkin method GPU acceleration parallel computing CUDA |
基金项目:国家自然科学基金(51375417,51405415)资助项目. |
|
摘要点击次数: 2501 |
全文下载次数: 1366 |
中文摘要: |
针对无网格Galerkin法计算耗时的问题,采用逐节点对法来组装刚度矩阵、共轭梯度法求解基于CSR格式存储的稀疏线性方程组,提出了一种利用罚函数法施加本质边界条件的EFG法GPU加速并行算法,给出了刚度矩阵和惩罚刚度矩阵的统一格式,以及GPU加速并行算法的流程图。编写了基于CUDA构架平台的GPU程序,且在NVIDIA GeForce GTX 660显卡上通过数值算例对所提算法进行了性能测试与分析比较,探讨了影响加速比的因素。算例结果验证了所提算法的可行性,并在满足计算精度的前提下,其加速比最大可达17倍;同时线性方程组的求解对加速比起决定性影响。 |
英文摘要: |
In order to reduce the computing time of Element-Free Galerkin(EFG) method,a GPU accele-ration parallel algorithm of EFG method that essential boundary condition is imposed by penalty function method is proposed,in which stiffness matrix is assembled by node pair-wise approach,and sparse linear equations based on CSR format is solved by conjugate gradient methods.The unified format of stiffness matrix and penalty stiffness matrix was derived,and the flow chart of the parallel algorithm was provided.The GPU codes were programmed on CUDA,and algorithm testing was finished on the device of NVIDIA GeForce GTX 660 by numerical examples.The factors of affecting speedup ratio were discussed.The example results verified the feasibility of the proposed algorithm.The maximum speedup ratio was up to 17 times on the premise that the calculating accuracy is met,and to solve linear equations is the major factor in the speedup. |
查看全文 查看/发表评论 下载PDF阅读器 |
|
|
|
|