Kernel optimization for short-range molecular dynamics

C Hu, X Wang, J Li, X He, S Li, Y Feng, S Yang… - Computer Physics …, 2017 - Elsevier
C Hu, X Wang, J Li, X He, S Li, Y Feng, S Yang, H Bai
Computer Physics Communications, 2017Elsevier
To optimize short-range force computations in Molecular Dynamics (MD) simulations, multi-
threading and SIMD optimizations are presented in this paper. With respect to multi-
threading optimization, a Partition-and-Separate-Calculation (PSC) method is designed to
avoid write conflicts caused by using Newton's third law. Serial bottlenecks are eliminated
with no additional memory usage. The method is implemented by using the OpenMP model.
Furthermore, the PSC method is employed on Intel Xeon Phi coprocessors in both native …
Abstract
To optimize short-range force computations in Molecular Dynamics (MD) simulations, multi-threading and SIMD optimizations are presented in this paper. With respect to multi-threading optimization, a Partition-and-Separate-Calculation (PSC) method is designed to avoid write conflicts caused by using Newton’s third law. Serial bottlenecks are eliminated with no additional memory usage. The method is implemented by using the OpenMP model. Furthermore, the PSC method is employed on Intel Xeon Phi coprocessors in both native and offload models. We also evaluate the performance of the PSC method under different thread affinities on the MIC architecture. In the SIMD execution, we explain the performance influence in the PSC method, considering the “if-clause” of the cutoff radius check. The experiment results show that our PSC method is relatively more efficient compared to some traditional methods. In double precision, our 256-bit SIMD implementation is about 3 times faster than the scalar version.
Elsevier
以上显示的是最相近的搜索结果。 查看全部搜索结果