Exploring simd for molecular dynamics, using intel® xeon® processors and intel® xeon phi coprocessors

SJ Pennycook, CJ Hughes… - 2013 IEEE 27th …, 2013 - ieeexplore.ieee.org
We analyse gather-scatter performance bottlenecks in molecular dynamics codes and the
challenges that they pose for obtaining benefits from SIMD execution. This analysis informs …

Active learning for human pose estimation

B Liu, V Ferrari - … of the IEEE international conference on …, 2017 - openaccess.thecvf.com
Annotating human poses in realistic scenes is very time consuming, yet necessary for
training human pose estimators. We propose to address this problem in an active learning …

591 TFLOPS multi-trillion particles simulation on SuperMUC

W Eckhardt, A Heinecke, R Bader, M Brehm… - … Conference, ISC 2013 …, 2013 - Springer
Anticipating large-scale molecular dynamics simulations (MD) in nano-fluidics, we conduct
performance and scalability studies of an optimized version of the code ls1 mardyn. We …

Optimization methods for virtual screening on novel computational architectures

H Perez-Sanchez, W Wenzel - Current computer-aided drug …, 2011 - ingentaconnect.com
The numerous virtual screening (VS) methods that are used today in drug discovery
processes differ mainly by the way they model the receptor and/or ligand and by the …

[图书][B] Scientific computing with multicore and accelerators

J Kurzak, DA Bader, J Dongarra - 2010 - books.google.com
The hybrid/heterogeneous nature of future microprocessors and large high-performance
computing systems will result in a reliance on two major types of components …

DARPA's HPCS program: History, models, tools, languages

J Dongarra, R Graybill, W Harrod, R Lucas, E Lusk… - Advances in …, 2008 - Elsevier
The historical context with regard to the origin of the DARPA High Productivity Computing
Systems (HPCS) program is important for understanding why federal government agencies …

Pipelining computation and optimization strategies for scaling gromacs on the sunway many-core processor

Y Yu, H An, J Chen, W Liang, Q Xu, Y Chen - Algorithms and Architectures …, 2017 - Springer
The increasing gap between plentiful computing elements and limited memory bandwidth
makes it increasingly difficult and sometimes even infeasible for HPC community to port …

SW_GROMACS: accelerate GROMACS on Sunway TaihuLight

T Zhang, Y Li, P Gao, Q Shao, M Shao… - Proceedings of the …, 2019 - dl.acm.org
GROMACS is one of the most popular Molecular Dynamic (MD) applications and is widely
used in the field of chemical and bimolecular system study. Similar to other MD applications …

Developing performance-portable molecular dynamics kernels in OpenCL

SJ Pennycook, SA Jarvis - 2012 SC Companion: High …, 2012 - ieeexplore.ieee.org
This paper investigates the development of a molecular dynamics code that is highly
portable between architectures. Using OpenCL, we develop an implementation of Sandia's …

Fine-grain parallelism using multi-core, Cell/BE, and GPU systems

F Pratas, P Trancoso, L Sousa, A Stamatakis, G Shi… - Parallel Computing, 2012 - Elsevier
Currently, we are facing a situation where applications exhibit increasing computational
demands and where a large variety of parallel processor systems are available. In this paper …