A survey of parallel programming models and tools in the multi and many-core era
In this work, we present a survey of the different parallel programming models and tools
available today with special consideration to their suitability for high-performance …
available today with special consideration to their suitability for high-performance …
Engineering efficient and effective non-metric space library
L Boytsov, B Naidan - Similarity Search and Applications: 6th International …, 2013 - Springer
We present a new similarity search library and discuss a variety of design and performance
issues related to its development. We adopt a position that engineering is equally important …
issues related to its development. We adopt a position that engineering is equally important …
ReaDDy-a software for particle-based reaction-diffusion dynamics in crowded cellular environments
J Schöneberg, F Noé - PloS one, 2013 - journals.plos.org
We introduce the software package ReaDDy for simulation of detailed spatiotemporal
mechanisms of dynamical processes in the cell, based on reaction-diffusion dynamics with …
mechanisms of dynamical processes in the cell, based on reaction-diffusion dynamics with …
Java in the high performance computing arena: Research, practice and experience
The rising interest in Java for High Performance Computing (HPC) is based on the
appealing features of this language for programming multi-core cluster architectures …
appealing features of this language for programming multi-core cluster architectures …
Design and implementation of Java bindings in Open MPI
O Vega-Gisbert, JE Roman, JM Squyres - Parallel Computing, 2016 - Elsevier
This paper describes the Java MPI bindings that have been included in the Open MPI
distribution. Open MPI is one of the most popular implementations of MPI, the Message …
distribution. Open MPI is one of the most popular implementations of MPI, the Message …
Evaluating ARM HPC clusters for scientific workloads
The power consumption of modern high‐performance computing (HPC) systems that are
built using power hungry commodity servers is one of the major hurdles for achieving …
built using power hungry commodity servers is one of the major hurdles for achieving …
Butterfly mixing: Accelerating incremental-update algorithms on clusters
Incremental model-update strategies are widely used in machine learning and data mining.
By “incremental update” we refer to models that are updated many times using small subsets …
By “incremental update” we refer to models that are updated many times using small subsets …
Next-generation bioinformatics: using many-core processor architecture to develop a web service for sequence alignment
S Gálvez, D Díaz, P Hernández, FJ Esteban… - …, 2010 - academic.oup.com
Motivation: Bioinformatics algorithms and computing power are the main bottlenecks for
analyzing huge amount of data generated by the current technologies, such as the 'next …
analyzing huge amount of data generated by the current technologies, such as the 'next …
Parallelizing and optimizing a bioinformatics pairwise sequence alignment algorithm for many-core architecture
D Díaz, FJ Esteban, P Hernández, JA Caballero… - Parallel Computing, 2011 - Elsevier
Current computer engineering evolves at an accelerated pace, with hardware advancing
towards new chip multiprocessors (CMP) architectures and with supporting software gearing …
towards new chip multiprocessors (CMP) architectures and with supporting software gearing …
[图书][B] High performance machine learning through codesign and rooflining
H Zhao - 2014 - search.proquest.com
Abstract Machine learning (ML) is a cornerstone of the new data revolution. Most attempts to
scale machine learning to massive datasets focus on parallelization on computer clusters …
scale machine learning to massive datasets focus on parallelization on computer clusters …