A survey of parallel programming models and tools in the multi and many-core era

J Diaz, C Munoz-Caro, A Nino - IEEE Transactions on parallel …, 2012 - ieeexplore.ieee.org
In this work, we present a survey of the different parallel programming models and tools
available today with special consideration to their suitability for high-performance …

Engineering efficient and effective non-metric space library

L Boytsov, B Naidan - Similarity Search and Applications: 6th International …, 2013 - Springer
We present a new similarity search library and discuss a variety of design and performance
issues related to its development. We adopt a position that engineering is equally important …

ReaDDy-a software for particle-based reaction-diffusion dynamics in crowded cellular environments

J Schöneberg, F Noé - PloS one, 2013 - journals.plos.org
We introduce the software package ReaDDy for simulation of detailed spatiotemporal
mechanisms of dynamical processes in the cell, based on reaction-diffusion dynamics with …

Java in the high performance computing arena: Research, practice and experience

GL Taboada, S Ramos, RR Expósito, J Tourino… - Science of Computer …, 2013 - Elsevier
The rising interest in Java for High Performance Computing (HPC) is based on the
appealing features of this language for programming multi-core cluster architectures …

Design and implementation of Java bindings in Open MPI

O Vega-Gisbert, JE Roman, JM Squyres - Parallel Computing, 2016 - Elsevier
This paper describes the Java MPI bindings that have been included in the Open MPI
distribution. Open MPI is one of the most popular implementations of MPI, the Message …

Evaluating ARM HPC clusters for scientific workloads

J Maqbool, S Oh, GC Fox - Concurrency and Computation …, 2015 - Wiley Online Library
The power consumption of modern high‐performance computing (HPC) systems that are
built using power hungry commodity servers is one of the major hurdles for achieving …

Butterfly mixing: Accelerating incremental-update algorithms on clusters

H Zhao, J Canny - Proceedings of the 2013 SIAM International …, 2013 - SIAM
Incremental model-update strategies are widely used in machine learning and data mining.
By “incremental update” we refer to models that are updated many times using small subsets …

Next-generation bioinformatics: using many-core processor architecture to develop a web service for sequence alignment

S Gálvez, D Díaz, P Hernández, FJ Esteban… - …, 2010 - academic.oup.com
Motivation: Bioinformatics algorithms and computing power are the main bottlenecks for
analyzing huge amount of data generated by the current technologies, such as the 'next …

Parallelizing and optimizing a bioinformatics pairwise sequence alignment algorithm for many-core architecture

D Díaz, FJ Esteban, P Hernández, JA Caballero… - Parallel Computing, 2011 - Elsevier
Current computer engineering evolves at an accelerated pace, with hardware advancing
towards new chip multiprocessors (CMP) architectures and with supporting software gearing …

[图书][B] High performance machine learning through codesign and rooflining

H Zhao - 2014 - search.proquest.com
Abstract Machine learning (ML) is a cornerstone of the new data revolution. Most attempts to
scale machine learning to massive datasets focus on parallelization on computer clusters …