MIMD Programs Execution Support on SIMD Machines: A Holistic Survey

D Mustafa, R Alkhasawneh, F Obeidat… - IEEE Access, 2024 - ieeexplore.ieee.org
The Single Instruction Multiple Data (SIMD) architecture, supported by various high-
performance computing platforms, efficiently utilizes data-level parallelism. The SIMD model …

Performance prediction of parallel applications: a systematic literature review

J Flores-Contreras, HA Duran-Limon… - The Journal of …, 2021 - Springer
Different techniques for estimating the execution time of parallel applications have been
studied for the last 25 years. These approaches have proposed different methods for …

[PDF][PDF] SIMD 自动向量化编译优化概述

高伟, 赵荣彩, 韩林, 庞建民, 丁锐 - 软件学报, 2015 - jos.org.cn
SIMD 扩展部件是集成到通用处理器中的加速部件, 旨在发掘多媒体程序和科学计算程序的数据
级并行. 首先介绍SIMD 扩展部件的背景和研究现状, 然后从发掘方法, 数据布局 …

Adaptive energy minimization of openmp parallel applications on many-core systems

RA Shafik, A Das, S Yang, G Merrett… - Proceedings of the 6th …, 2015 - dl.acm.org
Energy minimization of parallel applications is an emerging challenge for current and future
generations of many-core computing systems. In this paper, we propose a novel and …

Efficient voltage regulation for microprocessor cores stacked in vertical voltage domains

C Schaef, JT Stauth - IEEE Transactions on Power Electronics, 2015 - ieeexplore.ieee.org
Due to exponential (Moores law) scaling of advanced CMOS technologies, the challenges
associated with delivering power to performance and mobile computing systems are …

Parallel pairwise epistasis detection on heterogeneous computing architectures

J González-Domínguez, S Ramos… - … on Parallel and …, 2015 - ieeexplore.ieee.org
Development of new methods to detect pairwise epistasis, such as SNP-SNP interactions, in
Genome-Wide Association Studies is an important task in bioinformatics as they can help to …

CAP Bench: a benchmark suite for performance and energy evaluation of low‐power many‐core processors

MA Souza, PH Penna, MM Queiroz… - Concurrency and …, 2017 - Wiley Online Library
The constant need for faster and more energy‐efficient processors has been stimulating the
development of new architectures, such as low‐power many‐core architectures …

Landing sites detection using LiDAR data on manycore systems

OG Lorenzo, J Martínez, DL Vilariño, TF Pena… - The Journal of …, 2017 - Springer
Helicopters are widely used in emergency situations, where knowing if a geographical
location is adequate for landing is a critical issue, and it is far from being a straightforward …

Performance characterization of parallel discrete event simulation on knights landing processor

B Williams, D Ponomarev, N Abu-Ghazaleh… - Proceedings of the 2017 …, 2017 - dl.acm.org
Performance and scalability of Parallel Discrete Event Simulation (PDES) is often limited by
fine-grain communication, especially in execution environments with high communication …

Controlled asynchronous GVT: accelerating parallel discrete event simulation on many-core clusters

A Eker, B Williams, K Chiu, D Ponomarev - Proceedings of the 48th …, 2019 - dl.acm.org
In this paper, we investigate the performance of Parallel Discrete Event Simulation (PDES)
on a cluster of many-core Intel KNL processors. Specifically, we analyze the impact of …