CPElide: Efficient Multi-Chiplet GPU Implicit Synchronization

P Dalmia, RS Kumar, MD Sinclair - 2024 57th IEEE/ACM …, 2024 - ieeexplore.ieee.org
Chiplets are transforming computer system designs, allowing system designers to combine
heterogeneous computing resources at unprecedented scales. Breaking larger, mono-lithic …

[HTML][HTML] Parallel approaches for a decision tree-based explainability algorithm

D Loreti, G Visani - Future Generation Computer Systems, 2024 - Elsevier
Abstract While nowadays Machine Learning (ML) algorithms have achieved impressive
prediction accuracy in various fields, their ability to provide an explanation for the output …

Fine-grained or coarse-grained? Strategies for implementing parallel genetic algorithms in a programmable neuromorphic platform

S Furber, I Sugiarto - … , Computing, Electronics and …, 2021 - research.manchester.ac.uk
Genetic algorithm (GA) is one of popular heuristic-based optimization methods that attracts
engineers and scientists for many years. With the advancement of multi-and many-core …

Performance Optimization with an Integrated View of Compiler and Application Knowledge

R Tian - 2021 - search.proquest.com
Compiler optimization is a long-standing research field that enhances program performance
with a set of rigorous code analyses and transformations. Traditional compiler optimization …

[PDF][PDF] Enhancing Parallel Scheduling of Grid Jobs in a Multicored Environment

GT Abraham, EF Osaisai, A Ineyekineye - researchgate.net
The computing Grid has emerged as a platform to solve the complex and ever-increasing
processing need of man and advances in computing technology have birthed the multicore …

[PDF][PDF] Osaisai.“

GT Abraham, F Evans - Parallel Journal of Current Research - researchgate.net
Grid computing has continued to gain applicability in various spheres of comp computers
are also becoming ubiquitous. Most grid scheduling algorithms re several attempts at …

[PDF][PDF] Analysis of High Level implementations for Recursive Methods on GPUs

C Cao, J Kalloor - people.eecs.berkeley.edu
Higher level DSLs have allowed for performant computation on GPUs while providing
enough abstraction to the user to avoid significant deployment overhead. However, the …

[PDF][PDF] A Complete Bibliography of ACM Transactions on Parallel Computing (TOPC)

NHF Beebe - 2024 - ctan.math.utah.edu
A Complete Bibliography of ACM Transactions on Parallel Computing (TOPC) Page 1 A
Complete Bibliography of ACM Transactions on Parallel Computing (TOPC) Nelson HF …