Cloud computing landscape and research challenges regarding trust and reputation
Cloud Computing is an emerging computing paradigm. It shares massively scalable, elastic
resources (eg, data, calculations, and services) transparently among the users over a …
resources (eg, data, calculations, and services) transparently among the users over a …
Scheduling techniques for GPU architectures with processing-in-memory capabilities
Processing data in or near memory (PIM), as opposed to in conventional computational units
in a processor, can greatly alleviate the performance and energy penalties of data transfers …
in a processor, can greatly alleviate the performance and energy penalties of data transfers …
GPGPU performance and power estimation using machine learning
G Wu, JL Greathouse, A Lyashevsky… - 2015 IEEE 21st …, 2015 - ieeexplore.ieee.org
Graphics Processing Units (GPUs) have numerous configuration and design options,
including core frequency, number of parallel compute units (CUs), and available memory …
including core frequency, number of parallel compute units (CUs), and available memory …
OWL: cooperative thread array aware scheduling techniques for improving GPGPU performance
Emerging GPGPU architectures, along with programming models like CUDA and OpenCL,
offer a cost-effective platform for many applications by providing high thread level …
offer a cost-effective platform for many applications by providing high thread level …
Improving GPGPU resource utilization through alternative thread block scheduling
High performance in GPGPU workloads is obtained by maximizing parallelism and fully
utilizing the available resources. The thousands of threads are assigned to each core in …
utilizing the available resources. The thousands of threads are assigned to each core in …
Orchestrated scheduling and prefetching for GPGPUs
In this paper, we present techniques that coordinate the thread scheduling and prefetching
decisions in a General Purpose Graphics Processing Unit (GPGPU) architecture to better …
decisions in a General Purpose Graphics Processing Unit (GPGPU) architecture to better …
Divergence-aware warp scheduling
This paper uses hardware thread scheduling to improve the performance and energy
efficiency of divergent applications on GPUs. We propose Divergence-Aware Warp …
efficiency of divergent applications on GPUs. We propose Divergence-Aware Warp …
Coordinated static and dynamic cache bypassing for GPUs
The massive parallel architecture enables graphics processing units (GPUs) to boost
performance for a wide range of applications. Initially, GPUs only employ scratchpad …
performance for a wide range of applications. Initially, GPUs only employ scratchpad …
Managing GPU concurrency in heterogeneous architectures
Heterogeneous architectures consisting of general-purpose CPUs and throughput-
optimized GPUs are projected to be the dominant computing platforms for many classes of …
optimized GPUs are projected to be the dominant computing platforms for many classes of …
Warped-slicer: Efficient intra-SM slicing through dynamic resource partitioning for GPU multiprogramming
As technology scales, GPUs are forecasted to incorporate an ever-increasing amount of
computing resources to support thread-level parallelism. But even with the best effort …
computing resources to support thread-level parallelism. But even with the best effort …