查看文章

OWL: cooperative thread array aware scheduling techniques for improving GPGPU performance

作者

Adwait Jog, Onur Kayiran, Nachiappan Chidambaram Nachiappan, Asit K Mishra, Mahmut T Kandemir, Onur Mutlu, Ravishankar Iyer, Chita R Das

发表日期

2013/3/16

期刊

ACM SIGPLAN Notices

卷号

期号

页码范围

395-406

出版商

ACM

简介

Emerging GPGPU architectures, along with programming models like CUDA and OpenCL, offer a cost-effective platform for many applications by providing high thread level parallelism at lower energy budgets. Unfortunately, for many general-purpose applications, available hardware resources of a GPGPU are not efficiently utilized, leading to lost opportunity in improving performance. A major cause of this is the inefficiency of current warp scheduling policies in tolerating long memory latencies.

In this paper, we identify that the scheduling decisions made by such policies are agnostic to thread-block, or cooperative thread array (CTA), behavior, and as a result inefficient. We present a coordinated CTA-aware scheduling policy that utilizes four schemes to minimize the impact of long memory latencies. The first two schemes, CTA-aware two-level warp scheduling and locality aware warp scheduling, enhance per …

引用总数

被引用次数：369

20122013201420152016201720182019202020212022202320242 14 37 53 51 46 49 26 22 23 14 14 3

学术搜索中的文章

OWL: cooperative thread array aware scheduling techniques for improving GPGPU performance

A Jog, O Kayiran, N Chidambaram Nachiappan… - ACM SIGPLAN Notices, 2013

被引用次数：369 相关文章所有 21 个版本