Graph processing on GPUs: A survey

X Shi, Z Zheng, Y Zhou, H Jin, L He, B Liu… - ACM Computing Surveys …, 2018 - dl.acm.org
In the big data era, much real-world data can be naturally represented as graphs.
Consequently, many application domains can be modeled as graph processing. Graph …

Toward real-time ray tracing: A survey on hardware acceleration and microarchitecture techniques

Y Deng, Y Ni, Z Li, S Mu, W Zhang - ACM Computing Surveys (CSUR), 2017 - dl.acm.org
Ray tracing has long been considered as the next-generation technology for graphics
rendering. Recently, there has been strong momentum to adopt ray tracing--based …

Optimizing N-dimensional, winograd-based convolution for manycore CPUs

Z Jia, A Zlateski, F Durand, K Li - Proceedings of the 23rd ACM SIGPLAN …, 2018 - dl.acm.org
Recent work on Winograd-based convolution allows for a great reduction of computational
complexity, but existing implementations are limited to 2D data and a single kernel size of 3 …

Mesh color textures

C Yuksel - Proceedings of High Performance Graphics, 2017 - dl.acm.org
The fundamental limitations of texture mapping has been a long standing problem in
computer graphics. The cost of defining and maintaining texture coordinates and the seams …

Cimple: instruction and memory level parallelism: a DSL for uncovering ILP and MLP

V Kiriansky, H Xu, M Rinard… - Proceedings of the 27th …, 2018 - dl.acm.org
Modern out-of-order processors have increased capacity to exploit instruction level
parallelism (ILP) and memory level parallelism (MLP), eg, by using wide superscalar …

Many-task computing on many-core architectures

P Valero-Lara, P Nookala, FL Pelayo, J Jansson… - … Computing: Practice and …, 2016 - scpe.org
Abstract Many-Task Computing (MTC) is a common scenario for multiple parallel systems,
such as cluster, grids, cloud and supercomputers, but it is not so popular in shared memory …

GPUpd: a fast and scalable multi-GPU architecture using cooperative projection and distribution

Y Kim, JE Jo, H Jang, M Rhu, H Kim, J Kim - Proceedings of the 50th …, 2017 - dl.acm.org
Graphics Processing Unit (GPU) vendors have been scaling single-GPU architectures to
satisfy the ever-increasing user demands for faster graphics processing. However, as it gets …

Data Lakes: Building Flexible Architectures for Big Data Storage

N Dulam - Distributed Learning and Broad Applications in …, 2015 - dlabi.org
Data lakes are emerging as a powerful solution for managing big data's growing volume,
variety, and velocity. Unlike traditional data storage systems, data lakes provide a flexible …

Coherent path tracing

I Sadeghi, B Chen, HW Jensen - Journal of Graphics, GPU, and …, 2009 - Taylor & Francis
Packet tracing is a popular and efficient method for accelerating ray tracing. However,
packet traversal techniques become inefficient when they are applied to path tracing since …

Effective static bin patterns for sort-middle rendering

B Kerbl, M Kenzel, D Schmalstieg… - Proceedings of High …, 2017 - dl.acm.org
To effectively utilize an ever increasing number of processors during parallel rendering,
hardware and software designers rely on sophisticated load balancing strategies. While …