Graph processing on GPUs: A survey
In the big data era, much real-world data can be naturally represented as graphs.
Consequently, many application domains can be modeled as graph processing. Graph …
Consequently, many application domains can be modeled as graph processing. Graph …
Toward real-time ray tracing: A survey on hardware acceleration and microarchitecture techniques
Ray tracing has long been considered as the next-generation technology for graphics
rendering. Recently, there has been strong momentum to adopt ray tracing--based …
rendering. Recently, there has been strong momentum to adopt ray tracing--based …
Optimizing N-dimensional, winograd-based convolution for manycore CPUs
Recent work on Winograd-based convolution allows for a great reduction of computational
complexity, but existing implementations are limited to 2D data and a single kernel size of 3 …
complexity, but existing implementations are limited to 2D data and a single kernel size of 3 …
Mesh color textures
C Yuksel - Proceedings of High Performance Graphics, 2017 - dl.acm.org
The fundamental limitations of texture mapping has been a long standing problem in
computer graphics. The cost of defining and maintaining texture coordinates and the seams …
computer graphics. The cost of defining and maintaining texture coordinates and the seams …
Cimple: instruction and memory level parallelism: a DSL for uncovering ILP and MLP
V Kiriansky, H Xu, M Rinard… - Proceedings of the 27th …, 2018 - dl.acm.org
Modern out-of-order processors have increased capacity to exploit instruction level
parallelism (ILP) and memory level parallelism (MLP), eg, by using wide superscalar …
parallelism (ILP) and memory level parallelism (MLP), eg, by using wide superscalar …
Many-task computing on many-core architectures
Abstract Many-Task Computing (MTC) is a common scenario for multiple parallel systems,
such as cluster, grids, cloud and supercomputers, but it is not so popular in shared memory …
such as cluster, grids, cloud and supercomputers, but it is not so popular in shared memory …
GPUpd: a fast and scalable multi-GPU architecture using cooperative projection and distribution
Graphics Processing Unit (GPU) vendors have been scaling single-GPU architectures to
satisfy the ever-increasing user demands for faster graphics processing. However, as it gets …
satisfy the ever-increasing user demands for faster graphics processing. However, as it gets …
Data Lakes: Building Flexible Architectures for Big Data Storage
N Dulam - Distributed Learning and Broad Applications in …, 2015 - dlabi.org
Data lakes are emerging as a powerful solution for managing big data's growing volume,
variety, and velocity. Unlike traditional data storage systems, data lakes provide a flexible …
variety, and velocity. Unlike traditional data storage systems, data lakes provide a flexible …
Coherent path tracing
Packet tracing is a popular and efficient method for accelerating ray tracing. However,
packet traversal techniques become inefficient when they are applied to path tracing since …
packet traversal techniques become inefficient when they are applied to path tracing since …
Effective static bin patterns for sort-middle rendering
To effectively utilize an ever increasing number of processors during parallel rendering,
hardware and software designers rely on sophisticated load balancing strategies. While …
hardware and software designers rely on sophisticated load balancing strategies. While …