Feature 3dgs: Supercharging 3d gaussian splatting to enable distilled feature fields
Abstract 3D scene representations have gained immense popularity in recent years.
Methods that use Neural Radiance fields are versatile for traditional tasks such as novel …
Methods that use Neural Radiance fields are versatile for traditional tasks such as novel …
Cloud computing landscape and research challenges regarding trust and reputation
Cloud Computing is an emerging computing paradigm. It shares massively scalable, elastic
resources (eg, data, calculations, and services) transparently among the users over a …
resources (eg, data, calculations, and services) transparently among the users over a …
Cusz: An efficient gpu-based error-bounded lossy compression framework for scientific data
Error-bounded lossy compression is a state-of-the-art data reduction technique for HPC
applications because it not only significantly reduces storage overhead but also can retain …
applications because it not only significantly reduces storage overhead but also can retain …
Baymax: Qos awareness and increased utilization for non-preemptive accelerators in warehouse scale computers
Modern warehouse-scale computers (WSCs) are being outfitted with accelerators to provide
the significant compute required by emerging intelligent personal assistant (IPA) workloads …
the significant compute required by emerging intelligent personal assistant (IPA) workloads …
Warped-slicer: Efficient intra-SM slicing through dynamic resource partitioning for GPU multiprogramming
As technology scales, GPUs are forecasted to incorporate an ever-increasing amount of
computing resources to support thread-level parallelism. But even with the best effort …
computing resources to support thread-level parallelism. But even with the best effort …
Flexible software profiling of gpu architectures
M Stephenson, SK Sastry Hari, Y Lee… - Proceedings of the …, 2015 - dl.acm.org
To aid application characterization and architecture design space exploration, researchers
and engineers have developed a wide range of tools for CPUs, including simulators …
and engineers have developed a wide range of tools for CPUs, including simulators …
[图书][B] Fundamentals of parallel multicore architecture
Y Solihin - 2015 - books.google.com
Although multicore is now a mainstream architecture, there are few textbooks that cover
parallel multicore architectures. Filling this gap, Fundamentals of Parallel Multicore …
parallel multicore architectures. Filling this gap, Fundamentals of Parallel Multicore …
Exploiting inter-warp heterogeneity to improve GPGPU performance
In a GPU, all threads within a warp execute the same instruction in lockstep. For a memory
instruction, this can lead to memory divergence: the memory requests for some threads are …
instruction, this can lead to memory divergence: the memory requests for some threads are …
Zorua: A holistic approach to resource virtualization in GPUs
This paper introduces a new resource virtualization framework, Zorua, that decouples the
programmer-specified resource usage of a GPU application from the actual allocation in the …
programmer-specified resource usage of a GPU application from the actual allocation in the …
Enabling coordinated register allocation and thread-level parallelism optimization for GPUs
The key to high performance on GPUs lies in the massive threading to enable thread
switching and hide the latency of function unit and memory access. However, running with …
switching and hide the latency of function unit and memory access. However, running with …