GRAM: A framework for dynamically mixing precisions in GPU applications

NJ Higham, T Mary - Acta Numerica, 2022 - cambridge.org

Today's floating-point arithmetic landscape is broader than ever. While scientific computing
has traditionally used single precision and double precision floating-point arithmetics, half …

被引用次数：121 相关文章所有 17 个版本

Toward accelerated stencil computation by adapting tensor core unit on gpu

X Liu, Y Liu, H Yang, J Liao, M Li, Z Luan… - Proceedings of the 36th …, 2022 - dl.acm.org

The Tensor Core Unit (TCU) has been increasingly adopted on modern high performance
processors, specialized in boosting the performance of general matrix multiplication …

被引用次数：21 相关文章所有 2 个版本

[PDF] acm.org

Predicting Performance and Accuracy of Mixed-Precision Programs for Precision Tuning

Y Wang, C Rubio-González - Proceedings of the 46th IEEE/ACM …, 2024 - dl.acm.org

A mixed-precision program is a floating-point program that utilizes different precisions for
different operations, providing the opportunity of balancing the trade-off between accuracy …

被引用次数：5 相关文章所有 5 个版本

[PDF] arxiv.org

Simulation of Quantum Computers: Review and Acceleration Opportunities

A Cicero, MA Maleki, MW Azhar, AF Kockum… - arXiv preprint arXiv …, 2024 - arxiv.org

Quantum computing has the potential to revolutionize multiple fields by solving complex
problems that can not be solved in reasonable time with current classical computers …

Neural network precision tuning using stochastic arithmetic

Q Ferro, S Graillat, T Hilaire, F Jézéquel… - … Workshop on Numerical …, 2022 - Springer

Neural networks can be costly in terms of memory and execution time. Reducing their cost
has become an objective, especially when integrated in an embedded system with limited …

被引用次数：6 相关文章所有 13 个版本

[PDF] acm.org

BinFPE: accurate floating-point exception detection for GPU applications

I Laguna, X Li, G Gopalakrishnan - … Workshop on the State Of the Art in …, 2022 - dl.acm.org

When modern heterogeneous HPC systems perform numerical computations, floating-point
exceptional quantities such as NaN and infinity in the GPU context, remain insufficiently …

被引用次数：7 相关文章所有 3 个版本

FPChecker: Floating-point exception detection tool and benchmark for parallel and distributed hpc

I Laguna, T Tirpankar, X Li… - 2022 IEEE International …, 2022 - ieeexplore.ieee.org

Floating-point arithmetic is fundamental to many areas including high-performance
computing and machine learning. In order to ensure the numerical integrity of the overall …

被引用次数：5 相关文章所有 2 个版本

Towards a SYCL API for Approximate Computing

L Carpentieri, B Cosenza - … of the 2023 International Workshop on …, 2023 - dl.acm.org

Approximate computing is a well-known method [7] to achieve higher performance or lower
energy consumption while accepting a loss of output accuracy. Many applications such as …

被引用次数：4 相关文章所有 2 个版本

[PDF] springer.com

Convergence-aware operator-wise mixed-precision training

W Dai, Z Jia, Y Bai, Q Sun - CCF Transactions on High Performance …, 2024 - Springer

With the support of more precision formats in emerging hardware architectures, mixed-
precision has become a popular approach to accelerate deep learning (DL) training …

[PDF] github.io

MixPert: Optimizing Mixed-Precision Floating-Point Emulation on GPU Integer Tensor Cores

Z Lin, A Sun, X Zhang, Y Lu - Proceedings of the 25th ACM SIGPLAN …, 2024 - dl.acm.org

Featuring mixed-precision tensor operations, accelerators significantly enhance
performance for many error-tolerant computing tasks, but their applicability is limited in …