[HTML][HTML] Applications and techniques for fast machine learning in science

AMC Deiana, N Tran, J Agar, M Blott… - Frontiers in big …, 2022 - frontiersin.org
In this community review report, we discuss applications and techniques for fast machine
learning (ML) in science—the concept of integrating powerful ML methods into the real-time …

Ctrl: A conditional transformer language model for controllable generation

NS Keskar, B McCann, LR Varshney, C Xiong… - arXiv preprint arXiv …, 2019 - arxiv.org
Large-scale language models show promising text generation capabilities, but users cannot
easily control particular aspects of the generated text. We release CTRL, a 1.63 billion …

Integer quantization for deep learning inference: Principles and empirical evaluation

H Wu, P Judd, X Zhang, M Isaev… - arXiv preprint arXiv …, 2020 - arxiv.org
Quantization techniques can reduce the size of Deep Neural Networks and improve
inference latency and throughput by taking advantage of high throughput integer …

Quartznet: Deep automatic speech recognition with 1d time-channel separable convolutions

S Kriman, S Beliaev, B Ginsburg… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
We propose a new end-to-end neural acoustic model for automatic speech recognition. The
model is composed of multiple blocks with residual connections between them. Each block …

A fractional gradient descent algorithm robust to the initial weights of multilayer perceptron

X Xie, YF Pu, J Wang - Neural Networks, 2023 - Elsevier
For multilayer perceptron (MLP), the initial weights will significantly influence its
performance. Based on the enhanced fractional derivative extend from convex optimization …

Jasper: An end-to-end convolutional neural acoustic model

J Li, V Lavrukhin, B Ginsburg, R Leary… - arXiv preprint arXiv …, 2019 - arxiv.org
In this paper, we report state-of-the-art results on LibriSpeech among end-to-end speech
recognition models without any external training data. Our model, Jasper, uses only 1D …

Real-time neural radiance caching for path tracing

T Müller, F Rousselle, J Novák, A Keller - arXiv preprint arXiv:2106.12372, 2021 - arxiv.org
We present a real-time neural radiance caching method for path-traced global illumination.
Our system is designed to handle fully dynamic scenes, and makes no assumptions about …

Performance vs. hardware requirements in state-of-the-art automatic speech recognition

AL Georgescu, A Pappalardo, H Cucu… - EURASIP Journal on Audio …, 2021 - Springer
The last decade brought significant advances in automatic speech recognition (ASR) thanks
to the evolution of deep learning methods. ASR systems evolved from pipeline-based …

Prostate158-An expert-annotated 3T MRI dataset and algorithm for prostate cancer detection

LC Adams, MR Makowski, G Engel, M Rattunde… - Computers in Biology …, 2022 - Elsevier
Background The development of deep learning (DL) models for prostate segmentation on
magnetic resonance imaging (MRI) depends on expert-annotated data and reliable …

Citrinet: Closing the gap between non-autoregressive and autoregressive end-to-end models for automatic speech recognition

S Majumdar, J Balam, O Hrinchuk, V Lavrukhin… - arXiv preprint arXiv …, 2021 - arxiv.org
We propose Citrinet-a new end-to-end convolutional Connectionist Temporal Classification
(CTC) based automatic speech recognition (ASR) model. Citrinet is deep residual neural …