A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization

文章

学术资源搜索

获得 2 条结果（用时0.02秒）

我的图书馆

A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization

在引用文章中搜索

[PDF] arxiv.org

Outlier Reduction with Gated Attention for Improved Post-training Quantization in Large Sequence-to-sequence Speech Foundation Models

D Wagner, I Baumann, K Riedhammer… - arXiv preprint arXiv …, 2024 - arxiv.org

This paper explores the improvement of post-training quantization (PTQ) after knowledge
distillation in the Whisper speech foundation model family. We address the challenge of …

[PDF] arxiv.org

One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model

Z Li, H Xu, T Wang, S Hu, Z Jin, S Hu, J Deng… - arXiv preprint arXiv …, 2024 - arxiv.org

We propose a novel one-pass multiple ASR systems joint compression and quantization
approach using an all-in-one neural model. A single compression cycle allows multiple …