Personalized speech recognition on mobile devices

C Zhang, P Patras, H Haddadi - IEEE Communications surveys …, 2019 - ieeexplore.ieee.org

The rapid uptake of mobile devices and the rising popularity of mobile applications and
services pose unprecedented demands on mobile and wireless networking infrastructure …

被引用次数：1818 相关文章所有 8 个版本

[PDF] google.com

Deep learning on mobile and embedded devices: State-of-the-art, challenges, and future directions

Y Chen, B Zheng, Z Zhang, Q Wang, C Shen… - ACM Computing …, 2020 - dl.acm.org

Recent years have witnessed an exponential increase in the use of mobile and embedded
devices. With the great success of deep learning in many fields, there is an emerging trend …

被引用次数：133 相关文章所有 4 个版本

[PDF] arxiv.org

Streaming end-to-end speech recognition for mobile devices

Y He, TN Sainath, R Prabhavalkar… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org

End-to-end (E2E) models, which directly predict output character sequences given input
speech, are good candidates for on-device speech recognition. E2E models, however …

被引用次数：736 相关文章所有 9 个版本

[PDF] iczhiku.com

Hello edge: Keyword spotting on microcontrollers

Y Zhang, N Suda, L Lai, V Chandra - arXiv preprint arXiv:1711.07128, 2017 - arxiv.org

Keyword spotting (KWS) is a critical component for enabling speech based user interactions
on smart devices. It requires real-time response and high accuracy for good user …

被引用次数：528 相关文章所有 6 个版本

[PDF] arxiv.org

Large-scale visual speech recognition

B Shillingford, Y Assael, MW Hoffman, T Paine… - arXiv preprint arXiv …, 2018 - arxiv.org

This work presents a scalable solution to open-vocabulary visual speech recognition. To
achieve this, we constructed the largest existing visual speech recognition dataset …

被引用次数：199 相关文章所有 7 个版本

[PDF] isca-archive.org

[PDF][PDF] Shallow-Fusion End-to-End Contextual Biasing.

D Zhao, TN Sainath, D Rybach, P Rondon, D Bhatia… - Interspeech, 2019 - isca-archive.org

Contextual biasing to a specific domain, including a user's song names, app names and
contact names, is an important component of any production-level automatic speech …

被引用次数：168 相关文章所有 4 个版本

[PDF] arxiv.org

Two-pass end-to-end speech recognition

TN Sainath, R Pang, D Rybach, Y He… - arXiv preprint arXiv …, 2019 - arxiv.org

The requirements for many applications of state-of-the-art speech recognition systems
include not only low word error rate (WER) but also low latency. Specifically, for many use …

被引用次数：164 相关文章所有 11 个版本

[PDF] arxiv.org

Deep context: end-to-end contextual speech recognition

G Pundak, TN Sainath, R Prabhavalkar… - 2018 IEEE spoken …, 2018 - ieeexplore.ieee.org

In automatic speech recognition (ASR) what a user says depends on the particular context
she is in. Typically, this context is represented as a set of word n-grams. In this work, we …

被引用次数：202 相关文章所有 6 个版本

[PDF] arxiv.org

Federated evaluation and tuning for on-device personalization: System design & applications

M Paulik, M Seigel, H Mason, D Telaar… - arXiv preprint arXiv …, 2021 - arxiv.org

We describe the design of our federated task processing system. Originally, the system was
created to support two specific federated tasks: evaluation and tuning of on-device ML …

被引用次数：103 相关文章所有 5 个版本

[PDF] acm.org

Towards individuated reading experiences: Different fonts increase reading speed for different individuals

S Wallace, Z Bylinskii, J Dobres, B Kerr… - ACM Transactions on …, 2022 - dl.acm.org

In our age of ubiquitous digital displays, adults often read in short, opportunistic interludes.
In this context of Interlude Reading, we consider if manipulating font choice can improve …

被引用次数：43 相关文章所有 2 个版本