Deep learning in mobile and wireless networking: A survey

C Zhang, P Patras, H Haddadi - IEEE Communications surveys …, 2019 - ieeexplore.ieee.org
The rapid uptake of mobile devices and the rising popularity of mobile applications and
services pose unprecedented demands on mobile and wireless networking infrastructure …

Deep learning on mobile and embedded devices: State-of-the-art, challenges, and future directions

Y Chen, B Zheng, Z Zhang, Q Wang, C Shen… - ACM Computing …, 2020 - dl.acm.org
Recent years have witnessed an exponential increase in the use of mobile and embedded
devices. With the great success of deep learning in many fields, there is an emerging trend …

Streaming end-to-end speech recognition for mobile devices

Y He, TN Sainath, R Prabhavalkar… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
End-to-end (E2E) models, which directly predict output character sequences given input
speech, are good candidates for on-device speech recognition. E2E models, however …

Hello edge: Keyword spotting on microcontrollers

Y Zhang, N Suda, L Lai, V Chandra - arXiv preprint arXiv:1711.07128, 2017 - arxiv.org
Keyword spotting (KWS) is a critical component for enabling speech based user interactions
on smart devices. It requires real-time response and high accuracy for good user …

Large-scale visual speech recognition

B Shillingford, Y Assael, MW Hoffman, T Paine… - arXiv preprint arXiv …, 2018 - arxiv.org
This work presents a scalable solution to open-vocabulary visual speech recognition. To
achieve this, we constructed the largest existing visual speech recognition dataset …

[PDF][PDF] Shallow-Fusion End-to-End Contextual Biasing.

D Zhao, TN Sainath, D Rybach, P Rondon, D Bhatia… - Interspeech, 2019 - isca-archive.org
Contextual biasing to a specific domain, including a user's song names, app names and
contact names, is an important component of any production-level automatic speech …

Two-pass end-to-end speech recognition

TN Sainath, R Pang, D Rybach, Y He… - arXiv preprint arXiv …, 2019 - arxiv.org
The requirements for many applications of state-of-the-art speech recognition systems
include not only low word error rate (WER) but also low latency. Specifically, for many use …

Deep context: end-to-end contextual speech recognition

G Pundak, TN Sainath, R Prabhavalkar… - 2018 IEEE spoken …, 2018 - ieeexplore.ieee.org
In automatic speech recognition (ASR) what a user says depends on the particular context
she is in. Typically, this context is represented as a set of word n-grams. In this work, we …

Federated evaluation and tuning for on-device personalization: System design & applications

M Paulik, M Seigel, H Mason, D Telaar… - arXiv preprint arXiv …, 2021 - arxiv.org
We describe the design of our federated task processing system. Originally, the system was
created to support two specific federated tasks: evaluation and tuning of on-device ML …

Towards individuated reading experiences: Different fonts increase reading speed for different individuals

S Wallace, Z Bylinskii, J Dobres, B Kerr… - ACM Transactions on …, 2022 - dl.acm.org
In our age of ubiquitous digital displays, adults often read in short, opportunistic interludes.
In this context of Interlude Reading, we consider if manipulating font choice can improve …