Deep spoken keyword spotting: An overview

I López-Espejo, ZH Tan, JHL Hansen, J Jensen - IEEE Access, 2021 - ieeexplore.ieee.org
Spoken keyword spotting (KWS) deals with the identification of keywords in audio streams
and has become a fast-growing technology thanks to the paradigm shift introduced by deep …

Training keyword spotting models on non-iid data with federated learning

A Hard, K Partridge, C Nguyen, N Subrahmanya… - arXiv preprint arXiv …, 2020 - arxiv.org
We demonstrate that a production-quality keyword-spotting model can be trained on-device
using federated learning and achieve comparable false accept and false reject rates to a …

Wake word detection with streaming transformers

Y Wang, H Lv, D Povey, L Xie… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Modern wake word detection systems usually rely on neural networks for acoustic modeling.
Transformers has recently shown superior performance over LSTM and convolutional …

Speed-robust keyword spotting via soft self-attention on multi-scale features

C Ding, J Li, M Zong, B Li - 2022 IEEE Spoken Language …, 2023 - ieeexplore.ieee.org
In this work, we focus on the robustness of keyword spotting (KWS) at various speech
speeds. First, to enable small-footprint KWS, we graft a depthwise separable convolution …

Wekws: A production first small-footprint end-to-end keyword spotting toolkit

J Wang, M Xu, J Hou, B Zhang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Keyword spotting (KWS) enables speech-based user interaction and gradually becomes an
indispensable component of smart devices. Recently, end-to-end (E2E) methods have be …

Wake word detection with alignment-free lattice-free MMI

Y Wang, H Lv, D Povey, L Xie, S Khudanpur - arXiv preprint arXiv …, 2020 - arxiv.org
Always-on spoken language interfaces, eg personal digital assistants, rely on a wake word
to start processing spoken input. We present novel methods to train a hybrid DNN/HMM …

Tase: Task-aware speech enhancement for wake-up word detection in voice assistants

G Cámbara, F López, D Bonet, P Gómez, C Segura… - Applied Sciences, 2022 - mdpi.com
Wake-up word spotting in noisy environments is a critical task for an excellent user
experience with voice assistants. Unwanted activation of the device is often due to the …

VE-KWS: Visual modality enhanced end-to-end keyword spotting

A Zhang, H Wang, P Guo, Y Fu, L Xie… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
The performance of the keyword spotting (KWS) system based on audio modality, commonly
measured in false alarms and false rejects, degrades significantly under the far field and …

Two-stage streaming keyword detection and localization with multi-scale depthwise temporal convolution

J Hou, L Xie, S Zhang - Neural Networks, 2022 - Elsevier
A keyword spotting (KWS) system running on smart devices should accurately detect the
appearances and predict the locations of predefined keywords from audio streams, with …

Speech enhancement for wake-up-word detection in voice assistants

D Bonet, G Cámbara, F López, P Gómez… - arXiv preprint arXiv …, 2021 - arxiv.org
Keyword spotting and in particular Wake-Up-Word (WUW) detection is a very important task
for voice assistants. A very common issue of voice assistants is that they get easily activated …