Deep spoken keyword spotting: An overview
Spoken keyword spotting (KWS) deals with the identification of keywords in audio streams
and has become a fast-growing technology thanks to the paradigm shift introduced by deep …
and has become a fast-growing technology thanks to the paradigm shift introduced by deep …
Adversarial music: Real world audio adversary against wake-word detection system
Abstract Voice Assistants (VAs) such as Amazon Alexa or Google Assistant rely on wake-
word detection to respond to people's commands, which could potentially be vulnerable to …
word detection to respond to people's commands, which could potentially be vulnerable to …
Unacceptable, where is my privacy? exploring accidental triggers of smart speakers
Voice assistants like Amazon's Alexa, Google's Assistant, or Apple's Siri, have become the
primary (voice) interface in smart speakers that can be found in millions of households. For …
primary (voice) interface in smart speakers that can be found in millions of households. For …
Monophone-based background modeling for two-stage on-device wake word detection
Accurate on-device wake word detection is crucial to products with far-field voice control
such as the Amazon Echo. It is quite challenging to build a wake word system with both low …
such as the Amazon Echo. It is quite challenging to build a wake word system with both low …
End-to-end streaming keyword spotting
We present a system for keyword spotting that, except for a front-end component for feature
generation, it is entirely contained in a deep neural network (DNN) model trained" end-to …
generation, it is entirely contained in a deep neural network (DNN) model trained" end-to …
Query-by-example on-device keyword spotting
A keyword spotting (KWS) system determines the existence of, usually predefined, keyword
in a continuous speech stream. This paper presents a query-by-example on-device KWS …
in a continuous speech stream. This paper presents a query-by-example on-device KWS …
Open-vocabulary keyword spotting with audio and text embeddings
Keyword Spotting (KWS) systems allow detecting a set of spoken (pre-defined) keywords.
Open-vocabulary KWS systems search for the keywords in the set of word hypotheses …
Open-vocabulary KWS systems search for the keywords in the set of word hypotheses …
[PDF][PDF] GAN-Based Data Generation for Speech Emotion Recognition.
In this work, we propose a GAN-based method to generate synthetic data for speech
emotion recognition. Specifically, we investigate the usage of GANs for capturing the data …
emotion recognition. Specifically, we investigate the usage of GANs for capturing the data …
Data augmentation for robust keyword spotting under playback interference
Accurate on-device keyword spotting (KWS) with low false accept and false reject rate is
crucial to customer experience for far-field voice control of conversational agents. It is …
crucial to customer experience for far-field voice control of conversational agents. It is …
Towards data-efficient modeling for wake word spotting
Y Gao, Y Mishchenko, A Shah… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Wake word (WW) spotting is challenging in far-field not only because of the interference in
signal transmission but also the complexity in acoustic environment. Traditional WW model …
signal transmission but also the complexity in acoustic environment. Traditional WW model …