Smart home personal assistants: a security and privacy review

JS Edu, JM Such, G Suarez-Tangil - ACM Computing Surveys (CSUR), 2020 - dl.acm.org
Smart Home Personal Assistants (SPA) are an emerging innovation that is changing the
means by which home users interact with technology. However, several elements expose …

Privacy norms for smart home personal assistants

N Abdi, X Zhan, KM Ramokapane, J Such - Proceedings of the 2021 CHI …, 2021 - dl.acm.org
Smart Home Personal Assistants (SPA) have a complex ecosystem that enables them to
carry out various tasks on behalf of the user with just voice commands. SPA capabilities are …

Speech processing for digital home assistants: Combining signal processing with deep-learning techniques

R Haeb-Umbach, S Watanabe… - IEEE Signal …, 2019 - ieeexplore.ieee.org
Once a popular theme of futuristic science fiction or far-fetched technology forecasts, digital
home assistants with a spoken language interface have become a ubiquitous commodity …

Deep spiking neural networks for large vocabulary automatic speech recognition

J Wu, E Yılmaz, M Zhang, H Li, KC Tan - Frontiers in neuroscience, 2020 - frontiersin.org
Artificial neural networks (ANN) have become the mainstream acoustic modeling technique
for large vocabulary automatic speech recognition (ASR). A conventional ANN features a …

Adversarial music: Real world audio adversary against wake-word detection system

J Li, S Qu, X Li, J Szurley, JZ Kolter… - Advances in Neural …, 2019 - proceedings.neurips.cc
Abstract Voice Assistants (VAs) such as Amazon Alexa or Google Assistant rely on wake-
word detection to respond to people's commands, which could potentially be vulnerable to …

Unacceptable, where is my privacy? exploring accidental triggers of smart speakers

L Schönherr, M Golla, T Eisenhofer, J Wiele… - arXiv preprint arXiv …, 2020 - arxiv.org
Voice assistants like Amazon's Alexa, Google's Assistant, or Apple's Siri, have become the
primary (voice) interface in smart speakers that can be found in millions of households. For …

End-to-end streaming keyword spotting

R Alvarez, HJ Park - ICASSP 2019-2019 IEEE International …, 2019 - ieeexplore.ieee.org
We present a system for keyword spotting that, except for a front-end component for feature
generation, it is entirely contained in a deep neural network (DNN) model trained" end-to …

Multi-task learning for speaker verification and voice trigger detection

S Sigtia, E Marchi, S Kajarekar, D Naik… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Automatic speech transcription and speaker recognition are usually treated as separate
tasks even though they are interdependent. In this study, we investigate training a single …

Wake word detection with streaming transformers

Y Wang, H Lv, D Povey, L Xie… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Modern wake word detection systems usually rely on neural networks for acoustic modeling.
Transformers has recently shown superior performance over LSTM and convolutional …

Accurate detection of wake word start and end using a CNN

C Jose, Y Mishchenko, T Senechal, A Shah… - arXiv preprint arXiv …, 2020 - arxiv.org
Small footprint embedded devices require keyword spotters (KWS) with small model size
and detection latency for enabling voice assistants. Such a keyword is often referred to …