Neural model reprogramming with similarity based mapping for low-resource spoken command recognition

H Yen, PJ Ku, CHH Yang, H Hu, SM Siniscalchi… - arXiv preprint arXiv …, 2021 - arxiv.org
In this study, we propose a novel adversarial reprogramming (AR) approach for low-
resource spoken command recognition (SCR), and build an AR-SCR system. The AR …

CRNN-CTC based mandarin keywords spotting

H Yan, Q He, W Xie - ICASSP 2020-2020 IEEE International …, 2020 - ieeexplore.ieee.org
Deep learning based approaches have greatly improved the performance of spoken
keyword spotting (KWS). However, KWS of different languages should have their own …

Voice activation systems for embedded devices: Systematic literature review

A Kolesau, D Šešok - Informatica, 2020 - content.iospress.com
A large number of modern mobile devices, embedded devices and smart home devices are
equipped with a voice control. Automatic recognition of the entire audio stream, however, is …

Unsupervised pre-training for voice activation

A Kolesau, D Šešok - Applied Sciences, 2020 - mdpi.com
Featured Application The proposed way to use unsupervised pre-training in voice activation
could be beneficial in cases of limited data resources, eg, in low-resource domains or for …

Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition

H Yen, PJ Ku, SM Siniscalchi, CH Lee - arXiv preprint arXiv:2406.02488, 2024 - arxiv.org
We propose a novel language-universal approach to end-to-end automatic spoken keyword
recognition (SKR) leveraging upon (i) a self-supervised pre-trained model, and (ii) a set of …

[PDF][PDF] 基于特征空间轨迹信息的语音关键词检测方法

田颖慧, 贺前华, 郑若伟, 危卓, 李艳雄 - 电子学报, 2023 - ejournal.org.cn
当前语音关键词检测的主流技术为深度学习, 需要大规模标注样本进行训练,
难以应用于更普遍的低资源场景. 本文提出一种基于音频特征空间轨迹信息的低资源语音关键词 …

Voice activation for low-resource languages

A Kolesau, D Šešok - Applied Sciences, 2021 - mdpi.com
Voice activation systems are used to find a pre-defined word or phrase in the audio stream.
Industry solutions, such as “OK, Google” for Android devices, are trained with millions of …

Improving the effectiveness of voice activation systems with machine learning methods

A Kolesau, D Šešok - 2022 - etalpykla.vilniustech.lt
Modern devices are more frequently equipped with voice control. Using speech to operate is
a natural and efficient approach. However, incorporating voice control poses the following …

Fast speech keyword recognition based on improved filler model

Y Wang, J Yang, L Zhang - 2017 IEEE 2nd Advanced …, 2017 - ieeexplore.ieee.org
Most traditional template matching based keyword recognition methods don't need training
data, just rely on frame matching. However, the recognition speed is relatively slow and it …