Neural model reprogramming with similarity based mapping for low-resource spoken command recognition
In this study, we propose a novel adversarial reprogramming (AR) approach for low-
resource spoken command recognition (SCR), and build an AR-SCR system. The AR …
resource spoken command recognition (SCR), and build an AR-SCR system. The AR …
CRNN-CTC based mandarin keywords spotting
Deep learning based approaches have greatly improved the performance of spoken
keyword spotting (KWS). However, KWS of different languages should have their own …
keyword spotting (KWS). However, KWS of different languages should have their own …
Voice activation systems for embedded devices: Systematic literature review
A Kolesau, D Šešok - Informatica, 2020 - content.iospress.com
A large number of modern mobile devices, embedded devices and smart home devices are
equipped with a voice control. Automatic recognition of the entire audio stream, however, is …
equipped with a voice control. Automatic recognition of the entire audio stream, however, is …
Unsupervised pre-training for voice activation
A Kolesau, D Šešok - Applied Sciences, 2020 - mdpi.com
Featured Application The proposed way to use unsupervised pre-training in voice activation
could be beneficial in cases of limited data resources, eg, in low-resource domains or for …
could be beneficial in cases of limited data resources, eg, in low-resource domains or for …
Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
We propose a novel language-universal approach to end-to-end automatic spoken keyword
recognition (SKR) leveraging upon (i) a self-supervised pre-trained model, and (ii) a set of …
recognition (SKR) leveraging upon (i) a self-supervised pre-trained model, and (ii) a set of …
[PDF][PDF] 基于特征空间轨迹信息的语音关键词检测方法
田颖慧, 贺前华, 郑若伟, 危卓, 李艳雄 - 电子学报, 2023 - ejournal.org.cn
当前语音关键词检测的主流技术为深度学习, 需要大规模标注样本进行训练,
难以应用于更普遍的低资源场景. 本文提出一种基于音频特征空间轨迹信息的低资源语音关键词 …
难以应用于更普遍的低资源场景. 本文提出一种基于音频特征空间轨迹信息的低资源语音关键词 …
Voice activation for low-resource languages
A Kolesau, D Šešok - Applied Sciences, 2021 - mdpi.com
Voice activation systems are used to find a pre-defined word or phrase in the audio stream.
Industry solutions, such as “OK, Google” for Android devices, are trained with millions of …
Industry solutions, such as “OK, Google” for Android devices, are trained with millions of …
Improving the effectiveness of voice activation systems with machine learning methods
A Kolesau, D Šešok - 2022 - etalpykla.vilniustech.lt
Modern devices are more frequently equipped with voice control. Using speech to operate is
a natural and efficient approach. However, incorporating voice control poses the following …
a natural and efficient approach. However, incorporating voice control poses the following …
Fast speech keyword recognition based on improved filler model
Y Wang, J Yang, L Zhang - 2017 IEEE 2nd Advanced …, 2017 - ieeexplore.ieee.org
Most traditional template matching based keyword recognition methods don't need training
data, just rely on frame matching. However, the recognition speed is relatively slow and it …
data, just rely on frame matching. However, the recognition speed is relatively slow and it …