[HTML][HTML] A survey on deep learning tools dealing with data scarcity: definitions, challenges, solutions, tips, and applications

L Alzubaidi, J Bai, A Al-Sabaawi, J Santamaría… - Journal of Big Data, 2023 - Springer
Data scarcity is a major challenge when training deep learning (DL) models. DL demands a
large amount of data to achieve exceptional performance. Unfortunately, many applications …

Unsupervised domain adaptation for speech recognition via uncertainty driven self-training

S Khurana, N Moritz, T Hori… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
The performance of automatic speech recognition (ASR) systems typically degrades
significantly when the training and test data domains are mismatched. In this paper, we …

Momentum pseudo-labeling: Semi-supervised asr with continuously improving pseudo-labels

Y Higuchi, N Moritz, J Le Roux… - IEEE Journal of Selected …, 2022 - ieeexplore.ieee.org
End-to-end automatic speech recognition (ASR) has become a popular alternative to
traditional module-based systems, simplifying the model-building process with a single deep …

Modulation recognition using signal enhancement and multistage attention mechanism

S Lin, Y Zeng, Y Gong - IEEE Transactions on Wireless …, 2022 - ieeexplore.ieee.org
Robustness against noise is critical for modulation recognition (MR) approaches deployed
in real-world communication systems. In MR systems, a corrupted signal is normally …

Momentum pseudo-labeling for semi-supervised speech recognition

Y Higuchi, N Moritz, JL Roux, T Hori - arXiv preprint arXiv:2106.08922, 2021 - arxiv.org
Pseudo-labeling (PL) has been shown to be effective in semi-supervised automatic speech
recognition (ASR), where a base model is self-trained with pseudo-labels generated from …

Alternative pseudo-labeling for semi-supervised automatic speech recognition

H Zhu, D Gao, G Cheng, D Povey… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org
When labeled data is insufficient, pseudo-labeling based semi-supervised learning can
significantly improve the performance of automatic speech recognition. However, pseudo …

CTC variations through new WFST topologies

A Laptev, S Majumdar, B Ginsburg - arXiv preprint arXiv:2110.03098, 2021 - arxiv.org
This paper presents novel Weighted Finite-State Transducer (WFST) topologies to
implement Connectionist Temporal Classification (CTC)-like algorithms for automatic …

Multi-channel attentive feature fusion for radio frequency fingerprinting

Y Zeng, Y Gong, J Liu, S Lin, Z Han… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Radio frequency (RF) fingerprinting is a promising device authentication technique for
securing the Internet of Things. It exploits the intrinsic and unique hardware impairments of …

A novel self-training approach for low-resource speech recognition

S Singh, F Hou, R Wang - arXiv preprint arXiv:2308.05269, 2023 - arxiv.org
In this paper, we propose a self-training approach for automatic speech recognition (ASR)
for low-resource settings. While self-training approaches have been extensively developed …

Bayes risk ctc: Controllable ctc alignment in sequence-to-sequence tasks

J Tian, B Yan, J Yu, C Weng, D Yu… - arXiv preprint arXiv …, 2022 - arxiv.org
Sequence-to-Sequence (seq2seq) tasks transcribe the input sequence to a target sequence.
The Connectionist Temporal Classification (CTC) criterion is widely used in multiple …