Alternative pseudo-labeling for semi-supervised automatic speech recognition

H Zhu, D Gao, G Cheng, D Povey… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org
When labeled data is insufficient, pseudo-labeling based semi-supervised learning can
significantly improve the performance of automatic speech recognition. However, pseudo …

Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition Systems: A Case Study for Modern Greek

G Paraskevopoulos, T Kouzelis… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org
Modern speech recognition systems exhibit rapid performance degradation under domain
shift. This issue is especially prevalent in data-scarce settings, such as low-resource …

Efficient verification of arbitrary entangled states with homogeneous local measurements

YC Liu, Y Li, J Shang, X Zhang - Advanced Quantum …, 2023 - Wiley Online Library
Quantum state verification (QSV) is the task of relying on local measurements only to verify
that a given quantum device does produce the desired target state. Up to now, certain types …

Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models

Y Hu, C Chen, CHH Yang, C Qin, PY Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
We propose an unsupervised adaptation framework, Self-TAught Recognizer (STAR), which
leverages unlabeled data to enhance the robustness of automatic speech recognition (ASR) …

Partitioning Attention Weight: Mitigating Adverse Effect of Incorrect Pseudo-Labels for Self-Supervised ASR

JH Lee, JH Chang - IEEE/ACM Transactions on Audio, Speech …, 2023 - ieeexplore.ieee.org
The performance of automatic speech recognition (ASR) models has been significantly
improved owing to advances in deep learning and end-to-end approaches. However, these …

M2R-Whisper: Multi-stage and Multi-scale Retrieval Augmentation for Enhancing Whisper

J Zhou, S Zhao, J He, H Wang, W Zeng, Y Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
State-of-the-art models like OpenAI's Whisper exhibit strong performance in multilingual
automatic speech recognition (ASR), but they still face challenges in accurately recognizing …

Unsupervised Online Continual Learning for Automatic Speech Recognition

SV Eeckt - arXiv preprint arXiv:2406.12503, 2024 - arxiv.org
Adapting Automatic Speech Recognition (ASR) models to new domains leads to
Catastrophic Forgetting (CF) of previously learned information. This paper addresses CF in …

Sequence distribution matching for unsupervised domain adaptation in ASR

Q Li, H Zhu, L Luo, G Cheng, P Zhang… - … on Chinese Spoken …, 2022 - ieeexplore.ieee.org
Unsupervised domain adaptation (UDA) aims to improve the cross-domain model
performance without labeled target domain data. Distribution matching is a widely used UDA …

[PDF][PDF] Μέθοδοι μηχανικής μάθηση βασισμένες στη γνωσιακή επιστήμη για μείωση διαστατικότητας και προσαρμογή μεταξύ πεδίων μοντέλων φωνής και γλώσσας σε …

Π Γεώργιος - 2024 - dspace.lib.ntua.gr
In the recent years, a dominant strategy has arised in machine learning, ie, scaling-up model
capacity and training data, with impressive results. However, the development of techniques …