Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition Y Hu, N Hou, C Chen, ES Chng ICASSP 2022, 2022 | 39 | 2022 |
Noise-Robust Speech Recognition with 10 Minutes Unparalleled In-domain Data C Chen, N Hou, Y Hu, S Shirol, ES Chng ICASSP 2022, 2022 | 37 | 2022 |
Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning QS Zhu, L Zhou, J Zhang, SJ Liu, YC Hu, LR Dai ICASSP 2023, 2023 | 23 | 2023 |
Interactive audio-text representation for automated audio captioning with contrastive learning C Chen, N Hou, Y Hu, H Zou, X Qi, ES Chng Interspeech 2022, 2022 | 20 | 2022 |
Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning C Chen, Y Hu, Q Zhang, H Zou, B Zhu, ES Chng AAAI 2023, 2023 | 18 | 2023 |
Dual-path style learning for end-to-end noise-robust speech recognition Y Hu, N Hou, C Chen, ES Chng Interspeech 2023, 2023 | 17 | 2023 |
Self-Critical Sequence Training for Automatic Speech Recognition C Chen, Y Hu, N Hou, X Qi, H Zou, ES Chng ICASSP 2022, 2022 | 17 | 2022 |
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition Y Hu, C Chen, R Li, Q Zhu, ES Chng ICASSP 2023, 2023 | 16 | 2023 |
Hyporadise: An open baseline for generative speech recognition with large language models C Chen*, Y Hu*, CHH Yang, SM Siniscalchi, PY Chen, ES Chng NeurIPS 2023, 2023 | 14 | 2023 |
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model C Chen, Y Hu, W Weng, ES Chng ICASSP 2023, 2023 | 14 | 2023 |
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation Y Hu, C Chen, H Zou, X Zhong, ES Chng ICASSP 2023, 2023 | 11 | 2023 |
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR Y Hu, C Chen, Q Zhu, ES Chng IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023 | 10 | 2023 |
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021 D Liu, M Du, X Li, Y Hu, L Dai IWSLT 2021, 2021 | 9 | 2021 |
Unsupervised Noise Adaptation using Data Simulation C Chen, Y Hu, H Zou, L Sun, ES Chng ICASSP 2023, 2023 | 8 | 2023 |
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning H Zou, M Shen, C Chen, Y Hu, D Rajan, ES Chng ACL 2023, 2023 | 7 | 2023 |
A Neural State-Space Model Approach to Efficient Speech Separation C Chen, CHH Yang, K Li, Y Hu, PJ Ku, ES Chng Interspeech 2023, 2023 | 5 | 2023 |
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition Y Hu, R Li, C Chen, H Zou, Q Zhu, ES Chng IJCAI 2023, 2023 | 4 | 2023 |
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition Y Hu, C Chen, R Li, H Zou, ES Chng ACL 2023, 2023 | 4 | 2023 |
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition C Chen, R Li, Y Hu, SM Siniscalchi, PY Chen, E Chng, CHH Yang ICLR 2024, 2024 | 3 | 2024 |
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition Y Hu, C Chen, CHH Yang, R Li, C Zhang, PY Chen, ES Chng ICLR 2024, 2024 | 3 | 2024 |