关注
Yuchen Hu
标题
引用次数
引用次数
年份
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition
Y Hu, N Hou, C Chen, ES Chng
ICASSP 2022, 2022
392022
Noise-Robust Speech Recognition with 10 Minutes Unparalleled In-domain Data
C Chen, N Hou, Y Hu, S Shirol, ES Chng
ICASSP 2022, 2022
372022
Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning
QS Zhu, L Zhou, J Zhang, SJ Liu, YC Hu, LR Dai
ICASSP 2023, 2023
232023
Interactive audio-text representation for automated audio captioning with contrastive learning
C Chen, N Hou, Y Hu, H Zou, X Qi, ES Chng
Interspeech 2022, 2022
202022
Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning
C Chen, Y Hu, Q Zhang, H Zou, B Zhu, ES Chng
AAAI 2023, 2023
182023
Dual-path style learning for end-to-end noise-robust speech recognition
Y Hu, N Hou, C Chen, ES Chng
Interspeech 2023, 2023
172023
Self-Critical Sequence Training for Automatic Speech Recognition
C Chen, Y Hu, N Hou, X Qi, H Zou, ES Chng
ICASSP 2022, 2022
172022
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition
Y Hu, C Chen, R Li, Q Zhu, ES Chng
ICASSP 2023, 2023
162023
Hyporadise: An open baseline for generative speech recognition with large language models
C Chen*, Y Hu*, CHH Yang, SM Siniscalchi, PY Chen, ES Chng
NeurIPS 2023, 2023
142023
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model
C Chen, Y Hu, W Weng, ES Chng
ICASSP 2023, 2023
142023
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation
Y Hu, C Chen, H Zou, X Zhong, ES Chng
ICASSP 2023, 2023
112023
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR
Y Hu, C Chen, Q Zhu, ES Chng
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
102023
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021
D Liu, M Du, X Li, Y Hu, L Dai
IWSLT 2021, 2021
92021
Unsupervised Noise Adaptation using Data Simulation
C Chen, Y Hu, H Zou, L Sun, ES Chng
ICASSP 2023, 2023
82023
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning
H Zou, M Shen, C Chen, Y Hu, D Rajan, ES Chng
ACL 2023, 2023
72023
A Neural State-Space Model Approach to Efficient Speech Separation
C Chen, CHH Yang, K Li, Y Hu, PJ Ku, ES Chng
Interspeech 2023, 2023
52023
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition
Y Hu, R Li, C Chen, H Zou, Q Zhu, ES Chng
IJCAI 2023, 2023
42023
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition
Y Hu, C Chen, R Li, H Zou, ES Chng
ACL 2023, 2023
42023
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
C Chen, R Li, Y Hu, SM Siniscalchi, PY Chen, E Chng, CHH Yang
ICLR 2024, 2024
32024
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition
Y Hu, C Chen, CHH Yang, R Li, C Zhang, PY Chen, ES Chng
ICLR 2024, 2024
32024
系统目前无法执行此操作,请稍后再试。
文章 1–20