Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ... Proc. Interspeech 2021, 2021 | 191 | 2021 |
speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment J Zhang, Z Zhang, Y Wang, Z Yan, Q Song, Y Huang, K Li, D Povey, ... Proc. Interspeech 2021, 2021 | 75 | 2021 |
Attention-based End-to-end Speech Recognition On Voice Search C Shan, J Zhang, Y Wang, L Xie Proc. Interspeech 2018, 4764-4768, 2018 | 73 | 2018 |
Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition K Wang, J Zhang, S Sun, Y Wang, F Xiang, L Xie Proc. Interspeech 2018, 1581-1585, 2018 | 50 | 2018 |
Data Augmentation For Children's Speech Recognition--The" Ethiopian" System For The SLT 2021 Children Speech Recognition Challenge G Chen, X Na, Y Wang, Z Yan, J Zhang, S Ma, Y Wang SLT 2021 Children Speech Recognition Challenge, 2021 | 22 | 2021 |
Av-Sepformer: Cross-Attention Sepformer for Audio-Visual Target Speaker Extraction J Lin, X Cai, H Dinkel, J Chen, Z Yan, Y Wang, J Zhang, Z Wu, Y Wang, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 14 | 2023 |
A Computer-assist Algorithm To Detect Repetitive Stuttering Automatically J Zhang, B Dong, Y Yan 2013 International Conference on Asian Language Processing, 249-252, 2013 | 12 | 2013 |
The Smallrice Submission To The DCASE2021 Task 4 Challenge: A Lightweight Approach For Semi-supervised Sound Event Detection With Unsupervised Data Augmentation H Dinkel, X Cai, Z Yan, Y Wang, J Zhang, Y Wang Tech. Rep., DCASE2021 Challenge, 2021 | 11 | 2021 |
Sequence-to-sequence Models For Small-footprint Keyword Spotting H Zhang, J Zhang, Y Wang arXiv preprint arXiv:1811.00348, 2018 | 10 | 2018 |
CED: Consistent ensemble distillation for audio tagging H Dinkel, Y Wang, Z Yan, J Zhang, Y Wang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2023 | 8 | 2023 |
Pseudo Strong Labels For Large Scale Weakly Supervised Audio Tagging H Dinkel, Z Yan, Y Wang, J Zhang, Y Wang 2022 IEEE International Conference on Acoustics, Speech and Signal …, 2022 | 8 | 2022 |
A Lightweight Approach For Semi-supervised Sound Event Detection With Unsupervised Data Augmentation H Dinkel, X Cai, Z Yan, Y Wang, J Zhang, Y Wang Tech. Rep., DCASE2021 Challenge, 15-19, 2021 | 7* | 2021 |
Attention-Based End-to-End Speech Recognition in Mandarin C Shan, J Zhang, Y Wang, L Xie 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 7 | 2018 |
A Novel Discriminative Method For Pronunciation Quality Assessment J Zhang, F Pan, B Dong, Y Yan 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 7 | 2013 |
A Large Multi-modal Ensemble For Sound Event Detection H Dinkel, Z Yan, Y Wang, M Song, J Zhang, W Wang Detection and Classification of Acoustic Scenes and Events (DCASE) Challenge, 2022 | 6 | 2022 |
Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model K Wang, J Zhang, Y Wang, L Xie Proc. Interspeech 2018, 2429-2433, 2018 | 6 | 2018 |
Focus on the sound around you: Monaural target speaker extraction via distance and speaker information J Lin, P Wang, H Dinkel, J Chen, Z Wu, Z Yan, Y Wang, J Zhang, Y Wang arXiv preprint arXiv:2306.16241, 2023 | 5 | 2023 |
Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers H Dinkel, Y Wang, Z Yan, J Zhang, Y Wang 2023 IEEE International Conference on Acoustics, Speech and Signal …, 2023 | 3 | 2023 |
UniKW-AT: Unified Keyword Spotting and Audio Tagging H Dinkel, Y Wang, Z Yan, J Zhang, Y Wang Proc. Interspeech 2022, 2022 | 3 | 2022 |
End-to-end Models with auditory attention in Multi-channel Keyword Spotting H Zhang, J Zhang, Y Wang arXiv preprint arXiv:1811.00350, 2018 | 3 | 2018 |