Exploring the encoding layer and loss function in end-to-end speaker and language recognition system W Cai, J Chen, M Li arXiv preprint arXiv:1804.05160, 2018 | 407 | 2018 |
On-the-fly data loader and utterance-level aggregation for speaker and language recognition W Cai, J Chen, J Zhang, M Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1038-1051, 2020 | 92 | 2020 |
Analysis of length normalization in end-to-end speaker verification system W Cai, J Chen, M Li arXiv preprint arXiv:1806.03209, 2018 | 45 | 2018 |
End-to-end language identification using netfv and netvlad J Chen, W Cai, D Cai, Z Cai, H Zhong, M Li 2018 11th International Symposium on Chinese Spoken Language Processing …, 2018 | 15 | 2018 |
Bring dialogue-context into RNN-T for streaming ASR J Hou, J Chen, W Li, Y Tang, J Zhang, Z Ma Interspeech 2022, 2048--2052, 2022 | 7 | 2022 |
HMM-Free Encoder Pre-Training for Streaming RNN Transducer L Huang, J Sun, Y Tang, J Hou, J Chen, J Zhang, Z Ma Interspeech 2021, 2021 | 3 | 2021 |
The Sogou-TIIC speech translation system for IWSLT 2018 Y Wang, L Shi, L Wei, W Zhu, J Chen, Z Wang, S Wen, W Chen, Y Wang, ... Proceedings of the 15th International Conference on Spoken Language …, 2018 | 1 | 2018 |