Add 2022: the first audio deep synthesis detection challenge J Yi, R Fu, J Tao, S Nie, H Ma, C Wang, T Wang, Z Tian, Y Bai, C Fan, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 150 | 2022 |
Self-attention transducers for end-to-end speech recognition Z Tian, J Yi, J Tao, Y Bai, Z Wen arXiv preprint arXiv:1909.13037, 2019 | 77 | 2019 |
Synchronous transformers for end-to-end speech recognition Z Tian, J Yi, Y Bai, J Tao, S Zhang, Z Wen ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 72 | 2020 |
Spike-triggered non-autoregressive transformer for end-to-end speech recognition Z Tian, J Yi, J Tao, Y Bai, S Zhang, Z Wen arXiv preprint arXiv:2005.07903, 2020 | 62 | 2020 |
Language-adversarial transfer learning for low-resource speech recognition J Yi, J Tao, Z Wen, Y Bai IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (3), 621-630, 2018 | 62 | 2018 |
Half-truth: A partially fake audio detection dataset J Yi, Y Bai, J Tao, H Ma, Z Tian, C Wang, T Wang, R Fu arXiv preprint arXiv:2104.03617, 2021 | 61 | 2021 |
Fast end-to-end speech recognition via non-autoregressive models and cross-modal knowledge transferring from BERT Y Bai, J Yi, J Tao, Z Tian, Z Wen, S Zhang IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1897-1911, 2021 | 57 | 2021 |
Listen attentively, and spell once: Whole sentence generation via a non-autoregressive architecture for low-latency speech recognition Y Bai, J Yi, J Tao, Z Tian, Z Wen, S Zhang arXiv preprint arXiv:2005.04862, 2020 | 45 | 2020 |
Adversarial transfer learning for punctuation restoration J Yi, J Tao, Y Bai, Z Tian, C Fan arXiv preprint arXiv:2004.00248, 2020 | 43 | 2020 |
Learn spelling from teachers: Transferring knowledge from language models to sequence-to-sequence speech recognition Y Bai, J Yi, J Tao, Z Tian, Z Wen arXiv preprint arXiv:1907.06017, 2019 | 37 | 2019 |
A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting. Y Bai, J Yi, J Tao, Z Wen, Z Tian, C Zhao, C Fan INTERSPEECH, 2190-2194, 2019 | 34 | 2019 |
Adversarial multilingual training for low-resource speech recognition J Yi, J Tao, Z Wen, Y Bai 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 31 | 2018 |
Continual learning for fake audio detection H Ma, J Yi, J Tao, Y Bai, Z Tian, C Wang arXiv preprint arXiv:2104.07286, 2021 | 30 | 2021 |
End-to-end keywords spotting based on connectionist temporal classification for mandarin Y Bai, J Yi, H Ni, Z Wen, B Liu, Y Li, J Tao 2016 10th international symposium on Chinese spoken language processing …, 2016 | 30 | 2016 |
Rnn-transducer with language bias for end-to-end mandarin-english code-switching speech recognition S Zhang, J Yi, Z Tian, J Tao, Y Bai 2021 12th international symposium on Chinese spoken language processing …, 2021 | 27 | 2021 |
Focal Loss for Punctuation Prediction. J Yi, J Tao, Z Tian, Y Bai, C Fan Interspeech, 721-725, 2020 | 23 | 2020 |
Deep imitator: Handwriting calligraphy imitation via deep attention networks B Zhao, J Tao, M Yang, Z Tian, C Fan, Y Bai Pattern Recognition 104, 107080, 2020 | 22 | 2020 |
Polyvoice: Language models for speech to speech translation Q Dong, Z Huang, Q Tian, C Xu, T Ko, Y Zhao, S Feng, T Li, K Wang, ... arXiv preprint arXiv:2306.02982, 2023 | 17 | 2023 |
Fsr: Accelerating the inference process of transducer-based models by applying fast-skip regularization Z Tian, J Yi, Y Bai, J Tao, S Zhang, Z Wen arXiv preprint arXiv:2104.02882, 2021 | 16 | 2021 |
Noise prior knowledge learning for speech enhancement via gated convolutional generative adversarial network C Fan, B Liu, J Tao, J Yi, Z Wen, Y Bai 2019 Asia-Pacific Signal and Information Processing Association Annual …, 2019 | 15 | 2019 |