Zipformer: A faster and better encoder for automatic speech recognition Z Yao, L Guo, X Yang, W Kang, F Kuang, Y Yang, Z Jin, L Lin, D Povey The Twelfth International Conference on Learning Representations, 2023 | 33 | 2023 |
Knowledge distillation for neural transducers from large self-supervised pre-trained models X Yang, Q Li, PC Woodland ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 18 | 2022 |
Libriheavy: a 50,000 hours asr corpus with punctuation casing and context W Kang, X Yang, Z Yao, F Kuang, Y Yang, L Guo, L Lin, D Povey ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 12 | 2024 |
Fast and parallel decoding for transducer W Kang, L Guo, F Kuang, L Lin, M Luo, Z Yao, X Yang, P Żelasko, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 10 | 2023 |
Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition X Yang, Q Li, C Zhang, PC Woodland arXiv preprint arXiv:2303.10917, 2023 | 6 | 2023 |
PromptASR for contextualized ASR with controllable style X Yang, W Kang, Z Yao, Y Yang, L Guo, F Kuang, L Lin, D Povey ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 4 | 2024 |
Predicting multi-codebook vector quantization indexes for knowledge distillation L Guo, X Yang, Q Wang, Y Kong, Z Yao, F Cui, F Kuang, W Kang, L Lin, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 4 | 2023 |
Blank-regularized ctc for frame skipping in neural transducer Y Yang, X Yang, L Guo, Z Yao, W Kang, F Kuang, L Lin, X Chen, D Povey arXiv preprint arXiv:2305.11558, 2023 | 4 | 2023 |
Delay-penalized transducer for low-latency streaming asr W Kang, Z Yao, F Kuang, L Guo, X Yang, L Lin, P Żelasko, D Povey ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 2 | 2023 |
Delay-penalized CTC implemented based on Finite State Transducer Z Yao, W Kang, F Kuang, L Guo, X Yang, Y Yang, L Lin, D Povey arXiv preprint arXiv:2305.11539, 2023 | 1 | 2023 |
SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLM Q Wang, Y Yuan, X Yang, R Zhang, K Zhao, W Liu, J Luan, D Povey, ... arXiv preprint arXiv:2406.06571, 2024 | | 2024 |
Knowledge Distillation for End-to-End Automatic Speech Recognition X Yang | | |