关注
Wei Kang
Wei Kang
Senior engineer, Xiaomi Corp.
在 xiaomi.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Pruned RNN-T for fast, memory-efficient ASR training
F Kuang, L Guo, W Kang, L Lin, M Luo, Z Yao, D Povey
arXiv preprint arXiv:2206.13236, 2022
462022
Zipformer: A faster and better encoder for automatic speech recognition
Z Yao, L Guo, X Yang, W Kang, F Kuang, Y Yang, Z Jin, L Lin, D Povey
The Twelfth International Conference on Learning Representations, 2023
352023
Libriheavy: a 50,000 hours asr corpus with punctuation casing and context
W Kang, X Yang, Z Yao, F Kuang, Y Yang, L Guo, L Lin, D Povey
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
122024
Fast and parallel decoding for transducer
W Kang, L Guo, F Kuang, L Lin, M Luo, Z Yao, X Yang, P Żelasko, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
102023
PromptASR for contextualized ASR with controllable style
X Yang, W Kang, Z Yao, Y Yang, L Guo, F Kuang, L Lin, D Povey
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
Predicting multi-codebook vector quantization indexes for knowledge distillation
L Guo, X Yang, Q Wang, Y Kong, Z Yao, F Cui, F Kuang, W Kang, L Lin, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
42023
Blank-regularized ctc for frame skipping in neural transducer
Y Yang, X Yang, L Guo, Z Yao, W Kang, F Kuang, L Lin, X Chen, D Povey
arXiv preprint arXiv:2305.11558, 2023
42023
Delay-penalized transducer for low-latency streaming asr
W Kang, Z Yao, F Kuang, L Guo, X Yang, L Lin, P Żelasko, D Povey
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
22023
Method and apparatus for training neural network, and method and apparatus for audio processing
W Kang, P Daniel, F Kuang, GUO Liyong, YAO Zengwei, L Lin, ...
US Patent App. 18/080,713, 2023
12023
Delay-penalized CTC implemented based on Finite State Transducer
Z Yao, W Kang, F Kuang, L Guo, X Yang, Y Yang, L Lin, D Povey
arXiv preprint arXiv:2305.11539, 2023
12023
Method and apparatus for audio processing, electronic device and storage medium
LUO Mingshuang, F Kuang, GUO Liyong, L Lin, W Kang, YAO Zengwei, ...
US Patent App. 18/078,483, 2023
2023
Method of training speech recognition model, electronic device and storage medium
YAO Zengwei, GUO Liyong, P Daniel, L Lin, F Kuang, W Kang, ...
US Patent App. 18/078,460, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–12