关注
Linhao Dong
Linhao Dong
Bytedance AI-Lab
在 bytedance.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
Speech-transformer: a no-recurrence sequence-to-sequence model for speech recognition
L Dong, S Xu, B Xu
2018 IEEE international conference on acoustics, speech and signal …, 2018
11842018
Syllable-based sequence-to-sequence speech recognition with the transformer in mandarin chinese
S Zhou, L Dong, S Xu, B Xu
arXiv preprint arXiv:1804.10752, 2018
1342018
Cif: Continuous integrate-and-fire for end-to-end speech recognition
L Dong, B Xu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1192020
Self-attention aligner: A latency-control end-to-end model for asr using self-attention network and chunk-hopping
L Dong, F Wang, B Xu
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
952019
A comparison of modeling units in sequence-to-sequence speech recognition with the transformer on mandarin chinese
S Zhou, L Dong, S Xu, B Xu
International Conference on Neural Information Processing, 210-220, 2018
682018
Extending recurrent neural aligner for streaming end-to-end speech recognition in mandarin
L Dong, S Zhou, W Chen, B Xu
arXiv preprint arXiv:1806.06342, 2018
382018
Improving end-to-end contextual speech recognition with fine-grained contextual knowledge selection
M Han, L Dong, Z Liang, M Cai, S Zhou, Z Ma, B Xu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
352022
Cif-based collaborative decoding for end-to-end contextual speech recognition
M Han, L Dong, S Zhou, B Xu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
212021
A comparison of label-synchronous and frame-synchronous end-to-end models for speech recognition
L Dong, C Yi, J Wang, S Zhou, S Xu, X Jia, B Xu
arXiv preprint arXiv:2005.10113, 2020
152020
Language-specific acoustic boundary learning for mandarin-english code-switching speech recognition
Z Fan, L Dong, C Shen, Z Liang, J Zhang, L Lu, Z Ma
arXiv preprint arXiv:2306.05279, 2023
52023
Sequence-level speaker change detection with difference-based continuous integrate-and-fire
Z Fan, L Dong, M Cai, Z Ma, B Xu
IEEE Signal Processing Letters 29, 1551-1554, 2022
52022
Boosting Character-Based Chinese Speech Synthesis via Multi-Task Learning and Dictionary Tutoring.
Y Zou, L Dong, B Xu
INTERSPEECH, 2055-2059, 2019
52019
Syllable-based acoustic modeling with CTC for multi-scenarios Mandarin speech recognition
Y Zhao, L Dong, S Xu, B Xu
2018 International Joint Conference on Neural Networks (IJCNN), 1-8, 2018
42018
Token-level speaker change detection using speaker difference and speech content via continuous integrate-and-fire
Z Fan, Z Liang, L Dong, Y Liu, S Zhou, M Cai, J Zhang, Z Ma, B Xu
arXiv preprint arXiv:2211.09381, 2022
22022
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Y Bai, J Chen, J Chen, W Chen, Z Chen, C Ding, L Dong, Q Dong, Y Du, ...
arXiv preprint arXiv:2407.04675, 2024
12024
SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR
Z Fan, L Dong, J Zhang, L Lu, Z Ma
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
Cif-pt: Bridging speech and text representations for spoken language understanding via continuous integrate-and-fire pre-training
L Dong, Z An, P Wu, J Zhang, L Lu, Z Ma
arXiv preprint arXiv:2305.17499, 2023
12023
Method, apparatus, device, and storage medium for speaker change point detection
D Linhao, Z Fan, Z Ma
US Patent 12,039,981, 2024
2024
Voice recognition method and apparatus, medium, and electronic device
D Linhao, Z Ma
US Patent App. 18/288,531, 2024
2024
Method and device of generating acoustic features, speech model training, and speech recognition
D Linhao, Z Ma
US Patent App. 18/427,538, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–20