Watch, attend and parse: An end-to-end neural network based approach to handwritten mathematical expression recognition J Zhang, J Du, S Zhang, D Liu, Y Hu, J Hu, S Wei, L Dai Pattern Recognition 71, 196-206, 2017 | 229 | 2017 |
Deep-FSMN for large vocabulary continuous speech recognition S Zhang, M Lei, Z Yan, L Dai 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 126 | 2018 |
The Fixed-Size Ordinally-Forgetting Encoding Method for Neural Network Language Models S Zhang, H Jiang, M Xu, J Hou, L Dai ACL2015, 495, 2015 | 99* | 2015 |
Feedforward sequential memory networks: A new structure to learn long-term dependency S Zhang, C Liu, H Jiang, S Wei, L Dai, Y Hu arXiv preprint arXiv:1512.08301, 2015 | 87 | 2015 |
M2MeT: The ICASSP 2022 multi-channel multi-party meeting transcription challenge F Yu, S Zhang, Y Fu, L Xie, S Zheng, Z Du, W Huang, P Guo, Z Yan, B Ma, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 67 | 2022 |
Investigation of Transformer Based Spelling Correction Model for CTC-Based End-to-End Mandarin Speech Recognition. S Zhang, M Lei, Z Yan Interspeech, 2180-2184, 2019 | 56* | 2019 |
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge HB Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng ... ICASSP, 2022 | 48* | 2022 |
Simplified self-attention for transformer-based end-to-end speech recognition H Luo, S Zhang, M Lei, L Xie 2021 IEEE Spoken Language Technology Workshop (SLT), 75-81, 2021 | 45 | 2021 |
MDERank: A masked document embedding rank approach for unsupervised keyphrase extraction L Zhang, Q Chen, W Wang, C Deng, S Zhang, B Li, W Wang, X Cao arXiv preprint arXiv:2110.06651, 2021 | 42 | 2021 |
Qwen-audio: Advancing universal audio understanding via unified large-scale audio-language models Y Chu, J Xu, X Zhou, Q Yang, S Zhang, Z Yan, C Zhou, J Zhou arXiv preprint arXiv:2311.07919, 2023 | 41 | 2023 |
Paraformer: Fast and accurate parallel transformer for non-autoregressive end-to-end speech recognition Z Gao, S Zhang, I McLoughlin, Z Yan arXiv preprint arXiv:2206.08317, 2022 | 41 | 2022 |
Robust audio-visual speech recognition using bimodal DFSMN with multi-condition training and dropout regularization S Zhang, M Lei, B Ma, L Xie ICASSP 2019-2019 IEEE international conference on acoustics, speech and …, 2019 | 40 | 2019 |
Gaussian Prediction Based Attention for Online End-to-End Speech Recognition. J Hou, S Zhang, LR Dai Interspeech, 3692-3696, 2017 | 40 | 2017 |
Improving deep neural networks for LVCSR using dropout and shrinking structure S Zhang, Y Bao, P Zhou, H Jiang, L Dai 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 40 | 2014 |
Investigation of modeling units for mandarin speech recognition using dfsmn-ctc-smbr S Zhang, M Lei, Y Liu, W Li ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 36 | 2019 |
Prosospeech: Enhancing prosody with quantized vector pre-training in text-to-speech Y Ren, M Lei, Z Huang, S Zhang, Q Chen, Z Yan, Z Zhao ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 31 | 2022 |
Compact Feedforward Sequential Memory Networks for Large Vocabulary Continuous Speech Recognition. S Zhang, H Jiang, S Xiong, S Wei, LR Dai Interspeech, 3389-3393, 2016 | 31 | 2016 |
Streaming chunk-aware multihead attention for online end-to-end speech recognition S Zhang, Z Gao, H Luo, M Lei, J Gao, Z Yan, L Xie arXiv preprint arXiv:2006.01712, 2020 | 30 | 2020 |
San-m: Memory equipped self-attention for end-to-end speech recognition Z Gao, S Zhang, M Lei, I McLoughlin arXiv preprint arXiv:2006.01713, 2020 | 28 | 2020 |
Improving the microstructure and mechanical properties of Zr-Ti alloy by nickel addition C Liu, J Qin, Z Feng, S Zhang, M Ma, X Zhang, R Liu Journal of Alloys and Compounds 737, 405-411, 2018 | 25 | 2018 |