E-Branchformer: Branchformer with Enhanced merging for speech recognition K Kim, F Wu, Y Peng, J Pan, P Sridhar, KJ Han, S Watanabe 2022 IEEE Spoken Language Technology Workshop (SLT), 84-91, 2023 | 67 | 2023 |
Multistream CNN for robust acoustic modeling KJ Han, J Pan, VKN Tadala, T Ma, D Povey ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 43 | 2021 |
Performance-efficiency trade-offs in unsupervised pre-training for speech recognition F Wu, K Kim, J Pan, KJ Han, KQ Weinberger, Y Artzi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 37 | 2022 |
ASAPP-ASR: Multistream CNN and self-attentive SRU for SOTA speech recognition J Pan, J Shapiro, J Wohlwend, KJ Han, T Lei, T Ma arXiv preprint arXiv:2005.10469, 2020 | 33 | 2020 |
Leveraging Pre-trained Language Model for Speech Sentiment Analysis S Shon, P Brusco, J Pan, KJ Han, S Watanabe arXiv preprint arXiv:2106.06598, 2021 | 15 | 2021 |
SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition J Pan, T Lei, K Kim, KJ Han, S Watanabe ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 10 | 2022 |
Audio-based Piano Performance Evaluation for Beginners with Convolutional Neural Network and Attention Mechanism W Wang, J Pan, H Yi, Z Song, M Li IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021 | 10 | 2021 |
COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning J Pan, J Wu, Y Gaur, S Sivasankaran, Z Chen, S Liu, J Li arXiv preprint arXiv:2311.02248, 2023 | 8 | 2023 |
WavLLM: Towards Robust and Adaptive Speech Large Language Model S Hu, L Zhou, S Liu, S Chen, H Hao, J Pan, X Liu, J Li, S Sivasankaran, ... arXiv preprint arXiv:2404.00656, 2024 | 6 | 2024 |
An Audio Based Piano Performance Evaluation Method Using Deep Neural Network Based Acoustic Modeling. J Pan, M Li, Z Song, X Li, X Liu, H Yi, M Zhu INTERSPEECH, 3088-3092, 2017 | 5 | 2017 |
Speech sentiment analysis using a speech sentiment classifier pretrained with pseudo sentiment labels P Brusco, J Pan, KJ Han US Patent 11,521,639, 2022 | 3 | 2022 |
Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach J Chen, J Xue, P Wang, J Pan, J Li 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023 | 1 | 2023 |
Contextual feature vectors for processing speech F Wu, KIM Kwangyoun, J Pan, KJ Han, KQ Weinberger, Y Artzi US Patent App. 17/493,716, 2022 | | 2022 |