Large-scale asr domain adaptation using self-and semi-supervised learning D Hwang, A Misra, Z Huo, N Siddhartha, S Garg, D Qiu, KC Sim, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 44 | 2022 |
Pseudo label is better than human label D Hwang, KC Sim, Z Huo, T Strohman INTERSPEECH 2022, 2022 | 29 | 2022 |
A Comparison of Supervised and Unsupervised Pre-Training of End-to-End Models A Misra, D Hwang, Z Huo, S Garg, N Siddhartha, A Narayanan, KC Sim Proc. Interspeech 2021, 731-735, 2021 | 21 | 2021 |
Efficient domain adaptation for speech foundation models B Li, D Hwang, Z Huo, J Bai, G Prakash, TN Sainath, KC Sim, Y Zhang, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 14 | 2023 |
A unified cascaded encoder asr model for dynamic model sizes S Ding, W Wang, D Zhao, TN Sainath, Y He, R David, R Botros, X Wang, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2022 | 14 | 2022 |
Incremental layer-wise self-supervised learning for efficient speech domain adaptation on device Z Huo, D Hwang, KC Sim, S Garg, A Misra, N Siddhartha, T Strohman, ... INTERSPEECH 2022, 2021 | 10 | 2021 |
Modular domain adaptation for conformer-based streaming asr Q Li, B Li, D Hwang, TN Sainath, PM Mengibar INTERSPEECH 2023, 2023 | 8 | 2023 |
Comparison of soft and hard target rnn-t distillation for large-scale asr D Hwang, KC Sim, Y Zhang, T Strohman ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 7 | 2023 |
Batch rendering method for 2d vector graphics path using gpu D Hwang, K Chang-Hun US Patent App. 14/397,303, 2015 | 5 | 2015 |
Resource-efficient transfer learning from speech foundation model using hierarchical feature fusion Z Huo, KC Sim, B Li, D Hwang, TN Sainath, T Strohman ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 4 | 2023 |
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models R Prabhavalkar, Z Meng, W Wang, A Stooke, X Cai, Y He, A Narayanan, ... ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 3 | 2024 |
Efficient Cascaded Streaming ASR System via Frame Rate Reduction X Cai, D Qiu, S Ding, D Hwang, WWA Bruguier, R Prabhavalkar, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 2 | 2023 |
Massive End-to-end Models for Short Search Queries W Wang, R Prabhavalkar, D Hwang, Q Li, KC Sim, B Li, J Qin, X Cai, ... CoRR abs/2309.12963, 2023 | 2 | 2023 |
TransformerFAM: Feedback attention is working memory D Hwang, W Wang, Z Huo, KC Sim, PM Mengibar arXiv preprint arXiv:2404.09173, 2024 | 1 | 2024 |
Revisiting the Entropy Semiring for Neural Speech Recognition O Chang, D Hwang, O Siohan ICLR 2023, 2023 | 1 | 2023 |
Re-investigating the Efficient Transfer Learning of Speech Foundation Model using Feature Fusion Methods Z Huo, KC Sim, D Hwang, T Munkhdalai, T Sainath, P Moreno INTERSPEECH 2023, 0 | 1* | |
Massive End-to-end Speech Recognition Models with Time Reduction W Wang, R Prabhavalkar, H Shan, Z Meng, D Hwang, Q Li, KC Sim, B Li, ... Proceedings of the 2024 Conference of the North American Chapter of the …, 2024 | | 2024 |
FAdam: Adam is a natural gradient optimizer using diagonal empirical Fisher information D Hwang arXiv preprint arXiv:2405.12807, 2024 | | 2024 |
Improving Speech Recognition for African American English with Audio Classification S Garg, Z Huo, KC Sim, S Schwartz, M Chua, A Aksënova, T Munkhdalai, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
Unified Cascaded Encoder ASR model for Dynamic Model Sizes S Ding, Y He, X Wang, W Wang, T Strohman, TN Sainath, ... US Patent App. 18/182,925, 2023 | | 2023 |