Neural inverse text normalization M Sunkara, C Shivade, S Bodapati, K Kirchhoff ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 37 | 2021 |
Multimodal semi-supervised learning framework for punctuation prediction in conversational speech M Sunkara, S Ronanki, D Bekal, S Bodapati, K Kirchhoff arXiv preprint arXiv:2008.00702, 2020 | 33 | 2020 |
Robust prediction of punctuation and truecasing for medical asr M Sunkara, S Ronanki, K Dixit, S Bodapati, K Kirchhoff arXiv preprint arXiv:2007.02025, 2020 | 33 | 2020 |
Best of both worlds: Robust accented speech recognition with adversarial transfer learning N Das, S Bodapati, M Sunkara, S Srinivasan, DH Chau arXiv preprint arXiv:2103.05834, 2021 | 27 | 2021 |
Personalization of ctc speech recognition models S Dingliwal, M Sunkara, S Ronanki, J Farris, K Kirchhoff, S Bodapati 2022 IEEE Spoken Language Technology Workshop (SLT), 302-309, 2023 | 25 | 2023 |
Listen, know and spell: Knowledge-infused subword modeling for improving asr performance of oov named entities N Das, M Sunkara, D Bekal, DH Chau, S Bodapati, K Kirchhoff ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 20 | 2022 |
Adapting long context nlm for asr rescoring in conversational agents A Shenoy, S Bodapati, M Sunkara, S Ronanki, K Kirchhoff arXiv preprint arXiv:2104.11070, 2021 | 18 | 2021 |
Remember the context! asr slot error correction through memorization D Bekal, A Shenoy, M Sunkara, S Bodapati, K Kirchhoff 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 11 | 2021 |
Adaptation approaches for nearest neighbor language models R Bhardwaj, G Polovets, M Sunkara arXiv preprint arXiv:2211.07828, 2022 | 6 | 2022 |
What’s The Context?”: Long Context NLM Adaptation for ASR Rescoring in Conversational Agents A Shenoy, S Bodapati, M Sunkara, S Ronanki, K Kirchhoff CoRR, abs/2104.11070, 2021 | 2 | 2021 |
Mask the bias: Improving domain-adaptive generalization of CTC-based ASR with internal language model estimation N Das, M Sunkara, S Bodapati, J Cai, D Kulshreshtha, J Farris, K Kirchhoff ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |
CERET: Cost-effective extrinsic refinement for text generation J Cai, H Su, M Sunkara, I Shalyminov, S Mansour arXiv preprint arXiv:2406.05588, 2024 | | 2024 |
SpeechVerse: A Large-scale Generalizable Audio Language Model N Das, S Dingliwal, S Ronanki, R Paturi, D Huang, P Mathur, J Yuan, ... arXiv preprint arXiv:2405.08295, 2024 | | 2024 |
Masked Audio Text Encoders are Effective Multi-Modal Rescorers J Cai, M Sunkara, X Li, A Bhatia, X Pan, S Bodapati arXiv preprint arXiv:2305.07677, 2023 | | 2023 |
Multimodal based punctuation and/or casing prediction ML Sunkara, S Ronanki, DB Kannangola, SB Bodapati, K Kirchhoff US Patent 11,580,965, 2023 | | 2023 |
Masked audio text encoders are effective few-shot rescorers J Cai, M Sunkara, X Li, A Bhatia, X Pan, S Bodapati | | 2023 |
Towards Personalization of CTC Speech Recognition Models with Contextual Adapters and Adaptive Boosting S Dingliwal, M Sunkara, S Bodapati, S Ronanki, J Farris, K Kirchhoff arXiv preprint arXiv:2210.09510, 2022 | | 2022 |