Transformer transducer: A streamable speech recognition model with transformer encoders and rnn-t loss Q Zhang, H Lu, H Sak, A Tripathi, E McDermott, S Koo, S Kumar ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 504 | 2020 |
Toward domain-invariant speech recognition via large scale training A Narayanan, A Misra, KC Sim, G Pundak, A Tripathi, M Elfeky, P Haghani, ... 2018 IEEE Spoken Language Technology Workshop (SLT), 441-447, 2018 | 115 | 2018 |
Temporal modeling using dilated convolution and gating for voice-activity-detection SY Chang, B Li, G Simko, TN Sainath, A Tripathi, A van den Oord, ... 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 82 | 2018 |
Speech recognition for medical conversations CC Chiu, A Tripathi, K Chou, C Co, N Jaitly, D Jaunzeikare, A Kannan, ... arXiv preprint arXiv:1711.07274, 2017 | 68 | 2017 |
Turn-to-diarize: Online speaker diarization constrained by transformer transducer speaker turn detection W Xia, H Lu, Q Wang, A Tripathi, Y Huang, IL Moreno, H Sak ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 59 | 2022 |
Monotonic recurrent neural network transducer and decoding strategies A Tripathi, H Lu, H Sak, H Soltau 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 49 | 2019 |
Transformer transducer: One model unifying streaming and non-streaming speech recognition A Tripathi, J Kim, Q Zhang, H Lu, H Sak arXiv preprint arXiv:2010.03192, 2020 | 46 | 2020 |
Domain Adaptation Using Factorized Hidden Layer for Robust Automatic Speech Recognition. KC Sim, A Narayanan, A Misra, A Tripathi, G Pundak, TN Sainath, ... Interspeech, 892-896, 2018 | 44 | 2018 |
End-to-end multi-talker overlapping speech recognition A Tripathi, H Lu, H Sak ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 40 | 2020 |
Multilingual Speech Recognition with Self-Attention Structured Parameterization. Y Zhu, P Haghani, A Tripathi, B Ramabhadran, B Farris, H Xu, H Lu, ... INTERSPEECH, 4741-4745, 2020 | 29 | 2020 |
Reducing streaming ASR model delay with self alignment J Kim, H Lu, A Tripathi, Q Zhang, H Sak arXiv preprint arXiv:2105.05005, 2021 | 21 | 2021 |
Gender prediction of Indian names A Tripathi, M Faruqui IEEE Technology Students' Symposium, 137-141, 2011 | 15 | 2011 |
Contrastive siamese network for semi-supervised speech recognition S Khorram, J Kim, A Tripathi, H Lu, Q Zhang, H Sak ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 12 | 2022 |
Transformer transducer: one model unifying streaming and non-streaming speech recognition A Tripathi, H Sak, H Lu, Q Zhang, JY Kim US Patent 11,741,947, 2023 | 5 | 2023 |
End-to-End Audio-Visual Speech Recognition for Overlapping Speech. R Rose, O Siohan, A Tripathi, O Braga Interspeech, 3016-3020, 2021 | 5 | 2021 |
End-to-end multi-talker overlapping speech recognition A Tripathi, H Lu, H Sak US Patent 11,521,595, 2022 | 4 | 2022 |
Space optimized multicast in delay tolerant networks A Tripathi International Journal of Computing and Network Technology 1 (02), 2013 | 4 | 2013 |
Speaker-Turn-Based Online Speaker Diarization with Constrained Spectral Clustering Q Wang, H Lu, E Clark, IL Moreno, H Sak, W Xia, T Joglekar, A Tripathi US Patent App. 17/644,261, 2023 | 2 | 2023 |
Cross-training: A semi-supervised training scheme for speech recognition S Khorram, A Tripathi, J Kim, H Lu, Q Zhang, R Prabhavalkar, H Sak ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |
End-To-End Multi-Talker Overlapping Speech Recognition A Tripathi, H Liu, H Sak US Patent App. 18/055,553, 2023 | | 2023 |