English conversational telephone speech recognition by humans and machines G Saon, G Kurata, T Sercu, K Audhkhasi, S Thomas, D Dimitriadis, X Cui, ... arXiv preprint arXiv:1703.02136, 2017 | 472 | 2017 |
Applying machine learning to facilitate autism diagnostics: pitfalls and promises D Bone, MS Goodwin, MP Black, CC Lee, K Audhkhasi, S Narayanan Journal of autism and developmental disorders 45, 1121-1136, 2015 | 284 | 2015 |
Direct acoustics-to-word models for english conversational speech recognition K Audhkhasi, B Ramabhadran, G Saon, M Picheny, D Nahamoo arXiv preprint arXiv:1703.07754, 2017 | 166 | 2017 |
Avlnet: Learning audio-visual language representations from instructional videos A Rouditchenko, A Boggust, D Harwath, B Chen, D Joshi, S Thomas, ... arXiv preprint arXiv:2006.09199, 2020 | 141 | 2020 |
Building competitive direct acoustics-to-word models for english conversational speech recognition K Audhkhasi, B Kingsbury, B Ramabhadran, G Saon, M Picheny 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 140 | 2018 |
End-to-End ASR-free Keyword Search from Speech K Audhkhasi, A Rosenberg, A Sethy, B Ramabhadran, B Kingsbury arXiv preprint arXiv:1701.04313, 2017 | 133 | 2017 |
Noise-enhanced convolutional neural networks K Audhkhasi, O Osoba, B Kosko Neural Networks 78, 15-23, 2016 | 122 | 2016 |
Multilingual representations for low resource speech recognition and keyword search J Cui, B Kingsbury, B Ramabhadran, A Sethy, K Audhkhasi, Z Tüske, ... Proc. ASRU, 2015 | 114 | 2015 |
Joint modeling of accents and acoustics for multi-accent speech recognition X Yang, K Audhkhasi, A Rosenberg, S Thomas, B Ramabhadran, ... 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 86 | 2018 |
Invariant representations for noisy speech recognition D Serdyuk, K Audhkhasi, P Brakel, B Ramabhadran, S Thomas, Y Bengio arXiv preprint arXiv:1612.01928, 2016 | 81 | 2016 |
Formant-based technique for automatic filled-pause detection in spontaneous spoken English K Audhkhasi, K Kandhway, OD Deshmukh, A Verma 2009 IEEE International Conference on Acoustics, Speech and Signal …, 2009 | 80 | 2009 |
Single headed attention based sequence-to-sequence model for state-of-the-art results on switchboard Z Tüske, G Saon, K Audhkhasi, B Kingsbury arXiv preprint arXiv:2001.07263, 2020 | 78 | 2020 |
Which ASR should I choose for my dialogue system? F Morbini, K Audhkhasi, K Sagae, R Artstein, D Can, P Georgiou, ... SIGDIAL 2013, 2013 | 75 | 2013 |
Knowledge distillation across ensembles of multilingual models for low-resource languages J Cui, B Kingsbury, B Ramabhadran, G Saon, T Sercu, K Audhkhasi, ... 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 74 | 2017 |
End-to-end speech recognition and keyword search on low-resource languages A Rosenberg, K Audhkhasi, A Sethy, B Ramabhadran, M Picheny 2017 ieee international conference on acoustics, speech and signal …, 2017 | 71 | 2017 |
Leveraging unpaired text data for training end-to-end speech-to-intent systems Y Huang, HK Kuo, S Thomas, Z Kons, K Audhkhasi, B Kingsbury, R Hoory, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 66 | 2020 |
Noise-enhanced convolutional neural networks K Audhkhasi, B Kosko, O Osoba US Patent 11,256,982, 2022 | 57 | 2022 |
Alignment-length synchronous decoding for RNN transducer G Saon, Z Tüske, K Audhkhasi ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 57 | 2020 |
Guiding CTC posterior spike timings for improved posterior fusion and knowledge distillation G Kurata, K Audhkhasi arXiv preprint arXiv:1904.08311, 2019 | 55 | 2019 |
External word embedding neural network language models K Audhkhasi, B Ramabhadran, A Sethy US Patent 10,019,438, 2018 | 54 | 2018 |