Speaker Adaptation in DNN-Based Speech Synthesis Using d-Vectors. R Doddipatla, N Braunschweiler, R Maia Interspeech, 3404-3408, 2017 | 61 | 2017 |
On End-to-end Multi-channel Time Domain Speech Separation in Reverberant Environments J Zhang, C Zorilă, R Doddipatla, J Barker ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 45 | 2020 |
An investigation into the effectiveness of enhancement in asr training and test for chime-5 dinner party transcription C Zorilă, C Boeddeker, R Doddipatla, R Haeb-Umbach 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 47-53, 2019 | 43 | 2019 |
Monaural source separation: From anechoic to reverberant environments T Cord-Landwehr, C Boeddeker, T Von Neumann, C Zorilă, R Doddipatla, ... 2022 international workshop on acoustic signal enhancement (IWAENC), 1-5, 2022 | 28 | 2022 |
A computationally efficient approach to warp factor estimation in VTLN using EM algorithm and sufficient statistics. PT Akhil, SP Rath, S Umesh, DR Sanand Interspeech, 1713-1716, 2008 | 25 | 2008 |
Multi-pass sentence-end detection of lecture speech M Hasan, R Doddipatla, T Hain Fifteenth Annual Conference of the International Speech Communication …, 2014 | 24 | 2014 |
Factors in emotion recognition with deep learning models using speech and text on multiple corpora N Braunschweiler, R Doddipatla, S Keizer, S Stoyanchev IEEE Signal Processing Letters 29, 722-726, 2022 | 23 | 2022 |
Speaker Dependent Bottleneck Layer Training for Speaker Adaptation in Automatic Speech Recognition R Doddipatla, M Hasan, T Hain Fifteenth Annual Conference of the International Speech Communication …, 2014 | 22 | 2014 |
Head-synchronous decoding for transformer-based streaming asr M Li, C Zorilă, R Doddipatla ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 21 | 2021 |
Transformer-based online speech recognition with decoder-end adaptive computation steps M Li, C Zorilă, R Doddipatla 2021 IEEE spoken language technology workshop (SLT), 1-7, 2021 | 21 | 2021 |
VTLN using analytically determined linear-transformation on conventional MFCC DR Sanand, S Umesh IEEE transactions on audio, speech, and language processing 20 (5), 1573-1584, 2012 | 21 | 2012 |
Study of jacobian compensation using linear transformation of conventional MFCC for VTLN. DR Sanand, S Umesh Interspeech, 1233-1236, 2008 | 21 | 2008 |
Learning noise invariant features through transfer learning for robust end-to-end speech recognition S Zhang, CT Do, R Doddipatla, S Renals ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 16 | 2020 |
Teacher-student MixIT for unsupervised and semi-supervised speech separation J Zhang, C Zorila, R Doddipatla, J Barker arXiv preprint arXiv:2106.07843, 2021 | 15 | 2021 |
Time-domain speech extraction with spatial information and multi speaker conditioning mechanism J Zhang, C Zorilă, R Doddipatla, J Barker ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 14 | 2021 |
Noise-matched training of CRF based sentence end detection models. M Hasan, R Doddipatla, T Hain Interspeech, 349-353, 2015 | 14 | 2015 |
A study on cross-corpus speech emotion recognition and data augmentation N Braunschweiler, R Doddipatla, S Keizer, S Stoyanchev 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 24-30, 2021 | 13 | 2021 |
Linear transformation approach to VTLN using dynamic frequency warping. DR Sanand, DD Kumar, S Umesh Interspeech, 1138-1141, 2007 | 12 | 2007 |
Action state update approach to dialogue management S Stoyanchev, S Keizer, R Doddipatla ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 11 | 2021 |
The USFD spoken language translation system for IWSLT 2014 RWM Ng, M Doulaty, R Doddipatla, W Aziz, K Shah, O Saz, M Hasan, ... arXiv preprint arXiv:1509.03870, 2015 | 11 | 2015 |