CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ... arXiv preprint arXiv:2004.09249, 2020 | 303 | 2020 |
Probing the information encoded in x-vectors D Raj, D Snyder, D Povey, S Khudanpur 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 113 | 2019 |
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis D Raj, P Denisov, Z Chen, H Erdogan, Z Huang, M He, S Watanabe, J Du, ... 2021 IEEE spoken language technology workshop (SLT), 897-904, 2021 | 85 | 2021 |
Dover-lap: A method for combining overlap-aware diarization outputs D Raj, LP Garcia-Perera, Z Huang, S Watanabe, D Povey, A Stolcke, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 881-888, 2021 | 69 | 2021 |
Learning local and global contexts using a convolutional recurrent network model for relation classification in biomedical text D Raj, S Sahu, A Anand Proceedings of the 21st conference on computational natural language …, 2017 | 48 | 2017 |
Sequential multi-frame neural beamforming for speech separation and enhancement ZQ Wang, H Erdogan, S Wisdom, K Wilson, D Raj, S Watanabe, Z Chen, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 905-911, 2021 | 47 | 2021 |
The Hitachi-JHU DIHARD III system: Competitive end-to-end neural diarization and x-vector clustering systems combined by DOVER-Lap S Horiguchi, N Yalta, P Garcia, Y Takashima, Y Xue, D Raj, Z Huang, ... arXiv preprint arXiv:2102.01363, 2021 | 37 | 2021 |
Multi-class spectral clustering with overlaps for speaker diarization D Raj, Z Huang, S Khudanpur 2021 IEEE Spoken Language Technology Workshop (SLT), 582-589, 2021 | 33 | 2021 |
Target-speaker voice activity detection with improved i-vector estimation for unknown number of speaker M He, D Raj, Z Huang, J Du, Z Chen, S Watanabe arXiv preprint arXiv:2108.03342, 2021 | 30 | 2021 |
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios S Cornell, M Wiesner, S Watanabe, D Raj, X Chang, P Garcia, ... arXiv preprint arXiv:2306.13734, 2023 | 29 | 2023 |
GPU-accelerated guided source separation for meeting transcription D Raj, D Povey, S Khudanpur arXiv preprint arXiv:2212.05271, 2022 | 28 | 2022 |
Using ASR methods for OCR A Arora, CC Chang, B Rekabdar, B BabaAli, D Povey, D Etter, D Raj, ... 2019 International Conference on Document Analysis and Recognition (ICDAR …, 2019 | 23 | 2019 |
Uncertain fuzzy self-organization based clustering: interval type-2 approach to adaptive resonance theory S Majheed, A Gupta, D Raj, FCH Rhee Information Sciences, 2017 | 21* | 2017 |
Continuous streaming multi-talker asr with dual-path transducers D Raj, L Lu, Z Chen, Y Gaur, J Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 18 | 2022 |
The JHU multi-microphone multi-speaker ASR system for the CHiME-6 challenge A Arora, D Raj, AS Subramanian, K Li, B Ben-Yair, M Maciejewski, ... arXiv preprint arXiv:2006.07898, 2020 | 16 | 2020 |
Adapting self-supervised models to multi-talker speech recognition using speaker embeddings Z Huang, D Raj, P García, S Khudanpur ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 14 | 2023 |
Analysis of Data Generated from Multidimensional Type-1 and Type-2 Fuzzy Membership Functions D Raj, A Gupta, B Garg, K Tanna, FCH Rhee IEEE Transactions on Fuzzy Systems, 0 | 12* | |
Low-latency speech separation guided diarization for telephone conversations G Morrone, S Cornell, D Raj, L Serafini, E Zovato, A Brutti, S Squartini 2022 IEEE Spoken Language Technology Workshop (SLT), 641-646, 2023 | 8 | 2023 |
Injecting text and cross-lingual supervision in few-shot learning from self-supervised models M Wiesner, D Raj, S Khudanpur ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 8 | 2022 |
Joint speaker diarization and speech recognition based on region proposal networks Z Huang, M Delcroix, LP Garcia, S Watanabe, D Raj, S Khudanpur Computer Speech & Language 72, 101316, 2022 | 6 | 2022 |