Desh Raj 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	973	956
h 指数	15	15
i10 指数	17	17

260

130

195

201720182019202020212022202320243 12 20 78 178 242 258 179

开放获取的出版物数量

查看全部

2 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Sanjeev KhudanpurThe Johns Hopkins University在 jhu.edu 的电子邮件经过验证
Shinji WatanabeCarnegie Mellon University在 cmu.edu 的电子邮件经过验证
Daniel PoveyChief Speech Scientist, Xiaomi Corp.在 xiaomi.com 的电子邮件经过验证
Leibny Paola GarciaJohns Hopkins University在 jhu.edu 的电子邮件经过验证
Zili HuangJohns Hopkins University在 jhu.edu 的电子邮件经过验证
Jan "Yenda" TrmalAssociate Research Scientist at Johns Hopkins University在 jhu.edu 的电子邮件经过验证
Zhuo ChenBytedance (formerly Microsoft, Columbia University)在 columbia.edu 的电子邮件经过验证
David SnyderApple Inc.在 apple.com 的电子邮件经过验证
Takuya YoshiokaAssemblyAI在 assemblyai.com 的电子邮件经过验证
Naoyuki KandaMicrosoft在 microsoft.com 的电子邮件经过验证
Yusuke FujitaLY Corp.在 linecorp.com 的电子邮件经过验证
Shota HoriguchiNTT Corporation在 ntt.com 的电子邮件经过验证
Xuankai ChangCarnegie Mellon University, Student在 andrew.cmu.edu 的电子邮件经过验证
Vimal ManoharMeta Platforms Inc.在 meta.com 的电子邮件经过验证
Aswin Shanmugam SubramanianMicrosoft在 microsoft.com 的电子邮件经过验证
Christoph BoeddekerPaderborn University在 mail.upb.de 的电子邮件经过验证
Zhaoheng NiMeta Reality Labs在 meta.com 的电子邮件经过验证
Neville RyantUniversity of Pennsylvania在 ldc.upenn.edu 的电子邮件经过验证
John HersheyGoogle (formerly MERL, IBM, MSR, UCSD)在 google.com 的电子邮件经过验证
Hakan ErdoganGoogle在 google.com 的电子邮件经过验证

关注

Desh Raj

Meta AI

在 meta.com 的电子邮件经过验证 - 首页

Speech Recognition Deep Learning Natural Language Processing


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ... arXiv preprint arXiv:2004.09249, 2020	303	2020
Probing the information encoded in x-vectors D Raj, D Snyder, D Povey, S Khudanpur 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	113	2019
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis D Raj, P Denisov, Z Chen, H Erdogan, Z Huang, M He, S Watanabe, J Du, ... 2021 IEEE spoken language technology workshop (SLT), 897-904, 2021	85	2021
Dover-lap: A method for combining overlap-aware diarization outputs D Raj, LP Garcia-Perera, Z Huang, S Watanabe, D Povey, A Stolcke, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 881-888, 2021	69	2021
Learning local and global contexts using a convolutional recurrent network model for relation classification in biomedical text D Raj, S Sahu, A Anand Proceedings of the 21st conference on computational natural language …, 2017	48	2017
Sequential multi-frame neural beamforming for speech separation and enhancement ZQ Wang, H Erdogan, S Wisdom, K Wilson, D Raj, S Watanabe, Z Chen, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 905-911, 2021	47	2021
The Hitachi-JHU DIHARD III system: Competitive end-to-end neural diarization and x-vector clustering systems combined by DOVER-Lap S Horiguchi, N Yalta, P Garcia, Y Takashima, Y Xue, D Raj, Z Huang, ... arXiv preprint arXiv:2102.01363, 2021	37	2021
Multi-class spectral clustering with overlaps for speaker diarization D Raj, Z Huang, S Khudanpur 2021 IEEE Spoken Language Technology Workshop (SLT), 582-589, 2021	33	2021
Target-speaker voice activity detection with improved i-vector estimation for unknown number of speaker M He, D Raj, Z Huang, J Du, Z Chen, S Watanabe arXiv preprint arXiv:2108.03342, 2021	30	2021
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios S Cornell, M Wiesner, S Watanabe, D Raj, X Chang, P Garcia, ... arXiv preprint arXiv:2306.13734, 2023	29	2023
GPU-accelerated guided source separation for meeting transcription D Raj, D Povey, S Khudanpur arXiv preprint arXiv:2212.05271, 2022	28	2022
Using ASR methods for OCR A Arora, CC Chang, B Rekabdar, B BabaAli, D Povey, D Etter, D Raj, ... 2019 International Conference on Document Analysis and Recognition (ICDAR …, 2019	23	2019
Uncertain fuzzy self-organization based clustering: interval type-2 approach to adaptive resonance theory S Majheed, A Gupta, D Raj, FCH Rhee Information Sciences, 2017	21*	2017
Continuous streaming multi-talker asr with dual-path transducers D Raj, L Lu, Z Chen, Y Gaur, J Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	18	2022
The JHU multi-microphone multi-speaker ASR system for the CHiME-6 challenge A Arora, D Raj, AS Subramanian, K Li, B Ben-Yair, M Maciejewski, ... arXiv preprint arXiv:2006.07898, 2020	16	2020
Adapting self-supervised models to multi-talker speech recognition using speaker embeddings Z Huang, D Raj, P García, S Khudanpur ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	14	2023
Analysis of Data Generated from Multidimensional Type-1 and Type-2 Fuzzy Membership Functions D Raj, A Gupta, B Garg, K Tanna, FCH Rhee IEEE Transactions on Fuzzy Systems, 0	12*
Low-latency speech separation guided diarization for telephone conversations G Morrone, S Cornell, D Raj, L Serafini, E Zovato, A Brutti, S Squartini 2022 IEEE Spoken Language Technology Workshop (SLT), 641-646, 2023	8	2023
Injecting text and cross-lingual supervision in few-shot learning from self-supervised models M Wiesner, D Raj, S Khudanpur ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	8	2022
Joint speaker diarization and speech recognition based on region proposal networks Z Huang, M Delcroix, LP Garcia, S Watanabe, D Raj, S Khudanpur Computer Speech & Language 72, 101316, 2022	6	2022

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用