关注
Kunal Dhawan
Kunal Dhawan
Research Scientist, NVIDIA
在 cs.cmu.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
IITG-HingCoS corpus: A Hinglish code-switching database for automatic speech recognition
S Ganji, K Dhawan, R Sinha
Speech Communication 110, 76-89, 2019
272019
Unified Model for Code-Switching Speech Recognition and Language Identification Based on Concatenated Tokenizer
K Dhawan, KD Rekesh, B Ginsburg
Proceedings of the 6th Workshop on Computational Approaches to Linguistic …, 2023
9*2023
Novel textual features for language modeling of intra-sentential code-switching data
S Ganji, K Dhawan, R Sinha
Computer Speech & Language 64, 101099, 2020
92020
Hindi-English code-switching speech corpus
G Sreeram, K Dhawan, R Sinha
arXiv preprint arXiv:1810.00662, 2018
92018
Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition
KC Puvvada, NR Koluguri, K Dhawan, J Balam, B Ginsburg
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
82024
Enhancing speaker diarization with large language models: A contextual beam search approach
TJ Park, K Dhawan, N Koluguri, J Balam
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
82024
Joint language identification of code-switching speech using attention-based E2E network
G Sreeram, K Dhawan, K Priyadarshi, R Sinha
2020 International Conference on Signal Processing and Communications (SPCOM …, 2020
82020
Phonetic word embeddings
R Sharma, K Dhawan, B Pailla
arXiv preprint arXiv:2109.14796, 2021
42021
Investigating target set reduction for end-to-end speech recognition of Hindi-english code-switching data
K Dhawan, G Sreeram, K Priyadarshi, R Sinha
2020 National Conference on Communications (NCC), 1-5, 2020
42020
Spectral Codecs: Spectrogram-Based Audio Codecs for High Quality Speech Synthesis
R Langman, A Jukić, K Dhawan, NR Koluguri, B Ginsburg
arXiv preprint arXiv:2406.05298, 2024
22024
Property-aware multi-speaker data simulation: A probabilistic modelling technique for synthetic data generation
TJ Park, H Huang, C Hooper, N Koluguri, K Dhawan, A Jukic, J Balam, ...
arXiv preprint arXiv:2310.12371, 2023
22023
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System
TJ Park, H Huang, A Jukic, K Dhawan, KC Puvvada, N Koluguri, N Karpov, ...
arXiv preprint arXiv:2310.12378, 2023
22023
Evaluating speech production-based acoustic features for COVID-19 classification using cough signals
BT Nellore, G Sreeram, K Dhawan, PB Reddy
2021 IEEE 18th India Council International Conference (INDICON), 1-5, 2021
12021
Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations
K Dhawan, NR Koluguri, A Jukić, R Langman, J Balam, B Ginsburg
arXiv preprint arXiv:2407.03495, 2024
2024
Less is More: Accurate Speech Recognition & Translation without Web-Scale Data
KC Puvvada, P Żelasko, H Huang, O Hrinchuk, NR Koluguri, K Dhawan, ...
arXiv preprint arXiv:2406.19674, 2024
2024
Towards Adapting NMF Dictionaries Using Total Variability Modeling for Noise-Robust Acoustic Features
K Dhawan, C Vaz, R Travadi, S Narayanan
arXiv preprint arXiv:1907.06859, 2019
2019
系统目前无法执行此操作,请稍后再试。
文章 1–16