Clap learning audio concepts from natural language supervision B Elizalde, S Deshmukh, M Al Ismail, H Wang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 274 | 2023 |
Pengi: An audio language model for audio tasks S Deshmukh, B Elizalde, R Singh, H Wang Advances in Neural Information Processing Systems 36, 18090-18108, 2023 | 63 | 2023 |
Detection of COVID-19 through the analysis of vocal fold oscillations M Al Ismail, S Deshmukh, R Singh ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 53 | 2021 |
Audio Retrieval with WavText5K and CLAP Training S Deshmukh, B Elizalde, H Wang Proc. Interspeech 2023, 2948--2952, 2022 | 44 | 2022 |
Interpreting glottal flow dynamics for detecting covid-19 from voice S Deshmukh, M Al Ismail, R Singh ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 37 | 2021 |
Improving weakly supervised sound event detection with self-supervised auxiliary tasks S Deshmukh, B Raj, R Singh Proc. Interspeech 2021, 596--600, 2021 | 23* | 2021 |
Natural language supervision for general-purpose audio representations B Elizalde, S Deshmukh, H Wang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 20 | 2024 |
Attacker behaviour profiling using stochastic ensemble of hidden Markov models S Deshmukh, R Rade, DF Kazi arXiv preprint arXiv:1905.11824, 2019 | 14 | 2019 |
Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session LM Heller, B Elizalde, B Raj, S Deshmukh arXiv preprint arXiv:2302.09719, 2023 | 12 | 2023 |
NaRLE: Natural language models using reinforcement learning with emotion feedback R Zhou, S Deshmukh, J Greer, C Lee arXiv preprint arXiv:2110.02148, 2021 | 11 | 2021 |
Prompting audios using acoustic properties for emotion representation H Dhamyal, B Elizalde, S Deshmukh, H Wang, B Raj, R Singh ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 10* | 2024 |
Loft: Local proxy fine-tuning for improving transferability of adversarial attacks against large language model MA Shah, R Sharma, H Dhamyal, R Olivier, A Shah, D Alharthi, ... arXiv preprint arXiv:2310.04445, 2023 | 10 | 2023 |
Tackling toxic online communication with recurrent capsule networks S Deshmukh, R Rade 2018 Conference on Information and Communication Technology (CICT), 1-7, 2018 | 10 | 2018 |
Multi-view learning for speech emotion recognition with categorical emotion, categorical sentiment, and dimensional scores D Tompkins, D Emmanouilidou, S Deshmukh, B Elizalde ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 6 | 2023 |
Temporal and stochastic modelling of attacker behaviour R Rade, S Deshmukh, R Nene, AS Wadekar, A Unny Advances in Data Science: Third International Conference on Intelligent …, 2019 | 6 | 2019 |
Training audio captioning models without audio S Deshmukh, B Elizalde, D Emmanouilidou, B Raj, R Singh, H Wang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 5 | 2024 |
Multi-modal Language Models in Bioacoustics with Zero-shot Transfer: A Case Study Z Miao, B Elizalde, S Deshmukh, J Kitzes, H Wang, R Dodhia, JL Ferres | 3* | 2024 |
Pam: Prompting audio-language models for audio quality assessment S Deshmukh, D Alharthi, B Elizalde, H Gamper, MA Ismail, R Singh, B Raj, ... arXiv preprint arXiv:2402.00282, 2024 | 3 | 2024 |
Domain Adaptation for Contrastive Audio-Language Models S Deshmukh, R Singh, B Raj arXiv preprint arXiv:2402.09585, 2024 | 2 | 2024 |
Training framework for automated tasks involving multiple machine learning models CY Lee, R Zhou, N Nishikant, SS Deshmukh, JD Greer US Patent App. 17/516,940, 2023 | 2 | 2023 |