关注
Soham Deshmukh
Soham Deshmukh
Microsoft, Carnegie Mellon University
在 microsoft.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Clap learning audio concepts from natural language supervision
B Elizalde, S Deshmukh, M Al Ismail, H Wang
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
2442023
Pengi: An audio language model for audio tasks
S Deshmukh, B Elizalde, R Singh, H Wang
Advances in Neural Information Processing Systems 36, 18090-18108, 2023
572023
Detection of COVID-19 through the analysis of vocal fold oscillations
M Al Ismail, S Deshmukh, R Singh
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
512021
Audio Retrieval with WavText5K and CLAP Training
S Deshmukh, B Elizalde, H Wang
Proc. Interspeech 2023, 2948--2952, 2022
392022
Interpreting glottal flow dynamics for detecting covid-19 from voice
S Deshmukh, M Al Ismail, R Singh
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
372021
Improving weakly supervised sound event detection with self-supervised auxiliary tasks
S Deshmukh, B Raj, R Singh
Proc. Interspeech 2021, 596--600, 2021
23*2021
Natural language supervision for general-purpose audio representations
B Elizalde, S Deshmukh, H Wang
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
162024
Attacker behaviour profiling using stochastic ensemble of hidden Markov models
S Deshmukh, R Rade, DF Kazi
arXiv preprint arXiv:1905.11824, 2019
142019
Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session
LM Heller, B Elizalde, B Raj, S Deshmukh
arXiv preprint arXiv:2302.09719, 2023
112023
NaRLE: Natural language models using reinforcement learning with emotion feedback
R Zhou, S Deshmukh, J Greer, C Lee
arXiv preprint arXiv:2110.02148, 2021
112021
Tackling toxic online communication with recurrent capsule networks
S Deshmukh, R Rade
2018 Conference on Information and Communication Technology (CICT), 1-7, 2018
102018
Loft: Local proxy fine-tuning for improving transferability of adversarial attacks against large language model
MA Shah, R Sharma, H Dhamyal, R Olivier, A Shah, D Alharthi, ...
arXiv preprint arXiv:2310.04445, 2023
92023
Prompting audios using acoustic properties for emotion representation
H Dhamyal, B Elizalde, S Deshmukh, H Wang, B Raj, R Singh
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
8*2024
Temporal and stochastic modelling of attacker behaviour
R Rade, S Deshmukh, R Nene, AS Wadekar, A Unny
Advances in Data Science: Third International Conference on Intelligent …, 2019
62019
Multi-view learning for speech emotion recognition with categorical emotion, categorical sentiment, and dimensional scores
D Tompkins, D Emmanouilidou, S Deshmukh, B Elizalde
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
52023
Training audio captioning models without audio
S Deshmukh, B Elizalde, D Emmanouilidou, B Raj, R Singh, H Wang
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
Multi-modal Language Models in Bioacoustics with Zero-shot Transfer: A Case Study
Z Miao, B Elizalde, S Deshmukh, J Kitzes, H Wang, R Dodhia, JL Ferres
2*2024
Pam: Prompting audio-language models for audio quality assessment
S Deshmukh, D Alharthi, B Elizalde, H Gamper, MA Ismail, R Singh, B Raj, ...
arXiv preprint arXiv:2402.00282, 2024
22024
Training framework for automated tasks involving multiple machine learning models
CY Lee, R Zhou, N Nishikant, SS Deshmukh, JD Greer
US Patent App. 17/516,940, 2023
22023
Adapting Task-Oriented Dialogue Models for Email Conversations
S Deshmukh, C Lee
arXiv preprint arXiv:2208.09439, 2022
12022
系统目前无法执行此操作,请稍后再试。
文章 1–20