Follow
Soham Deshmukh
Soham Deshmukh
Microsoft, Carnegie Mellon University
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
Clap learning audio concepts from natural language supervision
B Elizalde, S Deshmukh, M Al Ismail, H Wang
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
1842023
Detection of COVID-19 through the analysis of vocal fold oscillations
M Al Ismail, S Deshmukh, R Singh
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
492021
Pengi: An audio language model for audio tasks
S Deshmukh, B Elizalde, R Singh, H Wang
Advances in Neural Information Processing Systems 36, 18090-18108, 2023
372023
Interpreting glottal flow dynamics for detecting covid-19 from voice
S Deshmukh, M Al Ismail, R Singh
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
352021
Audio Retrieval with WavText5K and CLAP Training
S Deshmukh, B Elizalde, H Wang
Proc. Interspeech 2023, 2948--2952, 2022
312022
Improving weakly supervised sound event detection with self-supervised auxiliary tasks
S Deshmukh, B Raj, R Singh
Proc. Interspeech 2021, 596--600, 2021
22*2021
Attacker behaviour profiling using stochastic ensemble of hidden markov models
S Deshmukh, R Rade, DF Kazi
arXiv preprint arXiv:1905.11824, 2019
142019
NaRLE: Natural language models using reinforcement learning with emotion feedback
R Zhou, S Deshmukh, J Greer, C Lee
arXiv preprint arXiv:2110.02148, 2021
112021
Natural language supervision for general-purpose audio representations
B Elizalde, S Deshmukh, H Wang
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
102024
Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session
LM Heller, B Elizalde, B Raj, S Deshmukh
arXiv preprint arXiv:2302.09719, 2023
102023
Tackling toxic online communication with recurrent capsule networks
S Deshmukh, R Rade
2018 Conference on Information and Communication Technology (CICT), 1-7, 2018
102018
Prompting audios using acoustic properties for emotion representation
H Dhamyal, B Elizalde, S Deshmukh, H Wang, B Raj, R Singh
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
9*2024
Loft: Local proxy fine-tuning for improving transferability of adversarial attacks against large language model
MA Shah, R Sharma, H Dhamyal, R Olivier, A Shah, D Alharthi, ...
arXiv preprint arXiv:2310.04445, 2023
62023
Temporal and stochastic modelling of attacker behaviour
R Rade, S Deshmukh, R Nene, AS Wadekar, A Unny
Advances in Data Science: Third International Conference on Intelligent …, 2019
52019
Training audio captioning models without audio
S Deshmukh, B Elizalde, D Emmanouilidou, B Raj, R Singh, H Wang
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
Multi-view learning for speech emotion recognition with categorical emotion, categorical sentiment, and dimensional scores
D Tompkins, D Emmanouilidou, S Deshmukh, B Elizalde
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
42023
PAM: Prompting Audio-Language Models for Audio Quality Assessment
S Deshmukh, D Alharthi, B Elizalde, H Gamper, MA Ismail, R Singh, B Raj, ...
arXiv preprint arXiv:2402.00282, 2024
12024
Zero-Shot Transfer for Wildlife Bioacoustics Detection
Z Miao, B Elizalde, S Deshmukh, J Kitzes, H Wang, R Dodhia, JML Ferres
12023
Training framework for automated tasks involving multiple machine learning models
CY Lee, R Zhou, N Nishikant, SS Deshmukh, JD Greer
US Patent App. 17/516,940, 2023
12023
Adapting task-oriented dialogue models for email conversations
S Deshmukh, C Lee
arXiv preprint arXiv:2208.09439, 2022
12022
The system can't perform the operation now. Try again later.
Articles 1–20