Dear: Debiasing vision-language models with additive residuals A Seth, M Hemani, C Agarwal Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 34 | 2023 |
Decorrelating feature spaces for learning general-purpose audio representations S Ghosh, A Seth, S Umesh IEEE Journal of Selected Topics in Signal Processing 16 (6), 1402-1414, 2022 | 7 | 2022 |
Dual script E2E framework for multilingual and code-switching ASR MG Kumar, J Kuriakose, A Thyagachandran, A Seth, LD Prasad, ... arXiv preprint arXiv:2106.01400, 2021 | 7 | 2021 |
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities S Ghosh, S Kumar, A Seth, CKR Evuru, U Tyagi, S Sakshi, O Nieto, ... arXiv preprint arXiv:2406.11768, 2024 | 4 | 2024 |
Compa: Addressing the gap in compositional reasoning in audio-language models S Ghosh, A Seth, S Kumar, U Tyagi, CK Evuru, S Ramaneswaran, ... arXiv preprint arXiv:2310.08753, 2023 | 4 | 2023 |
Delores: Decorrelating latent spaces for low-resource audio representation learning S Ghosh, A Seth, M Singh, S Umesh arXiv preprint arXiv:2203.13628, 2022 | 4 | 2022 |
Deep clustering for general-purpose audio representations S Ghosh, SV Katta, A Seth, S Umesh arXiv preprint arXiv:2110.08895, 2021 | 4 | 2021 |
MAST: Multiscale audio spectrogram transformers S Ghosh, A Seth, S Umesh, D Manocha ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 3 | 2023 |
SLICER: Learning universal audio representations using low-resource self-supervised pre-training A Seth, S Ghosh, S Umesh, D Manocha ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 2 | 2023 |
Analyzing the factors affecting usefulness of selfsupervised pre-trained representations for speech recognition LV Prasad, A Seth, S Ghosh, S Umesh arXiv preprint arXiv:2203.16973, 2022 | 2 | 2022 |
Technology pipeline for large scale cross-lingual dubbing of lecture videos into multiple indian languages A Prakash, A Kumar, A Seth, B Mukherjee, I Gupta, J Kuriakose, ... arXiv preprint arXiv:2211.01338, 2022 | 1 | 2022 |
LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition S Ghosh, S Kumar, A Seth, P Chiniya, U Tyagi, R Duraiswami, ... arXiv preprint arXiv:2406.04432, 2024 | | 2024 |
FusDom: Combining in-Domain and Out-of-Domain Knowledge for Continuous Self-Supervised Learning A Seth, S Ghosh, S Umesh, D Manocha ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
Stable Distillation: Regularizing Continued Pre-Training for Low-Resource Automatic Speech Recognition A Seth, S Ghosh, S Umesh, D Manocha ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |
Unfused: Unsupervised Finetuning Using Self Supervised Distillation A Seth, S Ghosh, S Umesh, D Manocha 2023 IEEE International Conference on Acoustics, Speech, and Signal …, 2023 | | 2023 |
Analyzing the factors affecting usefulness of Self-Supervised Pre-trained Representations for Speech Recognition LVSV Durga Prasad, A Seth, S Ghosh, S Umesh arXiv e-prints, arXiv: 2203.16973, 2022 | | 2022 |
Gram Vaani ASR Challenge on spontaneous telephone speech recordings in regional variations of Hindi ARK Kumar, N Ravi, A Seth, A Seth, A Singh | | 2022 |
MOMENTUM CONTRASTIVE LEARNING FOR GENERAL-PURPOSE AUDIO REPRESENTATIONS S Ghosh, A Seth, S Umesh | | |
DECAR: Deep Clustering for learning general-purpose Audio Representations S Ghosh, A Seth, S Katta, S Umesh | | |