Ashish seth 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	72	72
h 指数	4	4
i10 指数	1	1

2022202320248 23 41

关注

Ashish seth

Master's Student in Indian Institute Of Technology, Madras

在 smail.iitm.ac.in 的电子邮件经过验证

Deep Learning Machine Learning Speech Technology Speech Recognition


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Dear: Debiasing vision-language models with additive residuals A Seth, M Hemani, C Agarwal Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	34	2023
Decorrelating feature spaces for learning general-purpose audio representations S Ghosh, A Seth, S Umesh IEEE Journal of Selected Topics in Signal Processing 16 (6), 1402-1414, 2022	7	2022
Dual script E2E framework for multilingual and code-switching ASR MG Kumar, J Kuriakose, A Thyagachandran, A Seth, LD Prasad, ... arXiv preprint arXiv:2106.01400, 2021	7	2021
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities S Ghosh, S Kumar, A Seth, CKR Evuru, U Tyagi, S Sakshi, O Nieto, ... arXiv preprint arXiv:2406.11768, 2024	4	2024
Compa: Addressing the gap in compositional reasoning in audio-language models S Ghosh, A Seth, S Kumar, U Tyagi, CK Evuru, S Ramaneswaran, ... arXiv preprint arXiv:2310.08753, 2023	4	2023
Delores: Decorrelating latent spaces for low-resource audio representation learning S Ghosh, A Seth, M Singh, S Umesh arXiv preprint arXiv:2203.13628, 2022	4	2022
Deep clustering for general-purpose audio representations S Ghosh, SV Katta, A Seth, S Umesh arXiv preprint arXiv:2110.08895, 2021	4	2021
MAST: Multiscale audio spectrogram transformers S Ghosh, A Seth, S Umesh, D Manocha ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	3	2023
SLICER: Learning universal audio representations using low-resource self-supervised pre-training A Seth, S Ghosh, S Umesh, D Manocha ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	2	2023
Analyzing the factors affecting usefulness of selfsupervised pre-trained representations for speech recognition LV Prasad, A Seth, S Ghosh, S Umesh arXiv preprint arXiv:2203.16973, 2022	2	2022
Technology pipeline for large scale cross-lingual dubbing of lecture videos into multiple indian languages A Prakash, A Kumar, A Seth, B Mukherjee, I Gupta, J Kuriakose, ... arXiv preprint arXiv:2211.01338, 2022	1	2022
LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition S Ghosh, S Kumar, A Seth, P Chiniya, U Tyagi, R Duraiswami, ... arXiv preprint arXiv:2406.04432, 2024		2024
FusDom: Combining in-Domain and Out-of-Domain Knowledge for Continuous Self-Supervised Learning A Seth, S Ghosh, S Umesh, D Manocha ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024
Stable Distillation: Regularizing Continued Pre-Training for Low-Resource Automatic Speech Recognition A Seth, S Ghosh, S Umesh, D Manocha ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024
Unfused: Unsupervised Finetuning Using Self Supervised Distillation A Seth, S Ghosh, S Umesh, D Manocha 2023 IEEE International Conference on Acoustics, Speech, and Signal …, 2023		2023
Analyzing the factors affecting usefulness of Self-Supervised Pre-trained Representations for Speech Recognition LVSV Durga Prasad, A Seth, S Ghosh, S Umesh arXiv e-prints, arXiv: 2203.16973, 2022		2022
Gram Vaani ASR Challenge on spontaneous telephone speech recordings in regional variations of Hindi ARK Kumar, N Ravi, A Seth, A Seth, A Singh		2022
MOMENTUM CONTRASTIVE LEARNING FOR GENERAL-PURPOSE AUDIO REPRESENTATIONS S Ghosh, A Seth, S Umesh
DECAR: Deep Clustering for learning general-purpose Audio Representations S Ghosh, A Seth, S Katta, S Umesh

系统目前无法执行此操作，请稍后再试。

文章 1–19

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

引用