End-to-end speech summarization using restricted self-attention R Sharma, S Palaskar, AW Black, F Metze ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 30* | 2022 |
SLUE phase-2: A benchmark suite of diverse spoken language understanding tasks S Shon, S Arora, CJ Lin, A Pasad, F Wu, R Sharma, WL Wu, HY Lee, ... arXiv preprint arXiv:2212.10525, 2022 | 24 | 2022 |
Reproducing whisper-style training using an open-source toolkit and publicly available data Y Peng, J Tian, B Yan, D Berrebbi, X Chang, X Li, J Shi, S Arora, W Chen, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 19 | 2023 |
Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study X Chang, B Yan, K Choi, JW Jung, Y Lu, S Maiti, R Sharma, J Shi, J Tian, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 14 | 2024 |
A summary of the first workshop on language technology for language documentation and revitalization G Neubig, S Rijhwani, A Palmer, J MacKenzie, H Cruz, X Li, M Lee, ... arXiv preprint arXiv:2004.13203, 2020 | 14 | 2020 |
Speech recognition in Kannada using HTK and julius: a comparative study RS Sharma, SH Paladugu, KJ Priya, D Gupta 2019 international conference on communication and signal processing (iccsp …, 2019 | 14 | 2019 |
Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech C Huang, KH Lu, SH Wang, CY Hsiao, CY Kuan, H Wu, S Arora, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 9 | 2024 |
Loft: Local proxy fine-tuning for improving transferability of adversarial attacks against large language model MA Shah, R Sharma, H Dhamyal, R Olivier, A Shah, D Alharthi, ... arXiv preprint arXiv:2310.04445, 2023 | 9 | 2023 |
Speech summarization of long spoken document: Improving memory efficiency of speech/text encoders T Kano, A Ogawa, M Delcroix, R Sharma, K Matsuura, S Watanabe ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 8 | 2023 |
Universlu: Universal spoken language understanding for diverse classification and sequence generation tasks with a single network S Arora, H Futami, J Jung, Y Peng, R Sharma, Y Kashiwagi, E Tsunoo, ... arXiv preprint arXiv:2310.02973, 2023 | 5 | 2023 |
BASS: Block-wise Adaptation for Speech Summarization R Sharma, S Arora, K Zheng, S Watanabe, R Singh, B Raj Proc. INTERSPEECH 2023, 1454--1458, 2023 | 4 | 2023 |
Xnor-former: Learning accurate approximations in long speech transformers R Sharma, B Raj arXiv preprint arXiv:2210.16643, 2022 | 4 | 2022 |
Espnet-summ: Introducing a novel large dataset, toolkit, and a cross-corpora evaluation of speech summarization systems R Sharma, W Chen, T Kano, R Sharma, S Arora, S Watanabe, A Ogawa, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 3 | 2023 |
Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction R Sharma, T Vuong, M Lindsey, H Dhamyal, R Singh, B Raj Proceedings of the 39th International Conference on Machine Learning 2022 …, 2022 | 3 | 2022 |
AugSumm: Towards Generalizable Speech Summarization Using Synthetic Labels from Large Language Models J Jung, R Sharma, W Chen, B Raj, S Watanabe ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 2 | 2024 |
Unifying the discrete and continuous emotion labels for speech emotion recognition R Sharma, H Dhamyal, B Raj, R Singh arXiv preprint arXiv:2210.16642, 2022 | 2 | 2022 |
On the Evaluation of Speech Foundation Models for Spoken Language Understanding S Arora, A Pasad, CM Chien, J Han, R Sharma, J Jung, H Dhamyal, ... arXiv preprint arXiv:2406.10083, 2024 | 1 | 2024 |
Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech D Alharthi, R Sharma, H Dhamyal, S Maiti, B Raj, R Singh arXiv preprint arXiv:2310.00706, 2023 | 1 | 2023 |
Augmenting text for spoken language understanding with Large Language Models R Sharma, S Kim, D Lazar, T Le, A Shrivastava, K Ahn, P Kansal, L Sari, ... arXiv preprint arXiv:2309.09390, 2023 | 1 | 2023 |
Egocentric audio-visual noise suppression R Sharma, W He, J Lin, E Lakomkin, Y Liu, K Kalgaonkar ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |