On The Usefulness of Self-Attention for Automatic Speech Recognition with Transformers S Zhang, E Loweimi, P Bell, S Renals 2021 IEEE Spoken Language Technology Workshop (SLT), 89-96, 2021 | 36 | 2021 |
Windowed attention mechanisms for speech recognition S Zhang, E Loweimi, P Bell, S Renals ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 21 | 2019 |
Learning Noise Invariant Features Through Transfer Learning For Robust End-to-End Speech Recognition S Zhang, CT Do, R Doddipatla, S Renals ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 16 | 2020 |
Stochastic Attention Head Removal: A Simple and Effective Method for Improving Automatic Speech Recognition with Transformers S Zhang, E Loweimi, P Bell, S Renals arXiv e-prints, arXiv: 2011.04004, 2020 | 13* | 2020 |
LeBenchmark 2.0: A standardized, replicable and enhanced framework for self-supervised representations of French speech T Parcollet, H Nguyen, S Evain, MZ Boito, A Pupier, S Mdhaffar, H Le, ... Computer Speech & Language, 101622, 2024 | 8 | 2024 |
Transformer-Based Streaming ASR with Cumulative Attention M Li, S Zhang, C Zorilă, R Doddipatla ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 7 | 2022 |
Trainable Dynamic Subsampling for End-to-End Speech Recognition. S Zhang, E Loweimi, Y Xu, P Bell, S Renals INTERSPEECH, 1413-1417, 2019 | 6 | 2019 |
On the (In) Efficiency of Acoustic Feature Extractors for Self-Supervised Speech Representation Learning T Parcollet, S Zhang, R van Dalen, AGCP Ramos, S Bhattacharya Interspeech 2023, 2023 | 5 | 2023 |
SummaryMixing: A Linear-Complexity Alternative to Self-Attention for Speech Recognition and Understanding T Parcollet, R van Dalen, S Zhang, S Bhattacharya | 5* | |
Real-Time Personalised Speech Enhancement Transformers with Dynamic Cross-attended Speaker Representations S Zhang, M Chadwick, AGCP Ramos, T Parcollet, R van Dalen, ... | 4* | |
Train your classifier first: Cascade Neural Networks Training from upper layers to lower layers S Zhang, CT Do, R Doddipatla, E Loweimi, P Bell, S Renals ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 2 | 2021 |
Selective Adaptation of End-to-End Speech Recognition using Hybrid CTC/Attention Architecture for Noise Robustness CT Do, S Zhang, T Hain 2020 28th European Signal Processing Conference (EUSIPCO), 321-325, 2021 | 2 | 2021 |
Open-Source Conversational AI with SpeechBrain 1.0 M Ravanelli, T Parcollet, A Moumen, S de Langen, C Subakan, ... arXiv preprint arXiv:2407.00463, 2024 | | 2024 |
Resource Efficient Self-Supervised Learning for Speech Embeddings A Mehrotra, AGCP Ramos, S Zhang, S Zaiem, T Parcollet, S Bhattacharya | | |