Torchaudio: Building blocks for audio and speech processing YY Yang, M Hira, Z Ni, A Astafurov, C Chen, C Puhrsch, D Pollack, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 177 | 2022 |
ESPnet-SE: End-to-end speech enhancement and separation toolkit designed for ASR integration C Li, J Shi, W Zhang, AS Subramanian, X Chang, N Kamo, M Hira, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 785-792, 2021 | 77 | 2021 |
ESPnet-ST-v2: Multipurpose spoken language translation toolkit B Yan, J Shi, Y Tang, H Inaguma, Y Peng, S Dalmia, P Polák, ... arXiv preprint arXiv:2304.04596, 2023 | 8 | 2023 |
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch J Hwang, M Hira, C Chen, X Zhang, Z Ni, G Sun, P Ma, R Huang, V Pratap, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-9, 2023 | 6 | 2023 |
Less Peaky and More Accurate CTC Forced Alignment by Label Priors R Huang, X Zhang, Z Ni, L Sun, M Hira, J Hwang, V Manohar, V Pratap, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | | 2024 |