Superb: Speech processing universal performance benchmark S Yang, PH Chi, YS Chuang, CIJ Lai, K Lakhotia, YY Lin, AT Liu, J Shi, ... arXiv preprint arXiv:2105.01051, 2021 | 727 | 2021 |
Mockingjay: Unsupervised speech representation learning with deep bidirectional transformer encoders AT Liu, S Yang, PH Chi, P Hsu, H Lee ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 412 | 2020 |
Distilhubert: Speech representation learning by layer-wise distillation of hidden-unit bert HJ Chang, S Yang, H Lee ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 136 | 2022 |
SUPERB-SG: Enhanced speech processing universal PERformance benchmark for semantic and generative capabilities HS Tsai, HJ Chang, WC Huang, Z Huang, K Lakhotia, S Yang, S Dong, ... arXiv preprint arXiv:2203.06849, 2022 | 78 | 2022 |
An exploration of self-supervised pretrained representations for end-to-end speech recognition X Chang, T Maekaku, P Guo, J Shi, YJ Lu, AS Subramanian, T Wang, ... 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 74 | 2021 |
Investigating self-supervised learning for speech enhancement and separation Z Huang, S Watanabe, S Yang, P García, S Khudanpur ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 57 | 2022 |
S3prl-vc: Open-source voice conversion framework with self-supervised speech representations WC Huang, SW Yang, T Hayashi, HY Lee, S Watanabe, T Toda ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 37 | 2022 |
Understanding self-attention of self-supervised audio transformers S Yang, AT Liu, H Lee arXiv preprint arXiv:2006.03265, 2020 | 31 | 2020 |
Superb@ slt 2022: Challenge on generalization and efficiency of self-supervised speech representation learning T Feng, A Dong, CF Yeh, S Yang, TQ Lin, J Shi, KW Chang, Z Huang, ... 2022 IEEE Spoken Language Technology Workshop (SLT), 1096-1103, 2023 | 28 | 2023 |
DUAL: Discrete spoken unit adaptive learning for textless spoken question answering GT Lin, YS Chuang, HL Chung, S Yang, HJ Chen, S Dong, SW Li, ... arXiv preprint arXiv:2203.04911, 2022 | 21* | 2022 |
A comparative study of self-supervised speech representation based voice conversion WC Huang, SW Yang, T Hayashi, T Toda IEEE Journal of Selected Topics in Signal Processing 16 (6), 1308-1318, 2022 | 12 | 2022 |
Speechnet: A universal modularized model for speech processing tasks YC Chen, PH Chi, S Yang, KW Chang, J Lin, SF Huang, DR Liu, CL Liu, ... arXiv preprint arXiv:2105.03070, 2021 | 12 | 2021 |
Speech representation learning through self-supervised pretraining and multi-task finetuning YC Chen, S Yang, CK Lee, S See, H Lee arXiv preprint arXiv:2110.09930, 2021 | 11 | 2021 |
S3prl: The self-supervised speech pre-training and representation learning toolkit AT Liu, Y Shu-wen online GitHub reposi-tory, 2020 | 7 | 2020 |
Self-supervised representation learning for speech processing H Lee, A Mohamed, S Watanabe, T Sainath, K Livescu, SW Li, S Yang, ... Proceedings of the 2022 Conference of the North American Chapter of the …, 2022 | 5 | 2022 |
A Large-Scale Evaluation of Speech Foundation Models S Yang, HJ Chang, Z Huang, AT Liu, CI Lai, H Wu, J Shi, X Chang, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | | 2024 |
ISSUE ON SELF-SUPERVISED LEARNING FOR SPEECH AND AUDIO PROCESSING (SLSAP) HY Lee, S Watanabe, K Livescu, A Mohamed, T Sainath, JD Havtorn, ... | | |