emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation Z Ma, Z Zheng, J Ye, J Li, Z Gao, S Zhang, X Chen ACL Findings 2024, 2023 | 20 | 2023 |
Mt4ssl: Boosting self-supervised speech representation learning by integrating multiple targets Z Ma, Z Zheng, C Tang, Y Wang, X Chen INTERSPEECH 2023, 2022 | 18 | 2022 |
EAT: Self-Supervised Pre-Training with Efficient Audio Transformer W Chen, Y Liang, Z Ma, Z Zheng, X Chen IJCAI 2024, 2024 | 6 | 2024 |
Leveraging speech ptm, text llm, and emotional tts for speech emotion recognition Z Ma, W Wu, Z Zheng, Y Guo, Q Chen, S Zhang, X Chen ICASSP 2024, 2023 | 6 | 2023 |
Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning G Yang, Z Ma, Z Zheng, Y Song, Z Niu, X Chen ASRU 2023, 2023 | 4 | 2023 |
Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition Z Zheng, Z Ma, Y Wang, X Chen INTERSPEECH 2023, 2023 | 4 | 2023 |
Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation Z Ma, Z Zheng, G Yang, Y Wang, C Zhang, X Chen INTERSPEECH 2023, 2023 | 4 | 2023 |
Front-End Adapter: Adapting Front-End Input of Speech Based Self-Supervised Learning for Speech Recognition X Chen, Z Ma, C Tang, Y Wang, Z Zheng ICASSP 2023, 1-5, 2023 | 3 | 2023 |
EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark Z Ma, M Chen, H Zhang, Z Zheng, W Chen, X Li, J Ye, X Chen, T Hain INTERSPEECH 2024, arXiv: 2406.07162, 2024 | 2 | 2024 |
BAT: Learning to Reason about Spatial Sounds with Large Language Models Z Zheng, P Peng, Z Ma, X Chen, E Choi, D Harwath ICML 2024, 2024 | 2 | 2024 |
Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition Y Wang, C Tang, Z Ma, Z Zheng, X Chen, WQ Zhang ASRU 2023, 2022 | 1 | 2022 |