SALMONN: Towards Generic Hearing Abilities for Large Language Models C Tang, W Yu, G Sun, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang The Twelfth International Conference on Learning Representations, 2024 | 95 | 2024 |
Connecting Speech Encoder and Large Language Model for ASR W Yu, C Tang, G Sun, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 19 | 2024 |
MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets Z Ma, Z Zhen, C Tang, Y Wang, X Chen Proc. Interspeech 2023, 2022 | 19 | 2022 |
Fine-grained audio-visual joint representations for multimodal large language models G Sun, W Yu, C Tang, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang arXiv preprint arXiv:2310.05863, 2023 | 7 | 2023 |
Front-End Adapter: Adapting Front-End Input of Speech based Self-Supervised Learning for Speech Recognition X Chen, Z Ma, C Tang, Y Wang, Z Zheng ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 4 | 2023 |
Extending large language models for speech and audio captioning C Tang, W Yu, G Sun, X Chen, T Tan, W Li, L Lu, Z Ma, C Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 2 | 2024 |
video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models G Sun, W Yu, C Tang, X Chen, T Tan, W Li, L Lu, MA Zejun, Y Wang, ... Forty-first International Conference on Machine Learning, 2024 | 2 | 2024 |
Exploring Effective Fusion Algorithms for Speech Based Self-Supervised Learning Models C Tang, Y Wang, X Chen, WQ Zhang National Conference on Man-Machine Speech Communication, 2022, 2022 | 2 | 2022 |
Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition Y Wang, C Tang, Z Ma, Z Zheng, X Chen, WQ Zhang 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2022 | 1 | 2022 |
Can Large Language Models Understand Spatial Audio? C Tang, W Yu, G Sun, X Chen, T Tan, W Li, J Zhang, L Lu, Z Ma, Y Wang, ... Proc. Interspeech 2024, 2024 | | 2024 |