ML-SUPERB: Multilingual speech universal performance benchmark J Shi, D Berrebbi, W Chen, HL Chung, EP Hu, WP Huang, X Chang, ... arXiv preprint arXiv:2305.10615, 2023 | 43 | 2023 |
On the utility of self-supervised models for prosody-related tasks GT Lin, CL Feng, WP Huang, Y Tseng, TH Lin, CA Li, H Lee, NG Ward 2022 IEEE Spoken Language Technology Workshop (SLT), 1104-1111, 2023 | 39 | 2023 |
Why we should report the details in subjective evaluation of tts more rigorously CH Chiang, WP Huang, H Lee arXiv preprint arXiv:2306.02044, 2023 | 7 | 2023 |
Findings of the 2023 ML-Superb Challenge: Pre-Training And Evaluation Over More Languages And Beyond J Shi, W Chen, D Berrebbi, HH Wang, WP Huang, EP Hu, HL Chuang, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 6 | 2023 |
Few-shot cross-lingual tts using transferable phoneme embedding WP Huang, PC Chen, SF Huang, H Lee arXiv preprint arXiv:2206.15427, 2022 | 2 | 2022 |
Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models CY Kuan, WP Huang, H Lee arXiv preprint arXiv:2406.08402, 2024 | 1 | 2024 |
Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation CY Kuan, CK Yang, WP Huang, KH Lu, H Lee arXiv preprint arXiv:2407.09886, 2024 | | 2024 |
Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech GT Lin, WP Huang, H Lee arXiv preprint arXiv:2406.11064, 2024 | | 2024 |
Maximizing Data Efficiency for Cross-Lingual TTS Adaptation by Self-Supervised Representation Mixing and Embedding Initialization WP Huang, SF Huang, H Lee 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | | 2023 |