Fre-GAN: Adversarial frequency-consistent audio synthesis JH Kim, SH Lee, JH Lee, SW Lee Proc. Interspeech, 2021 | 62 | 2021 |
Multi-spectrogan: High-diversity and high-fidelity spectrogram generation with adversarial style combination for speech synthesis SH Lee, HW Yoon, HR Noh, JH Kim, SW Lee Proceedings of the AAAI Conference on Artificial Intelligence 35 (14), 13198 …, 2021 | 57 | 2021 |
Voicemixer: Adversarial voice style mixup SH Lee, JH Kim, H Chung, SW Lee Advances in Neural Information Processing Systems 34, 294-308, 2021 | 32 | 2021 |
Fre-gan 2: Fast and efficient frequency-consistent audio synthesis SH Lee, JH Kim, KE Lee, SW Lee ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 14 | 2022 |
PVAE-TTS: Adaptive text-to-speech via progressive style adaptation JH Lee, SH Lee, JH Kim, SW Lee ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 14 | 2022 |
TriniTTS: Pitch-controllable End-to-end TTS without External Aligner. Y Ju, I Kim, H Yang, JH Kim, B Kim, S Maiti, S Watanabe INTERSPEECH, 16-20, 2022 | 9 | 2022 |
CrossSpeech: Speaker-independent Acoustic Representation for Cross-lingual Speech Synthesis JH Kim, HS Yang, YC Ju, IH Kim, BY Kim ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 8 | 2023 |
GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints JH Kim, SH Lee, JH Lee, HG Jung, SW Lee IEEE International Conference on Systems, Man, and Cybernetics (SMC), 2021 | 7 | 2021 |
FACTSpeech: Speaking a Foreign Language Pronunciation Using Only Your Native Characters HS Yang, JH Kim, YC Ju, IH Kim, BY Kim, SJ Choi, HY Kim Proc. Interspeech, 2023 | 3 | 2023 |
FreGrad: Lightweight and Fast Frequency-aware Diffusion Vocoder TD Nguyen*, JH Kim*, Y Jang, J Kim, JS Chung ICASSP, 2024 | 2 | 2024 |
Text-To-Speech Synthesis In The Wild J Jung, W Zhang, S Maiti, Y Wu, X Wang, JH Kim, Y Matsunaga, S Um, ... arXiv preprint arXiv:2409.08711, 2024 | 1 | 2024 |
VoxSim: A perceptual voice similarity dataset J Ahn, Y Kim, Y Choi, D Kwak, JH Kim, S Mun, JS Chung Interspeech, 2024 | 1 | 2024 |
Faces that Speak: Jointly Synthesising Talking Face and Speech from Text Y Jang*, JH Kim*, J Ahn, D Kwak, HS Yang, YC Ju, IH Kim, BY Kim, ... CVPR, 2024 | 1 | 2024 |
Let There Be Sound: Reconstructing High Quality Speech from Silent Videos JH Kim*, J Kim*, JS Chung Proceedings of the AAAI Conference on Artificial Intelligence, 2024 | 1 | 2024 |
FlowAVSE: Efficient Audio-Visual Speech Enhancement with Conditional Flow Matching C Jung, S Lee, JH Kim, JS Chung Interspeech, 2024 | | 2024 |