关注
Ji-Hoon Kim
Ji-Hoon Kim
在 kaist.ac.kr 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Fre-GAN: Adversarial frequency-consistent audio synthesis
JH Kim, SH Lee, JH Lee, SW Lee
Proc. Interspeech, 2021
622021
Multi-spectrogan: High-diversity and high-fidelity spectrogram generation with adversarial style combination for speech synthesis
SH Lee, HW Yoon, HR Noh, JH Kim, SW Lee
Proceedings of the AAAI Conference on Artificial Intelligence 35 (14), 13198 …, 2021
572021
Voicemixer: Adversarial voice style mixup
SH Lee, JH Kim, H Chung, SW Lee
Advances in Neural Information Processing Systems 34, 294-308, 2021
322021
Fre-gan 2: Fast and efficient frequency-consistent audio synthesis
SH Lee, JH Kim, KE Lee, SW Lee
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
142022
PVAE-TTS: Adaptive text-to-speech via progressive style adaptation
JH Lee, SH Lee, JH Kim, SW Lee
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
142022
TriniTTS: Pitch-controllable End-to-end TTS without External Aligner.
Y Ju, I Kim, H Yang, JH Kim, B Kim, S Maiti, S Watanabe
INTERSPEECH, 16-20, 2022
92022
CrossSpeech: Speaker-independent Acoustic Representation for Cross-lingual Speech Synthesis
JH Kim, HS Yang, YC Ju, IH Kim, BY Kim
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
82023
GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints
JH Kim, SH Lee, JH Lee, HG Jung, SW Lee
IEEE International Conference on Systems, Man, and Cybernetics (SMC), 2021
72021
FACTSpeech: Speaking a Foreign Language Pronunciation Using Only Your Native Characters
HS Yang, JH Kim, YC Ju, IH Kim, BY Kim, SJ Choi, HY Kim
Proc. Interspeech, 2023
32023
FreGrad: Lightweight and Fast Frequency-aware Diffusion Vocoder
TD Nguyen*, JH Kim*, Y Jang, J Kim, JS Chung
ICASSP, 2024
22024
Text-To-Speech Synthesis In The Wild
J Jung, W Zhang, S Maiti, Y Wu, X Wang, JH Kim, Y Matsunaga, S Um, ...
arXiv preprint arXiv:2409.08711, 2024
12024
VoxSim: A perceptual voice similarity dataset
J Ahn, Y Kim, Y Choi, D Kwak, JH Kim, S Mun, JS Chung
Interspeech, 2024
12024
Faces that Speak: Jointly Synthesising Talking Face and Speech from Text
Y Jang*, JH Kim*, J Ahn, D Kwak, HS Yang, YC Ju, IH Kim, BY Kim, ...
CVPR, 2024
12024
Let There Be Sound: Reconstructing High Quality Speech from Silent Videos
JH Kim*, J Kim*, JS Chung
Proceedings of the AAAI Conference on Artificial Intelligence, 2024
12024
FlowAVSE: Efficient Audio-Visual Speech Enhancement with Conditional Flow Matching
C Jung, S Lee, JH Kim, JS Chung
Interspeech, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–15