关注
Myeonghun Jeong
标题
引用次数
引用次数
年份
Diff-TTS: A denoising diffusion model for text-to-speech
M Jeong, H Kim, SJ Cheon, BJ Choi, NS Kim
arXiv preprint arXiv:2104.01409, 2021
1852021
Transfer learning framework for low-resource text-to-speech using a large-scale unlabeled speech corpus
M Kim, M Jeong, BJ Choi, S Ahn, JY Lee, NS Kim
arXiv preprint arXiv:2203.15447, 2022
252022
SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech
BJ Choi, M Jeong, JY Lee, NS Kim
IEEE Signal Processing Letters 29, 2502-2506, 2022
152022
Transduce and speak: Neural transducer for text-to-speech with semantic token prediction
M Kim, M Jeong, BJ Choi, D Lee, NS Kim
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023
72023
Towards single integrated spoofing-aware speaker verification embeddings
SH Mun, H Shim, H Tak, X Wang, X Liu, M Sahidullah, M Jeong, MH Han, ...
arXiv preprint arXiv:2305.19051, 2023
62023
Adversarial speaker-consistency learning using untranscribed speech data for zero-shot multi-speaker text-to-speech
BJ Choi, M Jeong, M Kim, SH Mun, NS Kim
2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022
52022
Efficient Parallel Audio Generation Using Group Masked Language Modeling
M Jeong, M Kim, JY Lee, NS Kim
IEEE Signal Processing Letters, 2024
32024
Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction
M Kim, M Jeong, BJ Choi, S Kim, JY Lee, NS Kim
arXiv preprint arXiv:2401.01498, 2024
22024
Transfer Learning for Low-Resource, Multi-Lingual, and Zero-Shot Multi-Speaker Text-to-Speech
M Jeong, M Kim, BJ Choi, J Yoon, W Jang, NS Kim
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
12024
High Fidelity Text-to-Speech Via Discrete Tokens Using Token Transducer and Group Masked Language Model
JY Lee, M Jeong, M Kim, JH Lee, HY Cho, NS Kim
arXiv preprint arXiv:2406.17310, 2024
2024
MakeSinger: A Semi-Supervised Training Method for Data-Efficient Singing Voice Synthesis via Classifier-free Diffusion Guidance
S Kim, M Jeong, H Lee, M Kim, BJ Choi, NS Kim
arXiv preprint arXiv:2406.05965, 2024
2024
High Fidelity Text-to-Speech Via Discrete Tokens Using Token Transducer and Group Masked Language Model
J Yeop Lee, M Jeong, M Kim, JH Lee, HY Cho, NS Kim
arXiv e-prints, arXiv: 2406.17310, 2024
2024
Variable-Length Speaker Conditioning in Flow-Based Text-to-Speech
BJ Choi, M Jeong, M Kim, NS Kim
IEEE Signal Processing Letters, 2024
2024
Improving Learning Objectives for Speaker Verification from the Perspective of Score Comparison
MH Han, SH Mun, M Kim, M Jeong, SH Ahn, NS Kim
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–14