Diff-TTS: A denoising diffusion model for text-to-speech M Jeong, H Kim, SJ Cheon, BJ Choi, NS Kim arXiv preprint arXiv:2104.01409, 2021 | 185 | 2021 |
Transfer learning framework for low-resource text-to-speech using a large-scale unlabeled speech corpus M Kim, M Jeong, BJ Choi, S Ahn, JY Lee, NS Kim arXiv preprint arXiv:2203.15447, 2022 | 25 | 2022 |
SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech BJ Choi, M Jeong, JY Lee, NS Kim IEEE Signal Processing Letters 29, 2502-2506, 2022 | 15 | 2022 |
Transduce and speak: Neural transducer for text-to-speech with semantic token prediction M Kim, M Jeong, BJ Choi, D Lee, NS Kim 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023 | 7 | 2023 |
Towards single integrated spoofing-aware speaker verification embeddings SH Mun, H Shim, H Tak, X Wang, X Liu, M Sahidullah, M Jeong, MH Han, ... arXiv preprint arXiv:2305.19051, 2023 | 6 | 2023 |
Adversarial speaker-consistency learning using untranscribed speech data for zero-shot multi-speaker text-to-speech BJ Choi, M Jeong, M Kim, SH Mun, NS Kim 2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022 | 5 | 2022 |
Efficient Parallel Audio Generation Using Group Masked Language Modeling M Jeong, M Kim, JY Lee, NS Kim IEEE Signal Processing Letters, 2024 | 3 | 2024 |
Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction M Kim, M Jeong, BJ Choi, S Kim, JY Lee, NS Kim arXiv preprint arXiv:2401.01498, 2024 | 2 | 2024 |
Transfer Learning for Low-Resource, Multi-Lingual, and Zero-Shot Multi-Speaker Text-to-Speech M Jeong, M Kim, BJ Choi, J Yoon, W Jang, NS Kim IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 1 | 2024 |
High Fidelity Text-to-Speech Via Discrete Tokens Using Token Transducer and Group Masked Language Model JY Lee, M Jeong, M Kim, JH Lee, HY Cho, NS Kim arXiv preprint arXiv:2406.17310, 2024 | | 2024 |
MakeSinger: A Semi-Supervised Training Method for Data-Efficient Singing Voice Synthesis via Classifier-free Diffusion Guidance S Kim, M Jeong, H Lee, M Kim, BJ Choi, NS Kim arXiv preprint arXiv:2406.05965, 2024 | | 2024 |
High Fidelity Text-to-Speech Via Discrete Tokens Using Token Transducer and Group Masked Language Model J Yeop Lee, M Jeong, M Kim, JH Lee, HY Cho, NS Kim arXiv e-prints, arXiv: 2406.17310, 2024 | | 2024 |
Variable-Length Speaker Conditioning in Flow-Based Text-to-Speech BJ Choi, M Jeong, M Kim, NS Kim IEEE Signal Processing Letters, 2024 | | 2024 |
Improving Learning Objectives for Speaker Verification from the Perspective of Score Comparison MH Han, SH Mun, M Kim, M Jeong, SH Ahn, NS Kim ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | | 2023 |