High quality streaming speech synthesis with low, sentence-length-independent latency N Ellinas, G Vamvoukakis, K Markopoulos, A Chalamandaris, G Maniati, ... arXiv preprint arXiv:2111.09052, 2021 | 42 | 2021 |
Excitation modeling based on waveform interpolation for HMM-based speech synthesis. JS Sung, DH Hong, KH Oh, NS Kim Interspeech, 813-816, 2010 | 22 | 2010 |
Factored MLLR adaptation NS Kim, JS Sung, DH Hong IEEE Signal Processing Letters 18 (2), 99-102, 2010 | 21 | 2010 |
SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis G Maniati, A Vioni, N Ellinas, K Nikitaras, K Klapsas, JS Sung, G Jho, ... arXiv preprint arXiv:2204.03040, 2022 | 20 | 2022 |
Cross-lingual low resource speaker adaptation using phonological features G Maniati, N Ellinas, K Markopoulos, G Vamvoukakis, JS Sung, H Park, ... arXiv preprint arXiv:2111.09075, 2021 | 13 | 2021 |
Speech reinforcement based on partial specific loudness. JW Shin, W Lim, JS Sung, NS Kim INTERSPEECH, 978-981, 2007 | 10 | 2007 |
Vibrato learning in multi-singer singing voice synthesis R Liu, X Wen, C Lu, L Song, JS Sung 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 9 | 2021 |
Prosodic clustering for phoneme-level prosody control in end-to-end speech synthesis A Vioni, M Christidou, N Ellinas, G Vamvoukakis, P Kakoulidis, T Kim, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 8 | 2021 |
Statistical approaches to excitation modeling in HMM-based speech synthesis JS Sung, DH Hong, HW Koo, NS Kim IEICE Transactions on Information and Systems 96 (2), 379-382, 2013 | 8 | 2013 |
Bunched LPCNet2: Efficient neural vocoders covering devices from cloud to edge S Park, K Choo, J Lee, AV Porov, K Osipov, JS Sung arXiv preprint arXiv:2203.14416, 2022 | 7 | 2022 |
Word-level style control for expressive, non-attentive speech synthesis K Klapsas, N Ellinas, JS Sung, H Park, S Raptis Speech and Computer: 23rd International Conference, SPECOM 2021, St …, 2021 | 7 | 2021 |
Factored MLLR adaptation for singing voice generation JS Sung, DH Hong, SJ Kang, NS Kim Twelfth Annual Conference of the International Speech Communication Association, 2011 | 7 | 2011 |
Factored maximum penalized likelihood kernel regression for HMM-based style-adaptive speech synthesis JS Sung, DH Hong, NS Kim IEEE Journal of Selected Topics in Signal Processing 8 (2), 251-261, 2014 | 5 | 2014 |
Factored MLLR Adaptation Algorithm for HMM-based Expressive TTS. JS Sung, DH Hong, HW Koo, NS Kim Interspeech, 975-978, 2012 | 5 | 2012 |
A Novel Audio Fingerprinting Scheme based on Subband Envelop Hashing Y Liu, HS Yun, JS Sung, NS Kim Proceedings: APSIPA ASC 2009: Asia-Pacific Signal and Information Processing …, 2009 | 5 | 2009 |
Controllable speech synthesis by learning discrete phoneme-level prosodic representations N Ellinas, M Christidou, A Vioni, JS Sung, A Chalamandaris, P Tsiakoulis, ... Speech Communication 146, 22-31, 2023 | 4 | 2023 |
Factored maximum likelihood kernelized regression for HMM-based singing voice synthesis. JS Sung, DH Hong, HW Koo, NS Kim Interspeech, 359-363, 2013 | 4 | 2013 |
Investigating content-aware neural text-to-speech mos prediction using prosodic and linguistic features A Vioni, G Maniati, N Ellinas, JS Sung, I Hwang, A Chalamandaris, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 3 | 2023 |
Fine-grained noise control for multispeaker speech synthesis K Nikitaras, G Vamvoukakis, N Ellinas, K Klapsas, K Markopoulos, ... arXiv preprint arXiv:2204.05070, 2022 | 3 | 2022 |
Improved prosodic clustering for multispeaker and speaker-independent phoneme-level prosody control M Christidou, A Vioni, N Ellinas, G Vamvoukakis, K Markopoulos, ... Speech and Computer: 23rd International Conference, SPECOM 2021, St …, 2021 | 3 | 2021 |