VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network J Yang, J Lee, Y Kim, H Cho, I Kim Proc. Interspeech 2020, 200-204, 2020 | 88 | 2020 |
NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis HS Choi, J Yang, J Lee, H Kim ICLR 2023, 2022 | 43 | 2022 |
GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis J Yang, JS Bae, T Bak, Y Kim, HY Cho Proc. Interspeech 2021, 2202-2206, 2021 | 35 | 2021 |
Avocodo: Generative Adversarial Network for Artifact-free Vocoder T Bak, J Lee, H Bae, J Yang, JS Bae, YS Joo AAAI 2023, 2022 | 31 | 2022 |
HanFont: large-scale adaptive Hangul font recognizer using CNN and font clustering J Yang, H Kim, H Kwak, I Kim International Journal on Document Analysis and Recognition (IJDAR) 22, 407-416, 2019 | 7 | 2019 |
Emotion-aware music recommendation J Yang, WJ Chae, SY Kim, H Choi Design, User Experience, and Usability: Novel User Experiences: 5th …, 2016 | 7 | 2016 |
Hierarchical and Multi-Scale Variational Autoencoder for Diverse and Natural Non-Autoregressive Text-to-Speech JS Bae, J Yang, TJ Bak, YS Joo Proc. Interspeech 2022, 2022 | 6 | 2022 |
Varianceflow: High-Quality and Controllable Text-to-Speech using Variance Information via Normalizing Flow Y Lee, J Yang, K Jung ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 3 | 2022 |
Large-Scale Hangul Font Recognition Using Deep Learning JH Yang, HB Kwak, IJ Kim Annual Conference on Human and Language Technology, 8-12, 2017 | 3 | 2017 |
DualSpeech: Enhancing Speaker-Fidelity and Text-Intelligibility Through Dual Classifier-Free Guidance J Yang, J Lee, HS Choi, S Ji, H Kim, J Lee Interspeech 2024, 2024 | | 2024 |
생성적 적대 신경망과 데이터 확장을 이용한 딥러닝 기반 TTS 음질 개선 최진, 양진혁, 김인중 정보과학회 컴퓨팅의 실제 논문지 26 (5), 256-260, 2020 | | 2020 |
노이즈 어텐션을 통한 딥러닝 기반 음성 합성 양진혁, 김인중 한국정보과학회 학술발표논문집, 904-906, 2019 | | 2019 |