Se Jin Park 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	166	166
h 指数	4	4
i10 指数	4	4

20212022202320243 29 70 64

合著作者

Yong Man RoProfessor of Electrical Engineering, KAIST在 kaist.ac.kr 的电子邮件经过验证
Minsu KimMeta在 meta.com 的电子邮件经过验证
Joanna HongPh.D. at Korea Advanced Institute of Science and Technology在 kaist.ac.kr 的电子邮件经过验证
Jeongsoo ChoiKAIST在 kaist.ac.kr 的电子邮件经过验证
Jeong Hun YeoKorea Advanced Institute of Science and Technology在 kaist.ac.kr 的电子邮件经过验证
Chae Won KimKorea Advanced Institute of Science and Technology在 kaist.ac.kr 的电子邮件经过验证
Hyeongseop RhaIntegrated PhD program in KAIST在 kaist.ac.kr 的电子邮件经过验证

关注

Se Jin Park

Korea Advanced Institute of Science and Technology (KAIST)

在 kaist.ac.kr 的电子邮件经过验证 - 首页

multimodal learning image/video generation speech processing


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Synctalkface: Talking face generation with precise lip-syncing via audio-lip memory SJ Park, M Kim, J Hong, J Choi, YM Ro Proceedings of the AAAI Conference on Artificial Intelligence 36 (2), 2062-2070, 2022	61	2022
Multi-modality associative bridging through memory: Speech sound recollected from face video M Kim, J Hong, SJ Park, YM Ro Proceedings of the IEEE/CVF International Conference on Computer Vision, 296-306, 2021	44	2021
Cromm-vsr: Cross-modal memory augmented visual speech recognition M Kim, J Hong, SJ Park, YM Ro IEEE Transactions on Multimedia 24, 4342-4355, 2021	27	2021
Speech reconstruction with reminiscent sound via visual voice memory J Hong, M Kim, SJ Park, YM Ro IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3654-3667, 2021	19	2021
Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model J Hong, SJ Park, YM Ro arXiv preprint arXiv:2310.14946, 2023	3	2023
Test-time adaptation for real image denoising via meta-transfer learning A Gunawan, MA Nugroho, SJ Park arXiv preprint arXiv:2207.02066, 2022	3	2022
Multilingual visual speech recognition with a single model by learning with discrete visual speech units M Kim, JH Yeo, J Choi, SJ Park, YM Ro arXiv preprint arXiv:2401.09802, 2024	2	2024
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation J Choi, SJ Park, M Kim, YM Ro Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024	2	2024
DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion SJ Park, J Hong, M Kim, YM Ro arXiv preprint arXiv:2310.05934, 2023	2	2023
Reprogramming audio-driven talking face synthesis into text-driven J Choi, M Kim, SJ Park, YM Ro arXiv preprint arXiv:2306.16003, 2023	2	2023
Exploring Phonetic Context-Aware Lip-Sync for Talking Face Generation SJ Park, M Kim, J Choi, YM Ro ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	1	2024
Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation SJ Park, CW Kim, H Rha, M Kim, J Hong, JH Yeo, YM Ro arXiv preprint arXiv:2406.07867, 2024		2024
Text-Driven Talking Face Synthesis by Reprogramming Audio-Driven Models J Choi, M Kim, SJ Park, YM Ro ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024
Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation S Han, SJ Park, CW Kim, YM Ro ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024
Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation M Kim, J Yeo, SJ Park, H Rha, YM Ro ACM Multimedia 2024, 0
Multilingual Visual Speech Recognition with a Single Model using Visual Speech Unit M Kim, J Yeo, J Choi, SJ Park, YM Ro

系统目前无法执行此操作，请稍后再试。

文章 1–16

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用