Ch-sims: A chinese multimodal sentiment analysis dataset with fine-grained annotation of modality W Yu, H Xu, F Meng, Y Zhu, Y Ma, J Wu, J Zou, K Yang ACL2020, 3718-3727, 2020 | 244 | 2020 |
Cm-bert: Cross-modal bert for text-audio sentiment analysis K Yang, H Xu, K Gao ACMMM2020(Oral), 521-528, 2020 | 111 | 2020 |
Alip: Adaptive language-image pre-training with synthetic caption K Yang, J Deng, X An, J Li, Z Feng, J Guo, J Yang, T Liu ICCV2023, 2922-2931, 2023 | 22 | 2023 |
Unicom: Universal and Compact Representation Learning for Image Retrieval X An, J Deng, K Yang, J Li, Z Feng, J Guo, J Yang, T Liu ICLR2023, 2023 | 17 | 2023 |
RWKV-CLIP: A Robust Vision-Language Representation Learner T Gu, K Yang, X An, Z Feng, D Liu, W Cai, J Deng arXiv preprint arXiv:2406.06973, 2024 | 1 | 2024 |
A self-adjusting fusion representation learning model for unaligned text-audio sequences K Yang, R Zhang, H Xu, K Gao arXiv preprint arXiv:2212.11772, 2022 | 1 | 2022 |
High-Fidelity Facial Albedo Estimation via Texture Quantization Z Ran, X Ren, X An, K Yang, X Dai, Z Feng, J Guo, L Zhu, J Deng arXiv preprint arXiv:2406.13149, 2024 | | 2024 |
1st Place Solution to the 1st SkatingVerse Challenge T Sun, Y Fu, K Yang, J Wu, Z Feng Technical report, 2024 | | 2024 |
LaPA: Latent Prompt Assist Model For Medical Visual Question Answering T Gu, K Yang, D Liu, W Cai CVPRW2024(Oral), 2024 | | 2024 |
Multi-label Cluster Discrimination for Visual Representation Learning X An, K Yang, J Yang, Z Feng, J Guo, T Liu, J Deng ECCV2024, 0 | | |