关注
Haokun Wen
Haokun Wen
在 stu.hit.edu.cn 的电子邮件经过验证
标题
引用次数
引用次数
年份
Comprehensive linguistic-visual composition network for image retrieval
H Wen, X Song, X Yang, Y Zhan, L Nie
Proceedings of the 44th International ACM SIGIR Conference on Research and …, 2021
602021
Personalized fashion compatibility modeling via metapath-guided heterogeneous graph learning
W Guan, F Jiao, X Song, H Wen, CH Yeh, X Chang
Proceedings of the 45th international ACM SIGIR conference on research and …, 2022
422022
Multimodal compatibility modeling via exploring the consistent and complementary correlations
W Guan, H Wen, X Song, CH Yeh, X Chang, L Nie
Proceedings of the 29th ACM international conference on multimedia, 2299-2307, 2021
342021
Generative attribute manipulation scheme for flexible fashion search
X Yang, X Song, X Han, H Wen, J Nie, L Nie
Proceedings of the 43rd international ACM SIGIR conference on research and …, 2020
282020
Attribute-wise explainable fashion compatibility modeling
X Yang, X Song, F Feng, H Wen, LY Duan, L Nie
ACM Transactions on Multimedia Computing, Communications, and Applications …, 2021
272021
Partially supervised compatibility modeling
W Guan, H Wen, X Song, C Wang, CH Yeh, X Chang, L Nie
IEEE Transactions on Image Processing 31, 4733-4745, 2022
242022
Target-guided composed image retrieval
H Wen, X Zhang, X Song, Y Wei, L Nie
Proceedings of the 31st ACM International Conference on Multimedia, 915-923, 2023
182023
Egocentric early action prediction via multimodal transformer-based dual action prediction
W Guan, X Song, K Wang, H Wen, H Ni, Y Wang, X Chang
IEEE Transactions on Circuits and Systems for Video Technology 33 (9), 4472-4483, 2023
72023
Self-Training Boosted Multi-Factor Matching Network for Composed Image Retrieval
H Wen, X Song, J Yin, J Wu, W Guan, L Nie
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
52023
Clip-based composed image retrieval with comprehensive fusion and data augmentation
H Lin, H Wen, X Chen, X Song
Australasian Joint Conference on Artificial Intelligence, 190-202, 2023
42023
Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval
H Lin, H Wen, X Song, M Liu, Y Hu, L Nie
Proceedings of the 47th International ACM SIGIR Conference on Research and …, 2024
22024
Finetuning Language Models for Multimodal Question Answering
X Zhang, W Xie, Z Dai, J Rao, H Wen, X Luo, M Zhang, M Zhang
Proceedings of the 31st ACM International Conference on Multimedia, 9420-9424, 2023
12023
Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval
H Wen, X Song, X Chen, Y Wei, L Nie, TS Chua
Proceedings of the 47th International ACM SIGIR Conference on Research and …, 2024
2024
Pseudo-triplet Guided Few-shot Composed Image Retrieval
B Hou, H Lin, H Wen, M Liu, X Song
arXiv preprint arXiv:2407.06001, 2024
2024
Interactive Garment Recommendation with User in the Loop
F Becattini, X Chen, A Puccia, H Wen, X Song, L Nie, A Del Bimbo
arXiv preprint arXiv:2402.11627, 2024
2024
Differential-Perceptive and Retrieval-Augmented MLLM for Change Captioning
X Zhang, H Wen, J Wu, P Qin, L Nie
ACM Multimedia 2024, 0
系统目前无法执行此操作,请稍后再试。
文章 1–16