关注
Bin Zhu
Bin Zhu
Assistant Professor, Singapore Management University
在 smu.edu.sg 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
R2GAN: Cross-modal recipe retrieval with generative adversarial network
B Zhu, CW Ngo, J Chen, Y Hao
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
1432019
CookGAN: Causality based Text-to-Image Synthesis
B Zhu, CW Ngo
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2020
782020
A study of multi-task and region-wise deep learning for food ingredient recognition
J Chen, B Zhu, CW Ngo, TS Chua, YG Jiang
IEEE Transactions on Image Processing 30, 1514-1526, 2020
732020
Epic-kitchens visor benchmark: Video segmentations and object relations
A Darkhalil, D Shan, B Zhu, J Ma, A Kar, R Higgins, S Fidler, D Fouhey, ...
Advances in Neural Information Processing Systems 35, 13745-13758, 2022
682022
Person-level action recognition in complex events via tsd-tsm networks
Y Hao, ZN Liu, H Zhang, B Zhu, J Chen, YG Jiang, CW Ngo
Proceedings of the 28th ACM International Conference on Multimedia, 4699-4702, 2020
112020
Learning from web recipe-image pairs for food recognition: Problem, baselines and performance
B Zhu, CW Ngo, WK Chan
IEEE Transactions on Multimedia 24, 1175-1185, 2021
102021
Cross-domain cross-modal food transfer
B Zhu, CW Ngo, J Chen
Proceedings of the 28th ACM International Conference on Multimedia, 3762-3770, 2020
92020
Mix-dann and dynamic-modal-distillation for video domain adaptation
Y Yin, B Zhu, J Chen, L Cheng, YG Jiang
Proceedings of the 30th ACM International Conference on Multimedia, 3224-3233, 2022
82022
CgT-GAN: CLIP-guided Text GAN for Image Captioning
J Yu, H Li, Y Hao, B Zhu, T Xu, X He
Proceedings of the 31st ACM International Conference on Multimedia, 2252-2263, 2023
62023
Unsupervised video hashing with multi-granularity contextualization and multi-structure preservation
Y Hao, J Duan, H Zhang, B Zhu, P Zhou, X He
Proceedings of the 30th ACM International Conference on Multimedia, 3754-3763, 2022
62022
Learning to match anchor-target video pairs with dual attentional holographic networks
Y Hao, CW Ngo, B Zhu
IEEE Transactions on Image Processing 30, 8130-8143, 2021
52021
Pyramid fusion dark channel prior for single image dehazing
Q Liang, B Zhu, CW Ngo
arXiv preprint arXiv:2105.10192, 2021
52021
Foodlmm: A versatile food assistant using large multi-modal model
Y Yin, H Qi, B Zhu, J Chen, YG Jiang, CW Ngo
arXiv preprint arXiv:2312.14991, 2023
42023
Cross-lingual adaptation for recipe retrieval with mixup
B Zhu, CW Ngo, J Chen, WK Chan
Proceedings of the 2022 International Conference on Multimedia Retrieval …, 2022
42022
Text-driven Video Prediction
X Song, J Chen, B Zhu, Y Jiang
ACM Transactions on Multimedia Computing, Communications, and Applications …, 2024
32024
CAR: consolidation, augmentation and regulation for recipe retrieval
F Song, B Zhu, Y Hao, S Wang, X He
arXiv preprint arXiv:2312.04763, 2023
22023
From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical Scenarios
G Liu, Y Jiao, J Chen, B Zhu, YG Jiang
IEEE Transactions on Multimedia, 2024
12024
RoDE: Linear Rectified Mixture of Diverse Experts for Food Large Multi-Modal Models
P Jiao, X Wu, B Zhu, J Chen, CW Ngo, Y Jiang
arXiv preprint arXiv:2407.12730, 2024
2024
Model Inversion Attacks Through Target-Specific Conditional Diffusion Models
O Li, Y Hao, Z Wang, B Zhu, S Wang, Z Zhang, F Feng
arXiv preprint arXiv:2407.11424, 2024
2024
Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective
F Song, B Zhu, Y Hao, S Wang
European Conference on Computer Vision (ECCV), 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–20