R2GAN: Cross-modal recipe retrieval with generative adversarial network B Zhu, CW Ngo, J Chen, Y Hao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 143 | 2019 |
CookGAN: Causality based Text-to-Image Synthesis B Zhu, CW Ngo Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2020 | 78 | 2020 |
A study of multi-task and region-wise deep learning for food ingredient recognition J Chen, B Zhu, CW Ngo, TS Chua, YG Jiang IEEE Transactions on Image Processing 30, 1514-1526, 2020 | 73 | 2020 |
Epic-kitchens visor benchmark: Video segmentations and object relations A Darkhalil, D Shan, B Zhu, J Ma, A Kar, R Higgins, S Fidler, D Fouhey, ... Advances in Neural Information Processing Systems 35, 13745-13758, 2022 | 68 | 2022 |
Person-level action recognition in complex events via tsd-tsm networks Y Hao, ZN Liu, H Zhang, B Zhu, J Chen, YG Jiang, CW Ngo Proceedings of the 28th ACM International Conference on Multimedia, 4699-4702, 2020 | 11 | 2020 |
Learning from web recipe-image pairs for food recognition: Problem, baselines and performance B Zhu, CW Ngo, WK Chan IEEE Transactions on Multimedia 24, 1175-1185, 2021 | 10 | 2021 |
Cross-domain cross-modal food transfer B Zhu, CW Ngo, J Chen Proceedings of the 28th ACM International Conference on Multimedia, 3762-3770, 2020 | 9 | 2020 |
Mix-dann and dynamic-modal-distillation for video domain adaptation Y Yin, B Zhu, J Chen, L Cheng, YG Jiang Proceedings of the 30th ACM International Conference on Multimedia, 3224-3233, 2022 | 8 | 2022 |
CgT-GAN: CLIP-guided Text GAN for Image Captioning J Yu, H Li, Y Hao, B Zhu, T Xu, X He Proceedings of the 31st ACM International Conference on Multimedia, 2252-2263, 2023 | 6 | 2023 |
Unsupervised video hashing with multi-granularity contextualization and multi-structure preservation Y Hao, J Duan, H Zhang, B Zhu, P Zhou, X He Proceedings of the 30th ACM International Conference on Multimedia, 3754-3763, 2022 | 6 | 2022 |
Learning to match anchor-target video pairs with dual attentional holographic networks Y Hao, CW Ngo, B Zhu IEEE Transactions on Image Processing 30, 8130-8143, 2021 | 5 | 2021 |
Pyramid fusion dark channel prior for single image dehazing Q Liang, B Zhu, CW Ngo arXiv preprint arXiv:2105.10192, 2021 | 5 | 2021 |
Foodlmm: A versatile food assistant using large multi-modal model Y Yin, H Qi, B Zhu, J Chen, YG Jiang, CW Ngo arXiv preprint arXiv:2312.14991, 2023 | 4 | 2023 |
Cross-lingual adaptation for recipe retrieval with mixup B Zhu, CW Ngo, J Chen, WK Chan Proceedings of the 2022 International Conference on Multimedia Retrieval …, 2022 | 4 | 2022 |
Text-driven Video Prediction X Song, J Chen, B Zhu, Y Jiang ACM Transactions on Multimedia Computing, Communications, and Applications …, 2024 | 3 | 2024 |
CAR: consolidation, augmentation and regulation for recipe retrieval F Song, B Zhu, Y Hao, S Wang, X He arXiv preprint arXiv:2312.04763, 2023 | 2 | 2023 |
From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical Scenarios G Liu, Y Jiao, J Chen, B Zhu, YG Jiang IEEE Transactions on Multimedia, 2024 | 1 | 2024 |
RoDE: Linear Rectified Mixture of Diverse Experts for Food Large Multi-Modal Models P Jiao, X Wu, B Zhu, J Chen, CW Ngo, Y Jiang arXiv preprint arXiv:2407.12730, 2024 | | 2024 |
Model Inversion Attacks Through Target-Specific Conditional Diffusion Models O Li, Y Hao, Z Wang, B Zhu, S Wang, Z Zhang, F Feng arXiv preprint arXiv:2407.11424, 2024 | | 2024 |
Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective F Song, B Zhu, Y Hao, S Wang European Conference on Computer Vision (ECCV), 2024 | | 2024 |