A systematic survey of prompt engineering on vision-language foundation models J Gu, Z Han, S Chen, A Beirami, B He, G Zhang, R Liao, Y Qin, V Tresp, ... arXiv preprint arXiv:2307.12980, 2023 | 64 | 2023 |
Time-dependent entity embedding is not all you need: A re-evaluation of temporal knowledge graph completion models under a unified framework Z Han*, G Zhang*, Y Ma, V Tresp Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 19 | 2021 |
Cl-crossvqa: A continual learning benchmark for cross-domain visual question answering Y Zhang, H Chen, A Frikha, Y Yang, D Krompass, G Zhang, J Gu, V Tresp arXiv preprint arXiv:2211.10567, 2022 | 8 | 2022 |
Multi-event Video-Text Retrieval G Zhang, J Ren, J Gu, V Tresp Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 5 | 2023 |
Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning G Zhang, Y Zhang, K Zhang, V Tresp Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024 | 1 | 2024 |
SPOT! Revisiting Video-Language Models for Event Understanding G Zhang, J Bi, J Gu, V Tresp arXiv preprint arXiv:2311.12919, 2023 | 1 | 2023 |
Localizing Events in Videos with Multimodal Queries G Zhang, MLA Fok, Y Xia, Y Tang, D Cremers, P Torr, V Tresp, J Gu arXiv preprint arXiv:2406.10079, 2024 | | 2024 |
Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning Supplementary Materials G Zhang, Y Zhang, K Zhang, V Tresp, AD WikiTiLo Middle East 11, 16, 0 | | |