A comprehensive survey of scene graphs: Generation and application

X Chang, P Ren, P Xu, Z Li, X Chen… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Scene graph is a structured representation of a scene that can clearly express the objects,
attributes, and relationships between objects in the scene. As computer vision technology …

Graph representation learning meets computer vision: A survey

L Jiao, J Chen, F Liu, S Yang, C You… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
A graph structure is a powerful mathematical abstraction, which can not only represent
information about individuals but also capture the interactions between individuals for …

[HTML][HTML] Cpt: Colorful prompt tuning for pre-trained vision-language models

Y Yao, A Zhang, Z Zhang, Z Liu, TS Chua, M Sun - AI Open, 2024 - Elsevier
Abstract Vision-Language Pre-training (VLP) models have shown promising capabilities in
grounding natural language in image data, facilitating a broad range of cross-modal tasks …

Panoptic scene graph generation

J Yang, YZ Ang, Z Guo, K Zhou, W Zhang… - European Conference on …, 2022 - Springer
Existing research addresses scene graph generation (SGG)—a critical technology for scene
understanding in images—from a detection perspective, ie., objects are detected using …

Unbiased scene graph generation from biased training

K Tang, Y Niu, J Huang, J Shi… - Proceedings of the …, 2020 - openaccess.thecvf.com
Today's scene graph generation (SGG) task is still far from practical, mainly due to the
severe training bias, eg, collapsing diverse" human walk on/sit on/lay on beach" into" human …

Graphadapter: Tuning vision-language models with dual knowledge graph

X Li, D Lian, Z Lu, J Bai, Z Chen… - Advances in Neural …, 2024 - proceedings.neurips.cc
Adapter-style efficient transfer learning (ETL) has shown excellent performance in the tuning
of vision-language models (VLMs) under the low-data regime, where only a few additional …

Bipartite graph network with adaptive message passing for unbiased scene graph generation

R Li, S Zhang, B Wan, X He - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Scene graph generation is an important visual understanding task with a broad range of
vision applications. Despite recent tremendous progress, it remains challenging due to the …

[图书][B] Deep learning on graphs

Y Ma, J Tang - 2021 - books.google.com
Deep learning on graphs has become one of the hottest topics in machine learning. The
book consists of four parts to best accommodate our readers with diverse backgrounds and …

Mukea: Multimodal knowledge extraction and accumulation for knowledge-based visual question answering

Y Ding, J Yu, B Liu, Y Hu, M Cui… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Abstract Knowledge-based visual question answering requires the ability of associating
external knowledge for open-ended cross-modal scene understanding. One limitation of …

The devil is in the labels: Noisy label correction for robust scene graph generation

L Li, L Chen, Y Huang, Z Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Unbiased SGG has achieved significant progress over recent years. However, almost all
existing SGG models have overlooked the ground-truth annotation qualities of prevailing …