Reltr: Relation transformer for scene graph generation
Different objects in the same scene are more or less related to each other, but only a limited
number of these relationships are noteworthy. Inspired by Detection Transformer, which …
number of these relationships are noteworthy. Inspired by Detection Transformer, which …
Prototype-based embedding network for scene graph generation
Abstract Current Scene Graph Generation (SGG) methods explore contextual information to
predict relationships among entity pairs. However, due to the diverse visual appearance of …
predict relationships among entity pairs. However, due to the diverse visual appearance of …
[HTML][HTML] Scene graph generation: A comprehensive survey
Deep learning techniques have led to remarkable breakthroughs in the field of object
detection and have spawned a lot of scene-understanding tasks in recent years. Scene …
detection and have spawned a lot of scene-understanding tasks in recent years. Scene …
Sgtr: End-to-end scene graph generation with transformer
Abstract Scene Graph Generation (SGG) remains a challenging visual understanding task
due to its compositional property. Most previous works adopt a bottom-up two-stage or a …
due to its compositional property. Most previous works adopt a bottom-up two-stage or a …
Visually-prompted language model for fine-grained scene graph generation in an open world
Abstract Scene Graph Generation (SGG) aims to extract< subject, predicate, object>
relationships in images for vision understanding. Although recent works have made steady …
relationships in images for vision understanding. Although recent works have made steady …
A survey of deep learning for low-shot object detection
Object detection has achieved a huge breakthrough with deep neural networks and massive
annotated data. However, current detection methods cannot be directly transferred to the …
annotated data. However, current detection methods cannot be directly transferred to the …
Multilateral semantic relations modeling for image text retrieval
Z Wang, Z Gao, K Guo, Y Yang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Image-text retrieval is a fundamental task to bridge vision and language by exploiting
various strategies to fine-grained alignment between regions and words. This is still tough …
various strategies to fine-grained alignment between regions and words. This is still tough …
Egtr: Extracting graph from transformer for scene graph generation
Abstract Scene Graph Generation (SGG) is a challenging task of detecting objects and
predicting relationships between objects. After DETR was developed one-stage SGG …
predicting relationships between objects. After DETR was developed one-stage SGG …
Pair then relation: Pair-net for panoptic scene graph generation
Panoptic Scene Graph (PSG) is a challenging task in Scene Graph Generation (SGG) that
aims to create a more comprehensive scene graph representation using panoptic …
aims to create a more comprehensive scene graph representation using panoptic …
Zero-shot visual relation detection via composite visual cues from large language models
Pretrained vision-language models, such as CLIP, have demonstrated strong generalization
capabilities, making them promising tools in the realm of zero-shot visual recognition. Visual …
capabilities, making them promising tools in the realm of zero-shot visual recognition. Visual …