4d panoptic scene graph generation

J Yang, J Cen, W Peng, S Liu, F Hong… - Advances in …, 2024 - proceedings.neurips.cc
We are living in a three-dimensional space while moving forward through a fourth
dimension: time. To allow artificial intelligence to develop a comprehensive understanding …

Aims: All-inclusive multi-level segmentation for anything

L Qi, J Kuen, W Guo, J Gu, Z Lin, B Du… - Advances in Neural …, 2024 - proceedings.neurips.cc
Despite the progress of image segmentation for accurate visual entity segmentation,
completing the diverse requirements of image editing applications for different-level region …

A Review and Efficient Implementation of Scene Graph Generation Metrics

J Lorenz, R Schön, K Ludwig… - Proceedings of the …, 2024 - openaccess.thecvf.com
Scene graph generation has emerged as a prominent research field in computer vision
witnessing significant advancements in the recent years. However despite these strides …

DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation

Z Hayder, X He - Proceedings of the IEEE/CVF Conference …, 2024 - openaccess.thecvf.com
Scene graph generation aims to capture detailed spatial and semantic relationships
between objects in an image which is challenging due to incomplete labeling long-tailed …

Vlprompt: Vision-language prompting for panoptic scene graph generation

Z Zhou, M Shi, H Caesar - arXiv preprint arXiv:2311.16492, 2023 - arxiv.org
Panoptic Scene Graph Generation (PSG) aims at achieving a comprehensive image
understanding by simultaneously segmenting objects and predicting relations among …

Losh: Long-short text joint prediction network for referring video object segmentation

L Yuan, M Shi, Z Yue, Q Chen - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Referring video object segmentation (RVOS) aims to segment the target instance referred by
a given text expression in a video clip. The text expression normally contains sophisticated …

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

Z Zhou, Z Zhu, H Caesar, M Shi - arXiv preprint arXiv:2407.11213, 2024 - arxiv.org
Panoptic Scene Graph Generation (PSG) aims to segment objects and recognize their
relations, enabling the structured understanding of an image. Previous methods focus on …

From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation

H Shi, L Li, J Xiao, Y Zhuang, L Chen - arXiv preprint arXiv:2407.09191, 2024 - arxiv.org
Panoptic Scene Graph Generation (PSG) aims to generate a comprehensive graph-structure
representation based on panoptic segmentation masks. Despite remarkable progress in …

[PDF][PDF] DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation Supplementary Material

Z Hayder, X He - openaccess.thecvf.com
In this section, we first provide a summary highlighting the key contributions of our method,
along with comparisons to [4] and [7]. Following this, we offer an additional comparison with …

[PDF][PDF] A Space Information-Enhanced Dense Video Caption for Indoor Human Action Recognition

B CHEN, Y NAKAMURA, S FUKUSHIMA, Y ARAKAWA - app.ait.kyushu-u.ac.jp
Dense video captioning tasks are used to detect interesting events and provide descriptive
text for these events from untrimmed videos. This technology has the potential to be used in …