Object detection in 20 years: A survey

Z Zou, K Chen, Z Shi, Y Guo, J Ye - Proceedings of the IEEE, 2023 - ieeexplore.ieee.org
Object detection, as of one the most fundamental and challenging problems in computer
vision, has received great attention in recent years. Over the past two decades, we have …

Computer vision for autonomous vehicles: Problems, datasets and state of the art

J Janai, F Güney, A Behl, A Geiger - Foundations and Trends® …, 2020 - nowpublishers.com
Recent years have witnessed enormous progress in AI-related fields such as computer
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …

Panoptic segmentation

A Kirillov, K He, R Girshick… - Proceedings of the …, 2019 - openaccess.thecvf.com
We propose and study a task we name panoptic segmentation (PS). Panoptic segmentation
unifies the typically distinct tasks of semantic segmentation (assign a class label to each …

Action genome: Actions as compositions of spatio-temporal scene graphs

J Ji, R Krishna, L Fei-Fei… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Action recognition has typically treated actions and activities as monolithic events that occur
in videos. However, there is evidence from Cognitive Science and Neuroscience that people …

Drg: Dual relation graph for human-object interaction detection

C Gao, J Xu, Y Zou, JB Huang - … Conference, Glasgow, UK, August 23–28 …, 2020 - Springer
We tackle the challenging problem of human-object interaction (HOI) detection. Existing
methods either recognize the interaction of each human-object pair in isolation or perform …

Soft-NMS--improving object detection with one line of code

N Bodla, B Singh, R Chellappa… - Proceedings of the …, 2017 - openaccess.thecvf.com
Non-maximum suppression is an integral part of the object detection pipeline. First, it sorts
all detection boxes on the basis of their scores. The detection box M with the maximum score …

Scene graph generation by iterative message passing

D Xu, Y Zhu, CB Choy, L Fei-Fei - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
Understanding a visual scene goes beyond recognizing individual objects in isolation.
Relationships between objects also constitute rich semantic information about the scene. In …

Repulsion loss: Detecting pedestrians in a crowd

X Wang, T Xiao, Y Jiang, S Shao… - Proceedings of the …, 2018 - openaccess.thecvf.com
Detecting individual pedestrians in a crowd remains a challenging problem since the
pedestrians often gather together and occlude each other in real-world scenarios. In this …

Visual translation embedding network for visual relation detection

H Zhang, Z Kyaw, SF Chang… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
Visual relations, such as" person ride bike" and" bike next to car", offer a comprehensive
scene understanding of an image, and have already shown their great utility in connecting …

Image retrieval using scene graphs

J Johnson, R Krishna, M Stark, LJ Li… - Proceedings of the …, 2015 - openaccess.thecvf.com
This paper develops a novel framework for semantic image retrieval based on the notion of
a scene graph. Our scene graphs represent objects (" man"," boat"), attributes of objects (" …