Object detection in 20 years: A survey
Object detection, as of one the most fundamental and challenging problems in computer
vision, has received great attention in recent years. Over the past two decades, we have …
vision, has received great attention in recent years. Over the past two decades, we have …
Computer vision for autonomous vehicles: Problems, datasets and state of the art
Recent years have witnessed enormous progress in AI-related fields such as computer
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …
Panoptic segmentation
We propose and study a task we name panoptic segmentation (PS). Panoptic segmentation
unifies the typically distinct tasks of semantic segmentation (assign a class label to each …
unifies the typically distinct tasks of semantic segmentation (assign a class label to each …
Action genome: Actions as compositions of spatio-temporal scene graphs
Action recognition has typically treated actions and activities as monolithic events that occur
in videos. However, there is evidence from Cognitive Science and Neuroscience that people …
in videos. However, there is evidence from Cognitive Science and Neuroscience that people …
Drg: Dual relation graph for human-object interaction detection
We tackle the challenging problem of human-object interaction (HOI) detection. Existing
methods either recognize the interaction of each human-object pair in isolation or perform …
methods either recognize the interaction of each human-object pair in isolation or perform …
Soft-NMS--improving object detection with one line of code
N Bodla, B Singh, R Chellappa… - Proceedings of the …, 2017 - openaccess.thecvf.com
Non-maximum suppression is an integral part of the object detection pipeline. First, it sorts
all detection boxes on the basis of their scores. The detection box M with the maximum score …
all detection boxes on the basis of their scores. The detection box M with the maximum score …
Scene graph generation by iterative message passing
Understanding a visual scene goes beyond recognizing individual objects in isolation.
Relationships between objects also constitute rich semantic information about the scene. In …
Relationships between objects also constitute rich semantic information about the scene. In …
Repulsion loss: Detecting pedestrians in a crowd
Detecting individual pedestrians in a crowd remains a challenging problem since the
pedestrians often gather together and occlude each other in real-world scenarios. In this …
pedestrians often gather together and occlude each other in real-world scenarios. In this …
Visual translation embedding network for visual relation detection
Visual relations, such as" person ride bike" and" bike next to car", offer a comprehensive
scene understanding of an image, and have already shown their great utility in connecting …
scene understanding of an image, and have already shown their great utility in connecting …
Image retrieval using scene graphs
This paper develops a novel framework for semantic image retrieval based on the notion of
a scene graph. Our scene graphs represent objects (" man"," boat"), attributes of objects (" …
a scene graph. Our scene graphs represent objects (" man"," boat"), attributes of objects (" …