Computer vision for autonomous vehicles: Problems, datasets and state of the art

J Janai, F Güney, A Behl, A Geiger - Foundations and Trends® …, 2020 - nowpublishers.com
Recent years have witnessed enormous progress in AI-related fields such as computer
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …

Multiple object tracking: A literature review

W Luo, J Xing, A Milan, X Zhang, W Liu, TK Kim - Artificial intelligence, 2021 - Elsevier
Abstract Multiple Object Tracking (MOT) has gained increasing attention due to its academic
and commercial potential. Although different approaches have been proposed to tackle this …

Siamese masked autoencoders

A Gupta, J Wu, J Deng, FF Li - Advances in Neural …, 2023 - proceedings.neurips.cc
Establishing correspondence between images or scenes is a significant challenge in
computer vision, especially given occlusions, viewpoint changes, and varying object …

Motiontrack: Learning robust short-term and long-term motions for multi-object tracking

Z Qin, S Zhou, L Wang, J Duan… - Proceedings of the …, 2023 - openaccess.thecvf.com
The main challenge of Multi-Object Tracking (MOT) lies in maintaining a continuous
trajectory for each target. Existing methods often learn reliable motion patterns to match the …

Argoverse: 3d tracking and forecasting with rich maps

MF Chang, J Lambert, P Sangkloy… - Proceedings of the …, 2019 - openaccess.thecvf.com
We present Argoverse, a dataset designed to support autonomous vehicle perception tasks
including 3D tracking and motion forecasting. Argoverse includes sensor data collected by a …

Deep learning in video multi-object tracking: A survey

G Ciaparrone, FL Sánchez, S Tabik, L Troiano… - Neurocomputing, 2020 - Elsevier
Abstract The problem of Multiple Object Tracking (MOT) consists in following the trajectory of
different objects in a sequence, usually a video. In recent years, with the rise of Deep …

Kimera: From SLAM to spatial perception with 3D dynamic scene graphs

A Rosinol, A Violette, M Abate… - … Journal of Robotics …, 2021 - journals.sagepub.com
Humans are able to form a complex mental model of the environment they move in. This
mental model captures geometric and semantic aspects of the scene, describes the …

Learning correspondence from the cycle-consistency of time

X Wang, A Jabri, AA Efros - … of the IEEE/CVF conference on …, 2019 - openaccess.thecvf.com
We introduce a self-supervised method for learning visual correspondence from unlabeled
video. The main idea is to use cycle-consistency in time as free supervisory signal for …

Monitoring COVID-19 social distancing with person detection and tracking via fine-tuned YOLO v3 and Deepsort techniques

NS Punn, SK Sonbhadra, S Agarwal, G Rai - arXiv preprint arXiv …, 2020 - arxiv.org
The rampant coronavirus disease 2019 (COVID-19) has brought global crisis with its deadly
spread to more than 180 countries, and about 3,519,901 confirmed cases along with …

Vision meets drones: A challenge

P Zhu, L Wen, X Bian, H Ling, Q Hu - arXiv preprint arXiv:1804.07437, 2018 - arxiv.org
In this paper we present a large-scale visual object detection and tracking benchmark,
named VisDrone2018, aiming at advancing visual understanding tasks on the drone …