Towards natural language-guided drones: GeoText-1652 benchmark with spatial relation matching
Navigating drones through natural language commands remains challenging due to the
dearth of accessible multi-modal datasets and the stringent precision requirements for …
dearth of accessible multi-modal datasets and the stringent precision requirements for …
[HTML][HTML] A Review on Deep Learning for UAV Absolute Visual Localization
A Couturier, MA Akhloufi - Drones, 2024 - mdpi.com
In the past few years, the use of Unmanned Aerial Vehicles (UAVs) has expanded and now
reached mainstream levels for applications such as infrastructure inspection, agriculture …
reached mainstream levels for applications such as infrastructure inspection, agriculture …
Learning cross-view geo-localization embeddings via dynamic weighted decorrelation regularization
In the domain of cross-view geo-localization, the challenge lies in accurately matching
images captured from distinct perspectives, such as aerial drone imagery and satellite …
images captured from distinct perspectives, such as aerial drone imagery and satellite …
Hybrid CNN-transformer features for visual place recognition
Y Wang, Y Qiu, P Cheng… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Visual place recognition is a challenging problem in robotics and autonomous systems
because the scene undergoes appearance and viewpoint changes in a changing world …
because the scene undergoes appearance and viewpoint changes in a changing world …
Uav's status is worth considering: A fusion representations matching method for geo-localization
Visual geo-localization plays a crucial role in positioning and navigation for unmanned
aerial vehicles, whose goal is to match the same geographic target from different views. This …
aerial vehicles, whose goal is to match the same geographic target from different views. This …
Pseudo-mono for monocular 3d object detection in autonomous driving
C Tao, J Cao, C Wang, Z Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Current monocular 3D object detection algorithms generally suffer from inaccurate depth
estimation, which leads to reduction of detection accuracy. The depth error from image-to …
estimation, which leads to reduction of detection accuracy. The depth error from image-to …
Superpixel-based multiscale CNN approach toward multiclass object segmentation from UAV-captured aerial images
Unmanned aerial vehicles (UAVs) are promising remote sensors capable of reforming
remote sensing applications. However, for artificial-intelligence-guided tasks, such as land …
remote sensing applications. However, for artificial-intelligence-guided tasks, such as land …
MCCG: A ConvNeXt-based multiple-classifier method for cross-view geo-localization
T Shen, Y Wei, L Kang, S Wan… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The key to crossview geolocalization is to match images of the same target from different
viewpoints, eg, images from drones and satellites. It is a challenging problem due to the …
viewpoints, eg, images from drones and satellites. It is a challenging problem due to the …
Parformer: Transformer-based multi-task network for pedestrian attribute recognition
Pedestrian attribute recognition (PAR) has received increasing attention because of its wide
application in video surveillance and pedestrian analysis. Extracting robust feature …
application in video surveillance and pedestrian analysis. Extracting robust feature …
SUES-200: A multi-height multi-scene cross-view image benchmark across drone and satellite
Cross-view image matching aims to match images of the same target scene acquired from
different platforms. With the rapid development of drone technology, cross-view matching by …
different platforms. With the rapid development of drone technology, cross-view matching by …