Towards natural language-guided drones: GeoText-1652 benchmark with spatial relation matching

M Chu, Z Zheng, W Ji, T Wang, TS Chua - European Conference on …, 2025 - Springer
Navigating drones through natural language commands remains challenging due to the
dearth of accessible multi-modal datasets and the stringent precision requirements for …

[HTML][HTML] A Review on Deep Learning for UAV Absolute Visual Localization

A Couturier, MA Akhloufi - Drones, 2024 - mdpi.com
In the past few years, the use of Unmanned Aerial Vehicles (UAVs) has expanded and now
reached mainstream levels for applications such as infrastructure inspection, agriculture …

Learning cross-view geo-localization embeddings via dynamic weighted decorrelation regularization

T Wang, Z Zheng, Z Zhu, Y Sun, C Yan… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In the domain of cross-view geo-localization, the challenge lies in accurately matching
images captured from distinct perspectives, such as aerial drone imagery and satellite …

Hybrid CNN-transformer features for visual place recognition

Y Wang, Y Qiu, P Cheng… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Visual place recognition is a challenging problem in robotics and autonomous systems
because the scene undergoes appearance and viewpoint changes in a changing world …

Uav's status is worth considering: A fusion representations matching method for geo-localization

R Zhu, M Yang, L Yin, F Wu, Y Yang - Sensors, 2023 - mdpi.com
Visual geo-localization plays a crucial role in positioning and navigation for unmanned
aerial vehicles, whose goal is to match the same geographic target from different views. This …

Pseudo-mono for monocular 3d object detection in autonomous driving

C Tao, J Cao, C Wang, Z Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Current monocular 3D object detection algorithms generally suffer from inaccurate depth
estimation, which leads to reduction of detection accuracy. The depth error from image-to …

Superpixel-based multiscale CNN approach toward multiclass object segmentation from UAV-captured aerial images

TK Behera, S Bakshi, M Nappi… - IEEE Journal of Selected …, 2023 - ieeexplore.ieee.org
Unmanned aerial vehicles (UAVs) are promising remote sensors capable of reforming
remote sensing applications. However, for artificial-intelligence-guided tasks, such as land …

MCCG: A ConvNeXt-based multiple-classifier method for cross-view geo-localization

T Shen, Y Wei, L Kang, S Wan… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The key to crossview geolocalization is to match images of the same target from different
viewpoints, eg, images from drones and satellites. It is a challenging problem due to the …

Parformer: Transformer-based multi-task network for pedestrian attribute recognition

X Fan, Y Zhang, Y Lu, H Wang - IEEE Transactions on Circuits …, 2023 - ieeexplore.ieee.org
Pedestrian attribute recognition (PAR) has received increasing attention because of its wide
application in video surveillance and pedestrian analysis. Extracting robust feature …

SUES-200: A multi-height multi-scene cross-view image benchmark across drone and satellite

R Zhu, L Yin, M Yang, F Wu, Y Yang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Cross-view image matching aims to match images of the same target scene acquired from
different platforms. With the rapid development of drone technology, cross-view matching by …