RS5M and GeoRSCLIP: A large scale vision-language dataset and a large vision-language model for remote sensing

Z Zhang, T Zhao, Y Guo, J Yin - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Pretrained vision-language models (VLMs) utilizing extensive image–text paired data have
demonstrated unprecedented image–text association capabilities, achieving remarkable …

Applications of knowledge distillation in remote sensing: A survey

Y Himeur, N Aburaed, O Elharrouss, I Varlamis… - Information …, 2024 - Elsevier
With the ever-growing complexity of models in the field of remote sensing (RS), there is an
increasing demand for solutions that balance model accuracy with computational efficiency …

From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing

X Sun, B Peng, C Zhang, F Jin, Q Niu, J Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
Remote sensing has evolved from simple image acquisition to complex systems capable of
integrating and processing visual and textual data. This review examines the development …

DMCH: A Deep Metric and Category-Level Semantic Hashing Network for Retrieval in Remote Sensing

H Huang, Q Cheng, Z Shao, X Huang, L Shao - Remote Sensing, 2023 - mdpi.com
The effectiveness of hashing methods in big data retrieval has been proved due to their
merit in computational and storage efficiency. Recently, encouraged by the strong …

Remote Sensing Image-Text Retrieval with Implicit-Explicit Relation Reasoning

L Yang, T Zhou, W Ma, M Du, L Liu, F Li… - … on Geoscience and …, 2024 - ieeexplore.ieee.org
Remote sensing image-text retrieval (RSITR) has become a research hotspot in recent years
for its wide application. Existing methods in this context, based either on local or global …

An Enhanced Feature Extraction Framework for Cross-Modal Image–Text Retrieval

J Zhang, L Wang, F Zheng, X Wang, H Zhang - Remote Sensing, 2024 - mdpi.com
In general, remote sensing images depict intricate scenes. In cross-modal retrieval tasks
involving remote sensing images, the accompanying text includes numerus information with …

[HTML][HTML] CTDUNet: A Multimodal CNN–Transformer Dual U-Shaped Network with Coordinate Space Attention for Camellia oleifera Pests and Diseases Segmentation …

R Guo, R Zhang, H Zhou, T Xie, Y Peng, X Chen, G Yu… - Plants, 2024 - mdpi.com
Camellia oleifera is a crop of high economic value, yet it is particularly susceptible to various
diseases and pests that significantly reduce its yield and quality. Consequently, the precise …