RS5M and GeoRSCLIP: A large scale vision-language dataset and a large vision-language model for remote sensing
Pretrained vision-language models (VLMs) utilizing extensive image–text paired data have
demonstrated unprecedented image–text association capabilities, achieving remarkable …
demonstrated unprecedented image–text association capabilities, achieving remarkable …
Applications of knowledge distillation in remote sensing: A survey
With the ever-growing complexity of models in the field of remote sensing (RS), there is an
increasing demand for solutions that balance model accuracy with computational efficiency …
increasing demand for solutions that balance model accuracy with computational efficiency …
From Pixels to Prose: Advancing Multi-Modal Language Models for Remote Sensing
Remote sensing has evolved from simple image acquisition to complex systems capable of
integrating and processing visual and textual data. This review examines the development …
integrating and processing visual and textual data. This review examines the development …
DMCH: A Deep Metric and Category-Level Semantic Hashing Network for Retrieval in Remote Sensing
The effectiveness of hashing methods in big data retrieval has been proved due to their
merit in computational and storage efficiency. Recently, encouraged by the strong …
merit in computational and storage efficiency. Recently, encouraged by the strong …
Remote Sensing Image-Text Retrieval with Implicit-Explicit Relation Reasoning
Remote sensing image-text retrieval (RSITR) has become a research hotspot in recent years
for its wide application. Existing methods in this context, based either on local or global …
for its wide application. Existing methods in this context, based either on local or global …
An Enhanced Feature Extraction Framework for Cross-Modal Image–Text Retrieval
J Zhang, L Wang, F Zheng, X Wang, H Zhang - Remote Sensing, 2024 - mdpi.com
In general, remote sensing images depict intricate scenes. In cross-modal retrieval tasks
involving remote sensing images, the accompanying text includes numerus information with …
involving remote sensing images, the accompanying text includes numerus information with …
[HTML][HTML] CTDUNet: A Multimodal CNN–Transformer Dual U-Shaped Network with Coordinate Space Attention for Camellia oleifera Pests and Diseases Segmentation …
R Guo, R Zhang, H Zhou, T Xie, Y Peng, X Chen, G Yu… - Plants, 2024 - mdpi.com
Camellia oleifera is a crop of high economic value, yet it is particularly susceptible to various
diseases and pests that significantly reduce its yield and quality. Consequently, the precise …
diseases and pests that significantly reduce its yield and quality. Consequently, the precise …