Applications of machine vision in agricultural robot navigation: A review
T Wang, B Chen, Z Zhang, H Li, M Zhang - Computers and Electronics in …, 2022 - Elsevier
Many tasks in smart agriculture have further requirements for the autonomous navigation of
agricultural robots. Due to irreplaceable visual information and low-cost hardware costs …
agricultural robots. Due to irreplaceable visual information and low-cost hardware costs …
Few-shot object detection: A survey
Deep learning approaches have recently raised the bar in many fields, from Natural
Language Processing to Computer Vision, by leveraging large amounts of data. However …
Language Processing to Computer Vision, by leveraging large amounts of data. However …
Simple open-vocabulary object detection
Combining simple architectures with large-scale pre-training has led to massive
improvements in image classification. For object detection, pre-training and scaling …
improvements in image classification. For object detection, pre-training and scaling …
Glipv2: Unifying localization and vision-language understanding
We present GLIPv2, a grounded VL understanding model, that serves both localization tasks
(eg, object detection, instance segmentation) and Vision-Language (VL) understanding …
(eg, object detection, instance segmentation) and Vision-Language (VL) understanding …
Multimodal foundation models: From specialists to general-purpose assistants
Neural compression is the application of neural networks and other machine learning
methods to data compression. Recent advances in statistical machine learning have opened …
methods to data compression. Recent advances in statistical machine learning have opened …
Detecting twenty-thousand classes using image-level supervision
Current object detectors are limited in vocabulary size due to the small scale of detection
datasets. Image classifiers, on the other hand, reason about much larger vocabularies, as …
datasets. Image classifiers, on the other hand, reason about much larger vocabularies, as …
Grounded language-image pre-training
This paper presents a grounded language-image pre-training (GLIP) model for learning
object-level, language-aware, and semantic-rich visual representations. GLIP unifies object …
object-level, language-aware, and semantic-rich visual representations. GLIP unifies object …
Regionclip: Region-based language-image pretraining
Contrastive language-image pretraining (CLIP) using image-text pairs has achieved
impressive results on image classification in both zero-shot and transfer learning settings …
impressive results on image classification in both zero-shot and transfer learning settings …
Florence: A new foundation model for computer vision
Automated visual understanding of our diverse and open world demands computer vision
models to generalize well with minimal customization for specific tasks, similar to human …
models to generalize well with minimal customization for specific tasks, similar to human …
Learning to prompt for open-vocabulary object detection with vision-language model
Recently, vision-language pre-training shows great potential in open-vocabulary object
detection, where detectors trained on base classes are devised for detecting new classes …
detection, where detectors trained on base classes are devised for detecting new classes …