Towards open vocabulary learning: A survey
In the field of visual scene understanding, deep neural networks have made impressive
advancements in various core tasks like segmentation, tracking, and detection. However …
advancements in various core tasks like segmentation, tracking, and detection. However …
Semi-supervised vocabulary-informed learning
Despite significant progress in object categorization, in recent years, a number of important
challenges remain; mainly, ability to learn from limited labeled data and ability to recognize …
challenges remain; mainly, ability to learn from limited labeled data and ability to recognize …
Open vocabulary object detection with pseudo bounding-box labels
Despite great progress in object detection, most existing methods work only on a limited set
of object categories, due to the tremendous human effort needed for bounding-box …
of object categories, due to the tremendous human effort needed for bounding-box …
Edadet: Open-vocabulary object detection using early dense alignment
Vision-language models such as CLIP have boosted the performance of open-vocabulary
object detection, where the detector is trained on base categories but required to detect …
object detection, where the detector is trained on base categories but required to detect …
Open vocabulary semantic segmentation with patch aligned contrastive learning
Abstract We introduce Patch Aligned Contrastive Learning (PACL), a modified compatibility
function for CLIP's contrastive loss, intending to train an alignment between the patch tokens …
function for CLIP's contrastive loss, intending to train an alignment between the patch tokens …
Non-contrastive learning meets language-image pre-training
Contrastive language-image pre-training (CLIP) serves as a de-facto standard to align
images and texts. Nonetheless, the loose correlation between images and texts of web …
images and texts. Nonetheless, the loose correlation between images and texts of web …
General object foundation model for images and videos at scale
We present GLEE in this work an object-level foundation model for locating and identifying
objects in images and videos. Through a unified framework GLEEaccomplishes detection …
objects in images and videos. Through a unified framework GLEEaccomplishes detection …
Learning to detect and segment for open vocabulary object detection
T Wang - Proceedings of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Open vocabulary object detection has been greately advanced by the recent development of
vision-language pre-trained model, which helps recognizing the novel objects with only …
vision-language pre-trained model, which helps recognizing the novel objects with only …
Open-vocabulary object detection using captions
Despite the remarkable accuracy of deep neural networks in object detection, they are costly
to train and scale due to supervision requirements. Particularly, learning more object …
to train and scale due to supervision requirements. Particularly, learning more object …