[HTML][HTML] Review of large vision models and visual prompt engineering
Visual prompt engineering is a fundamental methodology in the field of visual and image
artificial general intelligence. As the development of large vision models progresses, the …
artificial general intelligence. As the development of large vision models progresses, the …
A survey of human-in-the-loop for machine learning
Abstract Machine learning has become the state-of-the-art technique for many tasks
including computer vision, natural language processing, speech processing tasks, etc …
including computer vision, natural language processing, speech processing tasks, etc …
Segment anything
Abstract We introduce the Segment Anything (SA) project: a new task, model, and dataset for
image segmentation. Using our efficient model in a data collection loop, we built the largest …
image segmentation. Using our efficient model in a data collection loop, we built the largest …
Segment everything everywhere all at once
In this work, we present SEEM, a promotable and interactive model for segmenting
everything everywhere all at once in an image. In SEEM, we propose a novel and versatile …
everything everywhere all at once in an image. In SEEM, we propose a novel and versatile …
Medical sam adapter: Adapting segment anything model for medical image segmentation
The Segment Anything Model (SAM) has recently gained popularity in the field of image
segmentation due to its impressive capabilities in various segmentation tasks and its prompt …
segmentation due to its impressive capabilities in various segmentation tasks and its prompt …
Seggpt: Segmenting everything in context
We present SegGPT, a generalist model for segmenting everything in context. We unify
various segmentation tasks into a generalist in-context learning framework that …
various segmentation tasks into a generalist in-context learning framework that …
Semantic-sam: Segment and recognize anything at any granularity
In this paper, we introduce Semantic-SAM, a universal image segmentation model to enable
segment and recognize anything at any desired granularity. Our model offers two key …
segment and recognize anything at any desired granularity. Our model offers two key …
Simpleclick: Interactive image segmentation with simple vision transformers
Click-based interactive image segmentation aims at extracting objects with a limited user
clicking. A hierarchical backbone is the de-facto architecture for current methods. Recently …
clicking. A hierarchical backbone is the de-facto architecture for current methods. Recently …
Focalclick: Towards practical interactive image segmentation
Interactive segmentation allows users to extract target masks by making positive/negative
clicks. Although explored by many previous works, there is still a gap between academic …
clicks. Although explored by many previous works, there is still a gap between academic …
Ai alignment: A comprehensive survey
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …