UVIS: Unsupervised Video Instance Segmentation

S Huang, S Suri, K Gupta… - Proceedings of the …, 2024 - openaccess.thecvf.com
Video instance segmentation requires classifying segmenting and tracking every object
across video frames. Unlike existing approaches that rely on masks boxes or category labels …

COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL

X Wang, R Zheng, Y Sun, R Jia, W Wongkamjan… - arXiv preprint arXiv …, 2023 - arxiv.org
Dyna-style model-based reinforcement learning contains two phases: model rollouts to
generate sample for policy learning and real environment exploration using current policy …

Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion

Y Liang, T Xu, K Hu, G Jiang, F Huang, H Xu - arXiv preprint arXiv …, 2024 - arxiv.org
Can we generate a control policy for an agent using just one demonstration of desired
behaviors as a prompt, as effortlessly as creating an image from a textual description? In this …

DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control

ZJ Cui, H Pan, A Iyer, S Haldar, L Pinto - arXiv preprint arXiv:2409.12192, 2024 - arxiv.org
Imitation learning has proven to be a powerful tool for training complex visuomotor policies.
However, current methods often require hundreds to thousands of expert demonstrations to …

TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies

R Zheng, Y Liang, S Huang, J Gao, H Daumé III… - arXiv preprint arXiv …, 2024 - arxiv.org
Although large vision-language-action (VLA) models pretrained on extensive robot datasets
offer promising generalist policies for robotic learning, they still struggle with spatial …