Scaling robot learning with semantically imagined experience

W Huang, C Wang, R Zhang, Y Li, J Wu… - arXiv preprint arXiv …, 2023 - arxiv.org

Large language models (LLMs) are shown to possess a wealth of actionable knowledge that
can be extracted for robot manipulation in the form of reasoning and planning. Despite the …

被引用次数：237 相关文章所有 6 个版本

[PDF] arxiv.org

Dall-e-bot: Introducing web-scale diffusion models to robotics

I Kapelyukh, V Vosylius, E Johns - IEEE Robotics and …, 2023 - ieeexplore.ieee.org

We introduce the first work to explore web-scale diffusion models for robotics. DALL-E-Bot
enables a robot to rearrange objects in a scene, by first inferring a text description of those …

被引用次数：89 相关文章所有 5 个版本

[PDF] neurips.cc

Diffusion model is an effective planner and data synthesizer for multi-task reinforcement learning

H He, C Bai, K Xu, Z Yang, W Zhang… - Advances in neural …, 2024 - proceedings.neurips.cc

Diffusion models have demonstrated highly-expressive generative capabilities in vision and
NLP. Recent studies in reinforcement learning (RL) have shown that diffusion models are …

被引用次数：32 相关文章所有 5 个版本

[PDF] arxiv.org

A systematic survey of prompt engineering on vision-language foundation models

J Gu, Z Han, S Chen, A Beirami, B He, G Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

Prompt engineering is a technique that involves augmenting a large pre-trained model with
task-specific hints, known as prompts, to adapt the model to new tasks. Prompts can be …

被引用次数：71 相关文章所有 3 个版本

[PDF] mlr.press

Generative skill chaining: Long-horizon skill planning with diffusion models

UA Mishra, S Xue, Y Chen… - Conference on Robot …, 2023 - proceedings.mlr.press

Long-horizon tasks, usually characterized by complex subtask dependencies, present a
significant challenge in manipulation planning. Skill chaining is a practical approach to …

被引用次数：20 相关文章所有 6 个版本

[PDF] arxiv.org

Octo: An open-source generalist robot policy

OM Team, D Ghosh, H Walke, K Pertsch… - arXiv preprint arXiv …, 2024 - arxiv.org

Large policies pretrained on diverse robot datasets have the potential to transform robotic
learning: instead of training new policies from scratch, such generalist robot policies may be …

被引用次数：40 相关文章所有 3 个版本

[PDF] arxiv.org

Zero-shot robotic manipulation with pretrained image-editing diffusion models

K Black, M Nakamoto, P Atreya, H Walke… - arXiv preprint arXiv …, 2023 - arxiv.org

If generalist robots are to operate in truly unstructured environments, they need to be able to
recognize and reason about novel objects and scenarios. Such objects and scenarios might …

被引用次数：32 相关文章所有 4 个版本

[PDF] arxiv.org

Roboagent: Generalization and efficiency in robot manipulation via semantic augmentations and action chunking

H Bharadhwaj, J Vakil, M Sharma, A Gupta… - arXiv preprint arXiv …, 2023 - arxiv.org

The grand aim of having a single robot that can manipulate arbitrary objects in diverse
settings is at odds with the paucity of robotics datasets. Acquiring and growing such datasets …

被引用次数：36 相关文章所有 3 个版本

[PDF] arxiv.org

Language-conditioned learning for robotic manipulation: A survey

H Zhou, X Yao, Y Meng, S Sun, Z BIng, K Huang… - arXiv preprint arXiv …, 2023 - arxiv.org

Language-conditioned robotic manipulation represents a cutting-edge area of research,
enabling seamless communication and cooperation between humans and robotic agents …

被引用次数：6 相关文章所有 2 个版本

[PDF] arxiv.org

A survey of reinforcement learning from human feedback

T Kaufmann, P Weng, V Bengs… - arXiv preprint arXiv …, 2023 - arxiv.org

Reinforcement learning from human feedback (RLHF) is a variant of reinforcement learning
(RL) that learns from human feedback instead of relying on an engineered reward function …

被引用次数：38 相关文章所有 4 个版本