Embodied communication: How robots and people communicate through physical interaction
A Kalinowska, PM Pilarski… - Annual review of control …, 2023 - annualreviews.org
Early research on physical human–robot interaction (pHRI) has necessarily focused on
device design—the creation of compliant and sensorized hardware, such as exoskeletons …
device design—the creation of compliant and sensorized hardware, such as exoskeletons …
Open problems and fundamental limitations of reinforcement learning from human feedback
Reinforcement learning from human feedback (RLHF) is a technique for training AI systems
to align with human goals. RLHF has emerged as the central method used to finetune state …
to align with human goals. RLHF has emerged as the central method used to finetune state …
Roboclip: One demonstration is enough to learn robot policies
Reward specification is a notoriously difficult problem in reinforcement learning, requiring
extensive expert supervision to design robust reward functions. Imitation learning (IL) …
extensive expert supervision to design robust reward functions. Imitation learning (IL) …
Few-shot preference learning for human-in-the-loop rl
DJ Hejna III, D Sadigh - Conference on Robot Learning, 2023 - proceedings.mlr.press
While reinforcement learning (RL) has become a more popular approach for robotics,
designing sufficiently informative reward functions for complex tasks has proven to be …
designing sufficiently informative reward functions for complex tasks has proven to be …
Inverse preference learning: Preference-based rl without a reward function
Reward functions are difficult to design and often hard to align with human intent. Preference-
based Reinforcement Learning (RL) algorithms address these problems by learning reward …
based Reinforcement Learning (RL) algorithms address these problems by learning reward …
Interactive imitation learning in robotics: A survey
Interactive Imitation Learning in Robotics: A Survey Page 1 Interactive Imitation Learning in
Robotics: A Survey Page 2 Other titles in Foundations and Trends® in Robotics A Survey on …
Robotics: A Survey Page 2 Other titles in Foundations and Trends® in Robotics A Survey on …
A survey of reinforcement learning from human feedback
Reinforcement learning from human feedback (RLHF) is a variant of reinforcement learning
(RL) that learns from human feedback instead of relying on an engineered reward function …
(RL) that learns from human feedback instead of relying on an engineered reward function …
Promptable behaviors: Personalizing multi-objective rewards from human preferences
Customizing robotic behaviors to be aligned with diverse human preferences is an
underexplored challenge in the field of embodied AI. In this paper we present Promptable …
underexplored challenge in the field of embodied AI. In this paper we present Promptable …
Guided reinforcement learning: A review and evaluation for efficient and effective real-world robotics [survey]
Recent successes aside, reinforcement learning (RL) still faces significant challenges in its
application to the real-world robotics domain. Guiding the learning process with additional …
application to the real-world robotics domain. Guiding the learning process with additional …
Active preference-based Gaussian process regression for reward learning and optimization
Designing reward functions is a difficult task in AI and robotics. The complex task of directly
specifying all the desirable behaviors a robot needs to optimize often proves challenging for …
specifying all the desirable behaviors a robot needs to optimize often proves challenging for …