Using simulation to improve sample-efficiency of Bayesian optimization for bipedal robots

Informed machine learning–a taxonomy and survey of integrating prior knowledge into learning systems

L Von Rueden, S Mayer, K Beckh… - … on Knowledge and …, 2021 - ieeexplore.ieee.org

Despite its great success, machine learning can have its limits when dealing with insufficient
training data. A potential solution is the additional integration of prior knowledge into the …

被引用次数：736 相关文章所有 9 个版本

Resource provisioning for IoT services in the fog computing environment: An autonomic approach

M Etemadi, M Ghobaei-Arani, A Shahidinejad - Computer Communications, 2020 - Elsevier

In the recent years, the Internet of Things (IoT) services has been increasingly applied to
promote the quality of the human life and this trend is predicted to stretch for into future. With …

被引用次数：110 相关文章

[PDF] mlr.press

Truly proximal policy optimization

Y Wang, H He, X Tan - Uncertainty in artificial intelligence, 2020 - proceedings.mlr.press

Proximal policy optimization (PPO) is one of the most successful deep reinforcement
learning methods, achieving state-of-the-art performance across a wide range of …

被引用次数：137 相关文章所有 6 个版本

[PDF] neurips.cc

Bayesian optimization of risk measures

S Cakmak, R Astudillo Marban… - Advances in Neural …, 2020 - proceedings.neurips.cc

We consider Bayesian optimization of objective functions of the form $\rho [F (x, W)] $, where
$ F $ is a black-box expensive-to-evaluate function and $\rho $ denotes either the VaR or …

被引用次数：58 相关文章所有 6 个版本

[PDF] jmlr.org

Bayesian optimization for policy search via online-offline experimentation

B Letham, E Bakshy - Journal of Machine Learning Research, 2019 - jmlr.org

Online field experiments are the gold-standard way of evaluating changes to real-world
interactive machine learning systems. Yet our ability to explore complex, multi-dimensional …

被引用次数：65 相关文章所有 6 个版本

[HTML] sagepub.com

Multi-fidelity black-box optimization for time-optimal quadrotor maneuvers

G Ryou, E Tal, S Karaman - The International Journal of …, 2021 - journals.sagepub.com

We consider the problem of generating a time-optimal quadrotor trajectory for highly
maneuverable vehicles, such as quadrotor aircraft. The problem is challenging because the …

被引用次数：47 相关文章所有 9 个版本

[HTML] mdpi.com

[HTML][HTML] Learning an efficient gait cycle of a biped robot based on reinforcement learning and artificial neural networks

CR Gil, H Calvo, H Sossa - Applied Sciences, 2019 - mdpi.com

Featured Application The final product is an algorithm that allows a simulated robot to learn
sequences of poses (single configurations of its joints) in order to learn to walk (and …

被引用次数：46 相关文章所有 7 个版本

[HTML] sciencedirect.com

[HTML][HTML] Transfer learning of fuzzy classifiers for optimized joint representation of simulated and measured data in anomaly detection of motor phase currents

E Lughofer, P Zorn, E Marth - Applied Soft Computing, 2022 - Elsevier

The generation of simulation data from physical models trying to mimic parts of the real-
world process as accurately as possible has received much attention in industry during the …

被引用次数：9 相关文章所有 4 个版本

[PDF] arxiv.org

Quadruped-Frog: Rapid Online Optimization of Continuous Quadruped Jumping

G Bellegarda, M Shafiee, ME Özberk… - arXiv preprint arXiv …, 2024 - arxiv.org

Legged robots are becoming increasingly agile in exhibiting dynamic behaviors such as
running and jumping. Usually, such behaviors are either optimized and engineered offline …

被引用次数：4 相关文章所有 2 个版本

[PDF] mlr.press

Bayesian optimization in variational latent spaces with dynamic compression

R Antonova, A Rai, T Li… - Conference on Robot …, 2020 - proceedings.mlr.press

Data-efficiency is crucial for autonomous robots to adapt to new tasks and environments. In
this work, we focus on robotics problems with a budget of only 10-20 trials. This is a very …

被引用次数：26 相关文章所有 5 个版本