Informed machine learning–a taxonomy and survey of integrating prior knowledge into learning systems

L Von Rueden, S Mayer, K Beckh… - … on Knowledge and …, 2021 - ieeexplore.ieee.org
Despite its great success, machine learning can have its limits when dealing with insufficient
training data. A potential solution is the additional integration of prior knowledge into the …

Resource provisioning for IoT services in the fog computing environment: An autonomic approach

M Etemadi, M Ghobaei-Arani, A Shahidinejad - Computer Communications, 2020 - Elsevier
In the recent years, the Internet of Things (IoT) services has been increasingly applied to
promote the quality of the human life and this trend is predicted to stretch for into future. With …

Truly proximal policy optimization

Y Wang, H He, X Tan - Uncertainty in artificial intelligence, 2020 - proceedings.mlr.press
Proximal policy optimization (PPO) is one of the most successful deep reinforcement
learning methods, achieving state-of-the-art performance across a wide range of …

Bayesian optimization of risk measures

S Cakmak, R Astudillo Marban… - Advances in Neural …, 2020 - proceedings.neurips.cc
We consider Bayesian optimization of objective functions of the form $\rho [F (x, W)] $, where
$ F $ is a black-box expensive-to-evaluate function and $\rho $ denotes either the VaR or …

Bayesian optimization for policy search via online-offline experimentation

B Letham, E Bakshy - Journal of Machine Learning Research, 2019 - jmlr.org
Online field experiments are the gold-standard way of evaluating changes to real-world
interactive machine learning systems. Yet our ability to explore complex, multi-dimensional …

Multi-fidelity black-box optimization for time-optimal quadrotor maneuvers

G Ryou, E Tal, S Karaman - The International Journal of …, 2021 - journals.sagepub.com
We consider the problem of generating a time-optimal quadrotor trajectory for highly
maneuverable vehicles, such as quadrotor aircraft. The problem is challenging because the …

[HTML][HTML] Learning an efficient gait cycle of a biped robot based on reinforcement learning and artificial neural networks

CR Gil, H Calvo, H Sossa - Applied Sciences, 2019 - mdpi.com
Featured Application The final product is an algorithm that allows a simulated robot to learn
sequences of poses (single configurations of its joints) in order to learn to walk (and …

[HTML][HTML] Transfer learning of fuzzy classifiers for optimized joint representation of simulated and measured data in anomaly detection of motor phase currents

E Lughofer, P Zorn, E Marth - Applied Soft Computing, 2022 - Elsevier
The generation of simulation data from physical models trying to mimic parts of the real-
world process as accurately as possible has received much attention in industry during the …

Quadruped-Frog: Rapid Online Optimization of Continuous Quadruped Jumping

G Bellegarda, M Shafiee, ME Özberk… - arXiv preprint arXiv …, 2024 - arxiv.org
Legged robots are becoming increasingly agile in exhibiting dynamic behaviors such as
running and jumping. Usually, such behaviors are either optimized and engineered offline …

Bayesian optimization in variational latent spaces with dynamic compression

R Antonova, A Rai, T Li… - Conference on Robot …, 2020 - proceedings.mlr.press
Data-efficiency is crucial for autonomous robots to adapt to new tasks and environments. In
this work, we focus on robotics problems with a budget of only 10-20 trials. This is a very …