Simulation optimization: a review of algorithms and applications

S Amaran, NV Sahinidis, B Sharda, SJ Bury - Annals of Operations …, 2016 - Springer
Simulation optimization (SO) refers to the optimization of an objective function subject to
constraints, both of which can be evaluated through a stochastic simulation. To address …

[HTML][HTML] A survey of applications of artificial intelligence and machine learning in future mobile networks-enabled systems

İ Yazici, I Shayea, J Din - … Science and Technology, an International Journal, 2023 - Elsevier
Different fields have been thriving with the advents in mobile communication systems in
recent years. These fields reap benefits of data collected by Internet of Things (IoT) in next …

Instruction-driven history-aware policies for robotic manipulations

PL Guhur, S Chen, RG Pinel… - … on Robot Learning, 2023 - proceedings.mlr.press
In human environments, robots are expected to accomplish a variety of manipulation tasks
given simple natural language instructions. Yet, robotic manipulation is extremely …

[图书][B] Control systems and reinforcement learning

S Meyn - 2022 - books.google.com
A high school student can create deep Q-learning code to control her robot, without any
understanding of the meaning of'deep'or'Q', or why the code sometimes fails. This book is …

Survey of model-based reinforcement learning: Applications on robotics

AS Polydoros, L Nalpantidis - Journal of Intelligent & Robotic Systems, 2017 - Springer
Reinforcement learning is an appealing approach for allowing robots to learn new tasks.
Relevant literature reveals a plethora of methods, but at the same time makes clear the lack …

Structured world models from human videos

R Mendonca, S Bahl, D Pathak - arXiv preprint arXiv:2308.10901, 2023 - arxiv.org
We tackle the problem of learning complex, general behaviors directly in the real world. We
propose an approach for robots to efficiently learn manipulation skills using only a handful of …

A survey on policy search for robotics

MP Deisenroth, G Neumann… - Foundations and Trends …, 2013 - nowpublishers.com
Policy search is a subfield in reinforcement learning which focuses on finding good
parameters for a given policy parametrization. It is well suited for robotics as it can cope with …

A survey of actor-critic reinforcement learning: Standard and natural policy gradients

I Grondman, L Busoniu, GAD Lopes… - IEEE Transactions on …, 2012 - ieeexplore.ieee.org
Policy-gradient-based actor-critic algorithms are amongst the most popular algorithms in the
reinforcement learning framework. Their advantage of being able to search for optimal …

[图书][B] Algorithms for reinforcement learning

C Szepesvári - 2022 - books.google.com
Reinforcement learning is a learning paradigm concerned with learning to control a system
so as to maximize a numerical performance measure that expresses a long-term objective …

Towards vision-based deep reinforcement learning for robotic motion control

F Zhang, J Leitner, M Milford, B Upcroft… - arXiv preprint arXiv …, 2015 - arxiv.org
This paper introduces a machine learning based system for controlling a robotic manipulator
with visual perception only. The capability to autonomously learn robot controllers solely …