Reinforcement learning in healthcare: A survey

C Yu, J Liu, S Nemati, G Yin - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
As a subfield of machine learning, reinforcement learning (RL) aims at optimizing decision
making by using interaction samples of an agent with its environment and the potentially …

Maven: Multi-agent variational exploration

A Mahajan, T Rashid, M Samvelyan… - Advances in neural …, 2019 - proceedings.neurips.cc
Centralised training with decentralised execution is an important setting for cooperative
deep multi-agent reinforcement learning due to communication constraints during execution …

A survey and critique of multiagent deep reinforcement learning

P Hernandez-Leal, B Kartal, ME Taylor - Autonomous Agents and Multi …, 2019 - Springer
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation

TD Kulkarni, K Narasimhan, A Saeedi… - Advances in neural …, 2016 - proceedings.neurips.cc
Learning goal-directed behavior in environments with sparse feedback is a major challenge
for reinforcement learning algorithms. One of the key difficulties is insufficient exploration …

A survey of algorithms for black-box safety validation of cyber-physical systems

A Corso, R Moss, M Koren, R Lee… - Journal of Artificial …, 2021 - jair.org
Autonomous cyber-physical systems (CPS) can improve safety and efficiency for safety-
critical applications, but require rigorous testing before deployment. The complexity of these …

[图书][B] Adversarial machine learning

Y Vorobeychik, M Kantarcioglu - 2022 - books.google.com
The increasing abundance of large high-quality datasets, combined with significant
technical advances over the last several decades have made machine learning into a major …

[PDF][PDF] A literature review on artificial intelligence

SA Oke - International journal of information and management …, 2008 - Citeseer
Research on artificial intelligence in the last two decades has greatly improved performance
of both manufacturing and service systems. Currently, there is a dire need for an article that …

Deep exploration via randomized value functions

I Osband, B Van Roy, DJ Russo, Z Wen - Journal of Machine Learning …, 2019 - jmlr.org
We study the use of randomized value functions to guide deep exploration in reinforcement
learning. This offers an elegant means for synthesizing statistically and computationally …

Model-based rl in contextual decision processes: Pac bounds and exponential improvements over model-free approaches

W Sun, N Jiang, A Krishnamurthy… - … on learning theory, 2019 - proceedings.mlr.press
We study the sample complexity of model-based reinforcement learning (henceforth RL) in
general contextual decision processes that require strategic exploration to find a near …

Schema networks: Zero-shot transfer with a generative causal model of intuitive physics

K Kansky, T Silver, DA Mély, M Eldawy… - International …, 2017 - proceedings.mlr.press
The recent adaptation of deep neural network-based methods to reinforcement learning and
planning domains has yielded remarkable progress on individual tasks. Nonetheless …