Efficient solution algorithms for factored MDPs

C Yu, J Liu, S Nemati, G Yin - ACM Computing Surveys (CSUR), 2021 - dl.acm.org

As a subfield of machine learning, reinforcement learning (RL) aims at optimizing decision
making by using interaction samples of an agent with its environment and the potentially …

被引用次数：603 相关文章所有 5 个版本

[PDF] neurips.cc

Maven: Multi-agent variational exploration

A Mahajan, T Rashid, M Samvelyan… - Advances in neural …, 2019 - proceedings.neurips.cc

Centralised training with decentralised execution is an important setting for cooperative
deep multi-agent reinforcement learning due to communication constraints during execution …

被引用次数：382 相关文章所有 12 个版本

[PDF] arxiv.org

A survey and critique of multiagent deep reinforcement learning

P Hernandez-Leal, B Kartal, ME Taylor - Autonomous Agents and Multi …, 2019 - Springer

Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

被引用次数：610 相关文章所有 10 个版本

[PDF] neurips.cc

Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation

TD Kulkarni, K Narasimhan, A Saeedi… - Advances in neural …, 2016 - proceedings.neurips.cc

Learning goal-directed behavior in environments with sparse feedback is a major challenge
for reinforcement learning algorithms. One of the key difficulties is insufficient exploration …

被引用次数：1390 相关文章所有 11 个版本

[PDF] jair.org Full View

A survey of algorithms for black-box safety validation of cyber-physical systems

A Corso, R Moss, M Koren, R Lee… - Journal of Artificial …, 2021 - jair.org

Autonomous cyber-physical systems (CPS) can improve safety and efficiency for safety-
critical applications, but require rigorous testing before deployment. The complexity of these …

被引用次数：176 相关文章所有 9 个版本

[图书][B] Adversarial machine learning

Y Vorobeychik, M Kantarcioglu - 2022 - books.google.com

The increasing abundance of large high-quality datasets, combined with significant
technical advances over the last several decades have made machine learning into a major …

被引用次数：299 相关文章所有 4 个版本

[PDF] psu.edu

[PDF][PDF] A literature review on artificial intelligence

SA Oke - International journal of information and management …, 2008 - Citeseer

Research on artificial intelligence in the last two decades has greatly improved performance
of both manufacturing and service systems. Currently, there is a dire need for an article that …

被引用次数：143 相关文章所有 4 个版本

[PDF] jmlr.org

Deep exploration via randomized value functions

I Osband, B Van Roy, DJ Russo, Z Wen - Journal of Machine Learning …, 2019 - jmlr.org

We study the use of randomized value functions to guide deep exploration in reinforcement
learning. This offers an elegant means for synthesizing statistically and computationally …

被引用次数：326 相关文章所有 10 个版本

[PDF] mlr.press

Model-based rl in contextual decision processes: Pac bounds and exponential improvements over model-free approaches

W Sun, N Jiang, A Krishnamurthy… - … on learning theory, 2019 - proceedings.mlr.press

We study the sample complexity of model-based reinforcement learning (henceforth RL) in
general contextual decision processes that require strategic exploration to find a near …

被引用次数：238 相关文章所有 7 个版本

[PDF] mlr.press

Schema networks: Zero-shot transfer with a generative causal model of intuitive physics

K Kansky, T Silver, DA Mély, M Eldawy… - International …, 2017 - proceedings.mlr.press

The recent adaptation of deep neural network-based methods to reinforcement learning and
planning domains has yielded remarkable progress on individual tasks. Nonetheless …

被引用次数：276 相关文章所有 7 个版本