Recent advances in reinforcement learning in finance

B Hambly, R Xu, H Yang - Mathematical Finance, 2023 - Wiley Online Library
The rapid changes in the finance industry due to the increasing amount of data have
revolutionized the techniques on data processing and data analysis and brought new …

On reliability of reinforcement learning based production scheduling systems: a comparative survey

C Waubert de Puiseau, R Meyes, T Meisen - Journal of Intelligent …, 2022 - Springer
The deep reinforcement learning (DRL) community has published remarkable results on
complex strategic planning problems, most famously in virtual scenarios for board and video …

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

G Dulac-Arnold, N Levine, DJ Mankowitz, J Li… - Machine Learning, 2021 - Springer
Reinforcement learning (RL) has proven its worth in a series of artificial domains, and is
beginning to show some successes in real-world scenarios. However, much of the research …

Inverse reward design

D Hadfield-Menell, S Milli, P Abbeel… - Advances in neural …, 2017 - proceedings.neurips.cc
Autonomous agents optimize the reward function we give them. What they don't know is how
hard it is for us to design a reward function that actually captures what we want. When …

Reinforcement learning in economics and finance

A Charpentier, R Elie, C Remlinger - Computational Economics, 2021 - Springer
Reinforcement learning algorithms describe how an agent can learn an optimal action policy
in a sequential decision process, through repeated experience. In a given environment, the …

Policy gradient for rectangular robust markov decision processes

N Kumar, E Derman, M Geist… - Advances in Neural …, 2023 - proceedings.neurips.cc
Policy gradient methods have become a standard for training reinforcement learning agents
in a scalable and efficient manner. However, they do not account for transition uncertainty …

An empirical investigation of the challenges of real-world reinforcement learning

G Dulac-Arnold, N Levine, DJ Mankowitz, J Li… - arXiv preprint arXiv …, 2020 - arxiv.org
Reinforcement learning (RL) has proven its worth in a series of artificial domains, and is
beginning to show some successes in real-world scenarios. However, much of the research …

Efficient risk-averse reinforcement learning

I Greenberg, Y Chow… - Advances in Neural …, 2022 - proceedings.neurips.cc
In risk-averse reinforcement learning (RL), the goal is to optimize some risk measure of the
returns. A risk measure often focuses on the worst returns out of the agent's experience. As a …

Convex reinforcement learning in finite trials

M Mutti, R De Santi, P De Bartolomeis… - Journal of Machine …, 2023 - jmlr.org
Convex Reinforcement Learning (RL) is a recently introduced framework that generalizes
the standard RL objective to any convex (or concave) function of the state distribution …

Being optimistic to be conservative: Quickly learning a CVaR policy

R Keramati, C Dann, A Tamkin… - Proceedings of the AAAI …, 2020 - ojs.aaai.org
While maximizing expected return is the goal in most reinforcement learning approaches,
risk-sensitive objectives such as conditional value at risk (CVaR) are more suitable for many …