Conditionally elicitable dynamic risk measures for deep reinforcement learning

A Coache, S Jaimungal, Á Cartea - SIAM Journal on Financial Mathematics, 2023 - SIAM
We propose a novel framework to solve risk-sensitive reinforcement learning problems
where the agent optimizes time-consistent dynamic spectral risk measures. Based on the …

Temporal robustness of stochastic signals

L Lindemann, A Rodionova, G Pappas - Proceedings of the 25th ACM …, 2022 - dl.acm.org
We study the temporal robustness of stochastic signals. This topic is of particular interest in
interleaving processes such as multi-agent systems where communication and individual …

Reinforcement learning with dynamic convex risk measures

A Coache, S Jaimungal - Mathematical Finance, 2024 - Wiley Online Library
We develop an approach for solving time‐consistent risk‐sensitive stochastic optimization
problems using model‐free reinforcement learning (RL). Specifically, we assume agents …

STL robustness risk over discrete-time stochastic processes

L Lindemann, N Matni… - 2021 60th IEEE …, 2021 - ieeexplore.ieee.org
We present a framework to interpret signal temporal logic (STL) formulas over discrete-time
stochastic processes in terms of the induced risk. Each realization of a stochastic process …

Risk of stochastic systems for temporal logic specifications

L Lindemann, L Jiang, N Matni, GJ Pappas - ACM Transactions on …, 2023 - dl.acm.org
The wide availability of data coupled with the computational advances in artificial
intelligence and machine learning promise to enable many future technologies such as …

An empirical estimator for the mean that dominates the empirical average

N Koumpis, D Kalogerias - arXiv preprint arXiv:2402.10418, 2024 - arxiv.org
We propose a simple empirical representation of expectations such that: For a number of
samples above a certain threshold, drawn from any probability distribution with finite fourth …

Transcendental equation solver: A novel neural network for solving transcendental equation

J Liu, G Wang, W Li, L Sun, L Zhang, L Yu - Applied Soft Computing, 2022 - Elsevier
In this paper, we propose a novel method called transcendental equation solver (TES) for
solving transcendental equations. The TES comprises a generator defined by a neural …

A critical comparison on attitude estimation: From gaussian approximate filters to coordinate‐free dual optimal control

NP Koumpis, PA Panagiotou… - IET Control Theory & …, 2021 - Wiley Online Library
This paper conveys attitude and rate estimation without rate sensors by performing a critical
comparison, validated by extensive simulations. The two dominant approaches to facilitate …

Time Consistent Reinforcement Learning for Optimal Consumption Under Epstein-Zin Preferences

MF Dixon, I Gvozdanovic, D O'Kane - Available at SSRN 4388762, 2023 - papers.ssrn.com
We present a class of least squares reinforcement learning algorithms for optimal
consumption under elasticity of intertemporal substitution and risk aversion preferences. The …

Uncertainty Principles in Risk-Aware Statistical Estimation

NP Koumpis, DS Kalogerias - 2021 60th IEEE Conference on …, 2021 - ieeexplore.ieee.org
We present a new uncertainty principle for risk-aware statistical estimation, effectively
quantifying the inherent trade-off between mean squared error (mse) and risk, the latter …