Conditionally elicitable dynamic risk measures for deep reinforcement learning
We propose a novel framework to solve risk-sensitive reinforcement learning problems
where the agent optimizes time-consistent dynamic spectral risk measures. Based on the …
where the agent optimizes time-consistent dynamic spectral risk measures. Based on the …
Temporal robustness of stochastic signals
We study the temporal robustness of stochastic signals. This topic is of particular interest in
interleaving processes such as multi-agent systems where communication and individual …
interleaving processes such as multi-agent systems where communication and individual …
Reinforcement learning with dynamic convex risk measures
A Coache, S Jaimungal - Mathematical Finance, 2024 - Wiley Online Library
We develop an approach for solving time‐consistent risk‐sensitive stochastic optimization
problems using model‐free reinforcement learning (RL). Specifically, we assume agents …
problems using model‐free reinforcement learning (RL). Specifically, we assume agents …
STL robustness risk over discrete-time stochastic processes
L Lindemann, N Matni… - 2021 60th IEEE …, 2021 - ieeexplore.ieee.org
We present a framework to interpret signal temporal logic (STL) formulas over discrete-time
stochastic processes in terms of the induced risk. Each realization of a stochastic process …
stochastic processes in terms of the induced risk. Each realization of a stochastic process …
Risk of stochastic systems for temporal logic specifications
The wide availability of data coupled with the computational advances in artificial
intelligence and machine learning promise to enable many future technologies such as …
intelligence and machine learning promise to enable many future technologies such as …
An empirical estimator for the mean that dominates the empirical average
N Koumpis, D Kalogerias - arXiv preprint arXiv:2402.10418, 2024 - arxiv.org
We propose a simple empirical representation of expectations such that: For a number of
samples above a certain threshold, drawn from any probability distribution with finite fourth …
samples above a certain threshold, drawn from any probability distribution with finite fourth …
Transcendental equation solver: A novel neural network for solving transcendental equation
In this paper, we propose a novel method called transcendental equation solver (TES) for
solving transcendental equations. The TES comprises a generator defined by a neural …
solving transcendental equations. The TES comprises a generator defined by a neural …
A critical comparison on attitude estimation: From gaussian approximate filters to coordinate‐free dual optimal control
NP Koumpis, PA Panagiotou… - IET Control Theory & …, 2021 - Wiley Online Library
This paper conveys attitude and rate estimation without rate sensors by performing a critical
comparison, validated by extensive simulations. The two dominant approaches to facilitate …
comparison, validated by extensive simulations. The two dominant approaches to facilitate …
Time Consistent Reinforcement Learning for Optimal Consumption Under Epstein-Zin Preferences
MF Dixon, I Gvozdanovic, D O'Kane - Available at SSRN 4388762, 2023 - papers.ssrn.com
We present a class of least squares reinforcement learning algorithms for optimal
consumption under elasticity of intertemporal substitution and risk aversion preferences. The …
consumption under elasticity of intertemporal substitution and risk aversion preferences. The …
Uncertainty Principles in Risk-Aware Statistical Estimation
NP Koumpis, DS Kalogerias - 2021 60th IEEE Conference on …, 2021 - ieeexplore.ieee.org
We present a new uncertainty principle for risk-aware statistical estimation, effectively
quantifying the inherent trade-off between mean squared error (mse) and risk, the latter …
quantifying the inherent trade-off between mean squared error (mse) and risk, the latter …