Fisher-Rao Gradient Flows of Linear Programs and State-Action Natural Policy Gradients
Kakade's natural policy gradient method has been studied extensively in the last years
showing linear convergence with and without regularization. We study another natural …
showing linear convergence with and without regularization. We study another natural …
Generative Assignment Flows for Representing and Learning Joint Distributions of Discrete Data
We introduce a novel generative model for the representation of joint probability distributions
of a possibly large number of discrete random variables. The approach uses measure …
of a possibly large number of discrete random variables. The approach uses measure …
Sigma Flows for Image and Data Labeling and Learning Structured Prediction
This paper introduces the sigma flow model for the prediction of structured labelings of data
observed on Riemannian manifolds, including Euclidean image domains as special case …
observed on Riemannian manifolds, including Euclidean image domains as special case …
Essentially Sharp Estimates on the Entropy Regularization Error in Discrete Discounted Markov Decision Processes
We study the error introduced by entropy regularization of infinite-horizon discrete
discounted Markov decision processes. We show that this error decreases exponentially in …
discounted Markov decision processes. We show that this error decreases exponentially in …