Geometry of information structures, strategic measures and associated stochastic control topologies
In many areas of applied mathematics, decentralization of information is a ubiquitous
attribute affecting how to approach a stochastic optimization, decision and estimation, or …
attribute affecting how to approach a stochastic optimization, decision and estimation, or …
Q-learning in regularized mean-field games
In this paper, we introduce a regularized mean-field game and study learning of this game
under an infinite-horizon discounted reward function. Regularization is introduced by adding …
under an infinite-horizon discounted reward function. Regularization is introduced by adding …
Average Cost Optimality of Partially Observed MDPs: Contraction of Nonlinear Filters and Existence of Optimal Solutions and Approximations
The average cost optimality is known to be a challenging problem for partially observable
stochastic control, with few results available beyond the finite state, action, and …
stochastic control, with few results available beyond the finite state, action, and …
Reinforcement learning for linear-convex models with jumps via stability analysis of feedback controls
We study finite-time horizon continuous-time linear-convex reinforcement learning problems
in an episodic setting. In this problem, the unknown linear jump-diffusion process is …
in an episodic setting. In this problem, the unknown linear jump-diffusion process is …
Continuity of discounted values and the structure of optimal policies for periodic‐review inventory systems with setup costs
EA Feinberg, DN Kraemer - Naval Research Logistics (NRL), 2023 - Wiley Online Library
This paper proves continuity of value functions in discounted periodic‐review single‐
commodity total‐cost inventory control problems with continuous inventory levels, fixed …
commodity total‐cost inventory control problems with continuous inventory levels, fixed …
Weak Feller property of non-linear filters
Weak Feller property of controlled and control-free Markov chains leads to many desirable
properties. In control-free setups this leads to the existence of invariant probability measures …
properties. In control-free setups this leads to the existence of invariant probability measures …
Robust markov decision processes with data-driven, distance-based ambiguity sets
S Ramani, A Ghate - SIAM Journal on Optimization, 2022 - SIAM
We consider finite-and infinite-horizon Markov decision processes (MDPs) with unknown
state-transition probabilities. They are assumed to belong to certain ambiguity sets, and the …
state-transition probabilities. They are assumed to belong to certain ambiguity sets, and the …
Regularity and stability of feedback relaxed controls
C Reisinger, Y Zhang - SIAM Journal on Control and Optimization, 2021 - SIAM
This paper proposes a relaxed control regularization with general exploration rewards to
design robust feedback controls for multidimensional continuous-time stochastic exit time …
design robust feedback controls for multidimensional continuous-time stochastic exit time …
Information manipulation in partially observable markov decision processes
A common approach to solve partially observable Markov decision processes (POMDPs) is
transforming them into Makov decision processes (MDPs) defined on information states …
transforming them into Makov decision processes (MDPs) defined on information states …
Control capacity
Feedback control actively dissipates uncertainty from a dynamical system by means of
actuation. We develop a notion of “control capacity” that gives a fundamental limit (in bits) on …
actuation. We develop a notion of “control capacity” that gives a fundamental limit (in bits) on …