Geometry of information structures, strategic measures and associated stochastic control topologies

N Saldi, S Yüksel - Probability Surveys, 2022 - projecteuclid.org
In many areas of applied mathematics, decentralization of information is a ubiquitous
attribute affecting how to approach a stochastic optimization, decision and estimation, or …

Q-learning in regularized mean-field games

B Anahtarci, CD Kariksiz, N Saldi - Dynamic Games and Applications, 2023 - Springer
In this paper, we introduce a regularized mean-field game and study learning of this game
under an infinite-horizon discounted reward function. Regularization is introduced by adding …

Average Cost Optimality of Partially Observed MDPs: Contraction of Nonlinear Filters and Existence of Optimal Solutions and Approximations

YE Demirci, AD Kara, S Yüksel - SIAM Journal on Control and Optimization, 2024 - SIAM
The average cost optimality is known to be a challenging problem for partially observable
stochastic control, with few results available beyond the finite state, action, and …

Reinforcement learning for linear-convex models with jumps via stability analysis of feedback controls

X Guo, A Hu, Y Zhang - SIAM Journal on Control and Optimization, 2023 - SIAM
We study finite-time horizon continuous-time linear-convex reinforcement learning problems
in an episodic setting. In this problem, the unknown linear jump-diffusion process is …

Continuity of discounted values and the structure of optimal policies for periodic‐review inventory systems with setup costs

EA Feinberg, DN Kraemer - Naval Research Logistics (NRL), 2023 - Wiley Online Library
This paper proves continuity of value functions in discounted periodic‐review single‐
commodity total‐cost inventory control problems with continuous inventory levels, fixed …

Weak Feller property of non-linear filters

AD Kara, N Saldi, S Yüksel - Systems & Control Letters, 2019 - Elsevier
Weak Feller property of controlled and control-free Markov chains leads to many desirable
properties. In control-free setups this leads to the existence of invariant probability measures …

Robust markov decision processes with data-driven, distance-based ambiguity sets

S Ramani, A Ghate - SIAM Journal on Optimization, 2022 - SIAM
We consider finite-and infinite-horizon Markov decision processes (MDPs) with unknown
state-transition probabilities. They are assumed to belong to certain ambiguity sets, and the …

Regularity and stability of feedback relaxed controls

C Reisinger, Y Zhang - SIAM Journal on Control and Optimization, 2021 - SIAM
This paper proposes a relaxed control regularization with general exploration rewards to
design robust feedback controls for multidimensional continuous-time stochastic exit time …

Information manipulation in partially observable markov decision processes

S Liu, Q Zhu - arXiv preprint arXiv:2312.07862, 2023 - arxiv.org
A common approach to solve partially observable Markov decision processes (POMDPs) is
transforming them into Makov decision processes (MDPs) defined on information states …

Control capacity

G Ranade, A Sahai - IEEE Transactions on Information Theory, 2018 - ieeexplore.ieee.org
Feedback control actively dissipates uncertainty from a dynamical system by means of
actuation. We develop a notion of “control capacity” that gives a fundamental limit (in bits) on …