Robustness to incorrect system models in stochastic control

N Saldi, S Yüksel - Probability Surveys, 2022 - projecteuclid.org

In many areas of applied mathematics, decentralization of information is a ubiquitous
attribute affecting how to approach a stochastic optimization, decision and estimation, or …

被引用次数：28 相关文章所有 8 个版本

[PDF] arxiv.org

Q-learning in regularized mean-field games

B Anahtarci, CD Kariksiz, N Saldi - Dynamic Games and Applications, 2023 - Springer

In this paper, we introduce a regularized mean-field game and study learning of this game
under an infinite-horizon discounted reward function. Regularization is introduced by adding …

被引用次数：79 相关文章所有 10 个版本

Average Cost Optimality of Partially Observed MDPs: Contraction of Nonlinear Filters and Existence of Optimal Solutions and Approximations

YE Demirci, AD Kara, S Yüksel - SIAM Journal on Control and Optimization, 2024 - SIAM

The average cost optimality is known to be a challenging problem for partially observable
stochastic control, with few results available beyond the finite state, action, and …

被引用次数：6 相关文章所有 2 个版本

[PDF] arxiv.org

Reinforcement learning for linear-convex models with jumps via stability analysis of feedback controls

X Guo, A Hu, Y Zhang - SIAM Journal on Control and Optimization, 2023 - SIAM

We study finite-time horizon continuous-time linear-convex reinforcement learning problems
in an episodic setting. In this problem, the unknown linear jump-diffusion process is …

被引用次数：26 相关文章所有 3 个版本

[PDF] arxiv.org

Continuity of discounted values and the structure of optimal policies for periodic‐review inventory systems with setup costs

EA Feinberg, DN Kraemer - Naval Research Logistics (NRL), 2023 - Wiley Online Library

This paper proves continuity of value functions in discounted periodic‐review single‐
commodity total‐cost inventory control problems with continuous inventory levels, fixed …

被引用次数：7 相关文章所有 4 个版本

[PDF] arxiv.org

Weak Feller property of non-linear filters

AD Kara, N Saldi, S Yüksel - Systems & Control Letters, 2019 - Elsevier

Weak Feller property of controlled and control-free Markov chains leads to many desirable
properties. In control-free setups this leads to the existence of invariant probability measures …

被引用次数：36 相关文章所有 3 个版本

[PDF] washington.edu

Robust markov decision processes with data-driven, distance-based ambiguity sets

S Ramani, A Ghate - SIAM Journal on Optimization, 2022 - SIAM

We consider finite-and infinite-horizon Markov decision processes (MDPs) with unknown
state-transition probabilities. They are assumed to belong to certain ambiguity sets, and the …

被引用次数：9 相关文章所有 4 个版本

[PDF] arxiv.org

Regularity and stability of feedback relaxed controls

C Reisinger, Y Zhang - SIAM Journal on Control and Optimization, 2021 - SIAM

This paper proposes a relaxed control regularization with general exploration rewards to
design robust feedback controls for multidimensional continuous-time stochastic exit time …

被引用次数：27 相关文章所有 6 个版本

[PDF] arxiv.org

Information manipulation in partially observable markov decision processes

S Liu, Q Zhu - arXiv preprint arXiv:2312.07862, 2023 - arxiv.org

A common approach to solve partially observable Markov decision processes (POMDPs) is
transforming them into Makov decision processes (MDPs) defined on information states …

被引用次数：3 相关文章所有 2 个版本

[PDF] ieee.org

Control capacity

G Ranade, A Sahai - IEEE Transactions on Information Theory, 2018 - ieeexplore.ieee.org

Feedback control actively dissipates uncertainty from a dynamical system by means of
actuation. We develop a notion of “control capacity” that gives a fundamental limit (in bits) on …

被引用次数：30 相关文章所有 9 个版本