[HTML][HTML] Newton's method for reinforcement learning and model predictive control
D Bertsekas - Results in Control and Optimization, 2022 - Elsevier
The purpose of this paper is to propose and develop a new conceptual framework for
approximate Dynamic Programming (DP) and Reinforcement Learning (RL). This framework …
approximate Dynamic Programming (DP) and Reinforcement Learning (RL). This framework …
Lessons from alphazero for optimal, model predictive, and adaptive control
D Bertsekas - arXiv preprint arXiv:2108.10315, 2021 - arxiv.org
In this paper we aim to provide analysis and insights (often based on visualization), which
explain the beneficial effects of on-line decision making on top of off-line training. In …
explain the beneficial effects of on-line decision making on top of off-line training. In …
A new approach for pricing discounted American options
TS Zaevski - Communications in Nonlinear Science and Numerical …, 2021 - Elsevier
The purpose of this paper is to present a new numerical approach for finding the early
exercise boundary of discounted American options, which payment structure is generalized …
exercise boundary of discounted American options, which payment structure is generalized …
Discounted perpetual game put options
TS Zaevski - Chaos, Solitons & Fractals, 2020 - Elsevier
The aim of this study is to explore the behavior of perpetual game put options, also known as
cancellable puts. Their main characteristic is the opportunity of the buyer and the seller to …
cancellable puts. Their main characteristic is the opportunity of the buyer and the seller to …
Spectral Analysis and Preconditioned Iterative Solvers for Large Structured Linear Systems
N Barakitis - arXiv preprint arXiv:2205.00339, 2022 - arxiv.org
In this thesis, the numerical solution of three different classes of problems have been
studied. Specifically, new techniques have been proposed and their theoretical analysis has …
studied. Specifically, new techniques have been proposed and their theoretical analysis has …
On a Boundary Updating Method for the Scalar Stefan Problem
EF Magirou, P Vassalos, N Barakitis - arXiv preprint arXiv:2202.06418, 2022 - arxiv.org
We report on a general purpose method for the scalar Stefan problem inspired by the
standard boundary updating method used in several existence proofs. By suitably modifying …
standard boundary updating method used in several existence proofs. By suitably modifying …
[PDF][PDF] Short time asymptotics for American maximum options with a dividend-paying asset
R Hou, Y Xu, J Fan, Y Zhu - AIMS Mathematics, 2022 - aimspress.com
We investigate the asymptotic behaviors of American maximum options with dividendpaying
assets near maturity. Using the exercise conditions of American options, we obtain the …
assets near maturity. Using the exercise conditions of American options, we obtain the …
[PDF][PDF] AN APPROACH FOR PRICING AMERICAN-STYLE DERIVATIVES
TS Zaevski - math.bas.bg
Derivatives are one of the most important instruments against the financial risks. They are
based on some underlying object which can be a single asset, portfolio of assets, financial …
based on some underlying object which can be a single asset, portfolio of assets, financial …
[PDF][PDF] Results in Control and Optimization
D Bertsekas - mit.edu
The purpose of this paper is to propose and develop a new conceptual framework for
approximate Dynamic Programming (DP) and Reinforcement Learning (RL). This framework …
approximate Dynamic Programming (DP) and Reinforcement Learning (RL). This framework …