Topological value iteration algorithms
Value iteration is a powerful yet inefficient algorithm for Markov decision processes (MDPs)
because it puts the majority of its effort into backing up the entire state space, which turns out …
because it puts the majority of its effort into backing up the entire state space, which turns out …
GPU based generation of state transition models using simulations for unmanned surface vehicle trajectory planning
This paper describes GPU based algorithms to compute state transition models for
unmanned surface vehicles (USVs) using 6 degree of freedom (DOF) dynamics simulations …
unmanned surface vehicles (USVs) using 6 degree of freedom (DOF) dynamics simulations …
Online planning for large markov decision processes with hierarchical decomposition
Markov decision processes (MDPs) provide a rich framework for planning under uncertainty.
However, exactly solving a large MDP is usually intractable due to the “curse of …
However, exactly solving a large MDP is usually intractable due to the “curse of …
[HTML][HTML] Real-time dynamic programming for Markov decision processes with imprecise probabilities
Abstract Markov Decision Processes have become the standard model for probabilistic
planning. However, when applied to many practical problems, the estimates of transition …
planning. However, when applied to many practical problems, the estimates of transition …
Power flow management in electric vehicles charging station using reinforcement learning
This paper investigates optimal power flow management problem in an electric vehicle
charging station. The charging station is powered by solar PV and is tied to the grid and a …
charging station. The charging station is powered by solar PV and is tied to the grid and a …
Trajectory planning with look-ahead for unmanned sea surface vehicles to handle environmental disturbances
We present a look-ahead based trajectory planning algorithm for computation of dynamically
feasible trajectories for Unmanned Sea Surface Vehicles (USSV) operating in high seas …
feasible trajectories for Unmanned Sea Surface Vehicles (USSV) operating in high seas …
Continuous search in constraint programming
This work presents the concept of Continuous Search (CS), which objective is to allow any
user to eventually get their constraint solver achieving a top performance on their problems …
user to eventually get their constraint solver achieving a top performance on their problems …
Lookahead-bounded q-learning
We introduce the lookahead-bounded Q-learning (LBQL) algorithm, a new, provably
convergent variant of Q-learning that seeks to improve the performance of standard Q …
convergent variant of Q-learning that seeks to improve the performance of standard Q …
Efficient Constraint Generation for Stochastic Shortest Path Problems
J Schmalz, F Trevizan - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Current methods for solving Stochastic Shortest Path Problems (SSPs) find states' costs-to-
go by applying Bellman backups, where state-of-the-art methods employ heuristics to select …
go by applying Bellman backups, where state-of-the-art methods employ heuristics to select …
On verification and controller synthesis for probabilistic systems at runtime
M Ujma - 2015 - ora.ox.ac.uk
Probabilistic model checking is a technique employed for verifying the correctness of
computer systems that exhibit probabilistic behaviour. A related technique is controller …
computer systems that exhibit probabilistic behaviour. A related technique is controller …