Topological value iteration algorithms

P Dai, DS Weld, J Goldsmith - Journal of Artificial Intelligence Research, 2011 - jair.org
Value iteration is a powerful yet inefficient algorithm for Markov decision processes (MDPs)
because it puts the majority of its effort into backing up the entire state space, which turns out …

GPU based generation of state transition models using simulations for unmanned surface vehicle trajectory planning

A Thakur, P Svec, SK Gupta - Robotics and Autonomous Systems, 2012 - Elsevier
This paper describes GPU based algorithms to compute state transition models for
unmanned surface vehicles (USVs) using 6 degree of freedom (DOF) dynamics simulations …

Online planning for large markov decision processes with hierarchical decomposition

A Bai, F Wu, X Chen - ACM Transactions on Intelligent Systems and …, 2015 - dl.acm.org
Markov decision processes (MDPs) provide a rich framework for planning under uncertainty.
However, exactly solving a large MDP is usually intractable due to the “curse of …

[HTML][HTML] Real-time dynamic programming for Markov decision processes with imprecise probabilities

KV Delgado, LN De Barros, DB Dias, S Sanner - Artificial Intelligence, 2016 - Elsevier
Abstract Markov Decision Processes have become the standard model for probabilistic
planning. However, when applied to many practical problems, the estimates of transition …

Power flow management in electric vehicles charging station using reinforcement learning

AO Erick, KA Folly - 2020 IEEE Congress on Evolutionary …, 2020 - ieeexplore.ieee.org
This paper investigates optimal power flow management problem in an electric vehicle
charging station. The charging station is powered by solar PV and is tied to the grid and a …

Trajectory planning with look-ahead for unmanned sea surface vehicles to handle environmental disturbances

P Svec, M Schwartz, A Thakur… - 2011 IEEE/RSJ …, 2011 - ieeexplore.ieee.org
We present a look-ahead based trajectory planning algorithm for computation of dynamically
feasible trajectories for Unmanned Sea Surface Vehicles (USSV) operating in high seas …

Continuous search in constraint programming

A Arbelaez, Y Hamadi, M Sebag - 2010 22nd IEEE International …, 2010 - ieeexplore.ieee.org
This work presents the concept of Continuous Search (CS), which objective is to allow any
user to eventually get their constraint solver achieving a top performance on their problems …

Lookahead-bounded q-learning

I El Shar, D Jiang - International Conference on Machine …, 2020 - proceedings.mlr.press
We introduce the lookahead-bounded Q-learning (LBQL) algorithm, a new, provably
convergent variant of Q-learning that seeks to improve the performance of standard Q …

Efficient Constraint Generation for Stochastic Shortest Path Problems

J Schmalz, F Trevizan - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Current methods for solving Stochastic Shortest Path Problems (SSPs) find states' costs-to-
go by applying Bellman backups, where state-of-the-art methods employ heuristics to select …

On verification and controller synthesis for probabilistic systems at runtime

M Ujma - 2015 - ora.ox.ac.uk
Probabilistic model checking is a technique employed for verifying the correctness of
computer systems that exhibit probabilistic behaviour. A related technique is controller …