Bayesian Real-Time Dynamic Programming.

P Dai, DS Weld, J Goldsmith - Journal of Artificial Intelligence Research, 2011 - jair.org

Value iteration is a powerful yet inefficient algorithm for Markov decision processes (MDPs)
because it puts the majority of its effort into backing up the entire state space, which turns out …

被引用次数：78 相关文章所有 15 个版本

[PDF] iitp.ac.in

GPU based generation of state transition models using simulations for unmanned surface vehicle trajectory planning

A Thakur, P Svec, SK Gupta - Robotics and Autonomous Systems, 2012 - Elsevier

This paper describes GPU based algorithms to compute state transition models for
unmanned surface vehicles (USVs) using 6 degree of freedom (DOF) dynamics simulations …

被引用次数：61 相关文章所有 9 个版本

[PDF] ustc.edu.cn

Online planning for large markov decision processes with hierarchical decomposition

A Bai, F Wu, X Chen - ACM Transactions on Intelligent Systems and …, 2015 - dl.acm.org

Markov decision processes (MDPs) provide a rich framework for planning under uncertainty.
However, exactly solving a large MDP is usually intractable due to the “curse of …

被引用次数：46 相关文章所有 5 个版本

[HTML] sciencedirect.com

[HTML][HTML] Real-time dynamic programming for Markov decision processes with imprecise probabilities

KV Delgado, LN De Barros, DB Dias, S Sanner - Artificial Intelligence, 2016 - Elsevier

Abstract Markov Decision Processes have become the standard model for probabilistic
planning. However, when applied to many practical problems, the estimates of transition …

被引用次数：36 相关文章所有 7 个版本

[PDF] academia.edu

Power flow management in electric vehicles charging station using reinforcement learning

AO Erick, KA Folly - 2020 IEEE Congress on Evolutionary …, 2020 - ieeexplore.ieee.org

This paper investigates optimal power flow management problem in an electric vehicle
charging station. The charging station is powered by solar PV and is tied to the grid and a …

被引用次数：18 相关文章所有 5 个版本

[PDF] psu.edu

Trajectory planning with look-ahead for unmanned sea surface vehicles to handle environmental disturbances

P Svec, M Schwartz, A Thakur… - 2011 IEEE/RSJ …, 2011 - ieeexplore.ieee.org

We present a look-ahead based trajectory planning algorithm for computation of dynamically
feasible trajectories for Unmanned Sea Surface Vehicles (USSV) operating in high seas …

被引用次数：44 相关文章所有 8 个版本

[PDF] hal.science

Continuous search in constraint programming

A Arbelaez, Y Hamadi, M Sebag - 2010 22nd IEEE International …, 2010 - ieeexplore.ieee.org

This work presents the concept of Continuous Search (CS), which objective is to allow any
user to eventually get their constraint solver achieving a top performance on their problems …

被引用次数：45 相关文章所有 13 个版本

[PDF] mlr.press

Lookahead-bounded q-learning

I El Shar, D Jiang - International Conference on Machine …, 2020 - proceedings.mlr.press

We introduce the lookahead-bounded Q-learning (LBQL) algorithm, a new, provably
convergent variant of Q-learning that seeks to improve the performance of standard Q …

被引用次数：11 相关文章所有 5 个版本

[PDF] aaai.org

Efficient Constraint Generation for Stochastic Shortest Path Problems

J Schmalz, F Trevizan - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org

Current methods for solving Stochastic Shortest Path Problems (SSPs) find states' costs-to-
go by applying Bellman backups, where state-of-the-art methods employ heuristics to select …

被引用次数：1 相关文章所有 4 个版本

[PDF] ox.ac.uk

On verification and controller synthesis for probabilistic systems at runtime

M Ujma - 2015 - ora.ox.ac.uk

Probabilistic model checking is a technique employed for verifying the correctness of
computer systems that exhibit probabilistic behaviour. A related technique is controller …

被引用次数：19 相关文章所有 6 个版本