Symbolic physics learner: Discovering governing equations via monte carlo tree search
Nonlinear dynamics is ubiquitous in nature and commonly seen in various science and
engineering disciplines. Distilling analytical expressions that govern nonlinear dynamics …
engineering disciplines. Distilling analytical expressions that govern nonlinear dynamics …
Online tree-based planning for active spacecraft fault estimation and collision avoidance
Autonomous robots operating in uncertain or hazardous environments subject to state safety
constraints must be able to identify and isolate faulty components in a time-optimal manner …
constraints must be able to identify and isolate faulty components in a time-optimal manner …
Sample efficient reinforcement learning via low-rank matrix estimation
We consider the question of learning $ Q $-function in a sample efficient manner for
reinforcement learning with continuous state and action spaces under a generative model. If …
reinforcement learning with continuous state and action spaces under a generative model. If …
Monte Carlo tree search with spectral expansion for planning with dynamical systems
The ability of a robot to plan complex behaviors with real-time computation, rather than
adhering to predesigned or offline-learned routines, alleviates the need for specialized …
adhering to predesigned or offline-learned routines, alleviates the need for specialized …
Scalable Online planning for multi-agent MDPs
We present a scalable tree search planning algorithm for large multi-agent sequential
decision problems that require dynamic collaboration. Teams of agents need to coordinate …
decision problems that require dynamic collaboration. Teams of agents need to coordinate …
Optimality guarantees for particle belief approximation of POMDPs
Partially observable Markov decision processes (POMDPs) provide a flexible representation
for real-world decision and control problems. However, POMDPs are notoriously difficult to …
for real-world decision and control problems. However, POMDPs are notoriously difficult to …
Monte carlo tree search methods for the earth-observing satellite scheduling problem
AP Herrmann, H Schaub - Journal of Aerospace Information Systems, 2022 - arc.aiaa.org
This work explores on-board planning for the single spacecraft, multiple ground station Earth-
observing satellite scheduling problem through artificial neural network function …
observing satellite scheduling problem through artificial neural network function …
On the convergence of policy iteration-based reinforcement learning with monte carlo policy evaluation
A Winnicki, R Srikant - International Conference on Artificial …, 2023 - proceedings.mlr.press
A common technique in reinforcement learning is to evaluate the value function from Monte
Carlo simulations of a given policy, and use the estimated value function to obtain a new …
Carlo simulations of a given policy, and use the estimated value function to obtain a new …
Can large language models play games? a case study of a self-play approach
Large Language Models (LLMs) harness extensive data from the Internet, storing a broad
spectrum of prior knowledge. While LLMs have proven beneficial as decision-making aids …
spectrum of prior knowledge. While LLMs have proven beneficial as decision-making aids …
Neural tree expansion for multi-robot planning in non-cooperative environments
We present a self-improving, Neural Tree Expansion (NTE) method for multi-robot online
planning in non-cooperative environments, where each robot attempts to maximize its …
planning in non-cooperative environments, where each robot attempts to maximize its …