[PDF][PDF] Structure in reinforcement learning: A survey and open problems
Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …
Networks (DNNs) for function approximation, has demonstrated considerable success in …
Sample efficient reinforcement learning via low-rank matrix estimation
We consider the question of learning $ Q $-function in a sample efficient manner for
reinforcement learning with continuous state and action spaces under a generative model. If …
reinforcement learning with continuous state and action spaces under a generative model. If …
Non-asymptotic analysis of monte carlo tree search
In this work, we consider the popular tree-based search strategy within the framework of
reinforcement learning, the Monte Carlo Tree Search (MCTS), in the context of infinite …
reinforcement learning, the Monte Carlo Tree Search (MCTS), in the context of infinite …
Overcoming the long horizon barrier for sample-efficient reinforcement learning with latent low-rank structure
The practicality of reinforcement learning algorithms has been limited due to poor scaling
with respect to the problem size, as the sample complexity of learning an ε-optimal policy is …
with respect to the problem size, as the sample complexity of learning an ε-optimal policy is …
Conditional imitation learning for multi-agent games
While advances in multi-agent learning have enabled the training of increasingly complex
agents, most existing techniques produce a final policy that is not designed to adapt to a …
agents, most existing techniques produce a final policy that is not designed to adapt to a …
Curvature explains loss of plasticity
Loss of plasticity is a phenomenon in which neural networks lose their ability to learn from
new experience. Despite being empirically observed in several problem settings, little is …
new experience. Despite being empirically observed in several problem settings, little is …
Structure in Deep Reinforcement Learning: A Survey and Open Problems
Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …
Networks (DNNs) for function approximation, has demonstrated considerable success in …
Dissecting Deep RL with High Update Ratios: Combatting Value Overestimation and Divergence
We show that deep reinforcement learning can maintain its ability to learn without resetting
network parameters in settings where the number of gradient updates greatly exceeds the …
network parameters in settings where the number of gradient updates greatly exceeds the …
Automatic generation of meta-path graph for concept recommendation in moocs
J Gong, C Wang, Z Zhao, X Zhang - Electronics, 2021 - mdpi.com
In MOOCs, generally speaking, curriculum designing, course selection, and knowledge
concept recommendation are the three major steps that systematically instruct users to learn …
concept recommendation are the three major steps that systematically instruct users to learn …
Tensor and matrix low-rank value-function approximation in reinforcement learning
S Rozada, S Paternain… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Value function (VF) approximation is a central problem in reinforcement learning (RL).
Classical non-parametric VF estimation suffers from the curse of dimensionality. As a result …
Classical non-parametric VF estimation suffers from the curse of dimensionality. As a result …