Gymnasium: A standard interface for reinforcement learning environments
Gymnasium is an open-source library providing an API for reinforcement learning
environments. Its main contribution is a central abstraction for wide interoperability between …
environments. Its main contribution is a central abstraction for wide interoperability between …
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning
Benchmarks play a crucial role in the development and analysis of reinforcement learning
(RL) algorithms. We identify that existing benchmarks used for research into open-ended …
(RL) algorithms. We identify that existing benchmarks used for research into open-ended …
BlackJAX: Composable Bayesian inference in JAX
BlackJAX is a library implementing sampling and variational inference algorithms commonly
used in Bayesian computation. It is designed for ease of use, speed, and modularity by …
used in Bayesian computation. It is designed for ease of use, speed, and modularity by …
NAVIX: Scaling MiniGrid Environments with JAX
As Deep Reinforcement Learning (Deep RL) research moves towards solving large-scale
worlds, efficient environment simulations become crucial for rapid experimentation …
worlds, efficient environment simulations become crucial for rapid experimentation …
Can we hop in general? A discussion of benchmark selection and design using the Hopper environment
CA Voelcker, M Hussing - arXiv preprint arXiv:2410.08870, 2024 - arxiv.org
Empirical, benchmark-driven testing is a fundamental paradigm in the current RL
community. While using off-the-shelf benchmarks in reinforcement learning (RL) research is …
community. While using off-the-shelf benchmarks in reinforcement learning (RL) research is …
SYMPOL: Symbolic Tree-Based On-Policy Reinforcement Learning
Reinforcement learning (RL) has seen significant success across various domains, but its
adoption is often limited by the black-box nature of neural network policies, making them …
adoption is often limited by the black-box nature of neural network policies, making them …
Dantzig-Wolfe Decomposition and Deep Reinforcement Learning
F Chouaki, RS Jeffers - openreview.net
The 3D bin packing problem is an NP-hard optimisation problem. RL solutions found in the
literature tackle simplified versions of the full problem due to its large action space and long …
literature tackle simplified versions of the full problem due to its large action space and long …