MSVIPER: Improved Policy Distillation for Reinforcement-Learning-Based Robot Navigation

AM Roth, J Liang, R Sriram, E Tabassi… - arXiv preprint arXiv …, 2022 - arxiv.org
We present Multiple Scenario Verifiable Reinforcement Learning via Policy Extraction
(MSVIPER), a new method for policy distillation to decision trees for improved robot …