Pessimistic Bayesianism for conservative optimization and imitation
M Cohen - 2023 - ora.ox.ac.uk
Subject to several assumptions, sufficiently advanced reinforcement learners would likely
face an incentive and likely have an ability to intervene in the provision of their reward, with …
face an incentive and likely have an ability to intervene in the provision of their reward, with …