A survey on interpretable reinforcement learning

C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao… - Machine Learning, 2024 - Springer
Although deep reinforcement learning has become a promising machine learning approach
for sequential decision-making problems, it is still not mature enough for high-stake domains …

Dense reward for free in reinforcement learning from human feedback

AJ Chan, H Sun, S Holt, M van der Schaar - arXiv preprint arXiv …, 2024 - arxiv.org
Reinforcement Learning from Human Feedback (RLHF) has been credited as the key
advance that has allowed Large Language Models (LLMs) to effectively follow instructions …

Explainable reinforcement learning (XRL): a systematic literature review and taxonomy

Y Bekkemoen - Machine Learning, 2024 - Springer
In recent years, reinforcement learning (RL) systems have shown impressive performance
and remarkable achievements. Many achievements can be attributed to combining RL with …

Learning long-term crop management strategies with cyclesgym

M Turchetta, L Corinzia, S Sussex… - Advances in neural …, 2022 - proceedings.neurips.cc
To improve the sustainability and resilience of modern food systems, designing improved
crop management strategies is crucial. The increasing abundance of data on agricultural …

Measuring interpretability of neural policies of robots with disentangled representation

TH Wang, W Xiao, T Seyde… - Conference on Robot …, 2023 - proceedings.mlr.press
The advancement of robots, particularly those functioning in complex human-centric
environments, relies on control solutions that are driven by machine learning …

Allsim: Simulating and benchmarking resource allocation policies in multi-user systems

J Berrevoets, D Jarrett, A Chan… - Advances in Neural …, 2024 - proceedings.neurips.cc
Numerous real-world systems, ranging from healthcare to energy grids, involve users
competing for finite and potentially scarce resources. Designing policies for resource …

Forecasting treatment outcomes over time using alternating deep sequential models

F Wu, G Zhao, Y Zhou, X Qian… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Medical decision making often relies on accurately forecasting future patient trajectories.
Conventional approaches for patient progression modeling often do not explicitly model …

[HTML][HTML] On the fusion of soft-decision-trees and concept-based models

DM Rodríguez, MP Cuéllar, DP Morales - Applied Soft Computing, 2024 - Elsevier
In the field of eXplainable Artificial Intelligence (XAI), the generation of interpretable models
that are able to match the performance of state-of-the-art deep learning methods is one of …

{AIRS}: Explanation for Deep Reinforcement Learning based Security Applications

J Yu, W Guo, Q Qin, G Wang, T Wang… - 32nd USENIX Security …, 2023 - usenix.org
Recently, we have witnessed the success of deep reinforcement learning (DRL) in many
security applications, ranging from malware mutation to selfish blockchain mining. Like all …

Deep Learning within Tabular Data: Foundations, Challenges, Advances and Future Directions

W Ren, T Zhao, Y Huang, V Honavar - arXiv preprint arXiv:2501.03540, 2025 - arxiv.org
Tabular data remains one of the most prevalent data types across a wide range of real-world
applications, yet effective representation learning for this domain poses unique challenges …