[PDF][PDF] A reinforcement learning-informed pattern mining framework for multivariate time series classification
Multivariate time series (MTS) classification is a challenging and important task in various
domains and real-world applications. Much of prior work on MTS can be roughly divided into …
domains and real-world applications. Much of prior work on MTS can be roughly divided into …
Off-policy evaluation for human feedback
Off-policy evaluation (OPE) is important for closing the gap between offline training and
evaluation of reinforcement learning (RL), by estimating performance and/or rank of target …
evaluation of reinforcement learning (RL), by estimating performance and/or rank of target …
Hope: Human-centric off-policy evaluation for e-learning and healthcare
Reinforcement learning (RL) has been extensively researched for enhancing human-
environment interactions in various human-centric tasks, including e-learning and …
environment interactions in various human-centric tasks, including e-learning and …
Offline learning of closed-loop deep brain stimulation controllers for parkinson disease treatment
Q Gao, SL Schmidt, A Chowdhury, G Feng… - Proceedings of the …, 2023 - dl.acm.org
Deep brain stimulation (DBS) has shown great promise toward treating motor symptoms
caused by Parkinson's disease (PD), by delivering electrical pulses to the Basal Ganglia …
caused by Parkinson's disease (PD), by delivering electrical pulses to the Basal Ganglia …
Robust reinforcement learning through efficient adversarial herding
Although reinforcement learning (RL) is considered the gold standard for policy design, it
may not always provide a robust solution in various scenarios. This can result in severe …
may not always provide a robust solution in various scenarios. This can result in severe …
Reconstructing missing ehrs using time-aware within-and cross-visit information for septic shock early prediction
Real-world Electronic Health Records (EHRs) are often plagued by a high rate of missing
data. In our EHRs, for example, the missing rates can be as high as 90% for some features …
data. In our EHRs, for example, the missing rates can be as high as 90% for some features …
Variational Latent Branching Model for Off-Policy Evaluation
Model-based methods have recently shown great potential for off-policy evaluation (OPE);
offline trajectories induced by behavioral policies are fitted to transitions of Markov decision …
offline trajectories induced by behavioral policies are fitted to transitions of Markov decision …
Robust exploration with adversary via Langevin Monte Carlo
In the realm of Deep Q-Networks (DQNs), numerous exploration strategies have
demonstrated efficacy within controlled environments. However, these methods encounter …
demonstrated efficacy within controlled environments. However, these methods encounter …
[PDF][PDF] Learning for Control and Decision Making toward Medical Autonomy
Q Gao - 2024 - dukespace.lib.duke.edu
Artificial intelligence (AI) and deep learning (DL) have recently shown success in domains
related to healthcare and its decision-making systems. However, most of the existing …
related to healthcare and its decision-making systems. However, most of the existing …
Robust Reinforcement Learning with Structured Adversarial Ensemble
Although reinforcement learning (RL) is considered the gold standard for policy design, it
may not always provide a robust solution in various scenarios. This can result in severe …
may not always provide a robust solution in various scenarios. This can result in severe …