Toward self‐driving processes: A deep reinforcement learning approach to control
Advanced model‐based controllers are well established in process industries. However,
such controllers require regular maintenance to maintain acceptable performance. It is a …
such controllers require regular maintenance to maintain acceptable performance. It is a …
Reinforcement learning design for cancer clinical trials
Y Zhao, MR Kosorok, D Zeng - Statistics in medicine, 2009 - Wiley Online Library
We develop reinforcement learning trials for discovering individualized treatment regimens
for life‐threatening diseases such as cancer. A temporal‐difference learning method called …
for life‐threatening diseases such as cancer. A temporal‐difference learning method called …
Ad recommendation systems for life-time value optimization
G Theocharous, PS Thomas… - Proceedings of the 24th …, 2015 - dl.acm.org
The main objective in the ad recommendation problem is to find a strategy that, for each
visitor of the website, selects the ad that has the highest probability of being clicked. This …
visitor of the website, selects the ad that has the highest probability of being clicked. This …
[PDF][PDF] Lifetime value marketing using reinforcement learning
G Theocharous, A Hallak - 2013 - Citeseer
In many marketing applications, companies use technology for interacting with their
customers and making product or services recommendations. Today, these marketing …
customers and making product or services recommendations. Today, these marketing …
Freekd: Free-direction knowledge distillation for graph neural networks
Knowledge distillation (KD) has demonstrated its effectiveness to boost the performance of
graph neural networks (GNNs), where its goal is to distill knowledge from a deeper teacher …
graph neural networks (GNNs), where its goal is to distill knowledge from a deeper teacher …
Optimizing debt collections using constrained reinforcement learning
The problem of optimally managing the collections process by taxation authorities is one of
prime importance, not only for the revenue it brings but also as a means to administer a fair …
prime importance, not only for the revenue it brings but also as a means to administer a fair …
Fire now, fire later: alarm-based systems for prescriptive process monitoring
Predictive process monitoring is a family of techniques to analyze events produced during
the execution of a business process in order to predict the future state or the final outcome of …
the execution of a business process in order to predict the future state or the final outcome of …
Extracting actionable knowledge from decision trees
Most data mining algorithms and tools stop at discovered customer models, producing
distribution information on customer profiles. Such techniques, when applied to industrial …
distribution information on customer profiles. Such techniques, when applied to industrial …
Recurrent reinforcement learning: a hybrid approach
Successful applications of reinforcement learning in real-world problems often require
dealing with partially observable states. It is in general very challenging to construct and …
dealing with partially observable states. It is in general very challenging to construct and …
Dynamic catalog mailing policies
DI Simester, P Sun, JN Tsitsiklis - Management science, 2006 - pubsonline.informs.org
Deciding who should receive a mail-order catalog is among the most important decisions
that mail-order-catalog firms must address. In practice, the current approach to the problem …
that mail-order-catalog firms must address. In practice, the current approach to the problem …