Toward self‐driving processes: A deep reinforcement learning approach to control

S Spielberg, A Tulsyan, NP Lawrence… - AIChE …, 2019 - Wiley Online Library
Advanced model‐based controllers are well established in process industries. However,
such controllers require regular maintenance to maintain acceptable performance. It is a …

Reinforcement learning design for cancer clinical trials

Y Zhao, MR Kosorok, D Zeng - Statistics in medicine, 2009 - Wiley Online Library
We develop reinforcement learning trials for discovering individualized treatment regimens
for life‐threatening diseases such as cancer. A temporal‐difference learning method called …

Ad recommendation systems for life-time value optimization

G Theocharous, PS Thomas… - Proceedings of the 24th …, 2015 - dl.acm.org
The main objective in the ad recommendation problem is to find a strategy that, for each
visitor of the website, selects the ad that has the highest probability of being clicked. This …

[PDF][PDF] Lifetime value marketing using reinforcement learning

G Theocharous, A Hallak - 2013 - Citeseer
In many marketing applications, companies use technology for interacting with their
customers and making product or services recommendations. Today, these marketing …

Freekd: Free-direction knowledge distillation for graph neural networks

K Feng, C Li, Y Yuan, G Wang - Proceedings of the 28th ACM SIGKDD …, 2022 - dl.acm.org
Knowledge distillation (KD) has demonstrated its effectiveness to boost the performance of
graph neural networks (GNNs), where its goal is to distill knowledge from a deeper teacher …

Optimizing debt collections using constrained reinforcement learning

N Abe, P Melville, C Pendus, CK Reddy… - Proceedings of the 16th …, 2010 - dl.acm.org
The problem of optimally managing the collections process by taxation authorities is one of
prime importance, not only for the revenue it brings but also as a means to administer a fair …

Fire now, fire later: alarm-based systems for prescriptive process monitoring

SA Fahrenkrog-Petersen, N Tax, I Teinemaa… - … and Information Systems, 2022 - Springer
Predictive process monitoring is a family of techniques to analyze events produced during
the execution of a business process in order to predict the future state or the final outcome of …

Extracting actionable knowledge from decision trees

Q Yang, J Yin, C Ling, R Pan - IEEE Transactions on …, 2006 - ieeexplore.ieee.org
Most data mining algorithms and tools stop at discovered customer models, producing
distribution information on customer profiles. Such techniques, when applied to industrial …

Recurrent reinforcement learning: a hybrid approach

X Li, L Li, J Gao, X He, J Chen, L Deng, J He - arXiv preprint arXiv …, 2015 - arxiv.org
Successful applications of reinforcement learning in real-world problems often require
dealing with partially observable states. It is in general very challenging to construct and …

Dynamic catalog mailing policies

DI Simester, P Sun, JN Tsitsiklis - Management science, 2006 - pubsonline.informs.org
Deciding who should receive a mail-order catalog is among the most important decisions
that mail-order-catalog firms must address. In practice, the current approach to the problem …