查看文章

aaai.org 中的 [PDF]

Bellman meets hawkes: Model-based reinforcement learning via temporal point processes

作者

Chao Qu, Xiaoyu Tan, Siqiao Xue, Xiaoming Shi, James Zhang, Hongyuan Mei

发表日期

2023/6/26

期刊

Proceedings of the AAAI Conference on Artificial Intelligence

卷号

期号

页码范围

9543-9551

简介

We consider a sequential decision making problem where the agent faces the environment characterized by the stochastic discrete events and seeks an optimal intervention policy such that its long-term reward is maximized. This problem exists ubiquitously in social media, finance and health informatics but is rarely investigated by the conventional research in reinforcement learning. To this end, we present a novel framework of the model-based reinforcement learning where the agent's actions and observations are asynchronous stochastic discrete events occurring in continuous-time. We model the dynamics of the environment by Hawkes process with external intervention control term and develop an algorithm to embed such process in the Bellman equation which guides the direction of the value gradient. We demonstrate the superiority of our method in both synthetic simulator and real-data experiments.

引用总数

被引用次数：14

2023202410 4

学术搜索中的文章

Bellman meets hawkes: Model-based reinforcement learning via temporal point processes

C Qu, X Tan, S Xue, X Shi, J Zhang, H Mei - Proceedings of the AAAI Conference on Artificial …, 2023

被引用次数：14 相关文章所有 4 个版本