A survey on recent advances and challenges in reinforcement learning methods for task-oriented dialogue policy learning

WC Kwan, HR Wang, HM Wang, KF Wong - Machine Intelligence …, 2023 - Springer
Dialogue policy learning (DPL) is a key component in a task-oriented dialogue (TOD)
system. Its goal is to decide the next action of the dialogue system, given the dialogue state …

A survey on dialog management: Recent advances and challenges

Y Dai, H Yu, Y Jiang, C Tang, Y Li, J Sun - arXiv preprint arXiv:2005.02233, 2020 - arxiv.org
Dialog management (DM) is a crucial component in a task-oriented dialog system. Given the
dialog history, DM predicts the dialog state and decides the next action that the dialog agent …

JoTR: A Joint Transformer and Reinforcement Learning Framework for Dialog Policy Learning

WC Kwan, H Wang, H Wang, Z Wang, X Wu… - arXiv preprint arXiv …, 2023 - arxiv.org
Dialogue policy learning (DPL) is a crucial component of dialogue modelling. Its primary role
is to determine the appropriate abstract response, commonly referred to as the" dialogue …

Coherent dialog generation with query graph

J Xu, Z Lei, H Wang, ZY Niu, H Wu, W Che… - Transactions on Asian …, 2021 - dl.acm.org
Learning to generate coherent and informative dialogs is an enduring challenge for open-
domain conversation generation. Previous work leverage knowledge graph or documents to …

Task-oriented Dialog Policy Learning via Deep Reinforcement Learning and Automatic Graph Neural Network Curriculum Learning

K Hanneman - 2024 - studenttheses.uu.nl
In a task-oriented dialog system, a core component is the dialog policy, which determines
the response action and guides the conversation system to complete the task. Optimizing …

[引用][C] Xiaomi team ten thousand words long article| Latest research progress on dialogue management model-Alibaba Cloud Developer Community

D Yinpei, Y Huihua, J Yixuan, T Chengguang…