Recent advances and challenges in task-oriented dialog systems

Z Zhang, R Takanobu, Q Zhu, ML Huang… - Science China …, 2020 - Springer
Due to the significance and value in human-computer interaction and natural language
processing, task-oriented dialog systems are attracting more and more attention in both …

The price of debiasing automatic metrics in natural language evaluation

AT Chaganty, S Mussman, P Liang - arXiv preprint arXiv:1807.02202, 2018 - arxiv.org
For evaluating generation systems, automatic metrics such as BLEU cost nothing to run but
have been shown to correlate poorly with human judgment, leading to systematic bias …

Neural user simulation for corpus-based policy optimisation for spoken dialogue systems

F Kreyssig, I Casanueva, P Budzianowski… - arXiv preprint arXiv …, 2018 - arxiv.org
User Simulators are one of the major tools that enable offline training of task-oriented
dialogue systems. For this task the Agenda-Based User Simulator (ABUS) is often used. The …

AgentGraph: Toward universal dialogue management with structured deep reinforcement learning

L Chen, Z Chen, B Tan, S Long… - … /ACM Transactions on …, 2019 - ieeexplore.ieee.org
Dialogue policy plays an important role in task-oriented spoken dialogue systems. It
determines how to respond to users. The recently proposed deep reinforcement learning …

Budgeted policy learning for task-oriented dialogue systems

Z Zhang, X Li, J Gao, E Chen - arXiv preprint arXiv:1906.00499, 2019 - arxiv.org
This paper presents a new approach that extends Deep Dyna-Q (DDQ) by incorporating a
Budget-Conscious Scheduling (BCS) to best utilize a fixed, small amount of user interactions …

Structured dialogue policy with graph neural networks

L Chen, B Tan, S Long, K Yu - Proceedings of the 27th …, 2018 - aclanthology.org
Recently, deep reinforcement learning (DRL) has been used for dialogue policy
optimization. However, many DRL-based policies are not sample-efficient. Most recent …

Policy adaptation for deep reinforcement learning-based dialogue management

L Chen, C Chang, Z Chen, B Tan… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org
Policy optimization is the core part of statistical dialogue management. Deep reinforcement
learning has been successfully used for dialogue policy optimization for a static pre-defined …

Actor-double-critic: Incorporating model-based critic for task-oriented dialogue systems

YC Wu, BH Tseng, M Gasic - Findings of the Association for …, 2020 - aclanthology.org
In order to improve the sample-efficiency of deep reinforcement learning (DRL), we
implemented imagination augmented agent (I2A) in spoken dialogue systems (SDS) …

A teacher-student framework for maintainable dialog manager

W Wang, J Zhang, H Zhang, MY Hwang… - Proceedings of the …, 2018 - aclanthology.org
Reinforcement learning (RL) is an attractive solution for task-oriented dialog systems.
However, extending RL-based systems to handle new intents and slots requires a system …

Deep reinforcement learning for on-line dialogue state tracking

Z Chen, L Chen, X Zhou, K Yu - National Conference on Man-Machine …, 2022 - Springer
Dialogue state tracking (DST) is a crucial module in dialogue management. It is usually cast
as a supervised training problem, which is not convenient for on-line optimization. In this …