Affordable on-line dialogue policy learning

Z Zhang, R Takanobu, Q Zhu, ML Huang… - Science China …, 2020 - Springer

Due to the significance and value in human-computer interaction and natural language
processing, task-oriented dialog systems are attracting more and more attention in both …

被引用次数：216 相关文章所有 5 个版本

[PDF] arxiv.org

The price of debiasing automatic metrics in natural language evaluation

AT Chaganty, S Mussman, P Liang - arXiv preprint arXiv:1807.02202, 2018 - arxiv.org

For evaluating generation systems, automatic metrics such as BLEU cost nothing to run but
have been shown to correlate poorly with human judgment, leading to systematic bias …

被引用次数：133 相关文章所有 6 个版本

[PDF] arxiv.org

Neural user simulation for corpus-based policy optimisation for spoken dialogue systems

F Kreyssig, I Casanueva, P Budzianowski… - arXiv preprint arXiv …, 2018 - arxiv.org

User Simulators are one of the major tools that enable offline training of task-oriented
dialogue systems. For this task the Agenda-Based User Simulator (ABUS) is often used. The …

被引用次数：85 相关文章所有 8 个版本

[PDF] arxiv.org

AgentGraph: Toward universal dialogue management with structured deep reinforcement learning

L Chen, Z Chen, B Tan, S Long… - … /ACM Transactions on …, 2019 - ieeexplore.ieee.org

Dialogue policy plays an important role in task-oriented spoken dialogue systems. It
determines how to respond to users. The recently proposed deep reinforcement learning …

被引用次数：46 相关文章所有 6 个版本

[PDF] arxiv.org

Budgeted policy learning for task-oriented dialogue systems

Z Zhang, X Li, J Gao, E Chen - arXiv preprint arXiv:1906.00499, 2019 - arxiv.org

This paper presents a new approach that extends Deep Dyna-Q (DDQ) by incorporating a
Budget-Conscious Scheduling (BCS) to best utilize a fixed, small amount of user interactions …

被引用次数：36 相关文章所有 3 个版本

[PDF] aclanthology.org

Structured dialogue policy with graph neural networks

L Chen, B Tan, S Long, K Yu - Proceedings of the 27th …, 2018 - aclanthology.org

Recently, deep reinforcement learning (DRL) has been used for dialogue policy
optimization. However, many DRL-based policies are not sample-efficient. Most recent …

被引用次数：34 相关文章

[PDF] hhu.de

Policy adaptation for deep reinforcement learning-based dialogue management

L Chen, C Chang, Z Chen, B Tan… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org

Policy optimization is the core part of statistical dialogue management. Deep reinforcement
learning has been successfully used for dialogue policy optimization for a static pre-defined …

被引用次数：30 相关文章所有 4 个版本

[PDF] aclanthology.org

Actor-double-critic: Incorporating model-based critic for task-oriented dialogue systems

YC Wu, BH Tseng, M Gasic - Findings of the Association for …, 2020 - aclanthology.org

In order to improve the sample-efficiency of deep reinforcement learning (DRL), we
implemented imagination augmented agent (I2A) in spoken dialogue systems (SDS) …

被引用次数：9 相关文章

[PDF] aclanthology.org

A teacher-student framework for maintainable dialog manager

W Wang, J Zhang, H Zhang, MY Hwang… - Proceedings of the …, 2018 - aclanthology.org

Reinforcement learning (RL) is an attractive solution for task-oriented dialog systems.
However, extending RL-based systems to handle new intents and slots requires a system …

被引用次数：12 相关文章所有 2 个版本

[PDF] arxiv.org

Deep reinforcement learning for on-line dialogue state tracking

Z Chen, L Chen, X Zhou, K Yu - National Conference on Man-Machine …, 2022 - Springer

Dialogue state tracking (DST) is a crucial module in dialogue management. It is usually cast
as a supervised training problem, which is not convenient for on-line optimization. In this …

被引用次数：6 相关文章所有 4 个版本