Gaussian processes for fast policy optimisation of pomdp-based dialogue managers

S Young, M Gašić, B Thomson… - Proceedings of the …, 2013 - ieeexplore.ieee.org

Statistical dialog systems (SDSs) are motivated by the need for a data-driven framework that
reduces the cost of laboriously handcrafting complex dialog managers and that provides …

被引用次数：1076 相关文章所有 8 个版本

[PDF] arxiv.org

A survey of available corpora for building data-driven dialogue systems

IV Serban, R Lowe, P Henderson, L Charlin… - arXiv preprint arXiv …, 2015 - arxiv.org

During the past decade, several areas of speech and language understanding have
witnessed substantial breakthroughs from the use of data-driven models. In the area of …

被引用次数：451 相关文章所有 9 个版本

[PDF] arxiv.org

Towards end-to-end learning for dialog state tracking and management using deep reinforcement learning

T Zhao, M Eskenazi - arXiv preprint arXiv:1606.02560, 2016 - arxiv.org

This paper presents an end-to-end framework for task-oriented dialog systems using a
variant of Deep Recurrent Q-Networks (DRQN). The model is able to interface with a …

被引用次数：332 相关文章所有 7 个版本

[PDF] arxiv.org

Deep dyna-q: Integrating planning for task-completion dialogue policy learning

B Peng, X Li, J Gao, J Liu, KF Wong, SY Su - arXiv preprint arXiv …, 2018 - arxiv.org

Training a task-completion dialogue agent via reinforcement learning (RL) is costly because
it requires many interactions with real users. One common alternative is to use a user …

被引用次数：240 相关文章所有 6 个版本

[PDF] aaai.org

Bbq-networks: Efficient exploration in deep reinforcement learning for task-oriented dialogue systems

Z Lipton, X Li, J Gao, L Li, F Ahmed… - Proceedings of the AAAI …, 2018 - ojs.aaai.org

We present a new algorithm that significantly improves the efficiency of exploration for deep
Q-learning agents in dialogue systems. Our agents explore via Thompson sampling …

被引用次数：195 相关文章所有 10 个版本

[PDF] arxiv.org

Unsupervised discrete sentence representation learning for interpretable neural dialog generation

T Zhao, K Lee, M Eskenazi - arXiv preprint arXiv:1804.08069, 2018 - arxiv.org

The encoder-decoder dialog model is one of the most prominent methods used to build
dialog systems in complex domains. Yet it is limited because it cannot output interpretable …

被引用次数：158 相关文章所有 4 个版本

[PDF] arxiv.org

[图书][B] Neural approaches to conversational information retrieval

J Gao, C Xiong, P Bennett, N Craswell - 2023 - Springer

A conversational information retrieval (CIR) system is an information retrieval (IR) system
with a conversational interface, which allows users to interact with the system to seek …

被引用次数：84 相关文章所有 6 个版本

[PDF] cam.ac.uk

Gaussian processes for pomdp-based dialogue manager optimization

M Gašić, S Young - IEEE/ACM Transactions on Audio, Speech …, 2013 - ieeexplore.ieee.org

A partially observable Markov decision process (POMDP) has been proposed as a dialog
model that enables automatic optimization of the dialog policy and provides robustness to …

被引用次数：162 相关文章所有 6 个版本

[PDF] arxiv.org

Sample efficient deep reinforcement learning for dialogue systems with large action spaces

G Weisz, P Budzianowski, PH Su… - IEEE/ACM Transactions …, 2018 - ieeexplore.ieee.org

In spoken dialogue systems, we aim to deploy artificial intelligence to build automated
dialogue agents that can converse with humans. A part of this effort is the policy optimization …

被引用次数：101 相关文章所有 7 个版本

[PDF] arxiv.org

Policy networks with two-stage training for dialogue systems

M Fatemi, LE Asri, H Schulz, J He… - arXiv preprint arXiv …, 2016 - arxiv.org

In this paper, we propose to use deep policy networks which are trained with an advantage
actor-critic method for statistically optimised dialogue systems. First, we show that, on …

被引用次数：112 相关文章所有 6 个版本