Pomdp-based statistical spoken dialog systems: A review

S Young, M Gašić, B Thomson… - Proceedings of the …, 2013 - ieeexplore.ieee.org
Statistical dialog systems (SDSs) are motivated by the need for a data-driven framework that
reduces the cost of laboriously handcrafting complex dialog managers and that provides …

A survey of available corpora for building data-driven dialogue systems

IV Serban, R Lowe, P Henderson, L Charlin… - arXiv preprint arXiv …, 2015 - arxiv.org
During the past decade, several areas of speech and language understanding have
witnessed substantial breakthroughs from the use of data-driven models. In the area of …

Towards end-to-end learning for dialog state tracking and management using deep reinforcement learning

T Zhao, M Eskenazi - arXiv preprint arXiv:1606.02560, 2016 - arxiv.org
This paper presents an end-to-end framework for task-oriented dialog systems using a
variant of Deep Recurrent Q-Networks (DRQN). The model is able to interface with a …

Deep dyna-q: Integrating planning for task-completion dialogue policy learning

B Peng, X Li, J Gao, J Liu, KF Wong, SY Su - arXiv preprint arXiv …, 2018 - arxiv.org
Training a task-completion dialogue agent via reinforcement learning (RL) is costly because
it requires many interactions with real users. One common alternative is to use a user …

Bbq-networks: Efficient exploration in deep reinforcement learning for task-oriented dialogue systems

Z Lipton, X Li, J Gao, L Li, F Ahmed… - Proceedings of the AAAI …, 2018 - ojs.aaai.org
We present a new algorithm that significantly improves the efficiency of exploration for deep
Q-learning agents in dialogue systems. Our agents explore via Thompson sampling …

Unsupervised discrete sentence representation learning for interpretable neural dialog generation

T Zhao, K Lee, M Eskenazi - arXiv preprint arXiv:1804.08069, 2018 - arxiv.org
The encoder-decoder dialog model is one of the most prominent methods used to build
dialog systems in complex domains. Yet it is limited because it cannot output interpretable …

[图书][B] Neural approaches to conversational information retrieval

J Gao, C Xiong, P Bennett, N Craswell - 2023 - Springer
A conversational information retrieval (CIR) system is an information retrieval (IR) system
with a conversational interface, which allows users to interact with the system to seek …

Gaussian processes for pomdp-based dialogue manager optimization

M Gašić, S Young - IEEE/ACM Transactions on Audio, Speech …, 2013 - ieeexplore.ieee.org
A partially observable Markov decision process (POMDP) has been proposed as a dialog
model that enables automatic optimization of the dialog policy and provides robustness to …

Sample efficient deep reinforcement learning for dialogue systems with large action spaces

G Weisz, P Budzianowski, PH Su… - IEEE/ACM Transactions …, 2018 - ieeexplore.ieee.org
In spoken dialogue systems, we aim to deploy artificial intelligence to build automated
dialogue agents that can converse with humans. A part of this effort is the policy optimization …

Policy networks with two-stage training for dialogue systems

M Fatemi, LE Asri, H Schulz, J He… - arXiv preprint arXiv …, 2016 - arxiv.org
In this paper, we propose to use deep policy networks which are trained with an advantage
actor-critic method for statistically optimised dialogue systems. First, we show that, on …