Hybrid reinforcement/supervised learning of dialogue policies from fixed data sets

S Levine, A Kumar, G Tucker, J Fu - arXiv preprint arXiv:2005.01643, 2020 - arxiv.org

In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …

被引用次数：2086 相关文章所有 3 个版本

[PDF] arxiv.org

Recent advances in deep learning based dialogue systems: A systematic survey

J Ni, T Young, V Pandelea, F Xue… - Artificial intelligence review, 2023 - Springer

Dialogue systems are a popular natural language processing (NLP) task as it is promising in
real-life applications. It is also a complicated task since many NLP tasks deserving study are …

被引用次数：285 相关文章所有 15 个版本

[PDF] nowpublishers.com

Neural approaches to conversational AI

J Gao, M Galley, L Li - The 41st international ACM SIGIR conference on …, 2018 - dl.acm.org

This tutorial surveys neural approaches to conversational AI that were developed in the last
few years. We group conversational systems into three categories:(1) question answering …

被引用次数：904 相关文章所有 16 个版本

[PDF] springer.com

Survey on reinforcement learning for language processing

V Uc-Cetina, N Navarro-Guerrero… - Artificial Intelligence …, 2023 - Springer

In recent years some researchers have explored the use of reinforcement learning (RL)
algorithms as key components in the solution of various natural language processing (NLP) …

被引用次数：152 相关文章所有 12 个版本

[PDF] arxiv.org

A deep reinforcement learning chatbot

IV Serban, C Sankar, M Germain, S Zhang… - arXiv preprint arXiv …, 2017 - arxiv.org

We present MILABOT: a deep reinforcement learning chatbot developed by the Montreal
Institute for Learning Algorithms (MILA) for the Amazon Alexa Prize competition. MILABOT is …

被引用次数：364 相关文章所有 4 个版本

[PDF] neurips.cc

Rl unplugged: A suite of benchmarks for offline reinforcement learning

C Gulcehre, Z Wang, A Novikov… - Advances in …, 2020 - proceedings.neurips.cc

Offline methods for reinforcement learning have a potential to help bridge the gap between
reinforcement learning research and real-world applications. They make it possible to learn …

被引用次数：197 相关文章所有 8 个版本

[PDF] mmeteer.com

Pomdp-based statistical spoken dialog systems: A review

S Young, M Gašić, B Thomson… - Proceedings of the …, 2013 - ieeexplore.ieee.org

Statistical dialog systems (SDSs) are motivated by the need for a data-driven framework that
reduces the cost of laboriously handcrafting complex dialog managers and that provides …

被引用次数：1075 相关文章所有 8 个版本

[PDF] neurips.cc

Emergence of language with multi-agent games: Learning to communicate with sequences of symbols

S Havrylov, I Titov - Advances in neural information …, 2017 - proceedings.neurips.cc

Learning to communicate through interaction, rather than relying on explicit supervision, is
often considered a prerequisite for developing a general AI. We study a setting where two …

被引用次数：336 相关文章所有 12 个版本

[PDF] arxiv.org

Dialogue learning with human teaching and feedback in end-to-end trainable task-oriented dialogue systems

B Liu, G Tur, D Hakkani-Tur, P Shah, L Heck - arXiv preprint arXiv …, 2018 - arxiv.org

In this work, we present a hybrid learning method for training task-oriented dialogue systems
through online user interactions. Popular methods for learning task-oriented dialogues …

被引用次数：214 相关文章所有 6 个版本

[PDF] researchgate.net

Improving recommender systems with adaptive conversational strategies

T Mahmood, F Ricci - Proceedings of the 20th ACM conference on …, 2009 - dl.acm.org

Conversational recommender systems (CRSs) assist online users in their information-
seeking and decision making tasks by supporting an interactive process. Although these …

被引用次数：421 相关文章所有 8 个版本