Offline reinforcement learning: Tutorial, review, and perspectives on open problems

S Levine, A Kumar, G Tucker, J Fu - arXiv preprint arXiv:2005.01643, 2020 - arxiv.org
In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …

Recent advances in deep learning based dialogue systems: A systematic survey

J Ni, T Young, V Pandelea, F Xue… - Artificial intelligence review, 2023 - Springer
Dialogue systems are a popular natural language processing (NLP) task as it is promising in
real-life applications. It is also a complicated task since many NLP tasks deserving study are …

Neural approaches to conversational AI

J Gao, M Galley, L Li - The 41st international ACM SIGIR conference on …, 2018 - dl.acm.org
This tutorial surveys neural approaches to conversational AI that were developed in the last
few years. We group conversational systems into three categories:(1) question answering …

Survey on reinforcement learning for language processing

V Uc-Cetina, N Navarro-Guerrero… - Artificial Intelligence …, 2023 - Springer
In recent years some researchers have explored the use of reinforcement learning (RL)
algorithms as key components in the solution of various natural language processing (NLP) …

A deep reinforcement learning chatbot

IV Serban, C Sankar, M Germain, S Zhang… - arXiv preprint arXiv …, 2017 - arxiv.org
We present MILABOT: a deep reinforcement learning chatbot developed by the Montreal
Institute for Learning Algorithms (MILA) for the Amazon Alexa Prize competition. MILABOT is …

Rl unplugged: A suite of benchmarks for offline reinforcement learning

C Gulcehre, Z Wang, A Novikov… - Advances in …, 2020 - proceedings.neurips.cc
Offline methods for reinforcement learning have a potential to help bridge the gap between
reinforcement learning research and real-world applications. They make it possible to learn …

Pomdp-based statistical spoken dialog systems: A review

S Young, M Gašić, B Thomson… - Proceedings of the …, 2013 - ieeexplore.ieee.org
Statistical dialog systems (SDSs) are motivated by the need for a data-driven framework that
reduces the cost of laboriously handcrafting complex dialog managers and that provides …

Emergence of language with multi-agent games: Learning to communicate with sequences of symbols

S Havrylov, I Titov - Advances in neural information …, 2017 - proceedings.neurips.cc
Learning to communicate through interaction, rather than relying on explicit supervision, is
often considered a prerequisite for developing a general AI. We study a setting where two …

Dialogue learning with human teaching and feedback in end-to-end trainable task-oriented dialogue systems

B Liu, G Tur, D Hakkani-Tur, P Shah, L Heck - arXiv preprint arXiv …, 2018 - arxiv.org
In this work, we present a hybrid learning method for training task-oriented dialogue systems
through online user interactions. Popular methods for learning task-oriented dialogues …

Improving recommender systems with adaptive conversational strategies

T Mahmood, F Ricci - Proceedings of the 20th ACM conference on …, 2009 - dl.acm.org
Conversational recommender systems (CRSs) assist online users in their information-
seeking and decision making tasks by supporting an interactive process. Although these …