Survey on reinforcement learning for language processing
V Uc-Cetina, N Navarro-Guerrero… - Artificial Intelligence …, 2023 - Springer
In recent years some researchers have explored the use of reinforcement learning (RL)
algorithms as key components in the solution of various natural language processing (NLP) …
algorithms as key components in the solution of various natural language processing (NLP) …
A survey on recent advances and challenges in reinforcement learning methods for task-oriented dialogue policy learning
Dialogue policy learning (DPL) is a key component in a task-oriented dialogue (TOD)
system. Its goal is to decide the next action of the dialogue system, given the dialogue state …
system. Its goal is to decide the next action of the dialogue system, given the dialogue state …
Multi-agent reinforcement learning: Methods, applications, visionary prospects, and challenges
Multi-agent reinforcement learning (MARL) is a widely used Artificial Intelligence (AI)
technique. However, current studies and applications need to address its scalability, non …
technique. However, current studies and applications need to address its scalability, non …
Learning rewards from linguistic feedback
We explore unconstrained natural language feedback as a learning signal for artificial
agents. Humans use rich and varied language to teach, yet most prior work on interactive …
agents. Humans use rich and varied language to teach, yet most prior work on interactive …
AgentGraph: Toward universal dialogue management with structured deep reinforcement learning
Dialogue policy plays an important role in task-oriented spoken dialogue systems. It
determines how to respond to users. The recently proposed deep reinforcement learning …
determines how to respond to users. The recently proposed deep reinforcement learning …
Agent-aware dropout dqn for safe and efficient on-line dialogue policy learning
Hand-crafted rules and reinforcement learning (RL) are two popular choices to obtain
dialogue policy. The rule-based policy is often reliable within predefined scope but not self …
dialogue policy. The rule-based policy is often reliable within predefined scope but not self …
Structured dialogue policy with graph neural networks
Recently, deep reinforcement learning (DRL) has been used for dialogue policy
optimization. However, many DRL-based policies are not sample-efficient. Most recent …
optimization. However, many DRL-based policies are not sample-efficient. Most recent …
Efficient dialogue complementary policy learning via deep q-network policy and episodic memory policy
Deep reinforcement learning has shown great potential in training dialogue policies.
However, its favorable performance comes at the cost of many rounds of interaction. Most of …
However, its favorable performance comes at the cost of many rounds of interaction. Most of …
Automatic curriculum learning with over-repetition penalty for dialogue policy learning
Dialogue policy learning based on reinforcement learning is difficult to be applied to real
users to train dialogue agents from scratch because of the high cost. User simulators, which …
users to train dialogue agents from scratch because of the high cost. User simulators, which …
Decomposed Deep Q-Network for Coherent Task-Oriented Dialogue Policy Learning
Reinforcement learning (RL) has emerged as a key technique for designing dialogue
policies. However, action space inflation in dialogue tasks has led to a heavy decision …
policies. However, action space inflation in dialogue tasks has led to a heavy decision …