An algorithmic perspective on imitation learning
As robots and other intelligent agents move from simple environments and problems to more
complex, unstructured settings, manually programming their behavior has become …
complex, unstructured settings, manually programming their behavior has become …
A survey of inverse reinforcement learning: Challenges, methods and progress
Inverse reinforcement learning (IRL) is the problem of inferring the reward function of an
agent, given its policy or observed behavior. Analogous to RL, IRL is perceived both as a …
agent, given its policy or observed behavior. Analogous to RL, IRL is perceived both as a …
Edge-labeling graph neural network for few-shot learning
In this paper, we propose a novel edge-labeling graph neural network (EGNN), which
adapts a deep neural network on the edge-labeling graph, for few-shot learning. The …
adapts a deep neural network on the edge-labeling graph, for few-shot learning. The …
Multi-agent deep reinforcement learning for multi-robot applications: A survey
J Orr, A Dutta - Sensors, 2023 - mdpi.com
Deep reinforcement learning has produced many success stories in recent years. Some
example fields in which these successes have taken place include mathematics, games …
example fields in which these successes have taken place include mathematics, games …
Smart “predict, then optimize”
AN Elmachtoub, P Grigas - Management Science, 2022 - pubsonline.informs.org
Many real-world analytics problems involve two significant challenges: prediction and
optimization. Because of the typically complex nature of each challenge, the standard …
optimization. Because of the typically complex nature of each challenge, the standard …
Optnet: Differentiable optimization as a layer in neural networks
This paper presents OptNet, a network architecture that integrates optimization problems
(here, specifically in the form of quadratic programs) as individual layers in larger end-to-end …
(here, specifically in the form of quadratic programs) as individual layers in larger end-to-end …
Safe, multi-agent, reinforcement learning for autonomous driving
S Shalev-Shwartz, S Shammah, A Shashua - arXiv preprint arXiv …, 2016 - arxiv.org
Autonomous driving is a multi-agent setting where the host vehicle must apply sophisticated
negotiation skills with other road users when overtaking, giving way, merging, taking left and …
negotiation skills with other road users when overtaking, giving way, merging, taking left and …
Input convex neural networks
This paper presents the input convex neural network architecture. These are scalar-valued
(potentially deep) neural networks with constraints on the network parameters such that the …
(potentially deep) neural networks with constraints on the network parameters such that the …
Simple and accurate dependency parsing using bidirectional LSTM feature representations
E Kiperwasser, Y Goldberg - Transactions of the Association for …, 2016 - direct.mit.edu
We present a simple and effective scheme for dependency parsing which is based on
bidirectional-LSTMs (BiLSTMs). Each sentence token is associated with a BiLSTM vector …
bidirectional-LSTMs (BiLSTMs). Each sentence token is associated with a BiLSTM vector …
Learning with differentiable pertubed optimizers
Abstract Machine learning pipelines often rely on optimizers procedures to make discrete
decisions (eg, sorting, picking closest neighbors, or shortest paths). Although these discrete …
decisions (eg, sorting, picking closest neighbors, or shortest paths). Although these discrete …