An algorithmic perspective on imitation learning

T Osa, J Pajarinen, G Neumann… - … and Trends® in …, 2018 - nowpublishers.com
As robots and other intelligent agents move from simple environments and problems to more
complex, unstructured settings, manually programming their behavior has become …

A survey of inverse reinforcement learning: Challenges, methods and progress

S Arora, P Doshi - Artificial Intelligence, 2021 - Elsevier
Inverse reinforcement learning (IRL) is the problem of inferring the reward function of an
agent, given its policy or observed behavior. Analogous to RL, IRL is perceived both as a …

Edge-labeling graph neural network for few-shot learning

J Kim, T Kim, S Kim, CD Yoo - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
In this paper, we propose a novel edge-labeling graph neural network (EGNN), which
adapts a deep neural network on the edge-labeling graph, for few-shot learning. The …

Multi-agent deep reinforcement learning for multi-robot applications: A survey

J Orr, A Dutta - Sensors, 2023 - mdpi.com
Deep reinforcement learning has produced many success stories in recent years. Some
example fields in which these successes have taken place include mathematics, games …

Smart “predict, then optimize”

AN Elmachtoub, P Grigas - Management Science, 2022 - pubsonline.informs.org
Many real-world analytics problems involve two significant challenges: prediction and
optimization. Because of the typically complex nature of each challenge, the standard …

Optnet: Differentiable optimization as a layer in neural networks

B Amos, JZ Kolter - International conference on machine …, 2017 - proceedings.mlr.press
This paper presents OptNet, a network architecture that integrates optimization problems
(here, specifically in the form of quadratic programs) as individual layers in larger end-to-end …

Safe, multi-agent, reinforcement learning for autonomous driving

S Shalev-Shwartz, S Shammah, A Shashua - arXiv preprint arXiv …, 2016 - arxiv.org
Autonomous driving is a multi-agent setting where the host vehicle must apply sophisticated
negotiation skills with other road users when overtaking, giving way, merging, taking left and …

Input convex neural networks

B Amos, L Xu, JZ Kolter - International conference on …, 2017 - proceedings.mlr.press
This paper presents the input convex neural network architecture. These are scalar-valued
(potentially deep) neural networks with constraints on the network parameters such that the …

Simple and accurate dependency parsing using bidirectional LSTM feature representations

E Kiperwasser, Y Goldberg - Transactions of the Association for …, 2016 - direct.mit.edu
We present a simple and effective scheme for dependency parsing which is based on
bidirectional-LSTMs (BiLSTMs). Each sentence token is associated with a BiLSTM vector …

Learning with differentiable pertubed optimizers

Q Berthet, M Blondel, O Teboul… - Advances in neural …, 2020 - proceedings.neurips.cc
Abstract Machine learning pipelines often rely on optimizers procedures to make discrete
decisions (eg, sorting, picking closest neighbors, or shortest paths). Although these discrete …