Learning structured prediction models: A large margin approach

T Osa, J Pajarinen, G Neumann… - … and Trends® in …, 2018 - nowpublishers.com

As robots and other intelligent agents move from simple environments and problems to more
complex, unstructured settings, manually programming their behavior has become …

被引用次数：972 相关文章所有 14 个版本

[PDF] sciencedirect.com

A survey of inverse reinforcement learning: Challenges, methods and progress

S Arora, P Doshi - Artificial Intelligence, 2021 - Elsevier

Inverse reinforcement learning (IRL) is the problem of inferring the reward function of an
agent, given its policy or observed behavior. Analogous to RL, IRL is perceived both as a …

被引用次数：781 相关文章所有 6 个版本

[PDF] thecvf.com

Edge-labeling graph neural network for few-shot learning

J Kim, T Kim, S Kim, CD Yoo - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com

In this paper, we propose a novel edge-labeling graph neural network (EGNN), which
adapts a deep neural network on the edge-labeling graph, for few-shot learning. The …

被引用次数：618 相关文章所有 10 个版本

[PDF] mdpi.com

Multi-agent deep reinforcement learning for multi-robot applications: A survey

J Orr, A Dutta - Sensors, 2023 - mdpi.com

Deep reinforcement learning has produced many success stories in recent years. Some
example fields in which these successes have taken place include mathematics, games …

被引用次数：79 相关文章所有 10 个版本

[PDF] arxiv.org

Smart “predict, then optimize”

AN Elmachtoub, P Grigas - Management Science, 2022 - pubsonline.informs.org

Many real-world analytics problems involve two significant challenges: prediction and
optimization. Because of the typically complex nature of each challenge, the standard …

被引用次数：764 相关文章所有 9 个版本

[PDF] mlr.press

Optnet: Differentiable optimization as a layer in neural networks

B Amos, JZ Kolter - International conference on machine …, 2017 - proceedings.mlr.press

This paper presents OptNet, a network architecture that integrates optimization problems
(here, specifically in the form of quadratic programs) as individual layers in larger end-to-end …

被引用次数：1119 相关文章所有 8 个版本

[PDF] arxiv.org

Safe, multi-agent, reinforcement learning for autonomous driving

S Shalev-Shwartz, S Shammah, A Shashua - arXiv preprint arXiv …, 2016 - arxiv.org

Autonomous driving is a multi-agent setting where the host vehicle must apply sophisticated
negotiation skills with other road users when overtaking, giving way, merging, taking left and …

被引用次数：1150 相关文章所有 3 个版本

[PDF] mlr.press

Input convex neural networks

B Amos, L Xu, JZ Kolter - International conference on …, 2017 - proceedings.mlr.press

This paper presents the input convex neural network architecture. These are scalar-valued
(potentially deep) neural networks with constraints on the network parameters such that the …

被引用次数：730 相关文章所有 6 个版本

[PDF] mit.edu

Simple and accurate dependency parsing using bidirectional LSTM feature representations

E Kiperwasser, Y Goldberg - Transactions of the Association for …, 2016 - direct.mit.edu

We present a simple and effective scheme for dependency parsing which is based on
bidirectional-LSTMs (BiLSTMs). Each sentence token is associated with a BiLSTM vector …

被引用次数：812 相关文章所有 10 个版本

[PDF] neurips.cc

Learning with differentiable pertubed optimizers

Q Berthet, M Blondel, O Teboul… - Advances in neural …, 2020 - proceedings.neurips.cc

Abstract Machine learning pipelines often rely on optimizers procedures to make discrete
decisions (eg, sorting, picking closest neighbors, or shortest paths). Although these discrete …

被引用次数：251 相关文章所有 12 个版本