Neurosymbolic transformers for multi-agent communication

Y Qing, S Liu, J Song, H Wang, M Song - arXiv preprint arXiv:2211.06665, 2022 - arxiv.org

Reinforcement Learning (RL) is a popular machine learning paradigm where intelligent
agents interact with the environment to fulfill a long-term goal. Driven by the resurgence of …

被引用次数：24 相关文章所有 3 个版本

[PDF] mlr.press

Cooperative exploration for multi-agent deep reinforcement learning

IJ Liu, U Jain, RA Yeh… - … conference on machine …, 2021 - proceedings.mlr.press

Exploration is critical for good results in deep reinforcement learning and has attracted much
attention. However, existing multi-agent deep reinforcement learning algorithms still use …

被引用次数：109 相关文章所有 9 个版本

[PDF] neurips.cc

Learning transformer programs

D Friedman, A Wettig, D Chen - Advances in Neural …, 2024 - proceedings.neurips.cc

Recent research in mechanistic interpretability has attempted to reverse-engineer
Transformer models by carefully inspecting network weights and activations. However, these …

被引用次数：22 相关文章所有 5 个版本

[PDF] neurips.cc

Compositional reinforcement learning from logical specifications

K Jothimurugan, S Bansal… - Advances in Neural …, 2021 - proceedings.neurips.cc

We study the problem of learning control policies for complex tasks given by logical
specifications. Recent approaches automatically generate a reward function from a given …

被引用次数：86 相关文章所有 15 个版本

[PDF] acm.org

Generative AI for Self-Adaptive Systems: State of the Art and Research Roadmap

J Li, M Zhang, N Li, D Weyns, Z Jin, K Tei - ACM Transactions on …, 2024 - dl.acm.org

Self-adaptive systems (SASs) are designed to handle changes and uncertainties through a
feedback loop with four core functionalities: monitoring, analyzing, planning, and execution …

被引用次数：1 相关文章

Integrating the traffic science with representation learning for city-wide network congestion prediction

W Zheng, HF Yang, J Cai, P Wang, X Jiang, SS Du… - Information …, 2023 - Elsevier

Recent studies on traffic congestion prediction have paved a promising path towards the
reduction of potential economic and environmental loss. However, at the city-wide scale …

被引用次数：12 相关文章所有 2 个版本

[PDF] neurips.cc

Program synthesis guided reinforcement learning for partially observed environments

Y Yang, JP Inala, O Bastani, Y Pu… - Advances in neural …, 2021 - proceedings.neurips.cc

A key challenge for reinforcement learning is solving long-horizon planning problems.
Recent work has leveraged programs to guide reinforcement learning in these settings …

被引用次数：33 相关文章所有 15 个版本

[PDF] acm.org

Web question answering with neurosymbolic program synthesis

Q Chen, A Lamoreaux, X Wang, G Durrett… - Proceedings of the …, 2021 - dl.acm.org

In this paper, we propose a new technique based on program synthesis for extracting
information from webpages. Given a natural language query and a few labeled webpages …

被引用次数：32 相关文章所有 9 个版本

2021 年无人机热点回眸

段海滨，何杭轩，赵彦杰，王寅，牛轶峰，袁莞迈… - 科技导报, 2022 - kjdb.org

随着人工智能技术的发展以及机载任务平台性能的提升, 2021 年无人机技术与应用呈现新的
发展态势. 从无人机政策法规, 自主控制, 反无人机, 应用领域等多方面回眸了2021 …

被引用次数：9 相关文章所有 3 个版本

[PDF] acm.org

Programming-by-demonstration for long-horizon robot tasks

N Patton, K Rahmani, M Missula, J Biswas… - Proceedings of the ACM …, 2024 - dl.acm.org

The goal of programmatic Learning from Demonstration (LfD) is to learn a policy in a
programming language that can be used to control a robot's behavior from a set of user …

被引用次数：6 相关文章所有 9 个版本