Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …

A social path to human-like artificial intelligence

EA Duéñez-Guzmán, S Sadedin, JX Wang… - Nature Machine …, 2023 - nature.com
Traditionally, cognitive and computer scientists have viewed intelligence solipsistically, as a
property of unitary agents devoid of social context. Given the success of contemporary …

Minigrid & miniworld: Modular & customizable reinforcement learning environments for goal-oriented tasks

M Chevalier-Boisvert, B Dai… - Advances in …, 2024 - proceedings.neurips.cc
We present the Minigrid and Miniworld libraries which provide a suite of goal-oriented 2D
and 3D environments. The libraries were explicitly created with a minimalistic design …

Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning

B Ellis, J Cook, S Moalla… - Advances in …, 2024 - proceedings.neurips.cc
The availability of challenging benchmarks has played a key role in the recent progress of
machine learning. In cooperative multi-agent reinforcement learning, the StarCraft Multi …

Towards a standardised performance evaluation protocol for cooperative marl

R Gorsane, O Mahjoub, RJ de Kock… - Advances in …, 2022 - proceedings.neurips.cc
Multi-agent reinforcement learning (MARL) has emerged as a useful approach to solving
decentralised decision-making problems at scale. Research in the field has been growing …

Deep reinforcement learning in the advanced cybersecurity threat detection and protection

M Sewak, SK Sahay, H Rathore - Information Systems Frontiers, 2023 - Springer
The cybersecurity threat landscape has lately become overly complex. Threat actors
leverage weaknesses in the network and endpoint security in a very coordinated manner to …

Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world

E Vinitsky, N Lichtlé, X Yang… - Advances in Neural …, 2022 - proceedings.neurips.cc
Abstract We introduce\textit {Nocturne}, a new 2D driving simulator for investigating multi-
agent coordination under partial observability. The focus of Nocturne is to enable research …

Gigastep-one billion steps per second multi-agent reinforcement learning

M Lechner, T Seyde, THJ Wang… - Advances in …, 2024 - proceedings.neurips.cc
Multi-agent reinforcement learning (MARL) research is faced with a trade-off: it either uses
complex environments requiring large compute resources, which makes it inaccessible to …

A review of cooperation in multi-agent learning

Y Du, JZ Leibo, U Islam, R Willis, P Sunehag - arXiv preprint arXiv …, 2023 - arxiv.org
Cooperation in multi-agent learning (MAL) is a topic at the intersection of numerous
disciplines, including game theory, economics, social sciences, and evolutionary biology …

Hardness of independent learning and sparse equilibrium computation in markov games

DJ Foster, N Golowich… - … Conference on Machine …, 2023 - proceedings.mlr.press
We consider the problem of decentralized multi-agent reinforcement learning in Markov
games. A fundamental question is whether there exist algorithms that, when run …