Foundational challenges in assuring alignment and safety of large language models

U Anwar, A Saparov, J Rando, D Paleka… - arXiv preprint arXiv …, 2024 - arxiv.org
This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …

[HTML][HTML] Mathematical frameworks for the analysis of norms

A Sontuoso - Current Opinion in Psychology, 2024 - Elsevier
Research into society's informal rules of conduct, or norms, has recently experienced a
surge, extending across multiple academic disciplines. Despite this growth, the theoretical …

Cooperation on the fly: Exploring language agents for ad hoc teamwork in the avalon game

Z Shi, M Fang, S Zheng, S Deng, L Chen… - arXiv preprint arXiv …, 2023 - arxiv.org
Multi-agent collaboration with Large Language Models (LLMs) demonstrates proficiency in
basic tasks, yet its efficiency in more complex scenarios remains unexplored. In gaming …

Applying opponent and environment modelling in decentralised multi-agent reinforcement learning

A Chernyavskiy, A Skrynnik, A Panov - Cognitive Systems Research, 2025 - Elsevier
Multi-agent reinforcement learning (MARL) has recently gained popularity and achieved
much success in different kind of games such as zero-sum, cooperative or general-sum …

STAS: Spatial-Temporal Return Decomposition for Solving Sparse Rewards Problems in Multi-agent Reinforcement Learning

S Chen, Z Zhang, Y Yang, Y Du - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Centralized Training with Decentralized Execution (CTDE) has been proven to be an
effective paradigm in cooperative multi-agent reinforcement learning (MARL). One of the …

Environment Complexity and Nash Equilibria in a Sequential Social Dilemma

M Yasir, A Howes, V Mavroudis, C Hicks - arXiv preprint arXiv:2408.02148, 2024 - arxiv.org
Multi-agent reinforcement learning (MARL) methods, while effective in zero-sum or positive-
sum games, often yield suboptimal outcomes in general-sum games where cooperation is …

Enabling Multi-Robot Collaboration from Single-Human Guidance

Z Ji, L Zhang, P Sajda, B Chen - arXiv preprint arXiv:2409.19831, 2024 - arxiv.org
Learning collaborative behaviors is essential for multi-agent systems. Traditionally, multi-
agent reinforcement learning solves this implicitly through a joint reward and centralized …

[HTML][HTML] Altered gene expression profiles in the lungs of benzo [a] pyrene-exposed mice in the presence of lipopolysaccharide-induced pulmonary inflammation

Q Shi, RR Fijten, D Spina, YR Vasquez, VM Arlt… - Toxicology and applied …, 2017 - Elsevier
Patients with inflammatory lung diseases are often additionally exposed to polycyclic
aromatic hydrocarbons like B [a] P and B [a] P-induced alterations in gene expression in …

TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient

X Lou, J Zhang, TJ Norman, K Huang… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Multi-Agent Policy Gradient (MAPG) has made significant progress in recent years.
However, centralized critics in state-of-the-art MAPG methods still face the centralized …

Beyond the matrix: Experimental approaches to studying cognitive agents in social-ecological systems

U Hertz, R Köster, MA Janssen, JZ Leibo - Cognition, 2025 - Elsevier
Social-ecological systems, in which agents interact with each other and their environment
are important both for sustainability applications and for under-standing how human …