Foundational challenges in assuring alignment and safety of large language models
This work identifies 18 foundational challenges in assuring the alignment and safety of large
language models (LLMs). These challenges are organized into three different categories …
language models (LLMs). These challenges are organized into three different categories …
[HTML][HTML] Mathematical frameworks for the analysis of norms
A Sontuoso - Current Opinion in Psychology, 2024 - Elsevier
Research into society's informal rules of conduct, or norms, has recently experienced a
surge, extending across multiple academic disciplines. Despite this growth, the theoretical …
surge, extending across multiple academic disciplines. Despite this growth, the theoretical …
Cooperation on the fly: Exploring language agents for ad hoc teamwork in the avalon game
Multi-agent collaboration with Large Language Models (LLMs) demonstrates proficiency in
basic tasks, yet its efficiency in more complex scenarios remains unexplored. In gaming …
basic tasks, yet its efficiency in more complex scenarios remains unexplored. In gaming …
Applying opponent and environment modelling in decentralised multi-agent reinforcement learning
A Chernyavskiy, A Skrynnik, A Panov - Cognitive Systems Research, 2025 - Elsevier
Multi-agent reinforcement learning (MARL) has recently gained popularity and achieved
much success in different kind of games such as zero-sum, cooperative or general-sum …
much success in different kind of games such as zero-sum, cooperative or general-sum …
STAS: Spatial-Temporal Return Decomposition for Solving Sparse Rewards Problems in Multi-agent Reinforcement Learning
Centralized Training with Decentralized Execution (CTDE) has been proven to be an
effective paradigm in cooperative multi-agent reinforcement learning (MARL). One of the …
effective paradigm in cooperative multi-agent reinforcement learning (MARL). One of the …
Environment Complexity and Nash Equilibria in a Sequential Social Dilemma
Multi-agent reinforcement learning (MARL) methods, while effective in zero-sum or positive-
sum games, often yield suboptimal outcomes in general-sum games where cooperation is …
sum games, often yield suboptimal outcomes in general-sum games where cooperation is …
Enabling Multi-Robot Collaboration from Single-Human Guidance
Learning collaborative behaviors is essential for multi-agent systems. Traditionally, multi-
agent reinforcement learning solves this implicitly through a joint reward and centralized …
agent reinforcement learning solves this implicitly through a joint reward and centralized …
[HTML][HTML] Altered gene expression profiles in the lungs of benzo [a] pyrene-exposed mice in the presence of lipopolysaccharide-induced pulmonary inflammation
Patients with inflammatory lung diseases are often additionally exposed to polycyclic
aromatic hydrocarbons like B [a] P and B [a] P-induced alterations in gene expression in …
aromatic hydrocarbons like B [a] P and B [a] P-induced alterations in gene expression in …
TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient
Multi-Agent Policy Gradient (MAPG) has made significant progress in recent years.
However, centralized critics in state-of-the-art MAPG methods still face the centralized …
However, centralized critics in state-of-the-art MAPG methods still face the centralized …
Beyond the matrix: Experimental approaches to studying cognitive agents in social-ecological systems
Social-ecological systems, in which agents interact with each other and their environment
are important both for sustainability applications and for under-standing how human …
are important both for sustainability applications and for under-standing how human …