A rubric for human-like agents and NeuroAI

I Momennejad - … Transactions of the Royal Society B, 2023 - royalsocietypublishing.org
Researchers across cognitive, neuro-and computer sciences increasingly reference 'human-
like'artificial intelligence and 'neuroAI'. However, the scope and use of the terms are often …

Direct behavior specification via constrained reinforcement learning

J Roy, R Girgis, J Romoff, PL Bacon, C Pal - arXiv preprint arXiv …, 2021 - arxiv.org
The standard formulation of Reinforcement Learning lacks a practical way of specifying what
are admissible and forbidden behaviors. Most often, practitioners go about the task of …

BEDD: the MineRL BASALT evaluation and demonstrations dataset for training and benchmarking agents that solve fuzzy tasks

S Milani, A Kanervisto… - Advances in …, 2024 - proceedings.neurips.cc
The MineRL BASALT competition has served to catalyze advances in learning from human
feedback through four hard-to-specify tasks in Minecraft, such as create and photograph a …

Exploration patterns shape cognitive map learning

IK Brunec, MM Nantais, JE Sutton, RA Epstein… - Cognition, 2023 - Elsevier
Spontaneous, volitional spatial exploration is crucial for building up a cognitive map of the
environment. However, decades of research have primarily measured the fidelity of …

Automated gameplay testing and validation with curiosity-conditioned proximal trajectories

A Sestini, L Gisslén, J Bergdahl… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
This article proposes a novel deep reinforcement learning algorithm to perform automated
analysis and detection of gameplay issues in complex 3-D navigation environments. The …

Navigates like me: Understanding how people evaluate human-like AI in video games

S Milani, A Juliani, I Momennejad… - Proceedings of the …, 2023 - dl.acm.org
We aim to understand how people assess human likeness in navigation produced by
people and artificially intelligent (AI) agents in a video game. To this end, we propose a …

PNT 智能与智能PNT

刘经南, 罗亚荣, 郭迟, 高柯夫 - 测绘学报, 2022 - xb.chinasmp.com
定位(positioning), 导航(navigation), 授时(timing), 简称PNT, 是人类在长期感知,
认知宇宙与人类生存的关系后, 产生的与经济, 社会活动密切相关的时空位置概念. PNT …

How humans perceive human-like behavior in video game navigation

E Zuniga, S Milani, G Leroy, J Rzepecki… - CHI Conference on …, 2022 - dl.acm.org
The goal of this paper is to understand how people assess human-likeness in human-and AI-
generated behavior. To this end, we present a qualitative study of hundreds of crowd …

Memory, Space, and Planning: Multiscale Predictive Representations

I Momennejad - arXiv preprint arXiv:2401.09491, 2024 - arxiv.org
Memory is inherently entangled with prediction and planning. Flexible behavior in biological
and artificial agents depends on the interplay of learning from the past and predicting the …

Technical challenges of deploying reinforcement learning agents for game testing in aaa games

J Gillberg, J Bergdahl, A Sestini… - … IEEE Conference on …, 2023 - ieeexplore.ieee.org
Going from research to production, especially for large and complex software systems, is
fundamentally a hard problem. In large-scale game production, one of the main reasons is …