The impact of task underspecification in evaluating deep reinforcement learning

C Benjamins, T Eimer, F Schubert, A Mohan… - arXiv preprint arXiv …, 2022 - arxiv.org

While Reinforcement Learning (RL) has made great strides towards solving increasingly
complicated problems, many algorithms are still brittle to even slight environmental changes …

被引用次数：39 相关文章所有 8 个版本

[PDF] arxiv.org

Mitigating Metropolitan Carbon Emissions with Dynamic Eco-driving at Scale

V Jayawardana, B Freydt, A Qu, C Hickert… - arXiv preprint arXiv …, 2024 - arxiv.org

The sheer scale and diversity of transportation make it a formidable sector to decarbonize.
Here, we consider an emerging opportunity to reduce carbon emissions: the growing …

被引用次数：1 相关文章所有 2 个版本

[PDF] aaai.org

Probabilistic Offline Policy Ranking with Approximate Bayesian Computation

L Da, P Jenkins, T Schwantes, J Dotson… - Proceedings of the AAAI …, 2024 - ojs.aaai.org

In practice, it is essential to compare and rank candidate policies offline before real-world
deployment for safety and reliability. Prior work seeks to solve this offline policy ranking …

What is a typical signalized intersection in a city? A pipeline for intersection data imputation from OpenStreetMap

A Qu, A Valiveru, C Tang, V Jayawardana… - arXiv preprint arXiv …, 2024 - arxiv.org

Signalized intersections, arguably the most complicated type of traffic scenario, are essential
to urban mobility systems. With recent advancements in intelligent transportation …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

Metacognitive AI: Framework and the Case for a Neurosymbolic Approach

H Wei, P Shakarian, C Lebiere, B Draper… - … Conference on Neural …, 2024 - Springer

Metacognition is the concept of reasoning about an agent's own internal processes and was
originally introduced in the field of developmental psychology. In this position paper, we …

被引用次数：2 相关文章

[PDF] arxiv.org

Model-Free Learning of Corridor Clearance: A Near-Term Deployment Perspective

D Suo, V Jayawardana, C Wu - IEEE Transactions on Intelligent …, 2024 - ieeexplore.ieee.org

An emerging public health application of connected and automated vehicle (CAV)
technologies is to reduce response times of emergency medical service (EMS) by indirectly …

被引用次数：2 相关文章所有 4 个版本

[PDF] arxiv.org

Temporal Transfer Learning for Traffic Optimization with Coarse-grained Advisory Autonomy

JH Cho, S Li, J Kim, C Wu - arXiv preprint arXiv:2312.09436, 2023 - arxiv.org

The recent development of connected and automated vehicle (CAV) technologies has
spurred investigations to optimize dense urban traffic. This paper considers advisory …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

IntersectionZoo: Eco-driving for Benchmarking Multi-Agent Contextual Reinforcement Learning

V Jayawardana, B Freydt, A Qu, C Hickert… - arXiv preprint arXiv …, 2024 - arxiv.org

Despite the popularity of multi-agent reinforcement learning (RL) in simulated and two-
player applications, its success in messy real-world applications has been limited. A key …

[PDF] arxiv.org

Model-Based Transfer Learning for Contextual Reinforcement Learning

JH Cho, V Jayawardana, S Li, C Wu - arXiv preprint arXiv:2408.04498, 2024 - arxiv.org

Deep reinforcement learning is a powerful approach to complex decision making. However,
one issue that limits its practical application is its brittleness, sometimes failing to train in the …

Generalizing Cooperative Eco-driving via Multi-residual Task Learning

V Jayawardana, S Li, C Wu, Y Farid… - arXiv preprint arXiv …, 2024 - arxiv.org

Conventional control, such as model-based control, is commonly utilized in autonomous
driving due to its efficiency and reliability. However, real-world autonomous driving contends …

被引用次数：5 相关文章所有 4 个版本