Reinforcement learning for problems with hidden state

T Park, N Abuzainab, W Saad - IEEE Access, 2016 - ieeexplore.ieee.org

For a seamless deployment of the Internet of Things (IoT), there is a need for self-organizing
solutions to overcome key IoT challenges that include data processing, resource …

被引用次数：161 相关文章所有 10 个版本

[PDF] researchgate.net

The misbehavior of reinforcement learning

G Mongillo, H Shteingart… - Proceedings of the …, 2014 - ieeexplore.ieee.org

Organisms modify their behavior in response to its consequences, a phenomenon referred
to as operant learning. The computational principles and neural mechanisms underlying …

被引用次数：42 相关文章所有 11 个版本

Pareto: Fair congestion control with online reinforcement learning

S Emara, F Wang, B Li, T Zeyl - IEEE Transactions on Network …, 2022 - ieeexplore.ieee.org

Modern-day computer networks are highly diverse and dynamic, calling for fair and adaptive
network congestion control algorithms with the objective of achieving the best possible …

被引用次数：19 相关文章

Lore a red team emulation tool

H Holm - IEEE Transactions on Dependable and Secure …, 2022 - ieeexplore.ieee.org

This article presents the red team emulation tool Lore, which uses boolean logic and trained
models to automatically select and execute red team actions. Lore improves the current state …

被引用次数：21 相关文章所有 3 个版本

[PDF] fei.edu.br

Heuristic reinforcement learning applied to robocup simulation agents

LA Celiberto, CHC Ribeiro, AHR Costa… - RoboCup 2007: Robot …, 2008 - Springer

This paper describes the design and implementation of robotic agents for the RoboCup
Simulation 2D category that learns using a recently proposed Heuristic Reinforcement …

被引用次数：54 相关文章所有 14 个版本

A reinforcement learning framework for online data migration in hierarchical storage systems

D Vengerov - The Journal of Supercomputing, 2008 - Springer

Multi-tier storage systems are becoming more and more widespread in the industry. They
have more tunable parameters and built-in policies than traditional storage systems, and an …

被引用次数：40 相关文章所有 8 个版本

[图书][B] Qualitative spatial abstraction in reinforcement learning

L Frommberger - 2010 - books.google.com

Reinforcement learning has developed as a successful learning approach for domains that
are not fully understood and that are too complex to be described in closed form. However …

被引用次数：22 相关文章所有 7 个版本

Learning to weigh competing moral motivations.

O FeldmanHall, A Lamba - 2023 - psycnet.apa.org

This chapter presents the perspective that one promising avenue for understanding how
humans learn to weigh competing moral motivations is by decomposing the moral inference …

被引用次数：1 相关文章所有 2 个版本

[PDF] academia.edu

[图书][B] Learning and solving partially observable markov decision processes

G Shani - 2007 - academia.edu

Abstract Partially Observable Markov Decision Processes (POMDPs) provide a rich
representation for agents acting in a stochastic domain under partial observability. POMDPs …

被引用次数：21 相关文章所有 3 个版本

[PDF] researchgate.net

Adaptive approach for the regulation of a mobile agent population in a distributed network

M Bakhouya, J Gaber - 2006 Fifth International Symposium on …, 2006 - ieeexplore.ieee.org

Mobile agent is a program that can migrate from a machine to another in a network and
perform tasks on machines that provide agent hosting capability. The agent can clone itself …

被引用次数：25 相关文章所有 7 个版本