Learning how to communicate in the Internet of Things: Finite resources and heterogeneity
For a seamless deployment of the Internet of Things (IoT), there is a need for self-organizing
solutions to overcome key IoT challenges that include data processing, resource …
solutions to overcome key IoT challenges that include data processing, resource …
The misbehavior of reinforcement learning
G Mongillo, H Shteingart… - Proceedings of the …, 2014 - ieeexplore.ieee.org
Organisms modify their behavior in response to its consequences, a phenomenon referred
to as operant learning. The computational principles and neural mechanisms underlying …
to as operant learning. The computational principles and neural mechanisms underlying …
Pareto: Fair congestion control with online reinforcement learning
Modern-day computer networks are highly diverse and dynamic, calling for fair and adaptive
network congestion control algorithms with the objective of achieving the best possible …
network congestion control algorithms with the objective of achieving the best possible …
Lore a red team emulation tool
H Holm - IEEE Transactions on Dependable and Secure …, 2022 - ieeexplore.ieee.org
This article presents the red team emulation tool Lore, which uses boolean logic and trained
models to automatically select and execute red team actions. Lore improves the current state …
models to automatically select and execute red team actions. Lore improves the current state …
Heuristic reinforcement learning applied to robocup simulation agents
This paper describes the design and implementation of robotic agents for the RoboCup
Simulation 2D category that learns using a recently proposed Heuristic Reinforcement …
Simulation 2D category that learns using a recently proposed Heuristic Reinforcement …
A reinforcement learning framework for online data migration in hierarchical storage systems
D Vengerov - The Journal of Supercomputing, 2008 - Springer
Multi-tier storage systems are becoming more and more widespread in the industry. They
have more tunable parameters and built-in policies than traditional storage systems, and an …
have more tunable parameters and built-in policies than traditional storage systems, and an …
[图书][B] Qualitative spatial abstraction in reinforcement learning
L Frommberger - 2010 - books.google.com
Reinforcement learning has developed as a successful learning approach for domains that
are not fully understood and that are too complex to be described in closed form. However …
are not fully understood and that are too complex to be described in closed form. However …
Learning to weigh competing moral motivations.
O FeldmanHall, A Lamba - 2023 - psycnet.apa.org
This chapter presents the perspective that one promising avenue for understanding how
humans learn to weigh competing moral motivations is by decomposing the moral inference …
humans learn to weigh competing moral motivations is by decomposing the moral inference …
[图书][B] Learning and solving partially observable markov decision processes
G Shani - 2007 - academia.edu
Abstract Partially Observable Markov Decision Processes (POMDPs) provide a rich
representation for agents acting in a stochastic domain under partial observability. POMDPs …
representation for agents acting in a stochastic domain under partial observability. POMDPs …
Adaptive approach for the regulation of a mobile agent population in a distributed network
M Bakhouya, J Gaber - 2006 Fifth International Symposium on …, 2006 - ieeexplore.ieee.org
Mobile agent is a program that can migrate from a machine to another in a network and
perform tasks on machines that provide agent hosting capability. The agent can clone itself …
perform tasks on machines that provide agent hosting capability. The agent can clone itself …