Learning how to communicate in the Internet of Things: Finite resources and heterogeneity

T Park, N Abuzainab, W Saad - IEEE Access, 2016 - ieeexplore.ieee.org
For a seamless deployment of the Internet of Things (IoT), there is a need for self-organizing
solutions to overcome key IoT challenges that include data processing, resource …

The misbehavior of reinforcement learning

G Mongillo, H Shteingart… - Proceedings of the …, 2014 - ieeexplore.ieee.org
Organisms modify their behavior in response to its consequences, a phenomenon referred
to as operant learning. The computational principles and neural mechanisms underlying …

Pareto: Fair congestion control with online reinforcement learning

S Emara, F Wang, B Li, T Zeyl - IEEE Transactions on Network …, 2022 - ieeexplore.ieee.org
Modern-day computer networks are highly diverse and dynamic, calling for fair and adaptive
network congestion control algorithms with the objective of achieving the best possible …

Lore a red team emulation tool

H Holm - IEEE Transactions on Dependable and Secure …, 2022 - ieeexplore.ieee.org
This article presents the red team emulation tool Lore, which uses boolean logic and trained
models to automatically select and execute red team actions. Lore improves the current state …

Heuristic reinforcement learning applied to robocup simulation agents

LA Celiberto, CHC Ribeiro, AHR Costa… - RoboCup 2007: Robot …, 2008 - Springer
This paper describes the design and implementation of robotic agents for the RoboCup
Simulation 2D category that learns using a recently proposed Heuristic Reinforcement …

A reinforcement learning framework for online data migration in hierarchical storage systems

D Vengerov - The Journal of Supercomputing, 2008 - Springer
Multi-tier storage systems are becoming more and more widespread in the industry. They
have more tunable parameters and built-in policies than traditional storage systems, and an …

[图书][B] Qualitative spatial abstraction in reinforcement learning

L Frommberger - 2010 - books.google.com
Reinforcement learning has developed as a successful learning approach for domains that
are not fully understood and that are too complex to be described in closed form. However …

Learning to weigh competing moral motivations.

O FeldmanHall, A Lamba - 2023 - psycnet.apa.org
This chapter presents the perspective that one promising avenue for understanding how
humans learn to weigh competing moral motivations is by decomposing the moral inference …

[图书][B] Learning and solving partially observable markov decision processes

G Shani - 2007 - academia.edu
Abstract Partially Observable Markov Decision Processes (POMDPs) provide a rich
representation for agents acting in a stochastic domain under partial observability. POMDPs …

Adaptive approach for the regulation of a mobile agent population in a distributed network

M Bakhouya, J Gaber - 2006 Fifth International Symposium on …, 2006 - ieeexplore.ieee.org
Mobile agent is a program that can migrate from a machine to another in a network and
perform tasks on machines that provide agent hosting capability. The agent can clone itself …