Two-dimensional antijamming mobile communication based on reinforcement learning
By using smart radio devices, a jammer can dynamically change its jamming policy based
on opposing security mechanisms; it can even induce the mobile device to enter a specific …
on opposing security mechanisms; it can even induce the mobile device to enter a specific …
Learning to repeat: Fine grained action repetition for deep reinforcement learning
Reinforcement Learning algorithms can learn complex behavioral patterns for sequential
decision making tasks wherein an agent interacts with an environment and acquires …
decision making tasks wherein an agent interacts with an environment and acquires …
A deep hierarchical reinforcement learning algorithm in partially observable Markov decision processes
In recent years, reinforcement learning (RL) has achieved remarkable success due to the
growing adoption of deep learning techniques and the rapid growth of computing power …
growing adoption of deep learning techniques and the rapid growth of computing power …
An analysis of frame-skipping in reinforcement learning
S Kalyanakrishnan, S Aravindan, V Bagdawat… - arXiv preprint arXiv …, 2021 - arxiv.org
In the practice of sequential decision making, agents are often designed to sense state at
regular intervals of $ d $ time steps, $ d> 1$, ignoring state information in between sensing …
regular intervals of $ d $ time steps, $ d> 1$, ignoring state information in between sensing …
Continuous control on time
The physical world evolves continuously in time. Most prior works on reinforcement learning
cast continuous-time environments into a discrete-time Markov Decision Process (MDP), by …
cast continuous-time environments into a discrete-time Markov Decision Process (MDP), by …
Addressing action oscillations through learning policy inertia
Deep reinforcement learning (DRL) algorithms have been demonstrated to be effective on a
wide range of challenging decision making and control tasks. However, these methods …
wide range of challenging decision making and control tasks. However, these methods …
Disentangled representations for sequence data using information bottleneck principle
We propose the factorizing variational autoencoder (FAVAE), a generative model for
learning dis-entangled representations from sequential data via the information bottleneck …
learning dis-entangled representations from sequential data via the information bottleneck …
Composing Synergistic Macro Actions for Reinforcement Learning Agents
Macro actions have been demonstrated to be beneficial for the learning processes of an
agent and have encouraged a variety of techniques to be developed for constructing more …
agent and have encouraged a variety of techniques to be developed for constructing more …
DRL: Deep Reinforcement Learning for Intelligent Robot Control--Concept, Literature, and Future
A Dargazany - arXiv preprint arXiv:2105.13806, 2021 - arxiv.org
Combination of machine learning (for generating machine intelligence), computer vision (for
better environment perception), and robotic systems (for controlled environment interaction) …
better environment perception), and robotic systems (for controlled environment interaction) …
Action abstractions for amortized sampling
As trajectories sampled by policies used by reinforcement learning (RL) and generative flow
networks (GFlowNets) grow longer, credit assignment and exploration become more …
networks (GFlowNets) grow longer, credit assignment and exploration become more …