Two-dimensional antijamming mobile communication based on reinforcement learning

L Xiao, D Jiang, D Xu, H Zhu, Y Zhang… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
By using smart radio devices, a jammer can dynamically change its jamming policy based
on opposing security mechanisms; it can even induce the mobile device to enter a specific …

Learning to repeat: Fine grained action repetition for deep reinforcement learning

S Sharma, A Srinivas, B Ravindran - arXiv preprint arXiv:1702.06054, 2017 - arxiv.org
Reinforcement Learning algorithms can learn complex behavioral patterns for sequential
decision making tasks wherein an agent interacts with an environment and acquires …

A deep hierarchical reinforcement learning algorithm in partially observable Markov decision processes

TP Le, NA Vien, TC Chung - Ieee Access, 2018 - ieeexplore.ieee.org
In recent years, reinforcement learning (RL) has achieved remarkable success due to the
growing adoption of deep learning techniques and the rapid growth of computing power …

An analysis of frame-skipping in reinforcement learning

S Kalyanakrishnan, S Aravindan, V Bagdawat… - arXiv preprint arXiv …, 2021 - arxiv.org
In the practice of sequential decision making, agents are often designed to sense state at
regular intervals of $ d $ time steps, $ d> 1$, ignoring state information in between sensing …

Continuous control on time

T Ni, E Jang - ICLR 2022 Workshop on Generalizable Policy …, 2022 - openreview.net
The physical world evolves continuously in time. Most prior works on reinforcement learning
cast continuous-time environments into a discrete-time Markov Decision Process (MDP), by …

Addressing action oscillations through learning policy inertia

C Chen, H Tang, J Hao, W Liu, Z Meng - Proceedings of the AAAI …, 2021 - ojs.aaai.org
Deep reinforcement learning (DRL) algorithms have been demonstrated to be effective on a
wide range of challenging decision making and control tasks. However, these methods …

Disentangled representations for sequence data using information bottleneck principle

M Yamada, H Kim, K Miyoshi, T Iwata… - Asian Conference on …, 2020 - proceedings.mlr.press
We propose the factorizing variational autoencoder (FAVAE), a generative model for
learning dis-entangled representations from sequential data via the information bottleneck …

Composing Synergistic Macro Actions for Reinforcement Learning Agents

YM Chen, KY Chang, C Liu, TC Hsiao… - … on Neural Networks …, 2022 - ieeexplore.ieee.org
Macro actions have been demonstrated to be beneficial for the learning processes of an
agent and have encouraged a variety of techniques to be developed for constructing more …

DRL: Deep Reinforcement Learning for Intelligent Robot Control--Concept, Literature, and Future

A Dargazany - arXiv preprint arXiv:2105.13806, 2021 - arxiv.org
Combination of machine learning (for generating machine intelligence), computer vision (for
better environment perception), and robotic systems (for controlled environment interaction) …

Action abstractions for amortized sampling

O Boussif, LN Ezzine, JD Viviano, M Koziarski… - arXiv preprint arXiv …, 2024 - arxiv.org
As trajectories sampled by policies used by reinforcement learning (RL) and generative flow
networks (GFlowNets) grow longer, credit assignment and exploration become more …