Deep reinforcement learning with macro-actions

L Xiao, D Jiang, D Xu, H Zhu, Y Zhang… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org

By using smart radio devices, a jammer can dynamically change its jamming policy based
on opposing security mechanisms; it can even induce the mobile device to enter a specific …

被引用次数：129 相关文章所有 7 个版本

[PDF] arxiv.org

Learning to repeat: Fine grained action repetition for deep reinforcement learning

S Sharma, A Srinivas, B Ravindran - arXiv preprint arXiv:1702.06054, 2017 - arxiv.org

Reinforcement Learning algorithms can learn complex behavioral patterns for sequential
decision making tasks wherein an agent interacts with an environment and acquires …

被引用次数：123 相关文章所有 3 个版本

[PDF] ieee.org

A deep hierarchical reinforcement learning algorithm in partially observable Markov decision processes

TP Le, NA Vien, TC Chung - Ieee Access, 2018 - ieeexplore.ieee.org

In recent years, reinforcement learning (RL) has achieved remarkable success due to the
growing adoption of deep learning techniques and the rapid growth of computing power …

被引用次数：75 相关文章所有 11 个版本

[PDF] arxiv.org

An analysis of frame-skipping in reinforcement learning

S Kalyanakrishnan, S Aravindan, V Bagdawat… - arXiv preprint arXiv …, 2021 - arxiv.org

In the practice of sequential decision making, agents are often designed to sense state at
regular intervals of $ d $ time steps, $ d> 1$, ignoring state information in between sensing …

被引用次数：21 相关文章所有 3 个版本

[PDF] openreview.net

Continuous control on time

T Ni, E Jang - ICLR 2022 Workshop on Generalizable Policy …, 2022 - openreview.net

The physical world evolves continuously in time. Most prior works on reinforcement learning
cast continuous-time environments into a discrete-time Markov Decision Process (MDP), by …

被引用次数：7 相关文章

[PDF] aaai.org

Addressing action oscillations through learning policy inertia

C Chen, H Tang, J Hao, W Liu, Z Meng - Proceedings of the AAAI …, 2021 - ojs.aaai.org

Deep reinforcement learning (DRL) algorithms have been demonstrated to be effective on a
wide range of challenging decision making and control tasks. However, these methods …

被引用次数：13 相关文章所有 5 个版本

[PDF] mlr.press

Disentangled representations for sequence data using information bottleneck principle

M Yamada, H Kim, K Miyoshi, T Iwata… - Asian Conference on …, 2020 - proceedings.mlr.press

We propose the factorizing variational autoencoder (FAVAE), a generative model for
learning dis-entangled representations from sequential data via the information bottleneck …

被引用次数：14 相关文章所有 3 个版本

Composing Synergistic Macro Actions for Reinforcement Learning Agents

YM Chen, KY Chang, C Liu, TC Hsiao… - … on Neural Networks …, 2022 - ieeexplore.ieee.org

Macro actions have been demonstrated to be beneficial for the learning processes of an
agent and have encouraged a variety of techniques to be developed for constructing more …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

DRL: Deep Reinforcement Learning for Intelligent Robot Control--Concept, Literature, and Future

A Dargazany - arXiv preprint arXiv:2105.13806, 2021 - arxiv.org

Combination of machine learning (for generating machine intelligence), computer vision (for
better environment perception), and robotic systems (for controlled environment interaction) …

被引用次数：9 相关文章所有 3 个版本

[PDF] arxiv.org

Action abstractions for amortized sampling

O Boussif, LN Ezzine, JD Viviano, M Koziarski… - arXiv preprint arXiv …, 2024 - arxiv.org

As trajectories sampled by policies used by reinforcement learning (RL) and generative flow
networks (GFlowNets) grow longer, credit assignment and exploration become more …