CrowdTransfer: Enabling Crowd Knowledge Transfer in AIoT Community
Artificial Intelligence of Things (AIoT) is an emerging frontier based on the deep fusion of
Internet of Things (IoT) and Artificial Intelligence (AI) technologies. The fundamental goal of …
Internet of Things (IoT) and Artificial Intelligence (AI) technologies. The fundamental goal of …
Multi-agent Cooperation for Computing Power Scheduling in UAVs empowered Aerial Computing Systems
In the paradigm of ubiquitous edge computing, with those advantages, eg, high mobility, fast
response, flexibility and controllability, and low cost of use, Unmanned Aerial Vehicles …
response, flexibility and controllability, and low cost of use, Unmanned Aerial Vehicles …
AccDecoder: Accelerated decoding for neural-enhanced video analytics
The quality of the video stream is key to neural network-based video analytics. However, low-
quality video is inevitably collected by existing surveillance systems because of poor quality …
quality video is inevitably collected by existing surveillance systems because of poor quality …
基于多智能体强化学习的博弈综述
李艺春, 刘泽娇, 洪艺天, 王继超, 王健瑞, 李毅, 唐漾 - 自动化学报, 2024 - aas.net.cn
多智能体强化学习作为博弈论, 控制论和多智能体学习的交叉研究领域, 是多智能体系统研究中
的前沿方向, 赋予了智能体在动态多维的复杂环境中通过交互和决策完成多样化任务的能力 …
的前沿方向, 赋予了智能体在动态多维的复杂环境中通过交互和决策完成多样化任务的能力 …
Decomposing Temporal Equilibrium Strategy for Coordinated Distributed Multi-Agent Reinforcement Learning
The increasing demands for system complexity and robustness have prompted the
integration of temporal logic into Multi-Agent Reinforcement Learning (MARL) to address …
integration of temporal logic into Multi-Agent Reinforcement Learning (MARL) to address …
Accelerated Neural Enhancement for Video Analytics With Video Quality Adaptation
The quality of the video stream is the key to neural network-based video analytics. However,
low-quality video is inevitably collected by existing surveillance systems because of poor …
low-quality video is inevitably collected by existing surveillance systems because of poor …
[PDF][PDF] A Delay-Aware DRL-Based Environment for Cooperative Multi-UAV Systems in Multi-Purpose Scenarios.
We provide a customizable environment based on Deep Reinforcement Learning (DRL)
strategies for handling cooperative multi-UAV (Unmanned Aerial Vehicles) scenarios when …
strategies for handling cooperative multi-UAV (Unmanned Aerial Vehicles) scenarios when …
Learning to Communicate Strategically for Efficient Collective Intelligence
Learning to communicate (L2C) involves learning how, when, and with whom to
communicate to enhance cooperation among agents under limited bandwidth. However …
communicate to enhance cooperation among agents under limited bandwidth. However …
Resolving Action Delay: Multi-agent Reinforcement Learning Based on State Prediction
F Wang, H Zhang, Y Zhang - Chinese Intelligent Systems Conference, 2024 - Springer
Action delay is very common in practical applications and can significantly impact the
effectiveness of reinforcement learning, especially in multi-agent scenarios, due to the lag in …
effectiveness of reinforcement learning, especially in multi-agent scenarios, due to the lag in …
Centralized curiosity model-based synchronization control for robotics teleoperator under input saturation and time delay
F Wang, T Wang, J Luo, X Li, F Guo… - … Conference on Control …, 2024 - spiedigitallibrary.org
This paper investigates the state synchronization control problem of a teleoperation system
for actuators based on deep reinforcement learning, taking into account input saturation …
for actuators based on deep reinforcement learning, taking into account input saturation …