作者
Peter Gunnarson, Ioannis Mandralis, Guido Novati, Petros Koumoutsakos, John O Dabiri
发表日期
2021/12/8
期刊
Nature communications
卷号
12
期号
1
页码范围
7143
出版商
Nature Publishing Group UK
简介
Efficient point-to-point navigation in the presence of a background flow field is important for robotic applications such as ocean surveying. In such applications, robots may only have knowledge of their immediate surroundings or be faced with time-varying currents, which limits the use of optimal control techniques. Here, we apply a recently introduced Reinforcement Learning algorithm to discover time-efficient navigation policies to steer a fixed-speed swimmer through unsteady two-dimensional flow fields. The algorithm entails inputting environmental cues into a deep neural network that determines the swimmer’s actions, and deploying Remember and Forget Experience Replay. We find that the resulting swimmers successfully exploit the background flow to reach the target, but that this success depends on the sensed environmental cue. Surprisingly, a velocity sensing approach significantly outperformed a bio …
引用总数
学术搜索中的文章
P Gunnarson, I Mandralis, G Novati, P Koumoutsakos… - Nature communications, 2021