关注
Michał Bortkiewicz
Michał Bortkiewicz
在 pw.edu.pl 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Overestimation, overfitting, and plasticity in actor-critic: the bitter lesson of reinforcement learning
M Nauman, M Bortkiewicz, M Ostaszewski, P Miłoś, T Trzciński, M Cygan
arXiv preprint arXiv:2403.00514, 2024
42024
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem
M Wołczyk, B Cupiał, M Ostaszewski, M Bortkiewicz, M Zając, R Pascanu, ...
arXiv preprint arXiv:2402.02868, 2024
32024
The effectiveness of world models for continual reinforcement learning
S Kessler, M Ostaszewski, MP Bortkiewicz, M Żarski, M Wolczyk, ...
Conference on Lifelong Learning Agents, 184-204, 2023
22023
Subgoal Reachability in Goal Conditioned Hierarchical Reinforcement Learning
M Bortkiewicz, J Łyskawa, P Wawrzyński, M Ostaszewski, A Grudkowski, ...
16th International Conference on Agents and Artificial Intelligence, 2024
12024
The Role of Forgetting in Fine-Tuning Reinforcement Learning Models
M Wolczyk, B Cupiał, M Ostaszewski, M Bortkiewicz, M Zając, R Pascanu, ...
12023
Progressive Latent Replay for Efficient Generative Rehearsal
S Pawlak, F Szatkowski, M Bortkiewicz, J Dubiński, T Trzciński
International Conference on Neural Information Processing, 457-467, 2022
12022
Accelerating Goal-Conditioned RL Algorithms and Research
M Bortkiewicz, W Pałucki, V Myers, T Dziarmaga, T Arczewski, Ł Kuciński, ...
arXiv preprint arXiv:2408.11052, 2024
2024
Emergency action termination for immediate reaction in hierarchical reinforcement learning
M Bortkiewicz, J Łyskawa, P Wawrzyński, M Ostaszewski, A Grudkowski, ...
arXiv preprint arXiv:2211.06351, 2022
2022
Multisensor data fusion from sensors with different sampling time
MP Bortkiewicz
Instytut Techniki Lotniczej i Mechaniki Stosowanej, 2019
2019
系统目前无法执行此操作,请稍后再试。
文章 1–9