Overestimation, overfitting, and plasticity in actor-critic: the bitter lesson of reinforcement learning M Nauman, M Bortkiewicz, M Ostaszewski, P Miłoś, T Trzciński, M Cygan arXiv preprint arXiv:2403.00514, 2024 | 4 | 2024 |
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem M Wołczyk, B Cupiał, M Ostaszewski, M Bortkiewicz, M Zając, R Pascanu, ... arXiv preprint arXiv:2402.02868, 2024 | 3 | 2024 |
The effectiveness of world models for continual reinforcement learning S Kessler, M Ostaszewski, MP Bortkiewicz, M Żarski, M Wolczyk, ... Conference on Lifelong Learning Agents, 184-204, 2023 | 2 | 2023 |
Subgoal Reachability in Goal Conditioned Hierarchical Reinforcement Learning M Bortkiewicz, J Łyskawa, P Wawrzyński, M Ostaszewski, A Grudkowski, ... 16th International Conference on Agents and Artificial Intelligence, 2024 | 1 | 2024 |
The Role of Forgetting in Fine-Tuning Reinforcement Learning Models M Wolczyk, B Cupiał, M Ostaszewski, M Bortkiewicz, M Zając, R Pascanu, ... | 1 | 2023 |
Progressive Latent Replay for Efficient Generative Rehearsal S Pawlak, F Szatkowski, M Bortkiewicz, J Dubiński, T Trzciński International Conference on Neural Information Processing, 457-467, 2022 | 1 | 2022 |
Accelerating Goal-Conditioned RL Algorithms and Research M Bortkiewicz, W Pałucki, V Myers, T Dziarmaga, T Arczewski, Ł Kuciński, ... arXiv preprint arXiv:2408.11052, 2024 | | 2024 |
Emergency action termination for immediate reaction in hierarchical reinforcement learning M Bortkiewicz, J Łyskawa, P Wawrzyński, M Ostaszewski, A Grudkowski, ... arXiv preprint arXiv:2211.06351, 2022 | | 2022 |
Multisensor data fusion from sensors with different sampling time MP Bortkiewicz Instytut Techniki Lotniczej i Mechaniki Stosowanej, 2019 | | 2019 |