The primacy bias in deep reinforcement learning E Nikishin*, M Schwarzer*, P D’Oro*, PL Bacon, A Courville International conference on machine learning, 16828-16847, 2022 | 139 | 2022 |
Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier P D'Oro*, M Schwarzer*, E Nikishin, PL Bacon, MG Bellemare, A Courville International Conference on Learning Representations (ICLR), 𝐍𝐨𝐭𝐚𝐛𝐥𝐞-𝐭𝐨𝐩-𝟓%, 2023 | 68 | 2023 |
Gradient-Aware Model-based Policy Search P D'Oro*, AM Metelli*, A Tirinzoni, M Papini, M Restelli The Thirty-Fourth AAAI Conference on Artificial Intelligence, 3801-3808, 2020 | 45 | 2020 |
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization P D'Oro, W Jaśkowski Advances in Neural Information Processing Systems 34, 2020 | 32 | 2020 |
Motif: Intrinsic Motivation From Artificial Intelligence Feedback M Klissarov*, P D’Oro*, S Sodhani, R Raileanu, PL Bacon, P Vincent, ... International Conference on Learning Representations (ICLR), 2024 | 30 | 2024 |
Adversarial framework for unsupervised learning of motion dynamics in videos C Spampinato, S Palazzo, P D’Oro, D Giordano, M Shah International Journal of Computer Vision, 1-20, 2019 | 27* | 2019 |
Policy Optimization as Online Learning with Mediator Feedback AM Metelli*, M Papini*, P D'Oro, M Restelli The Thirty-Fifth AAAI Conference on Artificial Intelligence, 8958-8966, 2021 | 13 | 2021 |
Group Anomaly Detection via Graph Autoencoders P D’Oro, E Nasca, J Masci, M Matteucci NeurIPS Graph Representation Learning Workshop, 2019 | 9 | 2019 |
Real-time Classification from Short Event-Camera Streams using Input-filtering Neural ODEs G Giannone, A Anoosheh, A Quaglino, P D'Oro, M Gallieri, J Masci NeurIPS workshop on Interpretable Inductive Biases and Physically Structured …, 2020 | 8 | 2020 |
SMfinder: Small Molecules Finder for Metabolomics and Lipidomics analysis G Martano, M Leone, P D'Oro, V Matafora, A Cattaneo, M Masseroli, ... Analytical Chemistry, 2020 | 6 | 2020 |
Long-Term Credit Assignment via Model-based Temporal Shortcuts M Ma, P D'Oro, Y Bengio, PL Bacon Deep RL Workshop NeurIPS, 2021 | 5 | 2021 |
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control N Rahn*, P D'Oro*, H Wiltzer, PL Bacon, MG Bellemare Advances in Neural Information Processing Systems 37, 2023 | 3 | 2023 |
Meta Dynamic Programming P D’Oro, PL Bacon NeurIPS Workshop on Metacognition in the Age of AI: Challenges and Opportunities, 2021 | 2 | 2021 |
Controlling Large Language Model Agents with Entropic Activation Steering N Rahn, P D'Oro, MG Bellemare arXiv preprint arXiv:2406.00244, 2024 | | 2024 |
Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons S Dufort-Labbé, P D'Oro, E Nikishin, R Pascanu, PL Bacon, A Baratin arXiv preprint arXiv:2403.07688, 2024 | | 2024 |
The Curse of Diversity in Ensemble-Based Exploration Z Lin, P D'Oro, E Nikishin, A Courville International Conference on Learning Representations (ICLR), 2024 | | 2024 |
Do Transformer World Models Give Better Policy Gradients? M Ma*, T Ni, C Gehring, P D'Oro*, PL Bacon ICLR Workshop on Generative Models for Decision Making, 𝐎𝐫𝐚𝐥, 2024 | | 2024 |
Unleashing The Potential of Data Sharing in Ensemble Deep Reinforcement Learning Z Lin, P D'Oro, E Nikishin, A Courville Deep Reinforcement Learning Workshop @ NeurIPS, 2022 | | 2022 |
Beyond maximum likelihood model estimation in model-based policy search P D’Oro Politecnico di Milano Digital Archive, Italy, 2019 | | 2019 |