关注
Pierluca D'Oro
Pierluca D'Oro
Mila & Meta
在 mila.quebec 的电子邮件经过验证
标题
引用次数
引用次数
年份
The primacy bias in deep reinforcement learning
E Nikishin*, M Schwarzer*, P D’Oro*, PL Bacon, A Courville
International conference on machine learning, 16828-16847, 2022
1392022
Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier
P D'Oro*, M Schwarzer*, E Nikishin, PL Bacon, MG Bellemare, A Courville
International Conference on Learning Representations (ICLR), 𝐍𝐨𝐭𝐚𝐛𝐥𝐞-𝐭𝐨𝐩-𝟓%, 2023
682023
Gradient-Aware Model-based Policy Search
P D'Oro*, AM Metelli*, A Tirinzoni, M Papini, M Restelli
The Thirty-Fourth AAAI Conference on Artificial Intelligence, 3801-3808, 2020
452020
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization
P D'Oro, W Jaśkowski
Advances in Neural Information Processing Systems 34, 2020
322020
Motif: Intrinsic Motivation From Artificial Intelligence Feedback
M Klissarov*, P D’Oro*, S Sodhani, R Raileanu, PL Bacon, P Vincent, ...
International Conference on Learning Representations (ICLR), 2024
302024
Adversarial framework for unsupervised learning of motion dynamics in videos
C Spampinato, S Palazzo, P D’Oro, D Giordano, M Shah
International Journal of Computer Vision, 1-20, 2019
27*2019
Policy Optimization as Online Learning with Mediator Feedback
AM Metelli*, M Papini*, P D'Oro, M Restelli
The Thirty-Fifth AAAI Conference on Artificial Intelligence, 8958-8966, 2021
132021
Group Anomaly Detection via Graph Autoencoders
P D’Oro, E Nasca, J Masci, M Matteucci
NeurIPS Graph Representation Learning Workshop, 2019
92019
Real-time Classification from Short Event-Camera Streams using Input-filtering Neural ODEs
G Giannone, A Anoosheh, A Quaglino, P D'Oro, M Gallieri, J Masci
NeurIPS workshop on Interpretable Inductive Biases and Physically Structured …, 2020
82020
SMfinder: Small Molecules Finder for Metabolomics and Lipidomics analysis
G Martano, M Leone, P D'Oro, V Matafora, A Cattaneo, M Masseroli, ...
Analytical Chemistry, 2020
62020
Long-Term Credit Assignment via Model-based Temporal Shortcuts
M Ma, P D'Oro, Y Bengio, PL Bacon
Deep RL Workshop NeurIPS, 2021
52021
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
N Rahn*, P D'Oro*, H Wiltzer, PL Bacon, MG Bellemare
Advances in Neural Information Processing Systems 37, 2023
32023
Meta Dynamic Programming
P D’Oro, PL Bacon
NeurIPS Workshop on Metacognition in the Age of AI: Challenges and Opportunities, 2021
22021
Controlling Large Language Model Agents with Entropic Activation Steering
N Rahn, P D'Oro, MG Bellemare
arXiv preprint arXiv:2406.00244, 2024
2024
Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons
S Dufort-Labbé, P D'Oro, E Nikishin, R Pascanu, PL Bacon, A Baratin
arXiv preprint arXiv:2403.07688, 2024
2024
The Curse of Diversity in Ensemble-Based Exploration
Z Lin, P D'Oro, E Nikishin, A Courville
International Conference on Learning Representations (ICLR), 2024
2024
Do Transformer World Models Give Better Policy Gradients?
M Ma*, T Ni, C Gehring, P D'Oro*, PL Bacon
ICLR Workshop on Generative Models for Decision Making, 𝐎𝐫𝐚𝐥, 2024
2024
Unleashing The Potential of Data Sharing in Ensemble Deep Reinforcement Learning
Z Lin, P D'Oro, E Nikishin, A Courville
Deep Reinforcement Learning Workshop @ NeurIPS, 2022
2022
Beyond maximum likelihood model estimation in model-based policy search
P D’Oro
Politecnico di Milano Digital Archive, Italy, 2019
2019
系统目前无法执行此操作,请稍后再试。
文章 1–19