Multimodal model-agnostic meta-learning via task-aware modulation R Vuorio, SH Sun, H Hu, JJ Lim Advances in neural information processing systems 32, 2019 | 277 | 2019 |
A survey of meta-reinforcement learning J Beck, R Vuorio, EZ Liu, Z Xiong, L Zintgraf, C Finn, S Whiteson arXiv preprint arXiv:2301.08028, 2023 | 152 | 2023 |
Deep reinforcement learning for multi-driver vehicle dispatching and repositioning problem J Holler, R Vuorio, Z Qin, X Tang, Y Jiao, T Jin, S Singh, C Wang, J Ye 2019 IEEE International Conference on Data Mining (ICDM), 1090-1095, 2019 | 132 | 2019 |
Meta continual learning R Vuorio, DY Cho, D Kim, J Kim arXiv preprint arXiv:1806.06928, 2018 | 36 | 2018 |
Toward multimodal model-agnostic meta-learning R Vuorio, SH Sun, H Hu, JJ Lim arXiv preprint arXiv:1812.07172, 2018 | 32 | 2018 |
Hypernetworks in meta-reinforcement learning J Beck, MT Jackson, R Vuorio, S Whiteson Conference on Robot Learning, 1478-1487, 2023 | 31 | 2023 |
On the practical consistency of meta-reinforcement learning algorithms Z Xiong, L Zintgraf, J Beck, R Vuorio, S Whiteson arXiv preprint arXiv:2112.00478, 2021 | 12 | 2021 |
Discovering general reinforcement learning algorithms with adversarial environment design MT Jackson, M Jiang, J Parker-Holder, R Vuorio, C Lu, G Farquhar, ... Advances in Neural Information Processing Systems 36, 79980-79998, 2023 | 9 | 2023 |
Adaptive pairwise weights for temporal credit assignment Z Zheng, R Vuorio, R Lewis, S Singh Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 9225-9232, 2022 | 8* | 2022 |
Deconfounded imitation learning R Vuorio, P De Haan, J Brehmer, H Ackermann, D Dijkman, T Cohen Deep Reinforcement Learning Workshop NeurIPS 2022, 2022 | 6 | 2022 |
No DICE: An investigation of the bias-variance tradeoff in meta-gradients R Vuorio, JA Beck, G Farquhar, JN Foerster, S Whiteson Deep RL Workshop NeurIPS 2021, 2021 | 6 | 2021 |
Learning state representations from random deep action-conditional predictions Z Zheng, V Veeriah, R Vuorio, RL Lewis, S Singh Advances in Neural Information Processing Systems 34, 23679-23691, 2021 | 6 | 2021 |
Recurrent hypernetworks are surprisingly strong in meta-RL J Beck, R Vuorio, Z Xiong, S Whiteson Advances in Neural Information Processing Systems 36, 2024 | 4 | 2024 |
SplAgger: Split Aggregation for Meta-Reinforcement Learning J Beck, M Jackson, R Vuorio, Z Xiong, S Whiteson arXiv preprint arXiv:2403.03020, 2024 | 1 | 2024 |
Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control Z Xiong, R Vuorio, J Beck, M Zimmer, K Shao, S Whiteson arXiv preprint arXiv:2402.06570, 2024 | 1 | 2024 |
Deconfounding Imitation Learning with Variational Inference R Vuorio, P De Haan, J Brehmer, H Ackermann, D Dijkman, T Cohen Transactions on Machine Learning Research, 2024 | 1 | 2024 |
IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving C Grislain, R Vuorio, C Lu, S Whiteson arXiv preprint arXiv:2411.04653, 2024 | | 2024 |
A Bayesian Solution To The Imitation Gap R Vuorio, M Fellows, C Lu, C Grislain, S Whiteson arXiv preprint arXiv:2407.00495, 2024 | | 2024 |
System and process for deconfounded imitation learning R Vuorio, DE Pim, JH Brehmer, H Ackermann, TS Cohen, DHF Dijkman US Patent App. 18/459,258, 2024 | | 2024 |
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients R Vuorio, J Beck, S Whiteson, J Foerster, G Farquhar arXiv preprint arXiv:2209.11303, 2022 | | 2022 |