Multimodal model-agnostic meta-learning via task-aware modulation R Vuorio, SH Sun, H Hu, JJ Lim Advances in neural information processing systems 32, 2019 | 261 | 2019 |
Deep reinforcement learning for multi-driver vehicle dispatching and repositioning problem J Holler, R Vuorio, Z Qin, X Tang, Y Jiao, T Jin, S Singh, C Wang, J Ye 2019 IEEE International Conference on Data Mining (ICDM), 1090-1095, 2019 | 118 | 2019 |
A survey of meta-reinforcement learning J Beck, R Vuorio, EZ Liu, Z Xiong, L Zintgraf, C Finn, S Whiteson arXiv preprint arXiv:2301.08028, 2023 | 98 | 2023 |
Meta continual learning R Vuorio, DY Cho, D Kim, J Kim arXiv preprint arXiv:1806.06928, 2018 | 33 | 2018 |
Toward multimodal model-agnostic meta-learning R Vuorio, SH Sun, H Hu, JJ Lim arXiv preprint arXiv:1812.07172, 2018 | 30 | 2018 |
Hypernetworks in meta-reinforcement learning J Beck, MT Jackson, R Vuorio, S Whiteson Conference on Robot Learning, 1478-1487, 2023 | 24 | 2023 |
On the practical consistency of meta-reinforcement learning algorithms Z Xiong, L Zintgraf, J Beck, R Vuorio, S Whiteson arXiv preprint arXiv:2112.00478, 2021 | 9 | 2021 |
Adaptive pairwise weights for temporal credit assignment Z Zheng, R Vuorio, R Lewis, S Singh Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 9225-9232, 2022 | 7* | 2022 |
Learning state representations from random deep action-conditional predictions Z Zheng, V Veeriah, R Vuorio, RL Lewis, S Singh Advances in Neural Information Processing Systems 34, 23679-23691, 2021 | 6 | 2021 |
Deconfounded imitation learning R Vuorio, J Brehmer, H Ackermann, D Dijkman, T Cohen, P de Haan arXiv preprint arXiv:2211.02667, 2022 | 5 | 2022 |
No DICE: An investigation of the bias-variance tradeoff in meta-gradients R Vuorio, JA Beck, G Farquhar, JN Foerster, S Whiteson Deep RL Workshop NeurIPS 2021, 2021 | 5 | 2021 |
Discovering general reinforcement learning algorithms with adversarial environment design MT Jackson, M Jiang, J Parker-Holder, R Vuorio, C Lu, G Farquhar, ... Advances in Neural Information Processing Systems 36, 2024 | 4 | 2024 |
Recurrent hypernetworks are surprisingly strong in meta-RL J Beck, R Vuorio, Z Xiong, S Whiteson Advances in Neural Information Processing Systems 36, 2024 | 3 | 2024 |
System and process for deconfounded imitation learning R Vuorio, DE Pim, JH Brehmer, H Ackermann, TS Cohen, DHF Dijkman US Patent App. 18/459,258, 2024 | | 2024 |
SplAgger: Split Aggregation for Meta-Reinforcement Learning J Beck, M Jackson, R Vuorio, Z Xiong, S Whiteson arXiv preprint arXiv:2403.03020, 2024 | | 2024 |
Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control Z Xiong, R Vuorio, J Beck, M Zimmer, K Shao, S Whiteson arXiv preprint arXiv:2402.06570, 2024 | | 2024 |
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients R Vuorio, J Beck, S Whiteson, J Foerster, G Farquhar arXiv preprint arXiv:2209.11303, 2022 | | 2022 |
Supplementary Material of Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation R Vuorio, SH Sun, H Hu, JJ Lim, C Baselines | | |
Hypernetworks in Meta-Reinforcement Learning Supplementary Materials J Beck, M Jackson, R Vuorio, S Whiteson | | |
Model-Agnostic Meta-Learning for Multimodal Task Distributions R Vuorio, SH Sun, H Hu, JJ Lim | | |