Grandmaster level in StarCraft II using multi-agent reinforcement learning O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ... Nature 575 (7782), 350-354, 2019 | 4325 | 2019 |
Flamingo: a visual language model for few-shot learning JB Alayrac, J Donahue, P Luc, A Miech, I Barr, Y Hasson, K Lenc, ... Advances in neural information processing systems 35, 23716-23736, 2022 | 2258 | 2022 |
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 843 | 2023 |
Scaling language models: Methods, analysis & insights from training gopher JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ... arXiv preprint arXiv:2112.11446, 2021 | 827 | 2021 |
Improving language models by retrieving from trillions of tokens S Borgeaud, A Mensch, J Hoffmann, T Cai, E Rutherford, K Millican, ... International conference on machine learning, 2206-2240, 2022 | 763 | 2022 |
Red teaming language models with language models E Perez, S Huang, F Song, T Cai, R Ring, J Aslanides, A Glaese, ... arXiv preprint arXiv:2202.03286, 2022 | 342 | 2022 |
The DeepMind JAX Ecosystem (2020) I Babuschkin, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ... URL http://github.com/deepmind, 2020 | 96* | 2020 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024 | 81 | 2024 |
The DeepMind JAX Ecosystem I Babuschkin, K Baumli, A Bell, S Bhupatiraju, J Bruce, P Buchlovsky, ... URL http://github. com/deepmind 24, 25, 2020 | 60 | 2020 |
Cyprien de Masson d’Autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego de Las Casas, Aurelia Guy, Chris Jones, James Bradbury, Matthew J JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, HF Song, J Aslanides, ... Johnson, Blake A. Hechtman, Laura Weidinger, Iason Gabriel, William S. Isaac …, 2021 | 52 | 2021 |
StarCraft II Unplugged: Large Scale Offline Reinforcement Learning M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ... | 16 | 2021 |
Replicating deepmind starcraft ii reinforcement learning benchmark with actor-critic methods R Ring undergraduate thesis, 2017 | 9* | 2017 |
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ... arXiv preprint arXiv:2308.03526, 2023 | 6 | 2023 |
Language model for processing a multi-mode query input JB Alayrac, J Donahue, K Lenc, K Simonyan, MKC Reynolds, ... US Patent App. 18/141,337, 2023 | | 2023 |