Scaling language models: Methods, analysis & insights from training gopher JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ... arXiv preprint arXiv:2112.11446, 2021 | 945 | 2021 |
Magnetic control of tokamak plasmas through deep reinforcement learning J Degrave, F Felici, J Buchli, M Neunert, B Tracey, F Carpanese, T Ewalds, ... Nature 602 (7897), 414-419, 2022 | 772 | 2022 |
Multimodal few-shot learning with frozen language models M Tsimpoukelli, JL Menick, S Cabi, SM Eslami, O Vinyals, F Hill Advances in Neural Information Processing Systems 34, 200-212, 2021 | 670 | 2021 |
The llama 3 herd of models A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ... arXiv preprint arXiv:2407.21783, 2024 | 220 | 2024 |
Cyprien de Masson d’Autume JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ... | 95 | 2021 |
Cyprien de Masson d’Autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego de Las Casas, Aurelia Guy, Chris Jones, James Bradbury, Matthew J JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, HF Song, J Aslanides, ... Johnson, Blake A. Hechtman, Laura Weidinger, Iason Gabriel, William S. Isaac …, 2021 | 64 | 2021 |
Relaxation search: A simple way of managing optional clauses F Bacchus, J Davies, M Tsimpoukelli, G Katsirelos Proceedings of the AAAI Conference on Artificial Intelligence 28 (1), 2014 | 50 | 2014 |
Scaling Language Models: Methods JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ... Analysis & Insights from Training Gopher. arXiv, 2021 | 26 | 2021 |
Scaling language models: Methods, analysis & insights from training gopher. arXiv 2021 JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ... arXiv preprint arXiv:2112.11446, 2021 | 15 | 2021 |
Multimodal few-shot learning with frozen language models MR Tsimpoukelli, JL Menick, S Cabi, FG Hill, SM Eslami, O Vinyals US Patent App. 18/568,561, 2024 | | 2024 |