On the opportunities and risks of foundation models R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ... arXiv preprint arXiv:2108.07258, 2021 | 3287 | 2021 |
The state of sparsity in deep neural networks T Gale, E Elsen, S Hooker arXiv preprint arXiv:1902.09574, 2019 | 733 | 2019 |
Rigging the lottery: Making all tickets winners U Evci, T Gale, J Menick, PS Castro, E Elsen International conference on machine learning, 2943-2952, 2020 | 533 | 2020 |
Sparse gpu kernels for deep learning T Gale, M Zaharia, C Young, E Elsen SC20: International Conference for High Performance Computing, Networking …, 2020 | 218 | 2020 |
Fast sparse convnets E Elsen, M Dukhan, T Gale, K Simonyan Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 161 | 2020 |
On the opportunities and risks of foundation models. arXiv 2021 R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ... arXiv preprint arXiv:2108.07258, 2023 | 70 | 2023 |
Megablocks: Efficient sparse training with mixture-of-experts T Gale, D Narayanan, C Young, M Zaharia Proceedings of Machine Learning and Systems 5, 288-304, 2023 | 37 | 2023 |
The state of sparsity in deep neural networks.(2019) T Gale, E Elsen, S Hooker arXiv preprint cs.LG/1902.09574, 2019 | 37 | 2019 |
The state of sparsity in deep neural networks. arXiv e-prints T Gale, E Elsen, S Hooker arXiv preprint arXiv:1902.09574, 2019 | 15 | 2019 |
Delineation of skin strata in reflectance confocal microscopy images with recurrent convolutional networks A Bozkurt, T Gale, K Kose, C Alessi-Fox, DH Brooks, M Rajadhyaksha, ... Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017 | 13 | 2017 |
Rigging the Lottery: Making All Tickets Winners. arXiv e-prints, art U Evci, T Gale, J Menick, PS Castro, E Elsen arXiv preprint arXiv:1911.11134, 2019 | 6 | 2019 |
JaxPruner: A concise library for sparsity research JH Lee, W Park, NE Mitchell, J Pilault, JSO Ceron, HB Kim, N Lee, ... Conference on Parsimony and Learning, 515-528, 2024 | 3 | 2024 |
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models A Botev, S De, SL Smith, A Fernando, GC Muraru, R Haroun, L Berrada, ... arXiv preprint arXiv:2404.07839, 2024 | 2 | 2024 |
dMath: Distributed Linear Algebra for DL S Eliuk, C Upright, H Vardhan, S Walsh, T Gale arXiv preprint arXiv:1611.07819, 2016 | 2 | 2016 |
Fast sparse neural networks EK Elsen, TJ Gale, M Dukhan US Patent App. 17/763,924, 2022 | 1 | 2022 |
Scorch: A Library for Sparse Deep Learning B Yan, AJ Root, T Gale, D Broman, F Kjolstad arXiv preprint arXiv:2405.16883, 2024 | | 2024 |
General-Purpose Systolic Array RC Young, T Gale, S Honnavara-Prasad, P Mantovani US Patent App. 18/376,494, 2024 | | 2024 |
General-purpose systolic array RC Young, T Gale, S Honnavara-Prasad, P Mantovani US Patent 11,829,321, 2023 | | 2023 |
Sparse matrix operations for deep learning EK Elsen, TJ Gale, RC Young US Patent App. 17/791,771, 2023 | | 2023 |
In situ sparse matrix expansion RC Young, TJ Gale US Patent App. 17/368,374, 2023 | | 2023 |