Scaling vision transformers to 22 billion parameters M Dehghani, J Djolonga, B Mustafa, P Padlewski, J Heek, J Gilmer, ... ICML 2023, 2023 | 387 | 2023 |
scikit-optimize/scikit-optimize: v0. 5.2 T Head, M Kumar, L Gilles, I Shcherbatyi Zenodo, 2018 | 258* | 2018 |
VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation M Kumar, M Babaeizadeh, D Erhan, C Finn, S Levine, L Dinh, D Kingma ICLR 2020, 2020 | 249* | 2020 |
Deep learning for twelve hour precipitation forecasts L Espeholt, S Agrawal, C Sønderby, M Kumar, J Heek, C Bromberg, ... Nature communications 13 (1), 1-10, 2022 | 202 | 2022 |
Colorization Transformer M Kumar, D Weissenborn, N Kalchbrenner ICLR 2021, 2021 | 197 | 2021 |
Image Captioners Are Scalable Vision Learners Too M Tschannen, M Kumar, A Steiner, X Zhai, N Houlsby, L Beyer NeurIPS 2023, 2023 | 39 | 2023 |
Parallel architecture and hyperparameter search via successive halving and classification M Kumar, GE Dahl, V Vasudevan, M Norouzi arXiv preprint arXiv:1805.10255, 2018 | 37* | 2018 |
Do better ImageNet classifiers assess perceptual similarity better? M Kumar, N Houlsby, N Kalchbrenner, ED Cubuk TMLR 2022, 2022 | 31* | 2022 |
PaliGemma: A versatile 3B VLM for transfer L Beyer, A Steiner, AS Pinto, A Kolesnikov, X Wang, D Salz, M Neumann, ... arXiv preprint arXiv:2407.07726, 2024 | 17 | 2024 |
Dual PatchNorm M Kumar, M Dehghani, N Houlsby TMLR 2023, 2023 | 8 | 2023 |
Frozen Feature Augmentation for Few-Shot Image Classification A Bär, N Houlsby, M Dehghani, M Kumar CVPR 2024, 2024 | 3 | 2024 |
Semantica: An Adaptable Image-Conditioned Diffusion Model M Kumar, N Houlsby, E Hoogeboom arXiv preprint arXiv:2405.14857, 2024 | | 2024 |