Graph neural ordinary differential equations M Poli, S Massaroli, J Park, A Yamashita, H Asama, J Park Workshop on Deep Learning on Graphs: Methodologies and Applications (DLGMA’20), 2019 | 408 | 2019 |
Dissecting neural ODEs S Massaroli, M Poli, J Park, A Yamashita, H Asama Advances in Neural Information Processing Systems 33, 2020 | 190 | 2020 |
Hyena hierarchy: Towards larger convolutional language models M Poli, S Massaroli, E Nguyen, DY Fu, T Dao, S Baccus, Y Bengio, ... International Conference on Machine Learning, 28043-28078, 2023 | 167 | 2023 |
Hyenadna: Long-range genomic sequence modeling at single nucleotide resolution E Nguyen, M Poli, M Faizi, A Thomas, M Wornow, C Birch-Sykes, ... Advances in neural information processing systems 36, 2024 | 94 | 2024 |
Monarch: Expressive structured matrices for efficient and accurate training T Dao, B Chen, N Sohoni, A Desai, M Poli, J Grogan, A Liu, A Rao, ... arXiv preprint arXiv:2204.00595, 2022 | 70 | 2022 |
Hypersolvers: Toward fast continuous-depth models M Poli, S Massaroli, A Yamashita, H Asama, J Park Advances in Neural Information Processing Systems 33, 2020 | 54 | 2020 |
Which shortcut cues will dnns choose? a study from the parameter-space perspective L Scimeca, SJ Oh, S Chun, M Poli, S Yun International Conference on Learning Representations, ICLR 2022, 2021 | 44 | 2021 |
Stable neural flows S Massaroli, M Poli, M Bin, J Park, A Yamashita, H Asama arXiv preprint arXiv:2003.08063, 2020 | 44 | 2020 |
TorchDyn: Implicit models and neural numerical methods in PyTorch M Poli, S Massaroli, A Yamashita, H Asama, J Park, S Ermon | 39* | |
Effectively modeling time series with simple discrete state spaces M Zhang, KK Saab, M Poli, T Dao, K Goel, C Ré arXiv preprint arXiv:2303.09489, 2023 | 33 | 2023 |
Zoology: Measuring and improving recall in efficient language models S Arora, S Eyuboglu, A Timalsina, I Johnson, M Poli, J Zou, A Rudra, C Ré arXiv preprint arXiv:2312.04927, 2023 | 27 | 2023 |
Monarch mixer: A simple sub-quadratic gemm-based architecture D Fu, S Arora, J Grogan, I Johnson, ES Eyuboglu, A Thomas, B Spector, ... Advances in Neural Information Processing Systems 36, 2024 | 25 | 2024 |
Differentiable multiple shooting layers S Massaroli, M Poli, S Sonoda, T Suzuki, J Park, A Yamashita, H Asama Advances in Neural Information Processing Systems 34, 2021 | 20 | 2021 |
Deep latent state space models for time-series generation L Zhou, M Poli, W Xu, S Massaroli, S Ermon arXiv preprint arXiv:2212.12749, 2022 | 18 | 2022 |
Transform Once: Efficient operator learning in frequency domain M Poli, S Massaroli, F Berto, J Park, T Dao, C Re, S Ermon ICML 2022 2nd AI for Science Workshop, 2022 | 16 | 2022 |
Neural ordinary differential equations for intervention modeling D Gwak, G Sim, M Poli, S Massaroli, J Choo, E Choi arXiv preprint arXiv:2010.08304, 2020 | 14 | 2020 |
Port–hamiltonian approach to neural network training S Massaroli, M Poli, F Califano, A Faragasso, J Park, A Yamashita, ... 2019 IEEE 58th Conference on Decision and Control (CDC), 6799-6806, 2019 | 14 | 2019 |
Optimal energy shaping via neural approximators S Massaroli, M Poli, F Califano, J Park, A Yamashita, H Asama SIAM Journal on Applied Dynamical Systems (SIADS), 2021 | 13 | 2021 |
Sequence modeling and design from molecular to genome scale with Evo E Nguyen, M Poli, MG Durrant, AW Thomas, B Kang, J Sullivan, MY Ng, ... BioRxiv, 2024.02. 27.582234, 2024 | 11 | 2024 |
Laughing hyena distillery: Extracting compact recurrences from convolutions S Massaroli, M Poli, D Fu, H Kumbong, R Parnichkun, D Romero, ... Advances in Neural Information Processing Systems 36, 2024 | 11 | 2024 |