Tri Dao 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	4761	4757
h 指数	26	26
i10 指数	35	35

2900

1450

725

2175

20192020202120222023202446 84 148 268 1318 2886

开放获取的出版物数量

查看全部

22 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Christopher RéComputer Science, Stanford University在 cs.stanford.edu 的电子邮件经过验证
Atri RudraKatherine Johnson Chair in AI, Professor, CSE, University at Buffalo在 buffalo.edu 的电子邮件经过验证
Albert GuCarnegie Mellon University在 andrew.cmu.edu 的电子邮件经过验证
Stefano ErmonStanford University在 cs.stanford.edu 的电子邮件经过验证
Beidi ChenCarnegie Mellon University在 andrew.cmu.edu 的电子邮件经过验证
Daniel Y FuGraduate Student, Stanford University在 cs.stanford.edu 的电子邮件经过验证
Zhao SongAdobe Research在 ias.edu 的电子邮件经过验证
Khaled Kamal SaabGoogle, Stanford University在 google.com 的电子邮件经过验证
Michael PoliStanford University在 stanford.edu 的电子邮件经过验证
Karan GoelStanford University在 stanford.edu 的电子邮件经过验证
Eric NguyenStanford University在 stanford.edu 的电子邮件经过验证
Ce ZhangTogether AI; University of Chicago在 together.xyz 的电子邮件经过验证
Binhang Yuan（袁彬航）Hong Kong University of Science and Technology在 ust.hk 的电子邮件经过验证
Stephen BaccusProfessor of Neurobiology, Stanford University在 stanford.edu 的电子邮件经过验证
Armin W. ThomasLiquid AI在 liquid.ai 的电子邮件经过验证
Christopher De SaAssistant Professor of Computer Science, Cornell University在 cs.cornell.edu 的电子邮件经过验证
Jue WangTogether AI; ZJU在 zju.edu.cn 的电子邮件经过验证
Zichang LiuRice University在 rice.edu 的电子邮件经过验证
Stefano MassaroliRIKEN在 riken.jp 的电子邮件经过验证
Matthew EichhornCornell University在 cornell.edu 的电子邮件经过验证

关注

Tri Dao

Princeton University, Together AI

在 princeton.edu 的电子邮件经过验证 - 首页

Machine learning Systems


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Flashattention: Fast and memory-efficient exact attention with io-awareness T Dao, D Fu, S Ermon, A Rudra, C Ré Advances in Neural Information Processing Systems 35, 16344-16359, 2022	957	2022
Mamba: Linear-time sequence modeling with selective state spaces A Gu, T Dao Conference on Language Modeling (COLM), 2023	554	2023
Starcoder: may the source be with you! R Li, LB Allal, Y Zi, N Muennighoff, D Kocetkov, C Mou, M Marone, C Akiki, ... Transactions on Machine Learning Research (TMLR), 2023	550*	2023
Flashattention-2: Faster attention with better parallelism and work partitioning T Dao International Conference on Learning Representations, 2023	304	2023
Hippo: Recurrent memory with optimal polynomial projections A Gu, T Dao, S Ermon, A Rudra, C Ré Advances in neural information processing systems 33, 1474-1487, 2020	261	2020
Combining recurrent, convolutional, and continuous-time models with linear state space layers A Gu, I Johnson, K Goel, K Saab, T Dao, A Rudra, C Ré Advances in neural information processing systems 34, 572-585, 2021	251	2021
Hungry Hungry Hippos: Towards Language Modeling with State Space Models DY Fu, T Dao, KK Saab, AW Thomas, A Rudra, C Re The Eleventh International Conference on Learning Representations, 2023	231	2023
A kernel theory of modern data augmentation T Dao, A Gu, A Ratner, V Smith, CD Sa, C Ré Proceedings of the 36th International Conference on Machine Learning, ICML, 9-15, 2019	205	2019
Hyena Hierarchy: Towards Larger Convolutional Language Models M Poli, S Massaroli, E Nguyen, DY Fu, T Dao, S Baccus, Y Bengio, ... International Conference on Machine Learning, 2023	172	2023
Deja vu: Contextual sparsity for efficient llms at inference time Z Liu, J Wang, T Dao, T Zhou, B Yuan, Z Song, A Shrivastava, C Zhang, ... International Conference on Machine Learning, 22137-22176, 2023	118	2023
S4nd: Modeling images and videos as multidimensional signals with state spaces E Nguyen, K Goel, A Gu, G Downs, P Shah, T Dao, S Baccus, C Ré Advances in neural information processing systems 35, 2846-2861, 2022	110	2022
Learning fast algorithms for linear transforms using butterfly factorizations T Dao, A Gu, M Eichhorn, A Rudra, C Ré International conference on machine learning, 1517-1527, 2019	106	2019
Scatterbrain: Unifying sparse and low-rank attention B Chen, T Dao, E Winsor, Z Song, A Rudra, C Ré Advances in Neural Information Processing Systems 34, 17413-17426, 2021	98	2021
Monarch: Expressive structured matrices for efficient and accurate training T Dao, B Chen, NS Sohoni, A Desai, M Poli, J Grogan, A Liu, A Rao, ... International Conference on Machine Learning, 4690-4721, 2022	71	2022
Mongoose: A learnable lsh framework for efficient neural network training B Chen, Z Liu, B Peng, Z Xu, JL Li, T Dao, Z Song, A Shrivastava, C Re International Conference on Learning Representations, 2020	71	2020
Pixelated butterfly: Simple and efficient sparse training for neural network models T Dao, B Chen, K Liang, J Yang, Z Song, A Rudra, C Re International Conference on Learning Representations, 2021	65	2021
Decentralized training of foundation models in heterogeneous environments B Yuan, Y He, J Davis, T Zhang, T Dao, B Chen, PS Liang, C Re, C Zhang Advances in Neural Information Processing Systems 35, 25464-25477, 2022	62	2022
Gaussian quadrature for kernel features T Dao, CM De Sa, C Ré Advances in neural information processing systems 30, 2017	60	2017
Starcoder 2 and the stack v2: The next generation A Lozhkov, R Li, LB Allal, F Cassano, J Lamy-Poirier, N Tazi, A Tang, ... arXiv preprint arXiv:2402.19173, 2024	53	2024
Learning compressed transforms with low displacement rank A Thomas, A Gu, T Dao, A Rudra, C Ré Advances in neural information processing systems 31, 2018	52	2018

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用