Transformers implement functional gradient descent to learn non-linear functions in context X Cheng, Y Chen, S Sra arXiv preprint arXiv:2312.06528, 2023 | 31 | 2023 |
Accelerated dimension-independent adaptive Metropolis Y Chen, D Keyes, KJH Law, H Ltaief SIAM Journal on Scientific Computing 38 (5), S539-S565, 2016 | 27 | 2016 |
Atos: A task-parallel GPU scheduler for graph analytics Y Chen, B Brock, S Porumbescu, A Buluc, K Yelick, J Owens Proceedings of the 51st International Conference on Parallel Processing, 1-11, 2022 | 11 | 2022 |
Scalable irregular parallelism with GPUs: Getting CPUs out of the way Y Chen, B Brock, S Porumbescu, A Buluç, K Yelick, JD Owens SC22: International Conference for High Performance Computing, Networking …, 2022 | 9 | 2022 |
Performance trade-offs in GPU communication: A study of host and device-initiated approaches T Groves, B Brock, Y Chen, KZ Ibrahim, L Oliker, NJ Wright, S Williams, ... 2020 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High …, 2020 | 6 | 2020 |
Atos: A task-parallel GPU dynamic scheduling framework for dynamic irregular computations Y Chen, B Brock, S Porumbescu, A Buluç, K Yelick, JD Owens arXiv preprint arXiv:2112.00132, 2021 | 3 | 2021 |
RDMA vs. RPC for implementing distributed data structures BA Brock, Y Chen, J Yan, J Owens, A Buluç, K Yelick 2019 IEEE/ACM 9th Workshop on Irregular Applications: Architectures and …, 2019 | 3 | 2019 |