Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks C Mendis, A Renda, S Amarasinghe, M Carbin International Conference on Machine Learning, 4505-4515, 2019 | 172 | 2019 |
Making caches work for graph analytics Y Zhang, V Kiriansky, C Mendis, S Amarasinghe, M Zaharia 2017 IEEE International Conference on Big Data (Big Data), 293-302, 2017 | 150 | 2017 |
A learned performance model for tensor processing units S Kaufman, P Phothilimthana, Y Zhou, C Mendis, S Roy, A Sabne, ... Proceedings of Machine Learning and Systems 3, 387-400, 2021 | 72 | 2021 |
Helium: Lifting high-performance stencil kernels from stripped x86 binaries to Halide DSL code C Mendis, J Bosboom, K Wu, S Kamil, J Ragan-Kelley, S Paris, Q Zhao, ... Proceedings of the 36th ACM SIGPLAN Conference on Programming Language …, 2015 | 48 | 2015 |
Compiler auto-vectorization with imitation learning C Mendis, C Yang, Y Pu, DS Amarasinghe, M Carbin Advances in Neural Information Processing Systems 32, 2019 | 45 | 2019 |
Difftune: Optimizing cpu simulator parameters with learned differentiable surrogates A Renda, Y Chen, C Mendis, M Carbin 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020 | 42 | 2020 |
goSLP: globally optimized superword level parallelism framework C Mendis, S Amarasinghe Proceedings of the ACM on Programming Languages 2 (OOPSLA), 110, 2018 | 42 | 2018 |
BHive: A benchmark suite and measurement framework for validating x86-64 basic block performance models Y Chen, A Brahmakshatriya, C Mendis, A Renda, E Atkinson, O Sýkora, ... 2019 IEEE International Symposium on Workload Characterization (IISWC), 167-177, 2019 | 41 | 2019 |
VeGen: a vectorizer generator for SIMD and beyond Y Chen, C Mendis, M Carbin, S Amarasinghe Proceedings of the 26th ACM International Conference on Architectural …, 2021 | 40 | 2021 |
Optimizing cache performance for graph analytics Y Zhang, V Kiriansky, C Mendis, M Zaharia, S Amarasinghe arXiv preprint arXiv:1608.01362, 8, 2016 | 22 | 2016 |
Parallelizing wfst speech decoders C Mendis, J Droppo, S Maleki, M Musuvathi, T Mytkowicz, G Zweig 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 19 | 2016 |
Revec: Program Rejuvenation through Revectorization C Mendis, A Jain, P Jain, S Amarasinghe 28th International Conference on Compiler Construction, 29-41, 2019 | 16 | 2019 |
GRANITE: A graph neural network model for basic block throughput estimation O Sýkora, PM Phothilimthana, C Mendis, A Yazdanbakhsh 2022 IEEE International Symposium on Workload Characterization (IISWC), 14-26, 2022 | 15 | 2022 |
WACO: learning workload-aware co-optimization of the format and schedule of a sparse tensor program J Won, C Mendis, JS Emer, S Amarasinghe Proceedings of the 28th ACM International Conference on Architectural …, 2023 | 14 | 2023 |
Tpugraphs: A performance prediction dataset on large tensor computational graphs M Phothilimthana, S Abu-El-Haija, K Cao, B Fatemi, M Burrows, C Mendis, ... Advances in Neural Information Processing Systems 36, 2024 | 11 | 2024 |
All you need is superword-level parallelism: systematic control-flow vectorization with SLP Y Chen, C Mendis, S Amarasinghe Proceedings of the 43rd ACM SIGPLAN International Conference on Programming …, 2022 | 11 | 2022 |
Spade: A flexible and scalable accelerator for spmm and sddmm G Gerogiannis, S Yesil, D Lenadora, D Cao, C Mendis, J Torrellas Proceedings of the 50th Annual International Symposium on Computer …, 2023 | 10 | 2023 |
Tgopt: Redundancy-aware optimizations for temporal graph attention networks Y Wang, C Mendis Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and …, 2023 | 10 | 2023 |
Learning large graph property prediction via graph segment training K Cao, M Phothilimthana, S Abu-El-Haija, D Zelle, Y Zhou, C Mendis, ... Advances in Neural Information Processing Systems 36, 2024 | 7 | 2024 |
Unified Convolution Framework: A compiler-based approach to support sparse convolutions J Won, C Hong, C Mendis, J Emer, S Amarasinghe Proceedings of Machine Learning and Systems 5, 666-679, 2023 | 6 | 2023 |