A reconfigurable fabric for accelerating large-scale datacenter services A Putnam, AM Caulfield, ES Chung, D Chiou, K Constantinides, J Demme, ... ACM SIGARCH Computer Architecture News 42 (3), 13-24, 2014 | 1529 | 2014 |
Sage: Self-tuning approximation for graphics engines M Samadi, J Lee, DA Jamshidi, A Hormati, S Mahlke Proceedings of the 46th Annual IEEE/ACM International Symposium on …, 2013 | 353 | 2013 |
A reconfigurable fabric for accelerating large-scale datacenter services A Putnam, AM Caulfield, ES Chung, D Chiou, K Constantinides, J Demme, ... IEEE Micro 35 (3), 10-22, 2015 | 168 | 2015 |
Veal: Virtualized execution accelerator for loops N Clark, A Hormati, S Mahlke ACM SIGARCH Computer Architecture News 36 (3), 389-400, 2008 | 146 | 2008 |
Flextream: Adaptive compilation of streaming applications for heterogeneous architectures AH Hormati, Y Choi, M Kudlur, R Rabbah, T Mudge, S Mahlke 2009 18th International Conference on Parallel Architectures and Compilation …, 2009 | 134 | 2009 |
Sponge: portable stream programming on graphics engines AH Hormati, M Samadi, M Woh, T Mudge, S Mahlke ACM SIGPLAN Notices 46 (3), 381-392, 2011 | 124 | 2011 |
Liquid metal: Object-oriented programming across the hardware/software boundary SS Huang, A Hormati, DF Bacon, R Rabbah ECOOP 2008–Object-Oriented Programming: 22nd European Conference Paphos …, 2008 | 124 | 2008 |
Optimus: Efficient realization of streaming applications on FPGAs A Hormati, M Kudlur, S Mahlke, D Bacon, R Rabbah Proceedings of the 2008 international conference on Compilers, architectures …, 2008 | 103 | 2008 |
Translation of SIMD instructions in a data processing system S Yehia, K Flautner, N Clark, A Hormati, S Mahlke US Patent 8,505,002, 2013 | 97 | 2013 |
Scalable subgraph mapping for acyclic computation accelerators N Clark, A Hormati, S Mahlke, S Yehia Proceedings of the 2006 international conference on Compilers, architecture …, 2006 | 80 | 2006 |
Liquid SIMD: Abstracting SIMD hardware using lightweight dynamic mapping N Clark, A Hormati, S Yehia, S Mahlke, K Flautner 2007 IEEE 13th International Symposium on High Performance Computer …, 2007 | 66 | 2007 |
A reconfigurable fabric for accelerating large-scale datacenter services A Putnam, AM Caulfield, ES Chung, D Chiou, K Constantinides, J Demme, ... Communications of the ACM 59 (11), 114-122, 2016 | 54 | 2016 |
Adaptive input-aware compilation for graphics engines M Samadi, A Hormati, M Mehrara, J Lee, S Mahlke Proceedings of the 33rd ACM SIGPLAN Conference on Programming Language …, 2012 | 45 | 2012 |
Macross: Macro-simdization of streaming applications AH Hormati, Y Choi, M Woh, M Kudlur, R Rabbah, T Mudge, S Mahlke ACM SIGARCH computer architecture news 38 (1), 285-296, 2010 | 40 | 2010 |
Paragon: Collaborative speculative loop execution on gpu and cpu M Samadi, A Hormati, J Lee, S Mahlke Proceedings of the 5th Annual Workshop on General Purpose Processing with …, 2012 | 35 | 2012 |
Exploiting narrow accelerators with data-centric subgraph mapping A Hormati, N Clark, S Mahlke International Symposium on Code Generation and Optimization (CGO'07), 341-353, 2007 | 18 | 2007 |
Leveraging GPUs using cooperative loop speculation M Samadi, A Hormati, J Lee, S Mahlke ACM Transactions on Architecture and Code Optimization (TACO) 11 (1), 1-26, 2014 | 6 | 2014 |
Scaling performance via self-tuning approximation for graphics engines M Samadi, J Lee, DA Jamshidi, S Mahlke, A Hormati ACM Transactions on Computer Systems (TOCS) 32 (3), 1-29, 2014 | 5 | 2014 |
Scalable matrix factorization in a database AH Hormati, L Yin, UA Syed, M Deng US Patent 11,948,159, 2024 | 1 | 2024 |
Transformation for machine learning pre-processing J Wu, AH Hormati US Patent 11,928,559, 2024 | 1 | 2024 |