Cnvlutin: Ineffectual-neuron-free deep neural network computing J Albericio, P Judd, T Hetherington, T Aamodt, NE Jerger, A Moshovos ACM SIGARCH Computer Architecture News 44 (3), 1-13, 2016 | 892 | 2016 |
Demystifying GPU microarchitecture through microbenchmarking H Wong, MM Papadopoulou, M Sadooghi-Alvandi, A Moshovos 2010 IEEE International Symposium on Performance Analysis of Systems …, 2010 | 611 | 2010 |
Stripes: Bit-serial deep neural network computing P Judd, J Albericio, T Hetherington, TM Aamodt, A Moshovos 2016 49th Annual IEEE/ACM International Symposium on Microarchitecture …, 2016 | 554 | 2016 |
CHIMAERA: A high-performance architecture with a tightly-coupled reconfigurable functional unit ZA Ye, A Moshovos, S Hauck, P Banerjee ACM SIGARCH computer architecture news 28 (2), 225-235, 2000 | 434 | 2000 |
Dependence based prefetching for linked data structures A Roth, A Moshovos, GS Sohi Proceedings of the eighth international conference on Architectural support …, 1998 | 428 | 1998 |
Dynamic speculation and synchronization of data dependences A Moshovos, SE Breach, TN Vijaykumar, GS Sohi Proceedings of the 24th annual international symposium on Computer …, 1997 | 385 | 1997 |
Spatial memory streaming S Somogyi, TF Wenisch, A Ailamaki, B Falsafi, A Moshovos ACM SIGARCH Computer Architecture News 34 (2), 252-263, 2006 | 364 | 2006 |
Low-leakage asymmetric-cell SRAM N Azizi, A Moshovos, FN Najm Proceedings of the 2002 international symposium on Low power electronics and …, 2002 | 279 | 2002 |
Bit-pragmatic deep neural network computing J Albericio, A Delmás, P Judd, S Sharify, G O'Leary, R Genov, ... Proceedings of the 50th annual IEEE/ACM international symposium on …, 2017 | 271 | 2017 |
JETTY: Filtering snoops for reduced energy consumption in SMP servers A Moshovos, G Memik, B Falsafi, A Choudhary Proceedings HPCA Seventh International Symposium on High-Performance …, 2001 | 249 | 2001 |
RegionScout: Exploiting coarse grain sharing in snoop-based coherence A Moshovos 32nd International Symposium on Computer Architecture (ISCA'05), 234-245, 2005 | 236 | 2005 |
Streamlining inter-operation memory communication via data dependence prediction A Moshovos, GS Sohi Proceedings of 30th Annual International Symposium on Microarchitecture, 235-245, 1997 | 213 | 1997 |
A tagless coherence directory J Zebchuk, V Srinivasan, MK Qureshi, A Moshovos Proceedings of the 42nd Annual IEEE/ACM International Symposium on …, 2009 | 201 | 2009 |
Doppelgänger: A cache for approximate computing JS Miguel, J Albericio, A Moshovos, NE Jerger Proceedings of the 48th international symposium on microarchitecture, 50-61, 2015 | 167 | 2015 |
Mechanisms for store-wait-free multiprocessors TF Wenisch, A Ailamaki, B Falsafi, A Moshovos Proceedings of the 34th annual international symposium on Computer …, 2007 | 157 | 2007 |
Gobo: Quantizing attention-based nlp models for low latency and energy efficient inference AH Zadeh, I Edo, OM Awad, A Moshovos 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020 | 153 | 2020 |
Slice-processors: an implementation of operation-based prediction A Moshovos, DN Pnevmatikatos, A Baniasadi Proceedings of the 15th international conference on Supercomputing, 321-334, 2001 | 150 | 2001 |
Accurate and complexity-effective spatial pattern prediction CF Chen, SH Yang, B Falsafi, A Moshovos 10th International Symposium on High Performance Computer Architecture (HPCA …, 2004 | 149 | 2004 |
Table based data speculation circuit for parallel processing computer AI Moshovos, SE Breach, TN Vijaykumar, GS Sohi US Patent 5,781,752, 1998 | 135 | 1998 |
Reduced-precision strategies for bounded memory in deep neural nets P Judd, J Albericio, T Hetherington, T Aamodt, NE Jerger, R Urtasun, ... arXiv preprint arXiv:1511.05236, 2015 | 132 | 2015 |