Virtual machine showdown: Stack versus registers Y Shi, K Casey, MA Ertl, D Gregg ACM Transactions on Architecture and Code Optimization (TACO) 4 (4), 1-36, 2008 | 234 | 2008 |
Parallel multi channel convolution using general matrix multiplication A Vasudevan, A Anderson, D Gregg 2017 IEEE 28th international conference on application-specific systems …, 2017 | 185 | 2017 |
The structure and performance of efficient interpreters MA Ertl, D Gregg Journal of Instruction-Level Parallelism 5, 1-25, 2003 | 133 | 2003 |
vmgen—a generator of efficient virtual machine interpreters MA Ertl, D Gregg, A Krall, B Paysan Software: Practice and Experience 32 (3), 265-294, 2002 | 122 | 2002 |
Optimizing indirect branch prediction accuracy in virtual machine interpreters MA Ertl, D Gregg Proceedings of the ACM SIGPLAN 2003 conference on Programming language …, 2003 | 108 | 2003 |
The movidius myriad architecture's potential for scientific computing MH Ionica, D Gregg IEEE Micro 35 (1), 6-14, 2015 | 93 | 2015 |
The Behavior of Efficient Virtual Machine Interpreters on Modern Architectures MA Ertl, D Gregg Euro-Par 2001 Parallel Processing: 7th International Euro-Par Conference …, 2001 | 86 | 2001 |
The case for virtual register machines B Davis, A Beatty, K Casey, D Gregg, J Waldron Proceedings of the 2003 workshop on Interpreters, virtual machines and …, 2003 | 80 | 2003 |
Low-memory gemm-based convolution algorithms for deep neural networks A Anderson, A Vasudevan, C Keane, D Gregg arXiv preprint arXiv:1709.03395, 2017 | 72 | 2017 |
Optimizing indirect branch prediction accuracy in virtual machine interpreters K Casey, MA Ertl, D Gregg ACM Transactions on Programming Languages and Systems (TOPLAS) 29 (6), 37-es, 2007 | 53 | 2007 |
Multiple-valued logic buses for reducing bus energy in low-power systems E Özer, R Sendag, D Gregg IEE Proceedings-Computers and Digital Techniques 153 (4), 270-282, 2006 | 47 | 2006 |
Low complexity multiply accumulate unit for weight-sharing convolutional neural networks J Garland, D Gregg IEEE Computer Architecture Letters 16 (2), 132-135, 2017 | 42 | 2017 |
A practical solution for scripting language compilers P Biggar, E de Vries, D Gregg Proceedings of the 2009 ACM symposium on Applied computing, 1916-1923, 2009 | 41 | 2009 |
Low complexity multiply-accumulate units for convolutional neural networks with weight-sharing J Garland, D Gregg ACM Transactions on Architecture and Code Optimization (TACO) 15 (3), 1-24, 2018 | 38 | 2018 |
FPGA based sparse matrix vector multiplication using commodity dram memory D Gregg, C Mc Sweeney, C McElroy, F Connor, S McGettrick, D Moloney, ... 2007 International Conference on Field Programmable Logic and Applications …, 2007 | 36 | 2007 |
Optimal DNN primitive selection with partitioned boolean quadratic programming A Anderson, D Gregg Proceedings of the 2018 International Symposium on Code Generation and …, 2018 | 35 | 2018 |
Dynamic interpretation for dynamic scripting languages K Williams, J McCandless, D Gregg Proceedings of the 8th annual IEEE/ACM international symposium on Code …, 2010 | 35 | 2010 |
Stochastic bit-width approximation using extreme value theory for customizable processors E Özer, AP Nisbet, D Gregg International Conference on Compiler Construction, 250-264, 2004 | 35 | 2004 |
A stochastic bitwidth estimation technique for compact and low-power custom processors E Özer, AP Nisbet, D Gregg ACM Transactions on Embedded Computing Systems (TECS) 7 (3), 1-30, 2008 | 33 | 2008 |
Combining stack caching with dynamic superinstructions MA Ertl, D Gregg Proceedings of the 2004 workshop on Interpreters, Virtual Machines and …, 2004 | 33 | 2004 |