UGEMM: Unary computing architecture for GEMM applications
General matrix multiplication (GEMM) is universal in various applications, such as signal
processing, machine learning, and computer vision. Conventional GEMM hardware …
processing, machine learning, and computer vision. Conventional GEMM hardware …
A computational temporal logic for superconducting accelerators
Superconducting logic offers the potential to perform computation at tremendous speeds
and energy savings. However, a" semantic gap" lies between the level-driven logic that …
and energy savings. However, a" semantic gap" lies between the level-driven logic that …
Cambricon-u: A systolic random increment memory architecture for unary computing
Unary computing, whose arithmetics require only one logic gate, has enabled efficient DNN
processing, especially on strictly power-constrained devices. However, unary computing still …
processing, especially on strictly power-constrained devices. However, unary computing still …
Hierarchical, distributed and brain-inspired learning for internet of things systems
In this paper, we propose EdgeHD, a hierarchy-aware learning solution that performs online
training and inference in a highly distributed, cost-effective way. We use brain-inspired …
training and inference in a highly distributed, cost-effective way. We use brain-inspired …
X-TIME: An in-memory engine for accelerating machine learning on tabular data with CAMs
Structured, or tabular, data is the most common format in data science. While deep learning
models have proven formidable in learning from unstructured data such as images or …
models have proven formidable in learning from unstructured data such as images or …
Temporal computing with superconductors
Creating computing systems able to address our ever-increasing needs, especially as we
reach the end of CMOS transistor scaling, will require truly novel methods of computing …
reach the end of CMOS transistor scaling, will require truly novel methods of computing …
Temporal memory with magnetic racetracks
Race logic is a relative timing code that represents information in a wavefront of digital
edges on a set of wires in order to accelerate dynamic programming and machine learning …
edges on a set of wires in order to accelerate dynamic programming and machine learning …
Agile hardware development and instrumentation with PyRTL
Domain-specific architectures have emerged as a promising solution to meet growing
technology demands but with this comes an urgent need to improve hardware …
technology demands but with this comes an urgent need to improve hardware …
Energy Efficient Convolutions with Temporal Arithmetic
Convolution is an important operation at the heart of many applications, including image
processing, object detection, and neural networks. While data movement and coordination …
processing, object detection, and neural networks. While data movement and coordination …
Tnn7: A custom macro suite for implementing highly optimized designs of neuromorphic tnns
H Nair, P Vellaisamy, S Bhasuthkar… - 2022 IEEE Computer …, 2022 - ieeexplore.ieee.org
Temporal Neural Networks (TNNs), inspired from the mammalian neocortex, exhibit energy-
efficient online sensory processing capabilities. Recent works have proposed a microar …
efficient online sensory processing capabilities. Recent works have proposed a microar …