[PDF][PDF] The efficiency spectrum of large language models: An algorithmic survey
The rapid growth of Large Language Models (LLMs) has been a driving force in
transforming various domains, reshaping the artificial general intelligence landscape …
transforming various domains, reshaping the artificial general intelligence landscape …
Low Carbon Footprint Training for 1D-CNNs with Temporal Max-Pooling
A Durai Raju, K Wang - Proceedings of the 33rd ACM International …, 2024 - dl.acm.org
Training convolutional neural networks (CNNs) demands huge GPU memory consumption
and training time, leading to increased carbon emissions, and impacting sustainability. In …
and training time, leading to increased carbon emissions, and impacting sustainability. In …
Method and apparatus with neural network convolution operation
A processor-implemented neural network method includes: generating a first output line of
an output feature map by performing a convolution operation between a first input line group …
an output feature map by performing a convolution operation between a first input line group …
RSKCNN: introducing randomly-sparse-kernel CNN
Z Feng, Y He, Y Li - Fifth International Conference on …, 2024 - spiedigitallibrary.org
Convolutional neural networks (CNNs) perform excellently in many image processing and
computer vision tasks. However, their complex structure and the vast number of parameters …
computer vision tasks. However, their complex structure and the vast number of parameters …
Method for controlling target object, apparatus, device, and storage medium
Z Fangyun, F Yunfu, Z Chengjun, Q Zhou - US Patent 11,351,458, 2022 - Google Patents
Embodiments of this application disclose a method for controlling a target object. The
method includes receiving an object control instruction, and obtaining interaction frame data …
method includes receiving an object control instruction, and obtaining interaction frame data …
Broadcasting mode of planar engine for neural processor
CL Mills, KW Waters, KIM Youchang - US Patent 11,630,991, 2023 - Google Patents
G06F7/48—Methods or arrangements for performing computations using exclusively
denominational number representation, eg using binary, ternary, decimal representation …
denominational number representation, eg using binary, ternary, decimal representation …
Optimization for deconvolution
G Venkatesh - US Patent 11,222,092, 2022 - Google Patents
Disclosed herein includes a system, a method, and a device for improving computational
efficiency of deconvolution by reducing a number of dot products. In one aspect, an input …
efficiency of deconvolution by reducing a number of dot products. In one aspect, an input …
Reordering of sparse data to induce spatial locality for n-dimensional sparse convolutional neural network processing
A Thyagharajan, P Laddha, O Omer… - US Patent App. 16 …, 2020 - Google Patents
Exemplary embodiments maintain spatial locality of the data being processed by a sparse
CNN. The spatial locality is maintained by reordering the data to preserve spatial locality …
CNN. The spatial locality is maintained by reordering the data to preserve spatial locality …
Broadcasting mode of planar engine for neural processor
CL Mills, KW Waters, KIM Youchang - US Patent 12,124,943, 2024 - Google Patents
Embodiments relate to a neural processor that includes one or more neural engine circuits
and planar engine circuits. The neural engine circuits can perform convolution operations of …
and planar engine circuits. The neural engine circuits can perform convolution operations of …
Word based channels last ordering in memory
P Worfolk - US Patent 12,026,396, 2024 - Google Patents
A memory device includes a first word and a second word. The first word has a first subset of
a plurality of elements. The first subset of the plurality of elements each have a first set of …
a plurality of elements. The first subset of the plurality of elements each have a first set of …