[PDF][PDF] The efficiency spectrum of large language models: An algorithmic survey

T Ding, T Chen, H Zhu, J Jiang, Y Zhong… - arXiv preprint arXiv …, 2023 - researchgate.net
The rapid growth of Large Language Models (LLMs) has been a driving force in
transforming various domains, reshaping the artificial general intelligence landscape …

Low Carbon Footprint Training for 1D-CNNs with Temporal Max-Pooling

A Durai Raju, K Wang - Proceedings of the 33rd ACM International …, 2024 - dl.acm.org
Training convolutional neural networks (CNNs) demands huge GPU memory consumption
and training time, leading to increased carbon emissions, and impacting sustainability. In …

Method and apparatus with neural network convolution operation

SON Jinwoo, C Son, J Yoo, LEE Seohyung… - US Patent App. 17 …, 2021 - Google Patents
A processor-implemented neural network method includes: generating a first output line of
an output feature map by performing a convolution operation between a first input line group …

RSKCNN: introducing randomly-sparse-kernel CNN

Z Feng, Y He, Y Li - Fifth International Conference on …, 2024 - spiedigitallibrary.org
Convolutional neural networks (CNNs) perform excellently in many image processing and
computer vision tasks. However, their complex structure and the vast number of parameters …

Method for controlling target object, apparatus, device, and storage medium

Z Fangyun, F Yunfu, Z Chengjun, Q Zhou - US Patent 11,351,458, 2022 - Google Patents
Embodiments of this application disclose a method for controlling a target object. The
method includes receiving an object control instruction, and obtaining interaction frame data …

Broadcasting mode of planar engine for neural processor

CL Mills, KW Waters, KIM Youchang - US Patent 11,630,991, 2023 - Google Patents
G06F7/48—Methods or arrangements for performing computations using exclusively
denominational number representation, eg using binary, ternary, decimal representation …

Optimization for deconvolution

G Venkatesh - US Patent 11,222,092, 2022 - Google Patents
Disclosed herein includes a system, a method, and a device for improving computational
efficiency of deconvolution by reducing a number of dot products. In one aspect, an input …

Reordering of sparse data to induce spatial locality for n-dimensional sparse convolutional neural network processing

A Thyagharajan, P Laddha, O Omer… - US Patent App. 16 …, 2020 - Google Patents
Exemplary embodiments maintain spatial locality of the data being processed by a sparse
CNN. The spatial locality is maintained by reordering the data to preserve spatial locality …

Broadcasting mode of planar engine for neural processor

CL Mills, KW Waters, KIM Youchang - US Patent 12,124,943, 2024 - Google Patents
Embodiments relate to a neural processor that includes one or more neural engine circuits
and planar engine circuits. The neural engine circuits can perform convolution operations of …

Word based channels last ordering in memory

P Worfolk - US Patent 12,026,396, 2024 - Google Patents
A memory device includes a first word and a second word. The first word has a first subset of
a plurality of elements. The first subset of the plurality of elements each have a first set of …