[PDF][PDF] The efficiency spectrum of large language models: An algorithmic survey
The rapid growth of Large Language Models (LLMs) has been a driving force in
transforming various domains, reshaping the artificial general intelligence landscape …
transforming various domains, reshaping the artificial general intelligence landscape …
Cdfi: Compression-driven network design for frame interpolation
DNN-based frame interpolation--that generates the intermediate frames given two
consecutive frames--typically relies on heavy model architectures with a huge number of …
consecutive frames--typically relies on heavy model architectures with a huge number of …
Otov2: Automatic, generic, user-friendly
The existing model compression methods via structured pruning typically require
complicated multi-stage procedures. Each individual stage necessitates numerous …
complicated multi-stage procedures. Each individual stage necessitates numerous …
Lorashear: Efficient large language model structured pruning and knowledge recovery
Large Language Models (LLMs) have transformed the landscape of artificial intelligence,
while their enormous size presents significant challenges in terms of computational costs …
while their enormous size presents significant challenges in terms of computational costs …
St-mfnet mini: Knowledge distillation-driven frame interpolation
Currently, one of the major challenges in deep learning-based video frame interpolation
(VFI) is the large model size and high computational complexity associated with many high …
(VFI) is the large model size and high computational complexity associated with many high …
An adaptive half-space projection method for stochastic optimization problems with group sparse regularization
Optimization problems with group sparse regularization are ubiquitous in various popular
downstream applications, such as feature selection and compression for Deep Neural …
downstream applications, such as feature selection and compression for Deep Neural …
Sparsity-guided network design for frame interpolation
DNN-based frame interpolation, which generates intermediate frames from two consecutive
frames, is often dependent on model architectures with a large number of features …
frames, is often dependent on model architectures with a large number of features …
Mapping yolov4-tiny on fpga-based dnn accelerator by using dynamic fixed-point method
P Li, C Che - 2021 12th International Symposium on Parallel …, 2021 - ieeexplore.ieee.org
In the past few decades, with the large-scale application of deep learning technology, the
neural network inference speed problem is becoming more and more severe, especially in …
neural network inference speed problem is becoming more and more severe, especially in …
Implicit compressibility of overparametrized neural networks trained with heavy-tailed SGD
Neural network compression has been an increasingly important subject, not only due to its
practical relevance, but also due to its theoretical implications, as there is an explicit …
practical relevance, but also due to its theoretical implications, as there is an explicit …
OTOv3: Automatic Architecture-Agnostic Neural Network Training and Compression from Structured Pruning to Erasing Operators
Compressing a predefined deep neural network (DNN) into a compact sub-network with
competitive performance is crucial in the efficient machine learning realm. This topic spans …
competitive performance is crucial in the efficient machine learning realm. This topic spans …