Permute, quantize, and fine-tune: Efficient compression of neural networks

R Shwartz-Ziv, R Balestriero, K Kawaguchi… - arXiv preprint arXiv …, 2023 - arxiv.org

In this paper, we provide an information-theoretic perspective on Variance-Invariance-
Covariance Regularization (VICReg) for self-supervised learning. To do so, we first …

被引用次数：26 相关文章所有 5 个版本

[PDF] neurips.cc

An information theory perspective on variance-invariance-covariance regularization

R Shwartz-Ziv, R Balestriero… - Advances in …, 2023 - proceedings.neurips.cc

Abstract Variance-Invariance-Covariance Regularization (VICReg) is a self-supervised
learning (SSL) method that has shown promising results on a variety of tasks. However, the …

被引用次数：4 相关文章所有 2 个版本

[PDF] acm.org

martFL: Enabling Utility-Driven Data Marketplace with a Robust and Verifiable Federated Learning Architecture

Q Li, Z Liu, Q Li, K Xu - Proceedings of the 2023 ACM SIGSAC …, 2023 - dl.acm.org

The development of machine learning models requires a large amount of training data. Data
marketplace is a critical platform to trade high-quality and private-domain data that is not …

被引用次数：7 相关文章所有 4 个版本

[PDF] arxiv.org

Fine-grained data distribution alignment for post-training quantization

Y Zhong, M Lin, M Chen, K Li, Y Shen, F Chao… - … on Computer Vision, 2022 - Springer

While post-training quantization receives popularity mostly due to its evasion in accessing
the original complete training dataset, its poor performance also stems from scarce images …

被引用次数：21 相关文章所有 5 个版本

[PDF] arxiv.org

Vq4dit: Efficient post-training vector quantization for diffusion transformers

J Deng, S Li, Z Wang, H Gu, K Xu, K Huang - arXiv preprint arXiv …, 2024 - arxiv.org

The Diffusion Transformers Models (DiTs) have transitioned the network architecture from
traditional UNets to transformers, demonstrating exceptional capabilities in image …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

Sub-8-bit quantization for on-device speech recognition: A regularization-free approach

K Zhen, M Radfar, H Nguyen, GP Strimel… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org

For on-device automatic speech recognition (ASR), quantization aware training (QAT) is
ubiquitous to achieve the trade-off between model predictive performance and efficiency …

被引用次数：10 相关文章所有 4 个版本

[PDF] arxiv.org

Yono: Modeling multiple heterogeneous neural networks on microcontrollers

YD Kwon, J Chauhan, C Mascolo - 2022 21st ACM/IEEE …, 2022 - ieeexplore.ieee.org

Internet of Things (IoT) systems provide large amounts of data on all aspects of human
behavior. Machine learning techniques, especially deep neural networks (DNN), have …

被引用次数：16 相关文章所有 7 个版本

[PDF] arxiv.org

Enabling on-device smartphone GPU based training: Lessons learned

A Das, YD Kwon, J Chauhan… - 2022 IEEE International …, 2022 - ieeexplore.ieee.org

Deep Learning (DL) has shown impressive performance in many mobile applications. Most
existing works have focused on reducing the computational and resource overheads of …

被引用次数：15 相关文章所有 7 个版本

A noise-driven heterogeneous stochastic computing multiplier for heuristic precision improvement in energy-efficient dnns

J Wang, H Chen, D Wang, K Mei… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Stochastic computing (SC) has become a promising approximate computing solution by its
negligible resource occupancy and ultralow energy consumption. As a potential …

被引用次数：8 相关文章

[PDF] arxiv.org

Gptvq: The blessing of dimensionality for llm quantization

M van Baalen, A Kuzmin, M Nagel, P Couperus… - arXiv preprint arXiv …, 2024 - arxiv.org

In this work we show that the size versus accuracy trade-off of neural network quantization
can be significantly improved by increasing the quantization dimensionality. We propose the …

被引用次数：13 相关文章所有 2 个版本