Computer Science > Machine Learning

arXiv:2309.10975 (cs)

[Submitted on 20 Sep 2023]

Title:SPFQ: A Stochastic Algorithm and Its Error Analysis for Neural Network Quantization

View PDF

Abstract:Quantization is a widely used compression method that effectively reduces redundancies in over-parameterized neural networks. However, existing quantization techniques for deep neural networks often lack a comprehensive error analysis due to the presence of non-convex loss functions and nonlinear activations. In this paper, we propose a fast stochastic algorithm for quantizing the weights of fully trained neural networks. Our approach leverages a greedy path-following mechanism in combination with a stochastic quantizer. Its computational complexity scales only linearly with the number of weights in the network, thereby enabling the efficient quantization of large networks. Importantly, we establish, for the first time, full-network error bounds, under an infinite alphabet condition and minimal assumptions on the weights and input data. As an application of this result, we prove that when quantizing a multi-layer network having Gaussian weights, the relative square quantization error exhibits a linear decay as the degree of over-parametrization increases. Furthermore, we demonstrate that it is possible to achieve error bounds equivalent to those obtained in the infinite alphabet case, using on the order of a mere $\log\log N$ bits per weight, where $N$ represents the largest number of neurons in a layer.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2309.10975 [cs.LG]
	(or arXiv:2309.10975v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2309.10975

Submission history

From: Jinjie Zhang [view email]
[v1] Wed, 20 Sep 2023 00:35:16 UTC (445 KB)

Computer Science > Machine Learning

Title:SPFQ: A Stochastic Algorithm and Its Error Analysis for Neural Network Quantization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:SPFQ: A Stochastic Algorithm and Its Error Analysis for Neural Network Quantization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators