Computer Science > Machine Learning

arXiv:2410.16151 (cs)

[Submitted on 21 Oct 2024]

Title:Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importance

Authors:Mostafa Hussien, Mahmoud Afifi, Kim Khoa Nguyen, Mohamed Cheriet

Abstract:Recent advancements have scaled neural networks to unprecedented sizes, achieving remarkable performance across a wide range of tasks. However, deploying these large-scale models on resource-constrained devices poses significant challenges due to substantial storage and computational requirements. Neural network pruning has emerged as an effective technique to mitigate these limitations by reducing model size and complexity. In this paper, we introduce an intuitive and interpretable pruning method based on activation statistics, rooted in information theory and statistical analysis. Our approach leverages the statistical properties of neuron activations to identify and remove weights with minimal contributions to neuron outputs. Specifically, we build a distribution of weight contributions across the dataset and utilize its parameters to guide the pruning process. Furthermore, we propose a Pruning-aware Training strategy that incorporates an additional regularization term to enhance the effectiveness of our pruning method. Extensive experiments on multiple datasets and network architectures demonstrate that our method consistently outperforms several baseline and state-of-the-art pruning techniques.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2410.16151 [cs.LG]
	(or arXiv:2410.16151v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.16151

Submission history

From: Mostafa Hussien [view email]
[v1] Mon, 21 Oct 2024 16:18:31 UTC (571 KB)

Computer Science > Machine Learning

Title:Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importance

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importance

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators