Google Scholar

Ultimate tensorization: compressing convolutional and fc layers alike

T Garipov, D Podoprikhin, A Novikov…�- arXiv preprint arXiv�…, 2016 - arxiv.org

T Garipov, D Podoprikhin, A Novikov, D Vetrov

arXiv preprint arXiv:1611.03214, 2016•arxiv.org

Convolutional neural networks excel in image recognition tasks, but this comes at the cost of
high computational and memory complexity. To tackle this problem,[1] developed a tensor
factorization framework to compress fully-connected layers. In this paper, we focus on
compressing convolutional layers. We show that while the direct application of the tensor
framework [1] to the 4-dimensional kernel of convolution does compress the layer, we can
do better. We reshape the convolutional kernel into a tensor of higher order and factorize it�…

Convolutional neural networks excel in image recognition tasks, but this comes at the cost of high computational and memory complexity. To tackle this problem, [1] developed a tensor factorization framework to compress fully-connected layers. In this paper, we focus on compressing convolutional layers. We show that while the direct application of the tensor framework [1] to the 4-dimensional kernel of convolution does compress the layer, we can do better. We reshape the convolutional kernel into a tensor of higher order and factorize it. We combine the proposed approach with the previous work to compress both convolutional and fully-connected layers of a network and achieve 80x network compression rate with 1.1% accuracy drop on the CIFAR-10 dataset.

arxiv.org

Show moreShow less

Save Cite Cited by 234 Related articles All 3 versions View as HTML

Showing the best result for this search. See all results

Cite

Advanced search

Saved to My library

Ultimate tensorization: compressing convolutional and fc layers alike