Network Pruning

216 papers with code • 5 benchmarks • 5 datasets

Network Pruning is a popular approach to reduce a heavy network to obtain a light-weight form by removing redundancy in the heavy network. In this approach, a complex over-parameterized network is first trained, then pruned based on come criterions, and finally fine-tuned to achieve comparable performance with reduced parameters.

Source: Ensemble Knowledge Distillation for Learning Improved and Efficient Networks

Benchmarks

Add a Result

These leaderboards are used to track progress in Network Pruning

Dataset	Best Model	Compare
ImageNet	ResNet50-2.3 GFLOPs	See all
ImageNet - ResNet 50 - 90% sparsity	Feather	See all
CIFAR-100	Dense	See all
CIFAR-10	TAS-pruned ResNet-110	See all
MNIST	FFN-ShapleyPruned	See all

Libraries

Use these libraries to find Network Pruning models and implementations

JingtongSu/sanity-checking-pruning

4 papers

UCMerced-ML/LC-model-compression

3 papers

PaddlePaddle/PaddleOCR

2 papers

39,605

VainF/Torch-Pruning

2 papers

2,397

See all 7 libraries.

Datasets

Most implemented papers

Most implemented Social Latest No code

SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size

DeepScale/SqueezeNet • • 24 Feb 2016

(2) Smaller DNNs require less bandwidth to export a new model from the cloud to an autonomous car.

Paper
Code

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

google-research/lottery-ticket-hypothesis • • ICLR 2019

Based on these results, we articulate the "lottery ticket hypothesis:" dense, randomly-initialized, feed-forward networks contain subnetworks ("winning tickets") that - when trained in isolation - reach test accuracy comparable to the original network in a similar number of iterations.

Paper
Code

Pruning Filters for Efficient ConvNets

PaddlePaddle/PaddleOCR • • 31 Aug 2016

However, magnitude-based pruning of weights reduces a significant number of parameters from the fully connected layers and may not adequately reduce the computation costs in the convolutional layers due to irregular sparsity in the pruned networks.

Paper
Code

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

NervanaSystems/distiller • • 1 Oct 2015

To address this limitation, we introduce "deep compression", a three stage pipeline: pruning, trained quantization and Huffman coding, that work together to reduce the storage requirement of neural networks by 35x to 49x without affecting their accuracy.

Paper
Code

SNIP: Single-shot Network Pruning based on Connection Sensitivity

namhoonlee/snip-public • • ICLR 2019

To achieve this, we introduce a saliency criterion based on connection sensitivity that identifies structurally important connections in the network for the given task.

Paper
Code

Manifold Regularized Dynamic Network Pruning

mindspore-ai/models • • CVPR 2021

Then, the manifold relationship between instances and the pruned sub-networks will be aligned in the training procedure.

Paper
Code

PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning

arunmallya/packnet • • CVPR 2018

This paper presents a method for adding multiple tasks to a single deep neural network while avoiding catastrophic forgetting.

Paper
Code

Network Pruning via Transformable Architecture Search

D-X-Y/NAS-Projects • • NeurIPS 2019

The maximum probability for the size in each distribution serves as the width and depth of the pruned network, whose parameters are learned by knowledge transfer, e. g., knowledge distillation, from the original networks.

Paper
Code