Parameter Efficient Dynamic Convolution via Tensor Decomposition (BMVC 2021)

This repository contains the PyTorch implementation of the paper Parameter Efficient Dynamic Convolution via Tensor Decomposition.

Highlight

Dynamic convolution has demonstrated substantial performance improvements for convolutional neural networks. Previous aggregation based dynamic convolution methods are challenged by the parameter/memory inefficiency, and the learning difficulty due to the scalar type attention for aggregation. To rectify these limitations, we propose a parameter efficient dynamic convolution operator (dubbed as PEDConv) that learns to discriminatively perturb the spatial, input and output filters of a shared base convolution weight, through a tensor decomposition based input-dependent reparameterization. Our method considerably reduces the number of parameters compared to prior arts and limit the computational cost to maintain inference efficiency. Meanwhile, the proposed PEDConv significantly boosts the accuracy when substituting standard convolutions on a plethora of prevalent deep learning tasks at almost same computation cost as the static baselines.

This repo was built upon the DeepLearningExamples. Installation and setup of NVIDIA docker follows that repo.

Prepare the dataset

We use ImageNet-1K. Download the images.
Extract the training data:

mkdir train && mv ILSVRC2012_img_train.tar train/ && cd train
tar -xvf ILSVRC2012_img_train.tar && rm -f ILSVRC2012_img_train.tar
find . -name "*.tar" | while read NAME ; do mkdir -p "${NAME%.tar}"; tar -xvf "${NAME}" -C "${NAME%.tar}"; rm -f "${NAME}"; done
cd ..

Extract the validation data and move the images to subfolders:

mkdir val && mv ILSVRC2012_img_val.tar val/ && cd val && tar -xvf ILSVRC2012_img_val.tar
wget -qO- https://raw.githubusercontent.com/soumith/imagenetloader.torch/master/valprep.sh | bash

Training

Example of applying PEDConv to ResNet-18 model on ImageNet-1K using 8 GPUs:

python ./multiproc.py --nproc_per_node 8 ./main.py /data/imagenet \
    --data-backend pytorch \
    --raport-file raport.json \
    -j8 -p 100 \
    --lr 0.512 \ 
    --optimizer-batch-size 512 
    --warmup 8 \
    --arch resnet18 \
    --dynamic \
    -c fanin \
    --label-smoothing 0.1 \
    --lr-schedule cosine \
    --mom 0.875 \
    --wd 3.0517578125e-05 \
    -b 64 \
    --epochs 250 \
    --mixup 0.2

Checkpoints

We provide the trained PEDConv-ResNet-18 by the above command:

Model	Top-1	Checkpoints
PEDConv-ResNet-18	74.1%	download

Evaluation

Example of evaluating the trained PEDConv-ResNet-18 on ImageNet-1K:

CUDA_VISIBLE_DEVICES=0 python ./main.py /data/imagenet \
    --data-backend pytorch \
    --arch resnet18 \
    --evaluate \
    --epochs 1 \
    --pretrained-weights ./logs/model_best.pth.tar \
    -b 100

Citation

If you find this repository helpful, please consider citing:

@article{hou2021parameter,
  title={Parameter Efficient Dynamic Convolution via Tensor Decomposition},
  author={Hou, Zejiang and Kung, Sun-Yuan},
  year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
image_classification		image_classification
.DS_Store		.DS_Store
Dockerfile		Dockerfile
LOC_synset_mapping.json		LOC_synset_mapping.json
README.md		README.md
checkpoint2model.py		checkpoint2model.py
classify.py		classify.py
main.py		main.py
multiproc.py		multiproc.py
pedconv.png		pedconv.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parameter Efficient Dynamic Convolution via Tensor Decomposition (BMVC 2021)

Highlight

Prepare the dataset

Training

Checkpoints

Evaluation

Citation

About

Releases

Packages

Languages

zejiangh/PEDConv

Folders and files

Latest commit

History

Repository files navigation

Parameter Efficient Dynamic Convolution via Tensor Decomposition (BMVC 2021)

Highlight

Prepare the dataset

Training

Checkpoints

Evaluation

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages