CDFKD

Pytorch implementation of ICME 2021 paper: Model Compression via Collaborative Data-Free Knowledge Distillation for Edge Intelligence (CDFKD).

Requirements

python 3
pytorch

Run

MNIST

Train some teacher networks:

python teacher-train.py --batch_size=128

Distill knowledge of above teachers into a multi-header student network:

python CDFKD-train.py

(optional) Use additional data to train an attention vector for aggregating predictions:

python attention-train.py

CIFAR10

python teacher-train.py --dataset=cifar10 --batch_size=256

python CDFKD-train.py --dataset=cifar10 --n_epochs=800 --batch_size=1024 --lr_G=0.001 --lr_S=0.1 --latent_dim=1000 --channels=3 --oh=0.05 --ie=5 --a=0.01

python attention-train.py --dataset=cifar10 --data_num=500 --n_epochs=30

note: the large batch size is important for generating images uniformly distributed in each class. Decreasing its value may cause accuracy drop.

Results

Citation

@inproceedings{CDFKD,
  title={Model Compression via Collaborative Data-Free Knowledge Distillation for Edge Intelligence},
  author={Hao, Zhiwei and Luo, Yong and Wang, Zhi and Hu, Han and An, Jianping},
  booktitle={2021 IEEE International Conference on Multimedia and Expo (ICME)},
  pages={1--6},
  year={2021},
  organization={IEEE}
}

Reference

Data-Free Learning of Student Networks

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
figure		figure
models		models
utils		utils
CDFKD-train.py		CDFKD-train.py
README.md		README.md
attention-train.py		attention-train.py
teacher-train.py		teacher-train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CDFKD

Requirements

Run

Results

Citation

Reference

About

Releases

Packages

Languages

Hao840/pytorch-CDFKD

Folders and files

Latest commit

History

Repository files navigation

CDFKD

Requirements

Run

Results

Citation

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages