Comparing Rewinding and Fine-tuning in Neural Network Pruning

This is an anonymized public release of the code for Comparing Rewinding and Fine-tuning in Neural Network Pruning. This code was used to generate all of the data and plots in the paper.

The ResNet-20 implementation (using GPUs) can be found in gpu-src/official/resnet, and is based off of the implementation provided alongside Tensorflow. The VGG-16 implementation (using GPUs) can be found in gpu-src/official/vgg, and is a fork of the ResNet-20 implementation, modified to reflect a VGG-16 with batch norm and a with single fully connected layer at the end. The ResNet-50 implementation (using TPUs) can be found in tpu-src/models/official/resnet and is based off of the implementation provided alongside Tensorflow.

The rewinding and pruning code can be found in lottery/lottery/lottery.py and lottery/lottery/prune_functions.py.

Running the code

To set up the environment, a Dockerfile is provided in docker/_docker which installs all dependencies. The Dockerfile expects the following environment variables:

ROOT_INSTALL_FILE=docker/root_setup.sh
PROJECT_NAME=lth
PROJECT_NAME_UPPERCASE=LTH
USER_INSTALL_FILE=docker/user_setup.sh

Once the Docker environment is built and running, run docker/posthoc_setup.sh

To train a network, cd to gpu-src or tpu-src, and train the model as specified in the original GPU or TPU codebase, with two added command line flags:

--lottery_results_dir ${RESULTS_DIR}, which specifies the directory to save rewinding checkpoints to, and
--lottery_checkpoint_iters 4701,11259,..., which is a comma-separated list of iterations to checkpoint so that they can be rewound to later.

Once a network is trained, it can be pruned and re-trained by running the network training code again, with a different model_dir (as defined by the original codebases) and with the following additional parameters:

--lottery_prune_at "${RESULTS_DIR}/checkpoint_iter_final", which says to prune the network using the weights from the end of training
--lottery_reset_to "${RESULTS_DIR}/checkpoint_iter_4701", which says to rewind the weights to their values at iteration 4701.

To fine-tune rather than rewind, simply set --lottery_reset_to "${RESULTS_DIR}/checkpoint_iter_final", and --lottery_force_learning_rate 0.001 to force the learning rate to the desired value for fine-tuning.

Both --lottery_prune_at and --lottery_reset_to accept reinitialize as an argument to reinitialize the weights before pruning or re-training, which allows for generation of random pruning masks, or for running the reinitialization baselines.

To view the final accuracy reuslts, inspect the tfevent summary file produced by tensorflow.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
docker		docker
gpu-src		gpu-src
lottery		lottery
old_resnet20_expts_ibm		old_resnet20_expts_ibm
plot		plot
resnet20_expts		resnet20_expts
resnet50_expts		resnet50_expts
response		response
tpu-src		tpu-src
vgg_expts		vgg_expts
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.gitshared		.gitshared
README.md		README.md
byteit.py		byteit.py
check_unattached_tpus.sh		check_unattached_tpus.sh
cloud_config.json		cloud_config.json
enqueue.sh		enqueue.sh
is_done_training.py		is_done_training.py
kill_current_instance.sh		kill_current_instance.sh
logs_of.sh		logs_of.sh
message_trasher.py		message_trasher.py
queue_runner.py		queue_runner.py
repo_cleanup.py		repo_cleanup.py
reserve_tpu.py		reserve_tpu.py
start_queue_runners.sh		start_queue_runners.sh
strip.sh		strip.sh
target_ibm.sh		target_ibm.sh
tpu_runner.py		tpu_runner.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Comparing Rewinding and Fine-tuning in Neural Network Pruning

Running the code

About

Releases

Packages

Languages

mcarbin/code

Folders and files

Latest commit

History

Repository files navigation

Comparing Rewinding and Fine-tuning in Neural Network Pruning

Running the code

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages