Improved-Deep-Leakage-from-Gradients

The code for "Improved Deep Leakage from Gradients" (iDLG).

Abstract

It is widely believed that sharing gradients will not leak private training data in distributed learning systems such as Collaborative Learning and Federated Learning, etc. Recently, Zhu et al. [1] presented an approach which shows the possibility to obtain private training data from the publicly shared gradients. In their Deep Leakage from Gradient (DLG) method, they synthesize the dummy data and corresponding labels with the supervision of shared gradients. However, DLG has difficulty in convergence and discovering the ground-truth labels consistently. In this paper, we find that sharing gradients definitely leaks the ground-truth labels. We propose a simple but reliable approach to extract accurate data from the gradients. Particularly, our approach can certainly extract the ground-truth labels as opposed to DLG, hence we name it Improved DLG (iDLG). Our approach is valid for any differentiable model trained with cross-entropy loss over one-hot labels. We mathematically illustrate how our method can extract ground-truth labels from the gradients and empirically demonstrate the advantages over DLG.

Experiments

Dataset	DLG	iDLG
MNIST	89.9%	100.0%
CIFAR-100	83.3%	100.0%
LFW	79.1%	100.0%

Table 1: Accuracy of the extracted labels for DLG [1] and iDLG. Note that iDLG always extracts the correct label as opposed to DLG which extracts wrong labels frequently.

Cite

@article{zhao2020idlg,
  title={iDLG: Improved Deep Leakage from Gradients},
  author={Zhao, Bo and Mopuri, Konda Reddy and Bilen, Hakan},
  journal={arXiv preprint arXiv:2001.02610},
  year={2020}
}

Our further work

We further leveraged gradient matching to condense the large training set into small synthetic set for efficient deep learning - Dataset Condensation with Gradient Matching [PDF]. [Code]. ICLR 2021 Oral.
Our experiments show that we can condense large training sets into tiny synthetic ones and obtain good generalization ability when train arbitrary randomly initialized deep networks with them.
It's a promising solution to privacy protection and safe federated learning using the synthetic training set.

	MNIST	FashionMNIST	SVHN	CIFAR10
1 img/cls	91.7	70.5	31.2	28.3
10 img/cls	97.4	82.3	76.1	44.9
50 img/cls	98.8	83.6	82.3	53.9

Table 2: Testing accuracies (%) of ConvNets trained from scratch on 1, 10 or 50 synthetic image(s)/class.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
Figure1.png		Figure1.png
Figure2.png		Figure2.png
README.md		README.md
iDLG.py		iDLG.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improved-Deep-Leakage-from-Gradients

Abstract

Experiments

Cite

Our further work

About

Releases

Packages

Languages

PatrickZH/Improved-Deep-Leakage-from-Gradients

Folders and files

Latest commit

History

Repository files navigation

Improved-Deep-Leakage-from-Gradients

Abstract

Experiments

Cite

Our further work

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages