VFLAIR

Basic Introduction

VFLAIR is a general, extensible and light-weight VFL framework that provides vanilar VFL training and evaluation process simulation alonging with several effective communication improvement methods as well as attack and defense evaluations considering data safety and privacy. Aside from NN serving as local models for VFL systems, tree-based VFL is also supported.

VFLAIR provides simulation of the vanilar VFL process containing forward local model prediction transmits, backward gradient transmits as well as local and global model updates.
FedBCD (paper) is provided for improving the effectiveness of VFL training process.
Four attack types are included in VFLAIR as examples for training-decoupled attack and training-time attack separately. In each attack type, multiple attack is available for use:
- Label Inference(LI)
  - Batch-level Label Inference (paper)/Direct Label Inference (paper)
  - Norm-based Scoring (NS) ([paper]([2102.08504] Label Leakage and Protection in Two-party Split Learning (arxiv.org)))/Direction-based Scoring (DS) ([paper]([2102.08504] Label Leakage and Protection in Two-party Split Learning (arxiv.org)))
  - Passive Model Completion (PMC) ([paper](Label Inference Attacks Against Vertical Federated Learning | USENIX))/Active Model Completion (AMC) ([paper](Label Inference Attacks Against Vertical Federated Learning | USENIX))
- Feature Reconstruction(FR)
  - Generative Regression Network (GRN)([paper]([2010.10152] Feature Inference Attack on Model Predictions in Vertical Federated Learning (arxiv.org)))
  - Training-based Back Mapping by model inversion (TBM)([paper]([2205.04007v1] ResSFL: A Resistance Transfer Framework for Defending Model Inversion Attack in Split Federated Learning (arxiv.org)))
- Targeted Backdoor(TB)
  - Label replacement Backdoor (paper)
- Non-Targeted Backdoor(NTB)
  - Noisy-Sample Backdoor (NSB)(paper)
  - Missing Feature (MF)([paper](Liu2021.pdf (neurips2021workshopfl.github.io)))
Several basic defense methods as well as emerging defense strategies are provided in VFLAIR and can be flexibly applied in VFL training and testing flow. Defense methods provided in VFLAIR is listed below. Detail information of these defenses are included in /src/configs/README.md.
- Differentail Privacy (Laplace-DP and Gaussian-DP) (paper)
- Gradient Sparsification (GS) (paper)
- Confusional AutoEncoder (CAE) & DiscreteSGD �enhanced CAEï¼ˆDCAE) (paper)
- Mutual Information regularization Defense(MID) (paper)
- GradPerturb(GPer) ([paper]([2203.02073] Differentially Private Label Protection in Split Learning (arxiv.org)))
- Distance Correlation(dCor) ([paper]([2203.01451] Label Leakage and Protection from Forward Embedding in Vertical Federated Learning (arxiv.org)))
Multiple datasets are provided along with VFLAIR.
Defense Capability Score ——a comprehensive metric for assessing defense ability is also introduced.
Tree-based VFL is also proved in the code with XGBoost and RandomForest supported. See ./src/configs/README_TREE.md for detailed description.

Code Structure

VFLAIR
├── src
│   ├── evaluates           
│   |   ├── attacks                    # Attack Simulator,Implementation of attacks
│   │   |   ├── ...                    # Multiple Attack Implementation
│   |   ├── defenses                   # Implementation of defenses
│   │   |   ├── Trained CAE momdels    # Trained encoder-decoder models for CAE and DCAE
│   │   |   ├── ...                    # Defense Implementation & Functions
│   |   ├── MainTaskVFL                # Pipeline for BasicVFL & VFL with LI/FR/NTB
│   |   ├── MainTaskVFLwithBackdoor    # Pipeline for VFL with TB     
│   |   ├── MainTaskTVFL               # Pipeline for Tree-based VFL
│   ├── load                           # Load Configurations into training pipeline
│   |   ├── LoadConfigs.py             # Load basic parameters   
│   |   ├── LoadDataset.py             # Load dataset and do data partition
│   |   ├── LoadModels.py              # Initialize models
│   |   ├── LoadParty.py               # Initialized parties with data and model
│   ├── configs                        # Customizable configurations    
│   |   ├── standard_configs           # Standard configurations for reference
│   │   │   ├── ...   
│   |   ├── README.md                  # Guidance for configuration files 
│   |   ├── README_TREE.md             # Guidance for testing tree-based VFL
│   ├── models                         # bottom models & global models     
│   |   ├── model_parameters           # Some pretrained models
│   │   ├── ...                        # Implemented bottome models & global models
│   ├── party                          # party simulators   
│   |   ├── ...
│   ├── dataset                        # Dataset preprocessing functions       
│   |   ├── ...
│   ├── utils                          # Basic functions and Customized functions for attack&defense
│   |   ├── ...
│   ├── exp_result                     # Store experiment results
│   |   ├── ...
│   ├── metrics                        # Benchmark and Defense Capability Score (DCS) definition
│   |   ├── ...
│   ├── main_separate.py               # Main VFL(launch this file for NN based VFL)  
│   ├── main_tree.py                   # Main Tree-based VFL(launch this file for tree-based VFL)  
├── usage_guidance                     # Detailed Usage  
│   ├── figures
│   |   ├── ...
│   ├── Add_New_Algorithm.md           # Guidance on how to add user defined attacks and defenses algorithms
│   ├── Dataset_Usage.md               # Guidance on how to achieve dataset for experiments
├── README.md

Quick Start

Zero. Environment Preparation

Use pip install -r requirements.txt to install all the necessary requirements.

One. Basic Benchmark Usage: A Quick Example

Customize your own configurations

Create a json file for your own evaluation configuration in /src/configs folder. Name it whatever you want, like my_configs.json.
/src/configs/basic_configs.json is a sample configuration file. You can copy it and modify the contents for your own purpose.
For detail information about configuration parameters, see /src/configs/README.md for detail information.

Use cd src and python main_separate.py --seed 0 --gpu 0 --configs <Your_Config_file_name> to start the evaluation process. A quick example can be launched by simplying using cd src and python main_separate.py (a vanilar VFL training and testing process is launched). For more detail descriptions, see Section Two.

Two. Advanced Usage: Implement Your Own Algorithm

How to add new attack/defense?
- usage_guidance/Add_New_Evaluation.md
Dataset Usage?
- usage_guidance/Dataset_Usage.md
How to write Configuration files and how to specify hyper-parameters for evaluation?
- src/config/README.md and src/config/README_TREE.md
What is Defense Capability Score (DCS)?
- Refer to src/metrics for details.

Contributing

We greatly appreciate any contribution to VFLAIR!

Please feel free to contact us if there's any problem with the code base!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VFLAIR

Basic Introduction

Code Structure

Quick Start

Zero. Environment Preparation

One. Basic Benchmark Usage: A Quick Example

Two. Advanced Usage: Implement Your Own Algorithm

Contributing

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 264 Commits
gen_figs		gen_figs
src		src
usage_guidance		usage_guidance
.gitignore		.gitignore
README.md		README.md
push.sh		push.sh
requirements.txt		requirements.txt

yooopan/FHE-VFLAIR

Folders and files

Latest commit

History

Repository files navigation

VFLAIR

Basic Introduction

Code Structure

Quick Start

Zero. Environment Preparation

One. Basic Benchmark Usage: A Quick Example

Two. Advanced Usage: Implement Your Own Algorithm

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages