Backdoor attack OrderBkd on LM

Existing attacks involve poisoning the data samples such as insertion of tokens or sentence paraphrasing, which either alter the semantics of the original texts or can be detected. Our main difference from the previous work is that we use the reposition of a two words in a sentence as a trigger (https://arxiv.org/abs/2402.07689).

Install packages

pdm venv create

Available

Datasets: sst-2, ag, imbd
Models: bert, albert, roberta, lstm, distilbert

Run poisoning dataset and model

python3 attack_orderbkd.py --dataset sst-2 --model bert

Our attack

Examples

Citation

If the code or method help you in the research, please cite the following paper:

@misc{alekseevskaia2024orderbkdtextualbackdoorattack,
      title={OrderBkd: Textual backdoor attack through repositioning}, 
      author={Irina Alekseevskaia and Konstantin Arkhipenko},
      year={2024},
      eprint={2402.07689},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2402.07689}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
attacks		attacks
defense		defense
experiments		experiments
poison		poison
utils		utils
.gitignore		.gitignore
README.md		README.md
attack_orderbkd.py		attack_orderbkd.py
pdm.lock		pdm.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Backdoor attack OrderBkd on LM

Install packages

Available

Run poisoning dataset and model

Our attack

Examples

Citation

About

Releases

Packages

Languages

alekseevskaia/OrderBkd

Folders and files

Latest commit

History

Repository files navigation

Backdoor attack OrderBkd on LM

Install packages

Available

Run poisoning dataset and model

Our attack

Examples

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages