This is the supporting code for the paper Generating tabular datasets under differential privacy.
Please check out a quick overview on my blog.
TableDiffusion is a project focused on providing differentially-private generative models for sensitive tabular data. The goal is to enable the synthesis of data that maintains the statistical properties of the original dataset while ensuring the privacy of individuals' information.
The most notable model from this work is TableDiffusion
, the first differentially-private diffusion model for tabular data. See tablediffusion/models/table_diffusion.py
⚠️ Disclaimer: This codebase is intended for research purposes only and is not ready for production use. The current implementation may not preserve privacy guarantees due to seed and sampler settings that are not suitable for a production environment.
Truda, Gianluca. "Generating tabular datasets under differential privacy." arXiv preprint arXiv:2308.14784 (2023).
@article{truda2023generating,
title={Generating tabular datasets under differential privacy},
author={Truda, Gianluca},
journal={arXiv preprint arXiv:2308.14784},
year={2023}
}