PP-LDG: A Medical Privacy-Preserving Labeled Data Generation Framework

Haoxiang Yuan, Xiaochen Yuan, Xiuli Bi, Weisheng Li, Guoyin Wang, Bin Xiao

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The rapid development of deep learning has led to an increasing demand for data. However, such data are scarce in many fields and often contain private and sensitive information, such as medical images. The fact that data owners are unwilling to share these high-privacy data further exacerbates the scarcity of data. A prevalent research direction is to combine differential privacy and image generation, by which we can obtain a large amount of synthetic data with a similar distribution to the original dataset, and that synthetic dataset does not compromise the privacy of the original dataset. However, these methods cannot provide labels for the generated data, limiting the subsequent uses of the data. To solve the problem, we propose a generic privacy-preserving labeled data generation framework named PP-LDG. It is the first data publishing framework to generate labeled data while not disclosing the privacy of original data. We theoretically demonstrate that our proposed framework can provide strict privacy guarantees with differential privacy and demonstrate the effectiveness of the synthetic dataset obtained from the framework under practical privacy budgets through extensive experiments.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE International Conference on Medical Artificial Intelligence, MedAI 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages651-658
Number of pages8
ISBN (Electronic)9798350377613
DOIs
Publication statusPublished - 2024
Event2nd IEEE International Conference on Medical Artificial Intelligence, MedAI 2024 - Chongqing, China
Duration: 15 Nov 202417 Nov 2024

Publication series

NameProceedings - 2024 IEEE International Conference on Medical Artificial Intelligence, MedAI 2024

Conference

Conference2nd IEEE International Conference on Medical Artificial Intelligence, MedAI 2024
Country/TerritoryChina
CityChongqing
Period15/11/2417/11/24

Keywords

  • Dataset Synthesis
  • Differential Privacy
  • Medical Data
  • Security and Privacy

Fingerprint

Dive into the research topics of 'PP-LDG: A Medical Privacy-Preserving Labeled Data Generation Framework'. Together they form a unique fingerprint.

Cite this