Differential Correct Attribution Probability for Synthetic Data: An Exploration

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Synthetic data generation has been proposed as a flexible alternative to more traditional statistical disclosure control (SDC) methods for limiting disclosure risk. Synthetic data generation is functionally distinct from standard SDC methods in that it breaks the link between the data subjects and the data such that reidentification is no longer meaningful. Therefore orthodox measures of disclosure risk assessment - which are based on reidentification - are not applicable. Research into developing disclosure assessment measures specifically for synthetic data has been relatively limited. In this paper, we develop a method called Differential Correct Attribution Probability (DCAP). Using DCAP, we explore the effect of multiple imputation on the disclosure risk of synthetic data.

Bibliographical metadata

Original languageEnglish
Title of host publicationPrivacy in Statistical Databases 2018
PublisherSpringer Nature
Publication statusAccepted/In press - 9 Jul 2018