Masking and re-identification methods for public-use microdata: Overview and research problems

WE Winkler - International Workshop on Privacy in Statistical …, 2004 - Springer
International Workshop on Privacy in Statistical Databases, 2004Springer
This paper provides an overview of methods of masking microdata so that the data can be
placed in public-use files. It divides the methods according to whether they have been
demonstrated to provide analytic properties or not. For those methods that have been shown
to provide one or two sets of analytic properties in the masked data, we indicate where the
data may have limitations for most analyses and how re-identification might or can be
performed. We cover several methods for producing synthetic data and possible …
Abstract
This paper provides an overview of methods of masking microdata so that the data can be placed in public-use files. It divides the methods according to whether they have been demonstrated to provide analytic properties or not. For those methods that have been shown to provide one or two sets of analytic properties in the masked data, we indicate where the data may have limitations for most analyses and how re-identification might or can be performed. We cover several methods for producing synthetic data and possible computational extensions for better automating the creation of the underlying statistical models. We finish by providing background on analysis-specific and general information-loss metrics to stimulate research.
Springer
以上显示的是最相近的搜索结果。 查看全部搜索结果