Research report: DEIM-RR-13-001
Title
Improving the Utility of Differentially Private Data Releases via k-Anonymous Microaggregation
Author/s
Jordi Soria-Comas, Josep Domingo-Ferrer, David Sánchez, Sergio Martínez
Date
01-03-2013
Research team
CRISES - Criptografia i Secret Estadístic
Research report type
Recerca
Language
Anglès
Number of pages
12
Summary
A common view in some data anonymization literature is to
oppose the ?old? k-anonymity model to the ?new? differential
privacy model, which offers more robust privacy guarantees.
However, the utility of the masked results provided by
differential privacy is usually limited, due to the amount of
noise that needs to be added to the output, or because utility
can only be guaranteed for a restricted type of queries. This
is in contrast with the general-purpose anonymized data resulting
from k-anonymity mechanisms, which also focus on
preserving data utility. In this paper, we show that a synergy
between differential privacy and k-anonymity can be
found when the objective is to release anonymized data: k-
anonymity can help improving the utility of the differentially
private release. Specifically, we show that the amount of
noise required to fulfill e-differential privacy can be reduced
if noise is added to a k-anonymous version of the data set,
where k-anonymity is reached through a specially designed
microaggregation of all attributes. As a result of noise reduction,
the analytical utility of the anonymized output data
set is increased. The theoretical benefits of our proposal are
illustrated in a practical setting with an empirical evaluation
on a reference data set.
Keywords
Privacy-preserving data publishing, Differential privacy, k-Anonymity, Microaggregation, Data utility