Accés ràpid intranet

Més informació...

a a a
Inici

Research report: DEIM-RR-13-001


Title

Improving the Utility of Differentially Private Data Releases via k-Anonymous Microaggregation

Author/s

Jordi Soria-Comas, Josep Domingo-Ferrer, David Sánchez, Sergio Martínez

Date

01-03-2013

Research team

CRISES - Criptografia i Secret Estadístic

Research report type

Recerca

Language

Anglès

Number of pages

12

Summary

A common view in some data anonymization literature is to oppose the ?old? k-anonymity model to the ?new? differential privacy model, which offers more robust privacy guarantees. However, the utility of the masked results provided by differential privacy is usually limited, due to the amount of noise that needs to be added to the output, or because utility can only be guaranteed for a restricted type of queries. This is in contrast with the general-purpose anonymized data resulting from k-anonymity mechanisms, which also focus on preserving data utility. In this paper, we show that a synergy between differential privacy and k-anonymity can be found when the objective is to release anonymized data: k- anonymity can help improving the utility of the differentially private release. Specifically, we show that the amount of noise required to fulfill e-differential privacy can be reduced if noise is added to a k-anonymous version of the data set, where k-anonymity is reached through a specially designed microaggregation of all attributes. As a result of noise reduction, the analytical utility of the anonymized output data set is increased. The theoretical benefits of our proposal are illustrated in a practical setting with an empirical evaluation on a reference data set.

Keywords

Privacy-preserving data publishing, Differential privacy, k-Anonymity, Microaggregation, Data utility