Privacy-preserving data publishing: an overview

Privacy preservation has become a major issue in many data analysis applications. When a data set is released to other parties for data analysis, privacy-preserving techniques are often required to reduce the possibility of identifying sensitive information about individuals. For example, in medical...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Wong, Raymond Chi-Wing (VerfasserIn), Fu, Ada Wai-Chee (VerfasserIn)
Format: Elektronisch E-Book
Sprache:English
Veröffentlicht: [San Rafael, California] Morgan & Claypool Publishers [2010]
Schriftenreihe:Synthesis lectures on data management Lecture #3
Schlagworte:
Online-Zugang:Volltext
Zusammenfassung:Privacy preservation has become a major issue in many data analysis applications. When a data set is released to other parties for data analysis, privacy-preserving techniques are often required to reduce the possibility of identifying sensitive information about individuals. For example, in medical data, sensitive information can be the fact that a particular patient suffers from HIV. In spatial data, sensitive information can be a specific location of an individual. In web surfing data, the information that a user browses certain websites may be considered sensitive. Consider a dataset containing some sensitive information is to be released to the public. In order to protect sensitive information, the simplest solution is not to disclose the information. However, this would be an overkill since it will hinder the process of data analysis over the data from which we can find interesting patterns. Moreover, in some applications, the data must be disclosed under the government regulations. Alternatively, the data owner can first modify the data such that the modified data can guarantee privacy and, at the same time, the modified data retains sufficient utility and can be released to other parties safely. This process is usually called as privacy-preserving data publishing. In this monograph, we study how the data owner can modify the data and how the modified data can preserve privacy and protect sensitive information. Table of Contents: Introduction / Fundamental Concepts / One-Time Data Publishing / Multiple-Time Data Publishing / Graph Data / Other Data Types / Future Research Directions
Intro -- Introduction -- Data Publishing -- Significance -- Organization -- Fundamental Concepts -- Anonymization -- Information Loss Metric -- Privacy Models -- Other Privacy Models -- Conclusion -- One-Time Data Publishing -- Knowledge about Quasi-identifiers -- Knowledge about the Distribution of Sensitive Values -- Knowledge about the Linkage of Individuals to Sensitive Values -- Information That Some Individuals Do Not Have Some Sensitive Values -- Information That Some Individuals Have Some Sensitive Values -- Knowledge about the Relationship among Individuals -- Knowledge about Anonymization -- Knowledge Mined from the Microdata -- Knowledge Mined from the Published Data -- How To Use Published Data -- Aggregate Queries -- Information Loss -- Evaluation with Data Mining and Data Analysis Tools -- Querying over an Uncertain Database -- Conclusion -- Multiple-Time Data Publishing -- Individual-Based Correlation -- Data publishing from Static Microdata -- Data Publishing from Dynamic Microdata -- Sensitive Value-Based Correlation -- Protection for Permanent Sensitive Values -- Protection for Transient Sensitive Values -- Conclusion -- Graph Data -- Data Model -- Adversary Attacks -- Assumption of Adversary Knowledge -- Active Attacks -- Utility of the Published Data -- k-Anonymity -- Vertex Degree -- 1-Neighborhood -- Vertex Partitioning -- k-Automorphism -- Multiple Releases of Data Graphs -- Other Approaches -- Future Directions -- Other Data Types -- Spatial Data -- With Anonymizer -- Without Anonymizer -- Transactional Data -- Conclusion -- Future Research Directions -- One-Time Data Publishing -- Multiple-Time Data Publishing -- Publishing Graph Data -- Publishing Data of Other Forms -- Definition of Entropy l-Diversity and Recursive l-Diversity -- Bibliography -- Authors' Biographies
Beschreibung:1 Online-Ressource (138 Seiten)
ISBN:9781608452170
DOI:10.2200/S00237ED1V01Y201003DTM002