Formal Anonymity Models for Efficient Privacy-preserving Joins

Abstract

Organizations, such as federally-funded medical research centers, must share de-identified data on their consumers to publicly accessible repositories to adhere to regulatory requirements. Many repositories are managed by third-parties and it is often unknown if records received from disparate organizations correspond to the same individual. Failure to resolve this issue can lead to biased (e.g., double counting of identical records) and underpowered (e.g., unlinked records of different data types) investigations. in this paper, we present a secure multiparty computation protocol that enables record joins via consumers' encrypted identifiers. Our solution is more practical than prior secure join models in that data holders need to interact with the third party one time per data submission. Though technically feasible, the speed of the basic protocol scales quadratically with the number of records. Thus, we introduce an extended version of our protocol in which data holders append k-anonymous features of their consumers to their encrypted submissions. These features facilitate a more efficient join computation, while providing a formal guarantee that each record is linkable to no less than k individuals in the union of all organizations' consumers. Beyond a theoretical treatment of the problem, we provide an extensive experimental investigation with data derived from the US Census to illustrate the significant gains in efficiency such an approach can achieve. © 2009 Elsevier B.V. All rights reserved.

Recommended Citation

M. Kantarcioglu et al., "Formal Anonymity Models for Efficient Privacy-preserving Joins," Data and Knowledge Engineering, vol. 68, no. 11, pp. 1206 - 1223, Elsevier, Nov 2009.

The definitive version is available at https://doi.org/10.1016/j.datak.2009.06.011

Department(s)

Computer Science

Comments

National Science Foundation, Grant None

Keywords and Phrases

Anonymity; Data integration; Privacy; Security

International Standard Serial Number (ISSN)

0169-023X

Document Type

Article - Journal

Document Version

Citation

File Type

text

Language(s)

English

Rights

Publication Date

01 Nov 2009

Computer Science Faculty Research & Creative Works

Formal Anonymity Models for Efficient Privacy-preserving Joins

Abstract

Recommended Citation

Department(s)

Comments

Keywords and Phrases

International Standard Serial Number (ISSN)

Document Type

Document Version

File Type

Language(s)

Rights

Publication Date

Search

Browse

Author Corner

Related Content

Useful Links

Article Locations

Computer Science Faculty Research & Creative Works

Formal Anonymity Models for Efficient Privacy-preserving Joins

Author

Abstract

Recommended Citation

Department(s)

Comments

Keywords and Phrases

International Standard Serial Number (ISSN)

Document Type

Document Version

File Type

Language(s)

Rights

Publication Date

Share

Search

Browse

Author Corner

Related Content

Useful Links

Article Locations