Secure Multiset Intersection Cardinality and its Application to Jaccard Coefficient

Bharath K. Samanthula
Wei Jiang, Missouri University of Science and TechnologyFollow

Abstract

The Jaccard Coefficient, as an information similarity measure, has wide variety of applications, such as cluster analysis and image segmentation. Due to the concerns of personal privacy, the Jaccard Coefficient cannot be computed directly between two independently owned datasets. The problem, secure computation of the Jaccard Coefficient for multisets (SJCM), considers the situation where two parties want to securely compute the random shares of the Jaccard Coefficient between their multisets. During the process, the content of each party's multiset is not disclosed to the other party and also the value of Jaccard Coefficient should be hidden from both parties. Secure computation of multiset intersection cardinality is an important sub-problem of SJCM. Existing methods when applied to solve such a problem can lead to either insecure or inefficient solutions. Our work addresses this gap. We first present a basic SJCM protocol constructed using the existing secure dot product method as a sub-routine. Then, as a major contribution, we propose an approximated version of our basic protocol to improve efficiency without compromising accuracy much. We provide various experimental results to show that the proposed protocols are significantly more efficient than the existing techniques when the domain size is small using both simulated and real datasets.

Recommended Citation

B. K. Samanthula and W. Jiang, "Secure Multiset Intersection Cardinality and its Application to Jaccard Coefficient," IEEE Transactions on Dependable and Secure Computing, vol. 13, no. 5, pp. 591 - 604, article no. 7065287, Institute of Electrical and Electronics Engineers, Sep 2016.

The definitive version is available at https://doi.org/10.1109/TDSC.2015.2415482

Department(s)

Computer Science

Comments

National Science Foundation, Grant CNS-1011984

Keywords and Phrases

Garbled circuits; homomorphic encryption; Jaccard coefficient; multiset intersection; Security

International Standard Serial Number (ISSN)

1941-0018; 1545-5971

Document Type

Article - Journal

Document Version

Citation

File Type

text

Language(s)

English

Rights

Publication Date

01 Sep 2016

Computer Science Faculty Research & Creative Works

Secure Multiset Intersection Cardinality and its Application to Jaccard Coefficient

Abstract

Recommended Citation

Department(s)

Comments

Keywords and Phrases

International Standard Serial Number (ISSN)

Document Type

Document Version

File Type

Language(s)

Rights

Publication Date

Included in

Search

Browse

Author Corner

Related Content

Useful Links

Article Locations

Computer Science Faculty Research & Creative Works

Secure Multiset Intersection Cardinality and its Application to Jaccard Coefficient

Author

Abstract

Recommended Citation

Department(s)

Comments

Keywords and Phrases

International Standard Serial Number (ISSN)

Document Type

Document Version

File Type

Language(s)

Rights

Publication Date

Included in

Share

Search

Browse

Author Corner

Related Content

Useful Links

Article Locations