N-gram based Secure Similar Document Detection
Abstract
Secure similar document detection (SSDD) plays an important role in many applications, such as justifying the need-to-know basis and facilitating communication between government agencies. the SSDD problem considers situations where Alice with a query document wants to find similar information from Bob's document collection. during this process, the content of the query document is not disclosed to Bob, and Bob's document collection is not disclosed to Alice. Existing SSDD protocols are developed under the vector space model, which has the advantage of identifying global similar information. to effectively and securely detect similar documents with overlapping text fragments, this paper proposes a novel n-gram based SSDD protocol. © 2011 IFIP International Federation for Information Processing.
Recommended Citation
W. Jiang and B. K. Samanthula, "N-gram based Secure Similar Document Detection," Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 6818 LNCS, pp. 239 - 246, Springer, Jul 2011.
The definitive version is available at https://doi.org/10.1007/978-3-642-22348-8_19
Department(s)
Computer Science
Keywords and Phrases
n-gram; privacy; security
International Standard Book Number (ISBN)
978-364222347-1
International Standard Serial Number (ISSN)
1611-3349; 0302-9743
Document Type
Article - Conference proceedings
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 2024 Springer, All rights reserved.
Publication Date
18 Jul 2011