DiffXML: Change Detection in XML Data
Abstract
In this paper, we introduce a method to map XML files to relational data model by parsing XML files as DOM trees and store value and path information for each node in relational tables. We present an algorithm called “DiffXML” which uses SQL operations to detect changes between two versions of XML file stored in a relational database. The value and path information for XML files are also used to detect differences. DiffXML finds new inserted, deleted and updated nodes, and also finds the move of a subtree from one place to the other in the XML DOM tree. We analyze the performance of DiffXML with some current commercial and research prototype XML change detection tools.
Recommended Citation
Y. Chen et al., "DiffXML: Change Detection in XML Data," Database Systems for Advanced Applications, Springer Verlag, Feb 2004.
Department(s)
Computer Science
Keywords and Phrases
DiffXML; XML; XML Change Detection
Document Type
Article - Conference proceedings
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 2004 Springer Verlag, All rights reserved.
Publication Date
01 Feb 2004