DiffXML: Change Detection in XML Data

Abstract

In this paper, we introduce a method to map XML files to relational data model by parsing XML files as DOM trees and store value and path information for each node in relational tables. We present an algorithm called “DiffXML” which uses SQL operations to detect changes between two versions of XML file stored in a relational database. The value and path information for XML files are also used to detect differences. DiffXML finds new inserted, deleted and updated nodes, and also finds the move of a subtree from one place to the other in the XML DOM tree. We analyze the performance of DiffXML with some current commercial and research prototype XML change detection tools.

Department(s)

Computer Science

Keywords and Phrases

DiffXML; XML; XML Change Detection

Document Type

Article - Conference proceedings

Document Version

Citation

File Type

text

Language(s)

English

Rights

© 2004 Springer Verlag, All rights reserved.

Publication Date

01 Feb 2004

This document is currently not available here.

Share

 
COinS