Masters Theses

Keywords and Phrases

Semantic XML delta

Abstract

“The eXtensible Markup Language (XML) has become the industry standard for data and message passing due to its simplicity, platform independence, and ease of processing. XML technology is being incorporated into numerous applications and even into core components of personal computing systems and legacy systems. It is already a fact that the volume of data available in XML will eventually dominate over any other format. XML has become the data format of choice for fields as diverse as stock market trading and biology. The dramatic increase in XML data available on the Internet requires a monitoring system to keep track of vital changes taking place on the World Wide Web (WWW). In this paper we introduce a traditional, yet novel, approach of detecting changes in unordered XML documents stored in a relational database. Previous work in XML change detection were focused on detecting changes by constructing Document Object Model (DOM) trees from XML documents and comparing the two tree structures based on signatures. The tree comparison approach is not efficient in handling large XML documents due to the fact that the entire trees of both the documents have to be memory resident during the comparison process. We propose an efficient algorithm (XRel_Change_SQL) for detecting unordered changes between two XML documents using Structured Query Language (SQL) queries. XRel is used as the underlying model for storing XML documents in a relational database. We compare the efficiency and quality of our change detection algorithm with existing in-memory XML change detection tools like X-Diff, DeltaXML, and XANDY. We show that our approach is highly scalable and has comparable efficiency and better result quality than the above mentioned tools” --Abstract, page iv.

Advisor(s)

Madria, Sanjay Kumar

Committee Member(s)

McMillin, Bruce M.
Ramakrishnan, Sreeram

Department(s)

Computer Science

Degree Name

M.S. in Computer Science

Comments

Accompanying CD-ROM, available at Missouri S&T Library, contains XRel_Change_SQL source code, XRel implementation, and datasets.

Publisher

University of Missouri--Rolla

Publication Date

Fall 2005

Pagination

ix, 43 pages

Note about bibliography

Includes bibliographical references (pages 41-42).

Rights

© 2005 Sathyanarayanan Sundaram, All rights reserved.

Document Type

Thesis - Restricted Access

File Type

text

Language

English

Subject Headings

Database managementRelational databasesXML (Document markup language)SQL (Computer program language)

Thesis Number

T 8873

Print OCLC #

72523511

Share My Thesis If you are the author of this work and would like to grant permission to make it openly accessible to all, please click the button above.

Share

 
COinS