Missouri S&T Scholar's Mine Research RepositoryMissouri S&T Research
print 
Title: DTD-Diff: A change detection algorithm for DTDs
Author (s): Leonardi, Erwin
Hoai, Tran T.
Bhowmick, Sourav S.
Madria, Sanjay
Department/Lab Affiliations: Computer Science
Intelligent Systems Center
Keywords: Algorithm
Change detection
DTD
Performance
XML
Issue Date: 2007-05
Publisher: Elsevier
Citation: Leonardi, Erwin., Hoai, Tran T., Bhowmick, Sourav S., and Madria, Sanjay Kumar. "DTD-Diff: A Change Detection Algorithm for DTDs.", Data and Knowledge Engineering, vol. 61, no. 2, 2007.
Abstract: The DTD of a set of XML documents may change due to many reasons such as changes to the real-world events, changes to the user’s requirements, and mistakes in the initial design. In this paper, we present a novel algorithm called DTD-Diff to detect the changes to DTDs that defines the structure of a set of XML documents. Such change detection tool can be useful in several ways such as maintenance of XML documents, incremental maintenance of relational schema for storing XML data, and XML schema integration. We compare DTD-Diff with existing XML change detection approaches and show that converting DTD to XML schema (XSD) (which is in XML document format) and detecting the changes using existing XML change detection algorithms is not a feasible option. Our experimental results show that DTD-Diff is 5–325 times faster than X-Diff when it detects the changes to the XSD files. Compared to XyDiff, DTD-Diff is up to 38 times faster. We also study the result quality of detected deltas.
Type: Article - Journal
text
In Title: Data & Knowledge Engineering
Copyright Notice: Pre-print: author can archive with restrictions;Restriction: This does not include Cell Press; Post-print: author can archive;
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
FULL COPYRIGHT INFORMATION:
http://www.elsevier.com/wps/find/authorsview.authors/authorsrights
Publisher URL:
http://dx.doi.org/10.1016/j.datak.2006.06.003
Link to this page:
http://scholarsmine.mst.edu/post_prints/DTD-DiffAChangeDetectionAlgorithmforDTDs_09007dcc8050ebaf.html



titleDTD-Diff: A change detection algorithm for DTDs
contributor.authorLeonardi, Erwin
contributor.authorHoai, Tran T.
contributor.authorBhowmick, Sourav S.
contributor.authorMadria, Sanjay
contributor.deptlabComputer Science
contributor.deptlabIntelligent Systems Center
subjectAlgorithm
subjectChange detection
subjectDTD
subjectPerformance
subjectXML
date.issued2007-05
publisherElsevier
identifier.citationLeonardi, Erwin., Hoai, Tran T., Bhowmick, Sourav S., and Madria, Sanjay Kumar. "DTD-Diff: A Change Detection Algorithm for DTDs.", Data and Knowledge Engineering, vol. 61, no. 2, 2007.
identifier.pub.URI
http://dx.doi.org/10.1016/j.datak.2006.06.003
description.abstractThe DTD of a set of XML documents may change due to many reasons such as changes to the real-world events, changes to the user’s requirements, and mistakes in the initial design. In this paper, we present a novel algorithm called DTD-Diff to detect the changes to DTDs that defines the structure of a set of XML documents. Such change detection tool can be useful in several ways such as maintenance of XML documents, incremental maintenance of relational schema for storing XML data, and XML schema integration. We compare DTD-Diff with existing XML change detection approaches and show that converting DTD to XML schema (XSD) (which is in XML document format) and detecting the changes using existing XML change detection algorithms is not a feasible option. Our experimental results show that DTD-Diff is 5–325 times faster than X-Diff when it detects the changes to the XSD files. Compared to XyDiff, DTD-Diff is up to 38 times faster. We also study the result quality of detected deltas.
typeArticle - Journal
type.DCMITypetext
rightsPre-print: author can archive with restrictions;Restriction: This does not include Cell Press; Post-print: author can archive;
rightsThis material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
rights.URI
http://www.elsevier.com/wps/find/authorsview.authors/authorsrights
relation.isPartOfData & Knowledge Engineering
date.available2008-06-11T20:03:02Z
identifier.persist.URI
http://scholarsmine.mst.edu/post_prints/DTD-DiffAChangeDetectionAlgorithmforDTDs_09007dcc8050ebaf.html