Scholars' Mine
Missouri S&T
Research Repository
Curtis Laws Wilson Library
400 W. 14th Street
Rolla, MO 65409-0060
scholarsmine@mst.edu
| Title: | DTD-Diff: A change detection algorithm for DTDs |
| Author (s): | Leonardi, Erwin Hoai, Tran T. Bhowmick, Sourav S. Madria, Sanjay |
| Department/Lab Affiliations: | Computer Science Intelligent Systems Center |
| Keywords: | Algorithm Change detection DTD Performance XML |
| Issue Date: | 2007-05 |
| Publisher: | Elsevier |
| Citation: | Leonardi, Erwin., Hoai, Tran T., Bhowmick, Sourav S., and Madria, Sanjay Kumar. "DTD-Diff: A Change Detection Algorithm for DTDs.", Data and Knowledge Engineering, vol. 61, no. 2, 2007. |
| Abstract: | The DTD of a set of XML documents may change due to many reasons such as changes to the real-world events, changes to the user’s requirements, and mistakes in the initial design. In this paper, we present a novel algorithm called DTD-Diff to detect the changes to DTDs that defines the structure of a set of XML documents. Such change detection tool can be useful in several ways such as maintenance of XML documents, incremental maintenance of relational schema for storing XML data, and XML schema integration. We compare DTD-Diff with existing XML change detection approaches and show that converting DTD to XML schema (XSD) (which is in XML document format) and detecting the changes using existing XML change detection algorithms is not a feasible option. Our experimental results show that DTD-Diff is 5–325 times faster than X-Diff when it detects the changes to the XSD files. Compared to XyDiff, DTD-Diff is up to 38 times faster. We also study the result quality of detected deltas. |
| Type: | Article - Journal text |
| In Title: | Data & Knowledge Engineering |
| Copyright Notice: | Pre-print: author can archive with restrictions;Restriction: This does not include Cell Press; Post-print: author can archive; This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder. FULL COPYRIGHT INFORMATION: |
| Publisher URL: | |
| Link to this page: |
| title | DTD-Diff: A change detection algorithm for DTDs |
| contributor.author | Leonardi, Erwin |
| contributor.author | Hoai, Tran T. |
| contributor.author | Bhowmick, Sourav S. |
| contributor.author | Madria, Sanjay |
| contributor.deptlab | Computer Science |
| contributor.deptlab | Intelligent Systems Center |
| subject | Algorithm |
| subject | Change detection |
| subject | DTD |
| subject | Performance |
| subject | XML |
| date.issued | 2007-05 |
| publisher | Elsevier |
| identifier.citation | Leonardi, Erwin., Hoai, Tran T., Bhowmick, Sourav S., and Madria, Sanjay Kumar. "DTD-Diff: A Change Detection Algorithm for DTDs.", Data and Knowledge Engineering, vol. 61, no. 2, 2007. |
| identifier.pub.URI | |
| description.abstract | The DTD of a set of XML documents may change due to many reasons such as changes to the real-world events, changes to the user’s requirements, and mistakes in the initial design. In this paper, we present a novel algorithm called DTD-Diff to detect the changes to DTDs that defines the structure of a set of XML documents. Such change detection tool can be useful in several ways such as maintenance of XML documents, incremental maintenance of relational schema for storing XML data, and XML schema integration. We compare DTD-Diff with existing XML change detection approaches and show that converting DTD to XML schema (XSD) (which is in XML document format) and detecting the changes using existing XML change detection algorithms is not a feasible option. Our experimental results show that DTD-Diff is 5–325 times faster than X-Diff when it detects the changes to the XSD files. Compared to XyDiff, DTD-Diff is up to 38 times faster. We also study the result quality of detected deltas. |
| type | Article - Journal |
| type.DCMIType | text |
| rights | Pre-print: author can archive with restrictions;Restriction: This does not include Cell Press; Post-print: author can archive; |
| rights | This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder. |
| rights.URI | |
| relation.isPartOf | Data & Knowledge Engineering |
| date.available | 2008-06-11T20:03:02Z |
| identifier.persist.URI |