Scholars' Mine
Missouri S&T
Research Repository
Curtis Laws Wilson Library
400 W. 14th Street
Rolla, MO 65409-0060
scholarsmine@mst.edu
| Title: | An XML schema integration and query mechanism system |
| Author (s): | Madria, Sanjay Passi, Kalpdrum Bhowmick, Sourav |
| Department/Lab Affiliations: | Computer Science Intelligent Systems Center |
| Keywords: | Integration Query Schema XML |
| Issue Date: | 2008-05 |
| Publisher: | Elsevier |
| Citation: | Madria, Sanjay Kumar., Passi, Kalpdrum., and Bhowmick, Sourav. "An XML Schema Integration and QueryMechanismSystem.", Data & Knowledge Engineering, vol. 65, no. 2, 2008. |
| Abstract: | The availability of large amounts of heterogeneous distributed web data necessitates the integration of XML data from multiple XML sources for many reasons. For example, currently, there are many e-commerce companies, which offer similar products but use different XML Schemas with possibly different ontologies. When any two such companies merge, or make an effort to service customers in cooperation, there is a need for an integrated schema and query mechanism for the interoperability of applications. In applications like comparison-shopping, there is a need for an illusionary centralized homogeneous information system. In this paper, we propose XML Schema integration and querying methodology. We define an object-oriented data model called XSDM (XML Schema Data Model) and present a graphical representation of XML Schema for the purpose of schema integration. We use a three-layered architecture for XML Schema integration. The three layers included are namely pre-integration, comparison, and integration. The three layers can conceptually be regarded as three phases of the integration process. During pre-integration, the schemas present in XML Schema notation are read and converted into the XSDM notation. During the comparison phase of integration, correspondences as well as conflicts between elements are identified. During the integration phase, conflict resolution, restructuring and merging of the initial schemas takes place to obtain the global schema. We define integration policies for integrating element definitions as well as their datatypes and attributes. An integrated global schema forms the basis for querying a set of local XML documents. We discuss various strategies for rewriting the global query over the global schema into the sub-queries over local schemas. Their respective local schemas validate the sub-queries over the local XML documents. This requires the identification and use of mapping rules and relationships between the local schemas. |
| Type: | Article - Journal text |
| In Title: | Data & Knowledge Engineering |
| Copyright Notice: | Pre-print: author can archive with restrictions;Restriction: This does not include Cell Press; Post-print: author can archive; This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder. FULL COPYRIGHT INFORMATION: |
| Publisher URL: | |
| Link to this page: |
| title | An XML schema integration and query mechanism system |
| contributor.author | Madria, Sanjay |
| contributor.author | Passi, Kalpdrum |
| contributor.author | Bhowmick, Sourav |
| contributor.deptlab | Computer Science |
| contributor.deptlab | Intelligent Systems Center |
| subject | Integration |
| subject | Query |
| subject | Schema |
| subject | XML |
| date.issued | 2008-05 |
| publisher | Elsevier |
| identifier.citation | Madria, Sanjay Kumar., Passi, Kalpdrum., and Bhowmick, Sourav. "An XML Schema Integration and QueryMechanismSystem.", Data & Knowledge Engineering, vol. 65, no. 2, 2008. |
| identifier.pub.URI | |
| description.abstract | The availability of large amounts of heterogeneous distributed web data necessitates the integration of XML data from multiple XML sources for many reasons. For example, currently, there are many e-commerce companies, which offer similar products but use different XML Schemas with possibly different ontologies. When any two such companies merge, or make an effort to service customers in cooperation, there is a need for an integrated schema and query mechanism for the interoperability of applications. In applications like comparison-shopping, there is a need for an illusionary centralized homogeneous information system. In this paper, we propose XML Schema integration and querying methodology. We define an object-oriented data model called XSDM (XML Schema Data Model) and present a graphical representation of XML Schema for the purpose of schema integration. We use a three-layered architecture for XML Schema integration. The three layers included are namely pre-integration, comparison, and integration. The three layers can conceptually be regarded as three phases of the integration process. During pre-integration, the schemas present in XML Schema notation are read and converted into the XSDM notation. During the comparison phase of integration, correspondences as well as conflicts between elements are identified. During the integration phase, conflict resolution, restructuring and merging of the initial schemas takes place to obtain the global schema. We define integration policies for integrating element definitions as well as their datatypes and attributes. An integrated global schema forms the basis for querying a set of local XML documents. We discuss various strategies for rewriting the global query over the global schema into the sub-queries over local schemas. Their respective local schemas validate the sub-queries over the local XML documents. This requires the identification and use of mapping rules and relationships between the local schemas. |
| type | Article - Journal |
| type.DCMIType | text |
| rights | Pre-print: author can archive with restrictions;Restriction: This does not include Cell Press; Post-print: author can archive; |
| rights | This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder. |
| rights.URI | |
| relation.isPartOf | Data & Knowledge Engineering |
| date.accessioned | 2008-09-19T20:37:40Z |
| date.available | 2008-06-09T22:05:01Z |
| identifier.persist.URI |