Anatomy of the Coupling Query in a Web Warehouse

Abstract

To populate a data warehouse specifically designed for Web data, i.e. web warehouse, it is imperative to harness relevant documents from the Web. In this paper, we describe a query mechanism called coupling query to glean relevant Web data in the context of our web warehousing system called Warehouse of Web Data (WHOWEDA). Coupling query may be used for querying both HTML and XML documents. Some of the important features of our query mechanism are ability to query metadata, content, internal and external (hyperlink) structure of Web documents based on partial knowledge, ability to express constraints on tag attributes and tagless segment of data, ability to express conjunctive as well as disjunctive query conditions compactly, ability to control execution of a web query and preservation of the topological structure of hyperlinked documents in the query results. We also discuss how to formulate query graphically and in textual form using coupling graph and coupling text, respectively. © 2002 Published by Elsevier Science B.V.

Recommended Citation

S. S. Bhowmick et al., "Anatomy of the Coupling Query in a Web Warehouse," Information and Software Technology, vol. 44, no. 9, pp. 513 - 539, Elsevier, Jun 2002.

The definitive version is available at https://doi.org/10.1016/S0950-5849(02)00051-4

Department(s)

Computer Science

Keywords and Phrases

Coupling graph; Coupling query; Coupling text; Web tuples; Web warehouse

International Standard Serial Number (ISSN)

0950-5849

Document Type

Article - Journal

Document Version

Citation

File Type

text

Language(s)

English

Rights

Publication Date

15 Jun 2002

Computer Science Faculty Research & Creative Works

Anatomy of the Coupling Query in a Web Warehouse

Abstract

Recommended Citation

Department(s)

Keywords and Phrases

International Standard Serial Number (ISSN)

Document Type

Document Version

File Type

Language(s)

Rights

Publication Date

Included in

Search

Browse

Author Corner

Related Content

Useful Links

Article Locations

Computer Science Faculty Research & Creative Works

Anatomy of the Coupling Query in a Web Warehouse

Author

Abstract

Recommended Citation

Department(s)

Keywords and Phrases

International Standard Serial Number (ISSN)

Document Type

Document Version

File Type

Language(s)

Rights

Publication Date

Included in

Share

Search

Browse

Author Corner

Related Content

Useful Links

Article Locations