Abstract
We believe that, to manage Web data effectively, there is a need to build a data warehouse of Web data, i.e. a Web warehouse. in this paper, we focus on how to represent and store relevant hyperlinked Web documents effectively in a Web warehouse called WHOWEDA (WareHouse of WEb DAta) for further querying and manipulation. We present a simple and general model for representing metadata, structure and content of Web documents and hyperlinks in WHOWEDA. We discuss node and link objects which are used to represent Web documents and hyperlinks respectively in WHOWEDA. These objects are first class objects in our data model called WHOM (WareHouse Object Model) which is designed to represent and manipulate Web data in the warehouse. an important feature of our model is that it represents metadata, content and structure as trees called node and link metadata trees, and node and link data trees.
Recommended Citation
S. S. Bhowmick et al., "Representation of Web Data in a Web Warehouse," Computer Journal, vol. 46, no. 3, pp. 229 - 262, Oxford University Press; BCS, The Chartered Institute for IT, Jan 2003.
The definitive version is available at https://doi.org/10.1093/comjnl/46.3.229
Department(s)
Computer Science
International Standard Serial Number (ISSN)
0010-4620
Document Type
Article - Journal
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 2024 Oxford University Press; BCS, The Chartered Institute for IT, All rights reserved.
Publication Date
01 Jan 2003