Abstract

We believe that, to manage Web data effectively, there is a need to build a data warehouse of Web data, i.e. a Web warehouse. in this paper, we focus on how to represent and store relevant hyperlinked Web documents effectively in a Web warehouse called WHOWEDA (WareHouse of WEb DAta) for further querying and manipulation. We present a simple and general model for representing metadata, structure and content of Web documents and hyperlinks in WHOWEDA. We discuss node and link objects which are used to represent Web documents and hyperlinks respectively in WHOWEDA. These objects are first class objects in our data model called WHOM (WareHouse Object Model) which is designed to represent and manipulate Web data in the warehouse. an important feature of our model is that it represents metadata, content and structure as trees called node and link metadata trees, and node and link data trees.

Department(s)

Computer Science

International Standard Serial Number (ISSN)

0010-4620

Document Type

Article - Journal

Document Version

Citation

File Type

text

Language(s)

English

Rights

© 2024 Oxford University Press; BCS, The Chartered Institute for IT, All rights reserved.

Publication Date

01 Jan 2003

Share

 
COinS