What Can a Web Bag Discover for You?
Abstract
Sets and bags are closely related structures. A bag is different from a set in that it is sensitive to the number of times an element occurs while a set is not. In this paper, we introduce the concept of web bag in the context of the web warehouse project called WHOWEDA (Warehouse of Web Data). Informally, a web bag is a web table which allows multiple occurrences of identical web tuples. We have used web bag to discover useful knowledge from a web table such as visible documents (or web sites), luminous documents and luminous paths. In this paper, we formally discuss the semantics and properties of web bags. We design formal algorithms for the construction of a web bag and its schema. In addition, we also provide formal algorithms for various types of knowledge discovery in a web warehouse using web bag and illustrate them with examples. © 2002 Elsevier Science B.V. All rights reserved.
Recommended Citation
S. S. Bhowmick et al., "What Can a Web Bag Discover for You?," Data and Knowledge Engineering, vol. 43, no. 1, pp. 79 - 119, Elsevier, Oct 2002.
The definitive version is available at https://doi.org/10.1016/S0169-023X(02)00123-4
Department(s)
Computer Science
Keywords and Phrases
Luminous documents; Luminous paths; Visible documents; Web bag; Web table; Web warehouse
International Standard Serial Number (ISSN)
0169-023X
Document Type
Article - Journal
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 2024 Elsevier, All rights reserved.
Publication Date
01 Oct 2002