Detecting Spatial Outliers using Bipartite Outlier Detection Methods

Abstract

Outlier detection, as a data mining task, is to identify a small set of data that is considerably dissimilar or inconsistent with the remainder of the data. Spatial outliers are spatially referenced objects whose non-spatial attribute values are significantly different from that of their neighbors. Identification of spatial outliers can lead to the discovery of unexpected, interesting spatial patterns for further investigation. In this paper, bipartite methods are used to detect spatial outliers based on the concepts of spatial point estimation and spatial statistical theory. Two point estimation methods are introduced to estimate values of spatial points. The concept of Z-score is used to evaluate the deviation of ratios of estimated values vs. true values from average ratio in the study space. Two algorithms are proposed to identify spatial outliers using different methods. These algorithms are used in New Mexico Produced Water Chemistry Database (PWCD). Results show that outlier detection can aid in bad data checking and the analysis of produced water (in oil and gas production) related problems.

Meeting Name

International Conference on Information and Knowledge Engineering (2004: Jun. 21-24, Las Vegas, NV)

Department(s)

Geosciences and Geological and Petroleum Engineering

Keywords and Phrases

Bipartite Spatial Outlier Detection; Knowledge Discovery; Point Estimation; Spatial Data Mining; Z-Scores; Algorithms; Data Reduction; Estimation; Information Analysis; Relational Database Systems; Statistical Tests

International Standard Book Number (ISBN)

978-1932415278

Document Type

Article - Conference proceedings

Document Version

Citation

File Type

text

Language(s)

English

Publication Date

01 Jun 2004

This document is currently not available here.

Share

 
COinS