Abstract

In this article, we propose an adaptive procedure for testing for the effect of a factor of interest in the presence of one or more confounding variables in observational studies. It is especially relevant for applications where the factor of interest has a suspected causal relationship with a response. This procedure is not tied to linear modeling or normal distribution theory, and it offers a valuable alternative to traditional methods. It is suitable for applications where a factor of interest is categorical, and the response is continuous. Confounding variables may be continuous or categorical. The method is comprised of two basic steps that are performed in sequence. First, confounding variables alone (i.e., without the factor of interest) are used to group observations into subsets. These subsets have the property that, when restricted to a subset, there is little or no remaining variation in the response that is attributable to the confounding variables. We then test for the factor of interest within each subset of observations. We propose to implement the first step using a technique that is generally referred to as tree-structured regression. We use a nonparametric permutation procedure to carry out the second step. The proposed method is illustrated through an analysis of a U. S. Department of Agriculture (USDA) Forest Service data set and an air pollution data set. © 2002 American Statistical Association and the International Biometric Society.

Department(s)

Mathematics and Statistics

Keywords and Phrases

Classification and regression trees; Nonparametric; Permutation test; Tree

International Standard Serial Number (ISSN)

1085-7117

Document Type

Article - Journal

Document Version

Citation

File Type

text

Language(s)

English

Rights

© 2024 American Statistical Association; International Biometrics Society; Springer, All rights reserved.

Publication Date

01 Mar 2002

Share

 
COinS