Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

CiteULike is a free service for managing and discovering scholarly references - click here to get started.

Sign In to gain access to subscriptions and/or personal tools.
Statistical Modelling
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Lipsitz, S. R
Right arrow Articles by Ibrahim, J. G
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Protective estimator for linear regression with nonignorably missing Gaussian outcomes

Stuart R Lipsitz

Department of Biometry and Epidemiology, Medical University of South Carolina, Charleston, SC, USA, lipsitzs{at}musc.edu

Geert Molenberghs

Center for Statistics, Limburgs Universitair Centrum, Belgium

Garrett M Fitzmaurice

Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA

Joseph G Ibrahim

Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA, Division of Biostatistical Science, Dana-Farber Cancer Institute, Boston, MA, USA

We propose a method for estimating the regression parameters in a linear regression model for Gaussian data when the outcome variable is missing for some subjects and missingness is thought to be nonignorable. Throughout, we assume that missingness is restricted to the outcome variable and that the covariates are fully observed. Although maximum likelihood estimation of the regression parameters is possible once joint models for the outcome variable and the nonignorable missing data mechanism have been specified, these models are fundamentally nonidentifiable unless unverifiable modeling assumptions are imposed. In this paper, rather than explicitly modeling the nonignorable missingness mechanism, we consider the use of a ‘protective’ estimator of the regression parameters (Brown, 1990). To implement the proposed method, it is necessary to assume that the outcome variable and one of the covariates have an approximate bivariate normal distribution, conditional on the remaining covariates. In addition, it is assumed that the missing data mechanism is conditionally independent of this covariate, given the outcome variable and the remaining covariates; the latter is referred to as the ‘protective’ assumption. A method of moments approach is used to obtain the protective estimator of the regression parameters; the jackknife (Quenouille, 1956) is used to estimate the variance. The method is illustrated using data on the persistence of maternal smoking from the Six Cities Study of the health effects of air pollution (Ware et al., 1984). The results of a simulation study are presented that examine the magnitude of any finite sample bias.

Key Words: EM-algorithm • method of moments • nonignorable missing data • ordinary least squares

Statistical Modelling, Vol. 4, No. 1, 3-17 (2004)
DOI: 10.1191/1471082X04st066oa


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?