Bootstrap model selection in generalized linear models

Wei Pan, Chap T. Le

Research output: Contribution to journalArticlepeer-review

12 Scopus citations

Abstract

Model selection is a central component of data analysis. Though there are a variety of methods for likelihood-based estimation methods, there are relatively few for non-likelihood-based generalized linear models (GLMs), such as in the quasi-likelihood and generalized estimating equation (GEE) approaches. In this paper, we develop basic and bias-corrected bootstrap approaches to estimate the predictive mean squared error (PMSE) of a model and use the PMSE for model selection. Simulation studies show that the bias-corrected bootstrap estimate works well when quasi-likelihood or GEE is used to fit either overdispersed or correlated response GLMs. For correlated response data, when the marginal distribution assumption is (almost) correct, Akaike's information criterion (AIC) and Bayesian information criterion (BIC) calculated under the working independence model also perform well. For illustration, the methods are applied to data sets from evolutionary biology and teratology.

Original languageEnglish (US)
Pages (from-to)49-61
Number of pages13
JournalJournal of Agricultural, Biological, and Environmental Statistics
Volume6
Issue number1
DOIs
StatePublished - Mar 2001

Bibliographical note

Copyright:
Copyright 2005 Elsevier Science B.V., Amsterdam. All rights reserved.

Keywords

  • Akaike information criterion
  • Bayesian information criterion
  • Generalized estimating equations
  • Predictive mean squared error
  • Quasi-likelihood

Fingerprint

Dive into the research topics of 'Bootstrap model selection in generalized linear models'. Together they form a unique fingerprint.

Cite this