Markov Chain Monte Carlo Convergence Diagnostics: A Comparative Review

Mary Kathryn Cowles; Bradley P. Carlin

doi:10.1080/01621459.1996.10476956

Markov Chain Monte Carlo Convergence Diagnostics: A Comparative Review

Mary Kathryn Cowles, Bradley P. Carlin

Biostatistics

Research output: Contribution to journal › Article › peer-review

1438 Scopus citations

Abstract

A critical issue for users of Markov chain Monte Carlo (MCMC) methods in applications is how to determine when it is safe to stop sampling and use the samples to estimate characteristics of the distribution of interest. Research into methods of computing theoretical convergence bounds holds promise for the future but to date has yielded relatively little of practical use in applied work. Consequently, most MCMC users address the convergence problem by applying diagnostic tools to the output produced by running their samplers. After giving a brief overview of the area, we provide an expository review of 13 convergence diagnostics, describing the theoretical basis and practical implementation of each. We then compare their performance in two simple models and conclude that all of the methods can fail to detect the sorts of convergence failure that they were designed to identify. We thus recommend a combination of strategies aimed at evaluating and accelerating MCMC sampler convergence, including applying diagnostic procedures to a small number of parallel chains, monitoring autocorrelations and cross-correlations, and modifying parameterizations or sampling algorithms appropriately. We emphasize, however, that it is not possible to say with certainty that a finite sample from an MCMC algorithm is representative of an underlying stationary distribution.

Original language	English (US)
Pages (from-to)	883-904
Number of pages	22
Journal	Journal of the American Statistical Association
Volume	91
Issue number	434
DOIs	https://doi.org/10.1080/01621459.1996.10476956
State	Published - Jun 1 1996

Keywords

Autocorrelation
Gibbs sampler
Metropolis-Hastings algorithm

Access

10.1080/01621459.1996.10476956

OpenUrl availability

Full text

Cite this

@article{40742ac212e54a6f809c18e1b8e1cbe4,

title = "Markov Chain Monte Carlo Convergence Diagnostics: A Comparative Review",

abstract = "A critical issue for users of Markov chain Monte Carlo (MCMC) methods in applications is how to determine when it is safe to stop sampling and use the samples to estimate characteristics of the distribution of interest. Research into methods of computing theoretical convergence bounds holds promise for the future but to date has yielded relatively little of practical use in applied work. Consequently, most MCMC users address the convergence problem by applying diagnostic tools to the output produced by running their samplers. After giving a brief overview of the area, we provide an expository review of 13 convergence diagnostics, describing the theoretical basis and practical implementation of each. We then compare their performance in two simple models and conclude that all of the methods can fail to detect the sorts of convergence failure that they were designed to identify. We thus recommend a combination of strategies aimed at evaluating and accelerating MCMC sampler convergence, including applying diagnostic procedures to a small number of parallel chains, monitoring autocorrelations and cross-correlations, and modifying parameterizations or sampling algorithms appropriately. We emphasize, however, that it is not possible to say with certainty that a finite sample from an MCMC algorithm is representative of an underlying stationary distribution.",

keywords = "Autocorrelation, Gibbs sampler, Metropolis-Hastings algorithm",

author = "Cowles, {Mary Kathryn} and Carlin, {Bradley P.}",

year = "1996",

month = jun,

day = "1",

doi = "10.1080/01621459.1996.10476956",

language = "English (US)",

volume = "91",

pages = "883--904",

journal = "Journal of the American Statistical Association",

issn = "0162-1459",

publisher = "Taylor and Francis Ltd.",

number = "434",

}

TY - JOUR

T1 - Markov Chain Monte Carlo Convergence Diagnostics

T2 - A Comparative Review

AU - Cowles, Mary Kathryn

AU - Carlin, Bradley P.

PY - 1996/6/1

Y1 - 1996/6/1

N2 - A critical issue for users of Markov chain Monte Carlo (MCMC) methods in applications is how to determine when it is safe to stop sampling and use the samples to estimate characteristics of the distribution of interest. Research into methods of computing theoretical convergence bounds holds promise for the future but to date has yielded relatively little of practical use in applied work. Consequently, most MCMC users address the convergence problem by applying diagnostic tools to the output produced by running their samplers. After giving a brief overview of the area, we provide an expository review of 13 convergence diagnostics, describing the theoretical basis and practical implementation of each. We then compare their performance in two simple models and conclude that all of the methods can fail to detect the sorts of convergence failure that they were designed to identify. We thus recommend a combination of strategies aimed at evaluating and accelerating MCMC sampler convergence, including applying diagnostic procedures to a small number of parallel chains, monitoring autocorrelations and cross-correlations, and modifying parameterizations or sampling algorithms appropriately. We emphasize, however, that it is not possible to say with certainty that a finite sample from an MCMC algorithm is representative of an underlying stationary distribution.

AB - A critical issue for users of Markov chain Monte Carlo (MCMC) methods in applications is how to determine when it is safe to stop sampling and use the samples to estimate characteristics of the distribution of interest. Research into methods of computing theoretical convergence bounds holds promise for the future but to date has yielded relatively little of practical use in applied work. Consequently, most MCMC users address the convergence problem by applying diagnostic tools to the output produced by running their samplers. After giving a brief overview of the area, we provide an expository review of 13 convergence diagnostics, describing the theoretical basis and practical implementation of each. We then compare their performance in two simple models and conclude that all of the methods can fail to detect the sorts of convergence failure that they were designed to identify. We thus recommend a combination of strategies aimed at evaluating and accelerating MCMC sampler convergence, including applying diagnostic procedures to a small number of parallel chains, monitoring autocorrelations and cross-correlations, and modifying parameterizations or sampling algorithms appropriately. We emphasize, however, that it is not possible to say with certainty that a finite sample from an MCMC algorithm is representative of an underlying stationary distribution.

KW - Autocorrelation

KW - Gibbs sampler

KW - Metropolis-Hastings algorithm

UR - http://www.scopus.com/inward/record.url?scp=0030539336&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0030539336&partnerID=8YFLogxK

U2 - 10.1080/01621459.1996.10476956

DO - 10.1080/01621459.1996.10476956

M3 - Article

AN - SCOPUS:0030539336

SN - 0162-1459

VL - 91

SP - 883

EP - 904

JO - Journal of the American Statistical Association

JF - Journal of the American Statistical Association

IS - 434

ER -

Markov Chain Monte Carlo Convergence Diagnostics: A Comparative Review

Abstract

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this