Statistical Power in Experimental Audit Studies: Cautions and Calculations for Matched Tests With Nominal Outcomes

Mike Vuolo; Christopher Uggen; Sarah Lageson

doi:10.1177/0049124115570066

Statistical Power in Experimental Audit Studies: Cautions and Calculations for Matched Tests With Nominal Outcomes

Mike Vuolo, Christopher Uggen, Sarah Lageson

Sociology (Twin Cities)

Research output: Contribution to journal › Article › peer-review

35 Scopus citations

Abstract

Given their capacity to identify causal relationships, experimental audit studies have grown increasingly popular in the social sciences. Typically, investigators send fictitious auditors who differ by a key factor (e.g., race) to particular experimental units (e.g., employers) and then compare treatment and control groups on a dichotomous outcome (e.g., hiring). In such scenarios, an important design consideration is the power to detect a certain magnitude difference between the groups. But power calculations are not straightforward in standard matched tests for dichotomous outcomes. Given the paired nature of the data, the number of pairs in the concordant cells (when neither or both auditor receives a positive response) contributes to the power, which is lower as the sum of the discordant proportions approaches one. Because these quantities are difficult to determine a priori, researchers must exercise particular care in experimental design. We here present sample size and power calculations for McNemar’s test using empirical data from an audit study on misdemeanor arrest records and employability. We then provide formulas and examples for cases involving more than two treatments (Cochran’s Q test) and nominal outcomes (Stuart–Maxwell test). We conclude with concrete recommendations concerning power and sample size for researchers designing and presenting matched audit studies.

Original language	English (US)
Pages (from-to)	260-303
Number of pages	44
Journal	Sociological Methods and Research
Volume	45
Issue number	2
DOIs	https://doi.org/10.1177/0049124115570066
State	Published - May 2014

Bibliographical note

Publisher Copyright:
© 2015, © The Author(s) 2015.

Keywords

Cochran’s Q test
McNemar’s test
Stuart–Maxwell test
audit studies
experiments
power
sample size

Access

10.1177/0049124115570066

OpenUrl availability

Full text

Cite this

@article{a1b3cc53fef94b7386a457a948ddf143,

title = "Statistical Power in Experimental Audit Studies: Cautions and Calculations for Matched Tests With Nominal Outcomes",

abstract = "Given their capacity to identify causal relationships, experimental audit studies have grown increasingly popular in the social sciences. Typically, investigators send fictitious auditors who differ by a key factor (e.g., race) to particular experimental units (e.g., employers) and then compare treatment and control groups on a dichotomous outcome (e.g., hiring). In such scenarios, an important design consideration is the power to detect a certain magnitude difference between the groups. But power calculations are not straightforward in standard matched tests for dichotomous outcomes. Given the paired nature of the data, the number of pairs in the concordant cells (when neither or both auditor receives a positive response) contributes to the power, which is lower as the sum of the discordant proportions approaches one. Because these quantities are difficult to determine a priori, researchers must exercise particular care in experimental design. We here present sample size and power calculations for McNemar{\textquoteright}s test using empirical data from an audit study on misdemeanor arrest records and employability. We then provide formulas and examples for cases involving more than two treatments (Cochran{\textquoteright}s Q test) and nominal outcomes (Stuart–Maxwell test). We conclude with concrete recommendations concerning power and sample size for researchers designing and presenting matched audit studies.",

keywords = "Cochran{\textquoteright}s Q test, McNemar{\textquoteright}s test, Stuart–Maxwell test, audit studies, experiments, power, sample size",

author = "Mike Vuolo and Christopher Uggen and Sarah Lageson",

note = "Publisher Copyright: {\textcopyright} 2015, {\textcopyright} The Author(s) 2015.",

year = "2014",

month = may,

doi = "10.1177/0049124115570066",

language = "English (US)",

volume = "45",

pages = "260--303",

journal = "Sociological Methods and Research",

issn = "0049-1241",

publisher = "SAGE Publications Inc.",

number = "2",

}

TY - JOUR

T1 - Statistical Power in Experimental Audit Studies

T2 - Cautions and Calculations for Matched Tests With Nominal Outcomes

AU - Vuolo, Mike

AU - Uggen, Christopher

AU - Lageson, Sarah

PY - 2014/5

Y1 - 2014/5

N2 - Given their capacity to identify causal relationships, experimental audit studies have grown increasingly popular in the social sciences. Typically, investigators send fictitious auditors who differ by a key factor (e.g., race) to particular experimental units (e.g., employers) and then compare treatment and control groups on a dichotomous outcome (e.g., hiring). In such scenarios, an important design consideration is the power to detect a certain magnitude difference between the groups. But power calculations are not straightforward in standard matched tests for dichotomous outcomes. Given the paired nature of the data, the number of pairs in the concordant cells (when neither or both auditor receives a positive response) contributes to the power, which is lower as the sum of the discordant proportions approaches one. Because these quantities are difficult to determine a priori, researchers must exercise particular care in experimental design. We here present sample size and power calculations for McNemar’s test using empirical data from an audit study on misdemeanor arrest records and employability. We then provide formulas and examples for cases involving more than two treatments (Cochran’s Q test) and nominal outcomes (Stuart–Maxwell test). We conclude with concrete recommendations concerning power and sample size for researchers designing and presenting matched audit studies.

AB - Given their capacity to identify causal relationships, experimental audit studies have grown increasingly popular in the social sciences. Typically, investigators send fictitious auditors who differ by a key factor (e.g., race) to particular experimental units (e.g., employers) and then compare treatment and control groups on a dichotomous outcome (e.g., hiring). In such scenarios, an important design consideration is the power to detect a certain magnitude difference between the groups. But power calculations are not straightforward in standard matched tests for dichotomous outcomes. Given the paired nature of the data, the number of pairs in the concordant cells (when neither or both auditor receives a positive response) contributes to the power, which is lower as the sum of the discordant proportions approaches one. Because these quantities are difficult to determine a priori, researchers must exercise particular care in experimental design. We here present sample size and power calculations for McNemar’s test using empirical data from an audit study on misdemeanor arrest records and employability. We then provide formulas and examples for cases involving more than two treatments (Cochran’s Q test) and nominal outcomes (Stuart–Maxwell test). We conclude with concrete recommendations concerning power and sample size for researchers designing and presenting matched audit studies.

KW - Cochran’s Q test

KW - McNemar’s test

KW - Stuart–Maxwell test

KW - audit studies

KW - experiments

KW - power

KW - sample size

UR - http://www.scopus.com/inward/record.url?scp=84963669697&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84963669697&partnerID=8YFLogxK

U2 - 10.1177/0049124115570066

DO - 10.1177/0049124115570066

M3 - Article

AN - SCOPUS:84963669697

SN - 0049-1241

VL - 45

SP - 260

EP - 303

JO - Sociological Methods and Research

JF - Sociological Methods and Research

IS - 2

ER -

Statistical Power in Experimental Audit Studies: Cautions and Calculations for Matched Tests With Nominal Outcomes

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this