Cuing effect of 'all of the above' on the reliability and validity of multiple-choice test items

P. H. Harasym; E. J. Leong; C. Violato; R. Brant; F. L. Lorscheider

doi:10.1177/016327879802100106

Cuing effect of 'all of the above' on the reliability and validity of multiple-choice test items

P. H. Harasym, E. J. Leong, C. Violato, R. Brant, F. L. Lorscheider

Administration (TMED)

Research output: Contribution to journal › Article › peer-review

15 Scopus citations

Abstract

It is generally acknowledged that alternatives such as none of the above and all of the above should be used sparingly in multiple-choice (MC) items. But the effect that all of the above has on the reliability and validity of an MC item is unclear. This study compared the results of a single-response (SR(a)) item format that included all of the above as the correct response to a multiple-response (MR) item format that required examinees to select all of the available alternatives for a correct response. A crossover design was used to compare the effect of formats on student performance while item content, scoring method, and student ability levels remained constant. Results indicated that the SR(a) format greatly distorted examinee performance by elevating their scores because examinees who recognized two or more alternatives as being correct were cued to select all of the above. In addition, the SR(a) format significantly reduced the reliability and concurrent validity of examinee scores. In summary, the MR format was found to be superior. Based upon new empirical evidence, this study recommends that whenever an educator wishes to evaluate student understanding of an issue that has multiple facts, the SR(a) format should be avoided and the MR format should be used instead.

Original language	English (US)
Pages (from-to)	120-133
Number of pages	14
Journal	Evaluation and the Health Professions
Volume	21
Issue number	1
DOIs	https://doi.org/10.1177/016327879802100106
State	Published - 1998

Access

10.1177/016327879802100106

OpenUrl availability

Full text

Cite this

@article{857a2cb9aebf42e8a8ef7efa5f1a54f6,

title = "Cuing effect of 'all of the above' on the reliability and validity of multiple-choice test items",

abstract = "It is generally acknowledged that alternatives such as none of the above and all of the above should be used sparingly in multiple-choice (MC) items. But the effect that all of the above has on the reliability and validity of an MC item is unclear. This study compared the results of a single-response (SR(a)) item format that included all of the above as the correct response to a multiple-response (MR) item format that required examinees to select all of the available alternatives for a correct response. A crossover design was used to compare the effect of formats on student performance while item content, scoring method, and student ability levels remained constant. Results indicated that the SR(a) format greatly distorted examinee performance by elevating their scores because examinees who recognized two or more alternatives as being correct were cued to select all of the above. In addition, the SR(a) format significantly reduced the reliability and concurrent validity of examinee scores. In summary, the MR format was found to be superior. Based upon new empirical evidence, this study recommends that whenever an educator wishes to evaluate student understanding of an issue that has multiple facts, the SR(a) format should be avoided and the MR format should be used instead.",

author = "Harasym, {P. H.} and Leong, {E. J.} and C. Violato and R. Brant and Lorscheider, {F. L.}",

year = "1998",

doi = "10.1177/016327879802100106",

language = "English (US)",

volume = "21",

pages = "120--133",

journal = "Evaluation and the Health Professions",

issn = "0163-2787",

publisher = "SAGE Publications Inc.",

number = "1",

}

TY - JOUR

T1 - Cuing effect of 'all of the above' on the reliability and validity of multiple-choice test items

AU - Harasym, P. H.

AU - Leong, E. J.

AU - Violato, C.

AU - Brant, R.

AU - Lorscheider, F. L.

PY - 1998

Y1 - 1998

N2 - It is generally acknowledged that alternatives such as none of the above and all of the above should be used sparingly in multiple-choice (MC) items. But the effect that all of the above has on the reliability and validity of an MC item is unclear. This study compared the results of a single-response (SR(a)) item format that included all of the above as the correct response to a multiple-response (MR) item format that required examinees to select all of the available alternatives for a correct response. A crossover design was used to compare the effect of formats on student performance while item content, scoring method, and student ability levels remained constant. Results indicated that the SR(a) format greatly distorted examinee performance by elevating their scores because examinees who recognized two or more alternatives as being correct were cued to select all of the above. In addition, the SR(a) format significantly reduced the reliability and concurrent validity of examinee scores. In summary, the MR format was found to be superior. Based upon new empirical evidence, this study recommends that whenever an educator wishes to evaluate student understanding of an issue that has multiple facts, the SR(a) format should be avoided and the MR format should be used instead.

AB - It is generally acknowledged that alternatives such as none of the above and all of the above should be used sparingly in multiple-choice (MC) items. But the effect that all of the above has on the reliability and validity of an MC item is unclear. This study compared the results of a single-response (SR(a)) item format that included all of the above as the correct response to a multiple-response (MR) item format that required examinees to select all of the available alternatives for a correct response. A crossover design was used to compare the effect of formats on student performance while item content, scoring method, and student ability levels remained constant. Results indicated that the SR(a) format greatly distorted examinee performance by elevating their scores because examinees who recognized two or more alternatives as being correct were cued to select all of the above. In addition, the SR(a) format significantly reduced the reliability and concurrent validity of examinee scores. In summary, the MR format was found to be superior. Based upon new empirical evidence, this study recommends that whenever an educator wishes to evaluate student understanding of an issue that has multiple facts, the SR(a) format should be avoided and the MR format should be used instead.

UR - http://www.scopus.com/inward/record.url?scp=0031915431&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0031915431&partnerID=8YFLogxK

U2 - 10.1177/016327879802100106

DO - 10.1177/016327879802100106

M3 - Article

C2 - 10183336

AN - SCOPUS:0031915431

SN - 0163-2787

VL - 21

SP - 120

EP - 133

JO - Evaluation and the Health Professions

JF - Evaluation and the Health Professions

IS - 1

ER -

Cuing effect of 'all of the above' on the reliability and validity of multiple-choice test items

Abstract

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this