Lung cancer: Interobserver agreement on interpretation of pulmonary findings at low-dose CT screening

David S. Gierada; Thomas K. Pilgram; Melissa Ford; Richard M. Fagerstrom; Timothy R. Church; Hrudaya Nath; Kavita Garg; Diane C. Strollo

doi:10.1148/radiol.2461062097

Lung cancer: Interobserver agreement on interpretation of pulmonary findings at low-dose CT screening

David S. Gierada, Thomas K. Pilgram, Melissa Ford, Richard M. Fagerstrom, Timothy R. Church, Hrudaya Nath, Kavita Garg, Diane C. Strollo

Environmental Health Sciences

Research output: Contribution to journal › Article › peer-review

99 Scopus citations

Abstract

Purpose: To evaluate agreement among radiologists on the interpretation of pulmonary findings at low-dose computed tomographic (CT) screening examinations for lung cancer. Materials and Methods: Institutional review board approval and informed consent were obtained. HIPAA guidelines were followed. Sixteen radiologists from the 10 National Lung Screening Trial screening centers of the National Cancer Institute's Lung Screening Study network reviewed image subsets from 135 baseline low-dose screening CT examinations in 135 trial participants (89 men, 46 women; mean age, 62.7 years ± 5.4 [standard deviation]). Interpretations were classified into one of four of the following categories: non-calcified nodule 4 mm or larger in greatest transverse dimension (positive screening result); noncalcified nodule smaller than 4 mm in greatest transverse dimension (negative screening result); calcified, benign nodule (negative screening result); or no nodule (negative screening result). A recommendation for follow-up evaluation was obtained for each case. Interobserver agreement was evaluated by using the multirater κ statistic and by using response frequencies and descriptive statistics. Results: Multirater κ values ranged from 0.58 (for agreement among all four classifications; 95% confidence interval: 0.55, 0.61) to 0.64 (for agreement on classification as a positive or negative screening result; 95% confidence interval: 0.62, 0.65). The average percentage of reader pairs in agreement on the screening result per case (percentage agreement) was 82%. There was wide variation in the total number of abnormalities detected and classified as pulmonary nodules, with differences of up to more than twofold among radiologists. For cases classified as positive, multirater κ for follow-up recommendations was 0.35. Conclusion: Interobserver agreement was moderate to substantial; potential for considerable improvement exists.

Original language	English (US)
Pages (from-to)	265-272
Number of pages	8
Journal	Radiology
Volume	246
Issue number	1
DOIs	https://doi.org/10.1148/radiol.2461062097
State	Published - Jan 2008

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access

10.1148/radiol.2461062097

OpenUrl availability

Full text

Cite this

@article{b04f509ffc9b497b89d10cd57f62ad2b,

title = "Lung cancer: Interobserver agreement on interpretation of pulmonary findings at low-dose CT screening",

abstract = "Purpose: To evaluate agreement among radiologists on the interpretation of pulmonary findings at low-dose computed tomographic (CT) screening examinations for lung cancer. Materials and Methods: Institutional review board approval and informed consent were obtained. HIPAA guidelines were followed. Sixteen radiologists from the 10 National Lung Screening Trial screening centers of the National Cancer Institute's Lung Screening Study network reviewed image subsets from 135 baseline low-dose screening CT examinations in 135 trial participants (89 men, 46 women; mean age, 62.7 years ± 5.4 [standard deviation]). Interpretations were classified into one of four of the following categories: non-calcified nodule 4 mm or larger in greatest transverse dimension (positive screening result); noncalcified nodule smaller than 4 mm in greatest transverse dimension (negative screening result); calcified, benign nodule (negative screening result); or no nodule (negative screening result). A recommendation for follow-up evaluation was obtained for each case. Interobserver agreement was evaluated by using the multirater κ statistic and by using response frequencies and descriptive statistics. Results: Multirater κ values ranged from 0.58 (for agreement among all four classifications; 95% confidence interval: 0.55, 0.61) to 0.64 (for agreement on classification as a positive or negative screening result; 95% confidence interval: 0.62, 0.65). The average percentage of reader pairs in agreement on the screening result per case (percentage agreement) was 82%. There was wide variation in the total number of abnormalities detected and classified as pulmonary nodules, with differences of up to more than twofold among radiologists. For cases classified as positive, multirater κ for follow-up recommendations was 0.35. Conclusion: Interobserver agreement was moderate to substantial; potential for considerable improvement exists.",

author = "Gierada, {David S.} and Pilgram, {Thomas K.} and Melissa Ford and Fagerstrom, {Richard M.} and Church, {Timothy R.} and Hrudaya Nath and Kavita Garg and Strollo, {Diane C.}",

year = "2008",

month = jan,

doi = "10.1148/radiol.2461062097",

language = "English (US)",

volume = "246",

pages = "265--272",

journal = "Radiology",

issn = "0033-8419",

publisher = "Radiological Society of North America Inc.",

number = "1",

}

TY - JOUR

T1 - Lung cancer

T2 - Interobserver agreement on interpretation of pulmonary findings at low-dose CT screening

AU - Gierada, David S.

AU - Pilgram, Thomas K.

AU - Ford, Melissa

AU - Fagerstrom, Richard M.

AU - Church, Timothy R.

AU - Nath, Hrudaya

AU - Garg, Kavita

AU - Strollo, Diane C.

PY - 2008/1

Y1 - 2008/1

N2 - Purpose: To evaluate agreement among radiologists on the interpretation of pulmonary findings at low-dose computed tomographic (CT) screening examinations for lung cancer. Materials and Methods: Institutional review board approval and informed consent were obtained. HIPAA guidelines were followed. Sixteen radiologists from the 10 National Lung Screening Trial screening centers of the National Cancer Institute's Lung Screening Study network reviewed image subsets from 135 baseline low-dose screening CT examinations in 135 trial participants (89 men, 46 women; mean age, 62.7 years ± 5.4 [standard deviation]). Interpretations were classified into one of four of the following categories: non-calcified nodule 4 mm or larger in greatest transverse dimension (positive screening result); noncalcified nodule smaller than 4 mm in greatest transverse dimension (negative screening result); calcified, benign nodule (negative screening result); or no nodule (negative screening result). A recommendation for follow-up evaluation was obtained for each case. Interobserver agreement was evaluated by using the multirater κ statistic and by using response frequencies and descriptive statistics. Results: Multirater κ values ranged from 0.58 (for agreement among all four classifications; 95% confidence interval: 0.55, 0.61) to 0.64 (for agreement on classification as a positive or negative screening result; 95% confidence interval: 0.62, 0.65). The average percentage of reader pairs in agreement on the screening result per case (percentage agreement) was 82%. There was wide variation in the total number of abnormalities detected and classified as pulmonary nodules, with differences of up to more than twofold among radiologists. For cases classified as positive, multirater κ for follow-up recommendations was 0.35. Conclusion: Interobserver agreement was moderate to substantial; potential for considerable improvement exists.

AB - Purpose: To evaluate agreement among radiologists on the interpretation of pulmonary findings at low-dose computed tomographic (CT) screening examinations for lung cancer. Materials and Methods: Institutional review board approval and informed consent were obtained. HIPAA guidelines were followed. Sixteen radiologists from the 10 National Lung Screening Trial screening centers of the National Cancer Institute's Lung Screening Study network reviewed image subsets from 135 baseline low-dose screening CT examinations in 135 trial participants (89 men, 46 women; mean age, 62.7 years ± 5.4 [standard deviation]). Interpretations were classified into one of four of the following categories: non-calcified nodule 4 mm or larger in greatest transverse dimension (positive screening result); noncalcified nodule smaller than 4 mm in greatest transverse dimension (negative screening result); calcified, benign nodule (negative screening result); or no nodule (negative screening result). A recommendation for follow-up evaluation was obtained for each case. Interobserver agreement was evaluated by using the multirater κ statistic and by using response frequencies and descriptive statistics. Results: Multirater κ values ranged from 0.58 (for agreement among all four classifications; 95% confidence interval: 0.55, 0.61) to 0.64 (for agreement on classification as a positive or negative screening result; 95% confidence interval: 0.62, 0.65). The average percentage of reader pairs in agreement on the screening result per case (percentage agreement) was 82%. There was wide variation in the total number of abnormalities detected and classified as pulmonary nodules, with differences of up to more than twofold among radiologists. For cases classified as positive, multirater κ for follow-up recommendations was 0.35. Conclusion: Interobserver agreement was moderate to substantial; potential for considerable improvement exists.

UR - http://www.scopus.com/inward/record.url?scp=37349024293&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=37349024293&partnerID=8YFLogxK

U2 - 10.1148/radiol.2461062097

DO - 10.1148/radiol.2461062097

M3 - Article

C2 - 18024436

AN - SCOPUS:37349024293

SN - 0033-8419

VL - 246

SP - 265

EP - 272

JO - Radiology

JF - Radiology

IS - 1

ER -

Lung cancer: Interobserver agreement on interpretation of pulmonary findings at low-dose CT screening

Abstract

UN SDGs

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this