Assessing Methods for Assigning SNPs to Genes in Gene-Based Tests of Association Using Common Variants

Ashley Petersen; Carolina Alvarez; Scott DeClaire; Nathan L. Tintle

doi:10.1371/journal.pone.0062161

Assessing Methods for Assigning SNPs to Genes in Gene-Based Tests of Association Using Common Variants

Ashley Petersen, Carolina Alvarez, Scott DeClaire, Nathan L. Tintle

Biostatistics

Research output: Contribution to journal › Article › peer-review

34 Scopus citations

Abstract

Gene-based tests of association are frequently applied to common SNPs (MAF>5%) as an alternative to single-marker tests. In this analysis we conduct a variety of simulation studies applied to five popular gene-based tests investigating general trends related to their performance in realistic situations. In particular, we focus on the impact of non-causal SNPs and a variety of LD structures on the behavior of these tests. Ultimately, we find that non-causal SNPs can significantly impact the power of all gene-based tests. On average, we find that the "noise" from 6-12 non-causal SNPs will cancel out the "signal" of one causal SNP across five popular gene-based tests. Furthermore, we find complex and differing behavior of the methods in the presence of LD within and between non-causal and causal SNPs. Ultimately, better approaches for a priori prioritization of potentially causal SNPs (e.g., predicting functionality of non-synonymous SNPs), application of these methods to sequenced or fully imputed datasets, and limited use of window-based methods for assigning inter-genic SNPs to genes will improve power. However, significant power loss from non-causal SNPs may remain unless alternative statistical approaches robust to the inclusion of non-causal SNPs are developed.

Original language	English (US)
Article number	e62161
Journal	PloS one
Volume	8
Issue number	5
DOIs	https://doi.org/10.1371/journal.pone.0062161
State	Published - May 31 2013

Access

10.1371/journal.pone.0062161

OpenUrl availability

Full text

Cite this

@article{444a24084a7446e987c27c2e2b8798dd,

title = "Assessing Methods for Assigning SNPs to Genes in Gene-Based Tests of Association Using Common Variants",

abstract = "Gene-based tests of association are frequently applied to common SNPs (MAF>5%) as an alternative to single-marker tests. In this analysis we conduct a variety of simulation studies applied to five popular gene-based tests investigating general trends related to their performance in realistic situations. In particular, we focus on the impact of non-causal SNPs and a variety of LD structures on the behavior of these tests. Ultimately, we find that non-causal SNPs can significantly impact the power of all gene-based tests. On average, we find that the {"}noise{"} from 6-12 non-causal SNPs will cancel out the {"}signal{"} of one causal SNP across five popular gene-based tests. Furthermore, we find complex and differing behavior of the methods in the presence of LD within and between non-causal and causal SNPs. Ultimately, better approaches for a priori prioritization of potentially causal SNPs (e.g., predicting functionality of non-synonymous SNPs), application of these methods to sequenced or fully imputed datasets, and limited use of window-based methods for assigning inter-genic SNPs to genes will improve power. However, significant power loss from non-causal SNPs may remain unless alternative statistical approaches robust to the inclusion of non-causal SNPs are developed.",

author = "Ashley Petersen and Carolina Alvarez and Scott DeClaire and Tintle, {Nathan L.}",

year = "2013",

month = may,

day = "31",

doi = "10.1371/journal.pone.0062161",

language = "English (US)",

volume = "8",

journal = "PloS one",

issn = "1932-6203",

publisher = "Public Library of Science",

number = "5",

}

TY - JOUR

T1 - Assessing Methods for Assigning SNPs to Genes in Gene-Based Tests of Association Using Common Variants

AU - Petersen, Ashley

AU - Alvarez, Carolina

AU - DeClaire, Scott

AU - Tintle, Nathan L.

PY - 2013/5/31

Y1 - 2013/5/31

N2 - Gene-based tests of association are frequently applied to common SNPs (MAF>5%) as an alternative to single-marker tests. In this analysis we conduct a variety of simulation studies applied to five popular gene-based tests investigating general trends related to their performance in realistic situations. In particular, we focus on the impact of non-causal SNPs and a variety of LD structures on the behavior of these tests. Ultimately, we find that non-causal SNPs can significantly impact the power of all gene-based tests. On average, we find that the "noise" from 6-12 non-causal SNPs will cancel out the "signal" of one causal SNP across five popular gene-based tests. Furthermore, we find complex and differing behavior of the methods in the presence of LD within and between non-causal and causal SNPs. Ultimately, better approaches for a priori prioritization of potentially causal SNPs (e.g., predicting functionality of non-synonymous SNPs), application of these methods to sequenced or fully imputed datasets, and limited use of window-based methods for assigning inter-genic SNPs to genes will improve power. However, significant power loss from non-causal SNPs may remain unless alternative statistical approaches robust to the inclusion of non-causal SNPs are developed.

AB - Gene-based tests of association are frequently applied to common SNPs (MAF>5%) as an alternative to single-marker tests. In this analysis we conduct a variety of simulation studies applied to five popular gene-based tests investigating general trends related to their performance in realistic situations. In particular, we focus on the impact of non-causal SNPs and a variety of LD structures on the behavior of these tests. Ultimately, we find that non-causal SNPs can significantly impact the power of all gene-based tests. On average, we find that the "noise" from 6-12 non-causal SNPs will cancel out the "signal" of one causal SNP across five popular gene-based tests. Furthermore, we find complex and differing behavior of the methods in the presence of LD within and between non-causal and causal SNPs. Ultimately, better approaches for a priori prioritization of potentially causal SNPs (e.g., predicting functionality of non-synonymous SNPs), application of these methods to sequenced or fully imputed datasets, and limited use of window-based methods for assigning inter-genic SNPs to genes will improve power. However, significant power loss from non-causal SNPs may remain unless alternative statistical approaches robust to the inclusion of non-causal SNPs are developed.

UR - http://www.scopus.com/inward/record.url?scp=84878609902&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84878609902&partnerID=8YFLogxK

U2 - 10.1371/journal.pone.0062161

DO - 10.1371/journal.pone.0062161

M3 - Article

C2 - 23741293

AN - SCOPUS:84878609902

SN - 1932-6203

VL - 8

JO - PloS one

JF - PloS one

IS - 5

M1 - e62161

ER -

Assessing Methods for Assigning SNPs to Genes in Gene-Based Tests of Association Using Common Variants

Abstract

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this