TY - JOUR
T1 - Comparison of methods that use whole genome data to estimate the heritability and genetic architecture of complex traits.
AU - Haplotype Reference Consortium
AU - Evans, Luke M.
AU - Tahmasbi, Rasool
AU - Vrieze, Scott I.
AU - Abecasis, Gonçalo R.
AU - Das, Sayantan
AU - Bjelland, Doug W.
AU - deCandia, Teresa R.
AU - Goddard, Michael E.
AU - Neale, Benjamin M.
AU - Yang, Jian
AU - Visscher, Peter M.
AU - Keller, Matthew C.
AU - McCarthy, Shane
AU - Das, Sayantan
AU - Kretzschmar, Warren
AU - Delaneau, Olivier
AU - Wood, Andrew R.
AU - Teumer, Alexander
AU - Min Kang, Hyun
AU - Fuchsberger, Christian
AU - Danecek, Petr
AU - Sharp, Kevin
AU - Luo, Yang
AU - Sidore, Carlo
AU - Kwong, Alan
AU - Timpson, Nicholas
AU - Koskinen, Seppo
AU - Vrieze, Scott
AU - Scott, Laura J.
AU - Zhang, He
AU - Mahajan, Anubha
AU - Veldink, Jan
AU - Peters, Ulrike
AU - Pato, Carlos
AU - van Duijn, Cornelia M.
AU - Gillies, Christopher E.
AU - Gandin, Ilaria
AU - Mezzavilla, Massimo
AU - Gilly, Arthur
AU - Cocca, Massimiliano
AU - Traglia, Michela
AU - Angius, Andrea
AU - Barrett, Jeffrey C.
AU - Boomsma, Dorrett
AU - Branham, Kari
AU - Breen, Gerome
AU - Brummett, Chad M.
AU - Busonero, Fabio
AU - McGue, Matt
AU - Iacono, William
N1 - Publisher Copyright:
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2017/3/9
Y1 - 2017/3/9
N2 - Heritability, h2, is a foundational concept in genetics, critical to understanding the genetic basis of complex traits. Recentlyadeveloped methods that estimate heritability from genotyped SNPs, h2SNP, explain substantially more genetic variance than genomeawide significant loci, but less than classical estimates from twins and families. However, h2SNP estimates have yet to be comprehensively compared under a range of genetic architectures, making it difficult to draw conclusions from sometimes conflicting published estimates. Here, we used thousands of real whole genome sequences to simulate realistic phenotypes under a variety of genetic architectures, including those from very rare causal variants. We compared the performance of ten methods across different types of genotypic data (commercial SNP array positions, whole genome sequence variants, and imputed variants) and under differing causal variant frequencies, levels of stratification, and relatedness thresholds. These results provide guidance in interpreting past results and choosing optimal approaches for future studies. We then chose two methods (GREMLaMS and GREMLaLDMS) that best estimated overall h2SNP and the causal variant frequency spectra to six phenotypes in the UK Biobank using imputed genomeawide variants. Our results suggest that as imputation reference panels become larger and more diverse, estimates of the frequency distribution of causal variants will become increasingly unbiased and the vast majority of trait narrowasense heritability will be accounted for.
AB - Heritability, h2, is a foundational concept in genetics, critical to understanding the genetic basis of complex traits. Recentlyadeveloped methods that estimate heritability from genotyped SNPs, h2SNP, explain substantially more genetic variance than genomeawide significant loci, but less than classical estimates from twins and families. However, h2SNP estimates have yet to be comprehensively compared under a range of genetic architectures, making it difficult to draw conclusions from sometimes conflicting published estimates. Here, we used thousands of real whole genome sequences to simulate realistic phenotypes under a variety of genetic architectures, including those from very rare causal variants. We compared the performance of ten methods across different types of genotypic data (commercial SNP array positions, whole genome sequence variants, and imputed variants) and under differing causal variant frequencies, levels of stratification, and relatedness thresholds. These results provide guidance in interpreting past results and choosing optimal approaches for future studies. We then chose two methods (GREMLaMS and GREMLaLDMS) that best estimated overall h2SNP and the causal variant frequency spectra to six phenotypes in the UK Biobank using imputed genomeawide variants. Our results suggest that as imputation reference panels become larger and more diverse, estimates of the frequency distribution of causal variants will become increasingly unbiased and the vast majority of trait narrowasense heritability will be accounted for.
KW - Genetic architecturee
KW - GREML
KW - He complex traite
KW - Heritability
UR - http://www.scopus.com/inward/record.url?scp=85095659166&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85095659166&partnerID=8YFLogxK
U2 - 10.1101/115527
DO - 10.1101/115527
M3 - Article
AN - SCOPUS:85095659166
JO - Journal of Fluid Mechanics
JF - Journal of Fluid Mechanics
SN - 0022-1120
ER -