Comparative genomics approaches accurately predict deleterious variants in plants

Thomas Kono; Li Lei; Ching Hua Shih; Paul J. Hoffman; Peter L Morrell; Justin C. Fay

doi:10.1534/g3.118.200563

Comparative genomics approaches accurately predict deleterious variants in plants

Thomas Kono, Li Lei, Ching Hua Shih, Paul J. Hoffman, Peter L Morrell, Justin C. Fay

Research output: Contribution to journal › Article › peer-review

29 Scopus citations

Abstract

Recent advances in genome resequencing have led to increased interest in prediction of the functional consequences of genetic variants. Variants at phylogenetically conserved sites are of particular interest, because they are more likely than variants at phylogenetically variable sites to have deleterious effects on fitness and contribute to phenotypic variation. Numerous comparative genomic approaches have been developed to predict deleterious variants, but the approaches are nearly always assessed based on their ability to identify known disease-causing mutations in humans. Determining the accuracy of deleterious variant predictions in nonhuman species is important to understanding evolution, domestication, and potentially to improving crop quality and yield. To examine our ability to predict deleterious variants in plants we generated a curated database of 2,910 Arabidopsis thaliana mutants with known phenotypes. We evaluated seven approaches and found that while all performed well, their relative ranking differed from prior benchmarks in humans. We conclude that deleterious mutations can be reliably predicted in A. thaliana and likely other plant species, but that the relative performance of various approaches does not necessarily translate from one species to another.

Original language	English (US)
Pages (from-to)	3321-3329
Number of pages	9
Journal	G3: Genes, Genomes, Genetics
Volume	8
Issue number	10
DOIs	https://doi.org/10.1534/g3.118.200563
State	Published - Oct 1 2018

Bibliographical note

Funding Information:
We thank members of the Morrell Lab for discussion and software testing. We also would like to thank Drs. Danelle Seymour and Karl Schmid for helpful comments on an earlier version of the manuscript. Hardware and software support were provided by the University of Minnesota Supercomputing Institute. This work was supported by the US National Science Foundation Plant Genome Program grant (DBI-1339393 to JCF and PLM), the US Department of Agriculture Biotechnology Risk Assessment Research Grants Program (BRAG) (USDA BRAG 2015-06504 to PLM), and a University of Minnesota Doctoral Dissertation Fellowship (to TJYK).

Publisher Copyright:
Copyright © 2018 Reid et al.

Keywords

Deleterious
Genome
Mutations
Phenotypes
Training set

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access

10.1534/g3.118.200563

OpenUrl availability

Full text

Cite this

@article{27f093b174b24140800a6a42aa9d8ee9,

title = "Comparative genomics approaches accurately predict deleterious variants in plants",

abstract = "Recent advances in genome resequencing have led to increased interest in prediction of the functional consequences of genetic variants. Variants at phylogenetically conserved sites are of particular interest, because they are more likely than variants at phylogenetically variable sites to have deleterious effects on fitness and contribute to phenotypic variation. Numerous comparative genomic approaches have been developed to predict deleterious variants, but the approaches are nearly always assessed based on their ability to identify known disease-causing mutations in humans. Determining the accuracy of deleterious variant predictions in nonhuman species is important to understanding evolution, domestication, and potentially to improving crop quality and yield. To examine our ability to predict deleterious variants in plants we generated a curated database of 2,910 Arabidopsis thaliana mutants with known phenotypes. We evaluated seven approaches and found that while all performed well, their relative ranking differed from prior benchmarks in humans. We conclude that deleterious mutations can be reliably predicted in A. thaliana and likely other plant species, but that the relative performance of various approaches does not necessarily translate from one species to another.",

keywords = "Deleterious, Genome, Mutations, Phenotypes, Training set",

author = "Thomas Kono and Li Lei and Shih, {Ching Hua} and Hoffman, {Paul J.} and Morrell, {Peter L} and Fay, {Justin C.}",

note = "Funding Information: We thank members of the Morrell Lab for discussion and software testing. We also would like to thank Drs. Danelle Seymour and Karl Schmid for helpful comments on an earlier version of the manuscript. Hardware and software support were provided by the University of Minnesota Supercomputing Institute. This work was supported by the US National Science Foundation Plant Genome Program grant (DBI-1339393 to JCF and PLM), the US Department of Agriculture Biotechnology Risk Assessment Research Grants Program (BRAG) (USDA BRAG 2015-06504 to PLM), and a University of Minnesota Doctoral Dissertation Fellowship (to TJYK). Publisher Copyright: Copyright {\textcopyright} 2018 Reid et al.",

year = "2018",

month = oct,

day = "1",

doi = "10.1534/g3.118.200563",

language = "English (US)",

volume = "8",

pages = "3321--3329",

journal = "G3: Genes, Genomes, Genetics",

issn = "2160-1836",

publisher = "Genetics Society of America",

number = "10",

}

TY - JOUR

T1 - Comparative genomics approaches accurately predict deleterious variants in plants

AU - Kono, Thomas

AU - Lei, Li

AU - Shih, Ching Hua

AU - Hoffman, Paul J.

AU - Morrell, Peter L

AU - Fay, Justin C.

N1 - Funding Information: We thank members of the Morrell Lab for discussion and software testing. We also would like to thank Drs. Danelle Seymour and Karl Schmid for helpful comments on an earlier version of the manuscript. Hardware and software support were provided by the University of Minnesota Supercomputing Institute. This work was supported by the US National Science Foundation Plant Genome Program grant (DBI-1339393 to JCF and PLM), the US Department of Agriculture Biotechnology Risk Assessment Research Grants Program (BRAG) (USDA BRAG 2015-06504 to PLM), and a University of Minnesota Doctoral Dissertation Fellowship (to TJYK). Publisher Copyright: Copyright © 2018 Reid et al.

PY - 2018/10/1

Y1 - 2018/10/1

N2 - Recent advances in genome resequencing have led to increased interest in prediction of the functional consequences of genetic variants. Variants at phylogenetically conserved sites are of particular interest, because they are more likely than variants at phylogenetically variable sites to have deleterious effects on fitness and contribute to phenotypic variation. Numerous comparative genomic approaches have been developed to predict deleterious variants, but the approaches are nearly always assessed based on their ability to identify known disease-causing mutations in humans. Determining the accuracy of deleterious variant predictions in nonhuman species is important to understanding evolution, domestication, and potentially to improving crop quality and yield. To examine our ability to predict deleterious variants in plants we generated a curated database of 2,910 Arabidopsis thaliana mutants with known phenotypes. We evaluated seven approaches and found that while all performed well, their relative ranking differed from prior benchmarks in humans. We conclude that deleterious mutations can be reliably predicted in A. thaliana and likely other plant species, but that the relative performance of various approaches does not necessarily translate from one species to another.

AB - Recent advances in genome resequencing have led to increased interest in prediction of the functional consequences of genetic variants. Variants at phylogenetically conserved sites are of particular interest, because they are more likely than variants at phylogenetically variable sites to have deleterious effects on fitness and contribute to phenotypic variation. Numerous comparative genomic approaches have been developed to predict deleterious variants, but the approaches are nearly always assessed based on their ability to identify known disease-causing mutations in humans. Determining the accuracy of deleterious variant predictions in nonhuman species is important to understanding evolution, domestication, and potentially to improving crop quality and yield. To examine our ability to predict deleterious variants in plants we generated a curated database of 2,910 Arabidopsis thaliana mutants with known phenotypes. We evaluated seven approaches and found that while all performed well, their relative ranking differed from prior benchmarks in humans. We conclude that deleterious mutations can be reliably predicted in A. thaliana and likely other plant species, but that the relative performance of various approaches does not necessarily translate from one species to another.

KW - Deleterious

KW - Genome

KW - Mutations

KW - Phenotypes

KW - Training set

UR - http://www.scopus.com/inward/record.url?scp=85054465518&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85054465518&partnerID=8YFLogxK

U2 - 10.1534/g3.118.200563

DO - 10.1534/g3.118.200563

M3 - Article

C2 - 30139765

AN - SCOPUS:85054465518

SN - 2160-1836

VL - 8

SP - 3321

EP - 3329

JO - G3: Genes, Genomes, Genetics

JF - G3: Genes, Genomes, Genetics

IS - 10

ER -

Comparative genomics approaches accurately predict deleterious variants in plants

Abstract

Bibliographical note

Keywords

UN SDGs

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this