Genetic Code Evolution Investigated through the Synthesis and Characterisation of Proteins from Reduced-Alphabet Libraries

Matilda S. Newton; Dana J. Morrone; Kun Hwa Lee; Burckhard Seelig

doi:10.1002/cbic.201800668

Genetic Code Evolution Investigated through the Synthesis and Characterisation of Proteins from Reduced-Alphabet Libraries

Matilda S. Newton, Dana J. Morrone, Kun Hwa Lee, Burckhard Seelig

Research output: Contribution to journal › Article › peer-review

10 Scopus citations

Abstract

The universal genetic code of 20 amino acids is the product of evolution. It is believed that earlier versions of the code had fewer residues. Many theories for the order in which amino acids were integrated into the code have been proposed, considering factors ranging from prebiotic chemistry to codon capture. Several meta-analyses combined these theories to yield a feasible consensus chronology of the genetic code's evolution, but there is a dearth of experimental data to test the hypothesised order. We used combinatorial chemistry to synthesise libraries of random polypeptides that were based on different subsets of the 20 standard amino acids, thus representing different stages of a plausible history of the alphabet. Four libraries were comprised of the five, nine, and 16 most ancient amino acids, and all 20 extant residues for a direct side-by-side comparison. We characterised numerous variants from each library for their solubility and propensity to form secondary, tertiary or quaternary structures. Proteins from the two most ancient libraries were more likely to be soluble than those from the extant library. Several individual protein variants exhibited inducible protein folding and other traits typical of intrinsically disordered proteins. From these libraries, we can infer how primordial protein structure and function might have evolved with the genetic code.

Original language	English (US)
Pages (from-to)	846-856
Number of pages	11
Journal	ChemBioChem
Volume	20
Issue number	6
DOIs	https://doi.org/10.1002/cbic.201800668
State	Published - Mar 15 2019

Bibliographical note

Funding Information:
We thank Dr. Maureen Quin for assistance with the western blots, and Prof. Romas Kazlauskas and Fredarla Miller for their comments on the manuscript. This work was funded in part by grants from the US National Aeronautics and Space Administration (NASA) Agreement (NNX14AK29G), the Simons Foundation (340762), the Minnesota Medical Foundation (4036–9663-10), the University of Minnesota Biocatalysis Initiative, and the Office of the VP of Research at the University of Minnesota (Grant-in-Aid).

Publisher Copyright:
© 2019 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim

Keywords

genetic code
origin of proteins
primordial peptides
protein libraries

Access

10.1002/cbic.201800668

OpenUrl availability

Full text

Cite this

@article{e11049fab603409789220e8479f90847,

title = "Genetic Code Evolution Investigated through the Synthesis and Characterisation of Proteins from Reduced-Alphabet Libraries",

abstract = "The universal genetic code of 20 amino acids is the product of evolution. It is believed that earlier versions of the code had fewer residues. Many theories for the order in which amino acids were integrated into the code have been proposed, considering factors ranging from prebiotic chemistry to codon capture. Several meta-analyses combined these theories to yield a feasible consensus chronology of the genetic code's evolution, but there is a dearth of experimental data to test the hypothesised order. We used combinatorial chemistry to synthesise libraries of random polypeptides that were based on different subsets of the 20 standard amino acids, thus representing different stages of a plausible history of the alphabet. Four libraries were comprised of the five, nine, and 16 most ancient amino acids, and all 20 extant residues for a direct side-by-side comparison. We characterised numerous variants from each library for their solubility and propensity to form secondary, tertiary or quaternary structures. Proteins from the two most ancient libraries were more likely to be soluble than those from the extant library. Several individual protein variants exhibited inducible protein folding and other traits typical of intrinsically disordered proteins. From these libraries, we can infer how primordial protein structure and function might have evolved with the genetic code.",

keywords = "genetic code, origin of proteins, primordial peptides, protein libraries",

author = "Newton, {Matilda S.} and Morrone, {Dana J.} and Lee, {Kun Hwa} and Burckhard Seelig",

note = "Funding Information: We thank Dr. Maureen Quin for assistance with the western blots, and Prof. Romas Kazlauskas and Fredarla Miller for their comments on the manuscript. This work was funded in part by grants from the US National Aeronautics and Space Administration (NASA) Agreement (NNX14AK29G), the Simons Foundation (340762), the Minnesota Medical Foundation (4036–9663-10), the University of Minnesota Biocatalysis Initiative, and the Office of the VP of Research at the University of Minnesota (Grant-in-Aid). Publisher Copyright: {\textcopyright} 2019 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim",

year = "2019",

month = mar,

day = "15",

doi = "10.1002/cbic.201800668",

language = "English (US)",

volume = "20",

pages = "846--856",

journal = "ChemBioChem",

issn = "1439-4227",

publisher = "Wiley-VCH Verlag",

number = "6",

}

TY - JOUR

T1 - Genetic Code Evolution Investigated through the Synthesis and Characterisation of Proteins from Reduced-Alphabet Libraries

AU - Newton, Matilda S.

AU - Morrone, Dana J.

AU - Lee, Kun Hwa

AU - Seelig, Burckhard

N1 - Funding Information: We thank Dr. Maureen Quin for assistance with the western blots, and Prof. Romas Kazlauskas and Fredarla Miller for their comments on the manuscript. This work was funded in part by grants from the US National Aeronautics and Space Administration (NASA) Agreement (NNX14AK29G), the Simons Foundation (340762), the Minnesota Medical Foundation (4036–9663-10), the University of Minnesota Biocatalysis Initiative, and the Office of the VP of Research at the University of Minnesota (Grant-in-Aid). Publisher Copyright: © 2019 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim

PY - 2019/3/15

Y1 - 2019/3/15

N2 - The universal genetic code of 20 amino acids is the product of evolution. It is believed that earlier versions of the code had fewer residues. Many theories for the order in which amino acids were integrated into the code have been proposed, considering factors ranging from prebiotic chemistry to codon capture. Several meta-analyses combined these theories to yield a feasible consensus chronology of the genetic code's evolution, but there is a dearth of experimental data to test the hypothesised order. We used combinatorial chemistry to synthesise libraries of random polypeptides that were based on different subsets of the 20 standard amino acids, thus representing different stages of a plausible history of the alphabet. Four libraries were comprised of the five, nine, and 16 most ancient amino acids, and all 20 extant residues for a direct side-by-side comparison. We characterised numerous variants from each library for their solubility and propensity to form secondary, tertiary or quaternary structures. Proteins from the two most ancient libraries were more likely to be soluble than those from the extant library. Several individual protein variants exhibited inducible protein folding and other traits typical of intrinsically disordered proteins. From these libraries, we can infer how primordial protein structure and function might have evolved with the genetic code.

AB - The universal genetic code of 20 amino acids is the product of evolution. It is believed that earlier versions of the code had fewer residues. Many theories for the order in which amino acids were integrated into the code have been proposed, considering factors ranging from prebiotic chemistry to codon capture. Several meta-analyses combined these theories to yield a feasible consensus chronology of the genetic code's evolution, but there is a dearth of experimental data to test the hypothesised order. We used combinatorial chemistry to synthesise libraries of random polypeptides that were based on different subsets of the 20 standard amino acids, thus representing different stages of a plausible history of the alphabet. Four libraries were comprised of the five, nine, and 16 most ancient amino acids, and all 20 extant residues for a direct side-by-side comparison. We characterised numerous variants from each library for their solubility and propensity to form secondary, tertiary or quaternary structures. Proteins from the two most ancient libraries were more likely to be soluble than those from the extant library. Several individual protein variants exhibited inducible protein folding and other traits typical of intrinsically disordered proteins. From these libraries, we can infer how primordial protein structure and function might have evolved with the genetic code.

KW - genetic code

KW - origin of proteins

KW - primordial peptides

KW - protein libraries

UR - http://www.scopus.com/inward/record.url?scp=85061585120&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85061585120&partnerID=8YFLogxK

U2 - 10.1002/cbic.201800668

DO - 10.1002/cbic.201800668

M3 - Article

C2 - 30511381

AN - SCOPUS:85061585120

SN - 1439-4227

VL - 20

SP - 846

EP - 856

JO - ChemBioChem

JF - ChemBioChem

IS - 6

ER -

Genetic Code Evolution Investigated through the Synthesis and Characterisation of Proteins from Reduced-Alphabet Libraries

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this