Protein-Sol: A web tool for predicting protein solubility from sequence

Max Hebditch, M. Alejandro Carballo-Amador, Spyros Charonis, Robin Curtis, Jim Warwicker

Research output: Contribution to journalArticlepeer-review

62 Scopus citations

Abstract

Motivation Protein solubility is an important property in industrial and therapeutic applications. Prediction is a challenge, despite a growing understanding of the relevant physicochemical properties. Results Protein-Sol is a web server for predicting protein solubility. Using available data for Escherichia coli protein solubility in a cell-free expression system, 35 sequence-based properties are calculated. Feature weights are determined from separation of low and high solubility subsets. The model returns a predicted solubility and an indication of the features which deviate most from average values. Two other properties are profiled in windowed calculation along the sequence: Fold propensity, and net segment charge. The utility of these additional features is demonstrated with the example of thioredoxin.

Original languageEnglish (US)
Pages (from-to)3098-3100
Number of pages3
JournalBioinformatics
Volume33
Issue number19
DOIs
StatePublished - Oct 1 2017

Bibliographical note

Funding Information:
This work was supported by the Biotechnology and Biological Sciences Research Council (BBSRC) via a BioProNet award (BB/L013770/1), and a PhD studentship to MH (BB/J014478/1). SC was supported by a PhD studentship (EP/K50332/1) from the Engineering and Physical Sciences Research Council (EPSRC).

Fingerprint Dive into the research topics of 'Protein-Sol: A web tool for predicting protein solubility from sequence'. Together they form a unique fingerprint.

Cite this