[HTML][HTML] PROSPER: an integrated feature-based tool for predicting protease substrate cleavage sites

J Song, H Tan, AJ Perry, T Akutsu, GI Webb… - PloS one, 2012 - journals.plos.org
PloS one, 2012journals.plos.org
The ability to catalytically cleave protein substrates after synthesis is fundamental for all
forms of life. Accordingly, site-specific proteolysis is one of the most important post-
translational modifications. The key to understanding the physiological role of a protease is
to identify its natural substrate (s). Knowledge of the substrate specificity of a protease can
dramatically improve our ability to predict its target protein substrates, but this information
must be utilized in an effective manner in order to efficiently identify protein substrates by in …
The ability to catalytically cleave protein substrates after synthesis is fundamental for all forms of life. Accordingly, site-specific proteolysis is one of the most important post-translational modifications. The key to understanding the physiological role of a protease is to identify its natural substrate(s). Knowledge of the substrate specificity of a protease can dramatically improve our ability to predict its target protein substrates, but this information must be utilized in an effective manner in order to efficiently identify protein substrates by in silico approaches. To address this problem, we present PROSPER, an integrated feature-based server for in silico identification of protease substrates and their cleavage sites for twenty-four different proteases. PROSPER utilizes established specificity information for these proteases (derived from the MEROPS database) with a machine learning approach to predict protease cleavage sites by using different, but complementary sequence and structure characteristics. Features used by PROSPER include local amino acid sequence profile, predicted secondary structure, solvent accessibility and predicted native disorder. Thus, for proteases with known amino acid specificity, PROSPER provides a convenient, pre-prepared tool for use in identifying protein substrates for the enzymes. Systematic prediction analysis for the twenty-four proteases thus far included in the database revealed that the features we have included in the tool strongly improve performance in terms of cleavage site prediction, as evidenced by their contribution to performance improvement in terms of identifying known cleavage sites in substrates for these enzymes. In comparison with two state-of-the-art prediction tools, PoPS and SitePrediction, PROSPER achieves greater accuracy and coverage. To our knowledge, PROSPER is the first comprehensive server capable of predicting cleavage sites of multiple proteases within a single substrate sequence using machine learning techniques. It is freely available at http://lightning.med.monash.edu.au/PROSPER/.
PLOS