ISSPred

From DrugPedia: A Wikipedia for Drug discovery

(Difference between revisions)
Jump to: navigation, search
Line 1: Line 1:
==ISSPred==
==ISSPred==
-
[[Image:comp_DNA.jpg|frame]]
+
 
-
ISSPred is a prediction server that predicts Intein Post-translational modification (Protein Splicing) in proteins.It is SVM based method that exploits defferent features of protein such as amino acid composition, dipeptide composition. it has three models of prediction; Prediction of Intein Domains, prediction of intein containing proteins and Prediction of Intein's N and C terminal Splice sites.
+
ISSPred is a prediction server that predicts Intein Post-translational modification (Protein Splicing) in proteins.It is SVM based method that exploits different features of protein such as amino acid composition, dipeptide composition. it has three models of prediction; Prediction of Intein Domains, prediction of intein containing proteins and Prediction of Intein's N and C terminal Splice sites.
 +
 
 +
==Availability of ISSPred Webserver:==
 +
This server is hosted at [http://www.imtech.res.in/raghava/ Bioinformatics Center, IMTECH, Chandigarh] and is available at [http://www.imtech.res.in/raghava/isspred ISSPred]
 +
 
 +
==Algorithm behind TBpred==
 +
===Dataset===
 +
 
 +
Intein data was obtained from Inbase database. It represents intein sequences with corresponding N and C Terminal Splice Sites, covering all 3 kingdoms of life i.e. Archea (147), prokaryotes (201) and Eukaryotes (89).
 +
 
 +
From this they selected a total of 69 experimentally proved Inteins as annotated by Inbase Database. For Intein prediction they took these 69 as positive dataset and 600 protein sequences (randomly selected from three species as Archaeoglobus fulgidus (Archea) , Neisseria meningitidis (Prokaryote), Drosophila melanogaster (Eukaryote) due to absence of a single intein sequence in them as reported by InBase) as negative dataset.
 +
 
 +
For Intein's Splice Site prediction they collected N and C terminal splice site of 16 amino acid length as positive and 16 amino acid motifs obtained from the corresponding protein sequences by sliding window method considered as negative dataset.All protein sequences were obtained from Swiss-Prot Database.
 +
 
 +
===Support Vector Machine & Evaluation Procedures===
 +
 
 +
Support vector machines (SVMs) are a set of related supervised learning methods used for classification and regression.Machine learning tools have been proved useful and successful in identification of molecular patterns. Previously concept of SVM has been successfully utilized in the protein structure prediction, B-cell , T-cell epitope prediction, identification of the MHC binding peptides, sub cellular localization etc. In the present study, a freely downloadable package of SVM ie SVM light has been used to exploit different sequence features like Amino acid, Dipeptide composition and Binary patterns. For evaluation of prediction they used threshold dependent measures like Sensitivity and Specificity.
 +
<table width="100%" >
 +
<tr align="center"><td>[[Image:sensitivity.gif|frame]]</td><td>[[Image:specificity.gif|frame]]</td><td>[[Image:Image002.gif|frame]]</td><td>[[Image:MCC.gif|frame]]</td></tr>
 +
</table>
 +
 
 +
==Useful Links to various PTM related resources==
 +
 
 +
 
 +
<table border="1" width="70%" cellpadding="5" align="center">
 +
<tr>
 +
<td align="center"><font size="+1">[http://dbptm.mbc.nctu.edu.tw/ dbPTM]</font></td><td>Useful information repository of protein post-translational modification (PTMs).</td>
 +
</tr>
 +
<tr>
 +
<td align="center"><font size="+1">[http://prometheus.brc.mcw.edu/promost/ ProMoST]</font></td><td>Calculate the effect of single or multiple posttranslational modifications (PTMs) on protein isoelectric point (pI) and molecular weight and displays the calculated patterns as two-dimensional (2D) gel images.</td>
 +
 
 +
</tr>
 +
<tr>
 +
<td align="center"><font size="+1">[http://phospho.elm.eu.org/ PhosphoELM]</font></td><td>A Database of S/T/Y Phosphorylation sites</td>
 +
</tr>
 +
<tr>
 +
<td align="center"><font size="+1">[http://expasy.org/tools/#ptm Expasy Proteomics Tools]</font></td><td>Links of Database and prediction servers for different types of PTM at Expasy</td>
 +
</tr>
 +
</table>

Revision as of 06:55, 31 August 2008

Contents

ISSPred

ISSPred is a prediction server that predicts Intein Post-translational modification (Protein Splicing) in proteins.It is SVM based method that exploits different features of protein such as amino acid composition, dipeptide composition. it has three models of prediction; Prediction of Intein Domains, prediction of intein containing proteins and Prediction of Intein's N and C terminal Splice sites.

Availability of ISSPred Webserver:

This server is hosted at Bioinformatics Center, IMTECH, Chandigarh and is available at ISSPred

Algorithm behind TBpred

Dataset

Intein data was obtained from Inbase database. It represents intein sequences with corresponding N and C Terminal Splice Sites, covering all 3 kingdoms of life i.e. Archea (147), prokaryotes (201) and Eukaryotes (89).

From this they selected a total of 69 experimentally proved Inteins as annotated by Inbase Database. For Intein prediction they took these 69 as positive dataset and 600 protein sequences (randomly selected from three species as Archaeoglobus fulgidus (Archea) , Neisseria meningitidis (Prokaryote), Drosophila melanogaster (Eukaryote) due to absence of a single intein sequence in them as reported by InBase) as negative dataset.

For Intein's Splice Site prediction they collected N and C terminal splice site of 16 amino acid length as positive and 16 amino acid motifs obtained from the corresponding protein sequences by sliding window method considered as negative dataset.All protein sequences were obtained from Swiss-Prot Database.

Support Vector Machine & Evaluation Procedures

Support vector machines (SVMs) are a set of related supervised learning methods used for classification and regression.Machine learning tools have been proved useful and successful in identification of molecular patterns. Previously concept of SVM has been successfully utilized in the protein structure prediction, B-cell , T-cell epitope prediction, identification of the MHC binding peptides, sub cellular localization etc. In the present study, a freely downloadable package of SVM ie SVM light has been used to exploit different sequence features like Amino acid, Dipeptide composition and Binary patterns. For evaluation of prediction they used threshold dependent measures like Sensitivity and Specificity.

Useful Links to various PTM related resources

dbPTMUseful information repository of protein post-translational modification (PTMs).
ProMoSTCalculate the effect of single or multiple posttranslational modifications (PTMs) on protein isoelectric point (pI) and molecular weight and displays the calculated patterns as two-dimensional (2D) gel images.
PhosphoELMA Database of S/T/Y Phosphorylation sites
Expasy Proteomics ToolsLinks of Database and prediction servers for different types of PTM at Expasy