Lin, Shoukai and Song, Qi and Tao, Huan and Wang, Wei and Wan, Weifeng and Huang, Jian and Xu, Chaoqun and Chebii, Vivien and Kitony, Justine and Que, Shufu and Harrison, Andrew and He, Huaqin (2015) Rice_Phospho 1.0: a new rice-specific SVM predictor for protein phosphorylation sites. Scientific Reports, 5 (1). 11940-. DOI https://doi.org/10.1038/srep11940
Lin, Shoukai and Song, Qi and Tao, Huan and Wang, Wei and Wan, Weifeng and Huang, Jian and Xu, Chaoqun and Chebii, Vivien and Kitony, Justine and Que, Shufu and Harrison, Andrew and He, Huaqin (2015) Rice_Phospho 1.0: a new rice-specific SVM predictor for protein phosphorylation sites. Scientific Reports, 5 (1). 11940-. DOI https://doi.org/10.1038/srep11940
Lin, Shoukai and Song, Qi and Tao, Huan and Wang, Wei and Wan, Weifeng and Huang, Jian and Xu, Chaoqun and Chebii, Vivien and Kitony, Justine and Que, Shufu and Harrison, Andrew and He, Huaqin (2015) Rice_Phospho 1.0: a new rice-specific SVM predictor for protein phosphorylation sites. Scientific Reports, 5 (1). 11940-. DOI https://doi.org/10.1038/srep11940
Abstract
<jats:title>Abstract</jats:title><jats:p>Experimentally-determined or computationally-predicted protein phosphorylation sites for distinctive species are becoming increasingly common. In this paper, we compare the predictive performance of a novel classification algorithm with different encoding schemes to develop a rice-specific protein phosphorylation site predictor. Our results imply that the combination of Amino acid occurrence Frequency with Composition of K-Spaced Amino Acid Pairs (AF-CKSAAP) provides the best description of relevant sequence features that surround a phosphorylation site. A support vector machine (SVM) using AF-CKSAAP achieves the best performance in classifying rice protein phophorylation sites when compared to the other algorithms. We have used SVM with AF-CKSAAP to construct a rice-specific protein phosphorylation sites predictor, Rice_Phospho 1.0 (<jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="http://bioinformatics.fafu.edu.cn/rice_phospho1.0">http://bioinformatics.fafu.edu.cn/rice_phospho1.0</jats:ext-link>). We measure the Accuracy (ACC) and Matthews Correlation Coefficient (MCC) of Rice_Phospho 1.0 to be 82.0% and 0.64, significantly higher than those measures for other predictors such as Scansite, Musite, PlantPhos and PhosphoRice. Rice_Phospho 1.0 also successfully predicted the experimentally identified phosphorylation sites in LOC_Os03g51600.1, a protein sequence which did not appear in the training dataset. In summary, Rice_phospho 1.0 outputs reliable predictions of protein phosphorylation sites in rice and will serve as a useful tool to the community.</jats:p>
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Plant Proteins; Area Under Curve; ROC Curve; Phosphorylation; Algorithms; Internet; User-Computer Interface; Oryza; Support Vector Machine |
Subjects: | Q Science > QH Natural history > QH301 Biology |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Mathematics, Statistics and Actuarial Science, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 23 Jul 2015 23:48 |
Last Modified: | 04 Dec 2024 06:16 |
URI: | http://repository.essex.ac.uk/id/eprint/14440 |
Available files
Filename: srep11940.pdf
Licence: Creative Commons: Attribution 3.0