Abstract
β-Turn is a secondary protein structure type that plays an important role in protein configuration and function. Here, we introduced an approach of β-turn prediction that used the support vector machine (SVM) algorithm combined with predicted secondary structure information. The secondary structure information was obtained by using E-SSpred, a new secondary protein structure prediction method. A 7-fold cross validation based on the benchmark dataset of 426 non-homologous protein chains was used to evaluate the performance of our method. The prediction results broke the 80% Q total barrier and achieved Q total = 80.9%, MCC = 0.44, and Q predicted higher 0.9% when compared with the best method. The results in our research are coincident with the conclusion that β-turn prediction accuracy can be improved by inclusion of secondary structure information.
Similar content being viewed by others
Abbreviations
- SVM:
-
Support vector machine
- PSIPRED:
-
Position specific iterated prediction
- PSI-BLAST:
-
Position specific iterated-Basic local alignment search tool
- PROMOTIF:
-
A program to identify and analyze structural motifs in protein
- AA:
-
Amino acid
- PSSM:
-
Position-specific scoring matrices
- SSE:
-
Secondary structure elements
- H:
-
Helix
- E:
-
Strand
- C:
-
Coil
- nr:
-
Non-redundant
- BTPRED:
-
Beta-turns prediction
- RBF:
-
Radial basis function
- MOLEBRNN:
-
Prediction of beta-turns and beta-turn types by a novel bidirectional Elman-type recurrent neural network with multiple output layers
- BTSVM:
-
Prediction and analysis of beta-turns in proteins by support vector machine
- COUDES:
-
Chercher Ou` Une De′viation Existe Suˆ rement
- KNN:
-
K-nearest neighbor algorithm
- TC:
-
Tertiary contact
References
Aurora R, Srinivasan R, Rose GD (1994) Science 264:1126–1130
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Nucleic Acids Res 25:3389–3402
Blader M, Zhang XJ, Matthews BW (1993) Science 260:1637–1640
Chou KC (2000) Anal Biochem 286:1–16
Chou PY, Fasman GD (1974) Biochemistry 13:211–222
Cai YD, Liu XJ, Xu XB, Chou KC (2002) J Pept Sci 8:297–301
Cuff JA, Barton GJ (1999) Proteins Struct Funct Bioinform 34:508–519
Chang CC, Lin CJ LIBSVM: a library for support vector machines. Available: http://www.csie.ntu.edu.tw/~cjlin/libsvm
Deleo JM (1993) Receiver operating characteristic laboratory (ROCLAB): Software for developing decision strategies that account for uncertainty. In: Proceedings of the Second International Symposium on Uncertainty Modelling and Analysis. IEEE, Computer Society Press, College Park, pp. 318–25
Duan MJ, Huang M, Ma C, Li L, Zhou YH (2008) Protein Sci 17(9):1505–1512
Feng XY, Wang QQ, Zhang J, Nie FS, Li ML (2007) Vib Spectrosc 44(2):243–247
Fang YP, Guo YZ, Feng XY, Li ML (2008) Amino Acids 34(1):103–109
Fuchs PF, Alix AJ (2005) Proteins Struct Funct Bioinform 59:828–839
Guo J, Chen H, Sun ZR, Lin YL (2004) Proteins Struct Funct Bioinform 54:738–743
Guo YZ, Li ML, Lu MC, Wen ZN, Huang ZT (2006) Proteins Struct Funct Bioinform 65:55–60
Guo YZ, Yu LZ, Wen ZN, Li ML (2008) Nucleic Acids Res 36(9):3025–3030
Hutchinson EG, Thornton JM (1994) Protein Sci 3:2207–2216
Hu X, Li QJ (2008) Comput Chem 29(12):1867–1875
Hua SJ, Sun ZR (2001) J Mol Biol 308:397–407
Hutchinson EG, Thornton JM (1996) Science 5:212–220
Jones DT (1999) J Mol Biol 292:195–202
Kabsch W, Sander C (1983) Biopolymers 22:2577–2637
Kaur H, Raghava GP (2002) Bioinformatics 18:1508–1514
Kaur H, Raghava GP (2003) Protein Sci 12:627–634
Kaur H, Raghava GP (2004) Bioinformatics 20:2751–2758
Kim S (2004) Bioinformatics 20:40–44
Kim H, Park H (2003) Protein Eng 16:553–560
Kirschner A, Frishman D (2008) Gene 422(1–2):22–29
Lewis PN, Momany FA, Scheraga HA (1973) Biochem Biophys Acta 303:211–229
McGregor MJ, Flores TP, Sternberg MJE (1989) Protein Eng 2:521–526
Pham TH, Satou K, Ho TB (2003) Genome Inform 14:196–205
Presta LG, Rose GD (1988) Science 240:1632–1641
Padmanabhan S, Marquesee S, Ridgeway T, Laue TM, Baldwin RL (1990) Nature 344:268–270
Qin SB, He Y, Pan XM (2005) Proteins Struct Funct Bioinform 61:473–480
Richardson JS (1981) Adv Protein Chem 34:167–339
Rose GD, Gierasch L, Smith JA (1985) Adv Protein Chem 37:1–109
Rost B, Sander C (1994) Proteins Struct Funct Bioinform 19:55–72
Rost B, Sander C (1993) J Mol Biol 232:584–599
Richardson JS, Richardson DC (1988) Science 240:1648–1652
Rost B, Sander C, Schneider R (1994) J Mol Biol 235:13–26
Russell RB, Barton GJ (1993) J Mol Biol 234:951–957
Shepherd AJ, Gorse D, Thornton JM (1999) Protein Sci 8:1045–1055
Takano K, Yamagata Y, Yutani K (2000) Biochemistry 39:8655–8665
Tan FY, Feng XY, Fang Z, Li ML (2007) Amino Acids 33(4):669–675
Wilmot CM, Thornton JM (1990) Protein Eng 3:479–493
Yoon S, Welsh WJ (2004) Protein Sci 13:2149–2160
Zhang CT, Chou KC (1997) Biopolymers 41:673–702
Zhang Q, Yoon S, Welsh WJ (2005) Bioinformatics 21:2370–2374
Zheng C, Kurgan L (2008) BMC Bioinformatics 9:430
Acknowledgments
The work was funded by the National Natural Science Foundation of China (No. 20775052). The authors would like to express their cordial thanks to the unknown reviewers for providing comments on the manuscript.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Liu, L., Fang, Y., Li, M. et al. Prediction of Beta-Turn in Protein Using E-SSpred and Support Vector Machine. Protein J 28, 175–181 (2009). https://doi.org/10.1007/s10930-009-9181-4
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10930-009-9181-4