ScienceDirect® Home Skip Main Navigation Links
You have guest access to ScienceDirect. Find out more.
 
Home
Browse
My Settings
Alerts
Help
 Quick Search
 Search tips (Opens new window)
    Clear all fields    
advertisementadvertisement
Computational Biology and Chemistry
Volume 28, Issues 5-6, December 2004, Pages 367-374
 
Font Size: Decrease Font Size  Increase Font Size
 Abstract - selected
Article
Purchase PDF (263 K)

 
 
 
Related Articles in ScienceDirect
View More Related Articles
 
View Record in Scopus
 
doi:10.1016/j.compbiolchem.2004.09.006    How to Cite or Link Using DOI (Opens New Window)
Copyright © 2004 Elsevier Ltd All rights reserved.

Comparing two K-category assignments by a K-category correlation coefficient

J. GorodkinCorresponding Author Contact Information, E-mail The Corresponding Author

Center for Bioinformatics and Division of Genetics, IBHV, The Royal Veterinary and Agricultural University, Grønnegårdsvej 3, DK-1870 Frederiksberg C, Denmark

Received 1 September 2004; 
revised 16 September 2004; 
accepted 16 September 2004. 
Available online 18 November 2004.

Purchase the full-text article



References and further reading may be available for this article. To view references and further reading you must purchase this article.

Abstract

Predicted assignments of biological sequences are often evaluated by Matthews correlation coefficient. However, Matthews correlation coefficient applies only to cases where the assignments belong to two categories, and cases with more than two categories are often artificially forced into two categories by considering what belongs and what does not belong to one of the categories, leading to the loss of information. Here, an extended correlation coefficient that applies to K-categories is proposed, and this measure is shown to be highly applicable for evaluating prediction of RNA secondary structure in cases where some predicted pairs go into the category “unknown” due to lack of reliability in predicted pairs or unpaired residues. Hence, predicting base pairs of RNA secondary structure can be a three-category problem. The measure is further shown to be well in agreement with existing performance measures used for ranking protein secondary structure predictions. Server and software is available at http://rk.kvl.dk/

Keywords: Matthews correlation coefficient; RNA secondary structure; Protein secondary structure

Article Outline

1. Introduction
2. The K-category correlation coefficient
2.1. RK
2.2. Relation to linear least square fitting
2.3. The discrete case
3. Material and methods
3.1. 5 HIV-1 RNA sequences
3.2. Protein sequence data: EVA prediction on secondary structure
4. Results
4.1. Application to RNA secondary prediction
4.2. Application to protein secondary structure prediction
5. Discussion
Acknowledgements
References




 
Home
Browse
My Settings
Alerts
Help
Elsevier.com (Opens new window)
About ScienceDirect  |  Contact Us  |  Information for Advertisers  |  Terms & Conditions  |  Privacy Policy
Copyright © 2008 Elsevier B.V. All rights reserved. ScienceDirect® is a registered trademark of Elsevier B.V.