doi:10.1016/j.compbiolchem.2004.09.006
Copyright © 2004 Elsevier Ltd All rights reserved.
Comparing two K-category assignments by a K-category correlation coefficient
J. Gorodkin
, 
Center for Bioinformatics and Division of Genetics, IBHV, The Royal Veterinary and Agricultural University, Grønnegårdsvej 3, DK-1870 Frederiksberg C, Denmark
Received 1 September 2004;
revised 16 September 2004;
accepted 16 September 2004.
Available online 18 November 2004.
References and further reading may be available for this article. To view references and further reading you must
purchase this article.
Abstract
Predicted assignments of biological sequences are often evaluated by Matthews correlation coefficient. However, Matthews correlation coefficient applies only to cases where the assignments belong to two categories, and cases with more than two categories are often artificially forced into two categories by considering what belongs and what does not belong to one of the categories, leading to the loss of information. Here, an extended correlation coefficient that applies to K-categories is proposed, and this measure is shown to be highly applicable for evaluating prediction of RNA secondary structure in cases where some predicted pairs go into the category “unknown” due to lack of reliability in predicted pairs or unpaired residues. Hence, predicting base pairs of RNA secondary structure can be a three-category problem. The measure is further shown to be well in agreement with existing performance measures used for ranking protein secondary structure predictions. Server and software is available at http://rk.kvl.dk/
Keywords: Matthews correlation coefficient; RNA secondary structure; Protein secondary structure
Fig. 2. RNA secondary structure prediction comparisons. Each curve represents different k-values. In the previous study (Knudsen et al., 2004), k-values of up about 2 indicated useful predictions. Using R3 to measure the comparisons for three categories, this limit might be pushed towards larger k s. When the cutoffs become larger than 0.4, the R3 drops drastically off in agreement with the observation in Knudsen et al. (2004), where the base pair content drops off, also at the threshold cutoff of 0.4.
Fig. 3. RNA secondary structure prediction comparisons between original prediction and prediction based on randomized parameters, corresponding to various k-values. Comparisons are made for the initial as well as the cleaned alignments for the cutoffs (A) 0.05 and (B) 0.3. Three categories (K=3) were used and only predicted pairs (non-pairs, respectively) with probability higher than 0.05 (0.3, respectively) were accepted to belong to the category pair (non-pair, respectively), and otherwise were assigned to the “unknown” category. Similar plots are obtained for other cutoff values than 0.05 (0.3, respectively). The higher the k is, the more the randomization, and the worse the prediction becomes (compared to the original prediction). However, in agreement with our expectations, the performance on the cleaned alignment is consistently better, and the generalized correlation coefficient demonstrates this (when k=1, there is no randomization).
Table 1.
The secondary structure prediction methods listed in EVA for different data sets (one to six; August 2003)

The main performance evaluations from EVA are listed along with R3. The SOV, segment overlap and the three-state prediction accuracy Q3 measures are indicated along with the overall ranking (rank) by EVA (taking the error signals into account). As few of the downloaded data records were problematic to process due to their inconsistent formats, a revised Q3 was computed. (For example, some predicted assignments did not have the same length as the dssp assignment line, and the entire entry was therefore discarded.) This is largely in agreement with the original indicated Q3 values. The “SOV” and “rank” values are the original values. Computing error signals similar to those in EVA (Koh et al., 2003) for R3 still gives a ranking in complete agreement with the one listed here (data not shown). For each set, the ranking is according to Q3 values. The protein prediction methods are: phd (Rost, 1996), samt99_sec (Karplus et al., 1998), jpred (Cuff and Barton, 1999), psipred (Jones, 1999), prof_king (Ouali and King, 1999), apssp2 (Raghava, 2000), prospect (Xu and Xu, 2000), phdpsi (Przybylski and Rost, 2001), sspro2 (Pollastri et al., 2002), profsec (Rost, 2003).