Abstract
In recent years, ontologies have been extensively used in many biological fields to support a variety of applications. A well known example is Gene Ontology (GO) that organizes a vocabulary of terms about gene products and functions. GO offers an effective support for evaluating the similarity between two genes by measuring the distance of their respective GO terms. The advent of high-throughput technologies and the consequent production of lists of genes associated with specific conditions is stressing the need of recognizing groups of genes which cooperate within a specific biological event. This paper compares six popular similarity measures on GO in order to evaluate their effectiveness in discovering functionally coherent genes from an assigned list of genes. The aim is to discover which measure performs best. We also investigate about the potential of GO in evaluating the similarity of a set of genes according to its cardinality and the characteristics of the similarity measures. Experiments take into consideration: (a) 84 groups of genes sharing similar molecular functions through the production of enzymes within the human organism; (b) 150 groups of randomly selected genes. The paper demonstrates the efficient support of GO in detecting functionally related groups of genes, despite the GO’s hierarchical structure limits the representation of richer forms of knowledge.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Consortium GO: Gene Ontology annotations and resources. Nucleic Acids Res. 41, D530–D535 (2013)
Lord, P.W., Stevens, R.D., Brass, A., Goble, C.A.: Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation. Bioinformatics 19(10), 1275–1283 (2003)
Lee, W.N., Shah, N., Sundlass, K., Musen, M.: Comparison of Ontology-based Semantic-Similarity Measures. AMIA Annu Symp Proceedings 2008, V2008:384-388
Mazandu, G.K., Mulder, N.J.: Information content-based gene ontology semantic similarity approaches: toward a unified framework theory. Biomed Res. Int. 2013, Article ID 292063 (2013)
Guzzi, P.H., Mina, M., Guerra, C., Cannataro, M.: Semantic similarity analysis of protein data: assessment with biological features and issues. Briefings Bioinf. 13(5), 569–585 (2012)
Peng, J., Wang, Y., Chen, J.: Towards integrative gene functional similarity measurement. BMC Bioinformatics 15(Suppl 2), S5 (2014)
Richards, A.J., Muller, B., Shotwell, M., Cowart, L.A., Rohrer, B., Lu, X.: Assessing the functional coherence of gene sets with metrics based on the Gene Ontology graph. Bioinformatics 26(12), i79–i87 (2010)
Kandula, S., Zeng-Treitler, Q.: Exploring relations among semantic groups: a comparison of concept co-occurrence in biomedical sources. Stud. Health Technol. Inform. 160, 995–999 (2010)
Teng, Z., Guo, M., Liu, X., Dai, Q., Wang, C., Xuan, P.: Measuring gene functional similarity based on group-wise comparison of GO terms. Bioinformatics 29, 1424–1432 (2013)
Soldatos, T.G., Perdigão, N., Brown, N.P., Sabir, K.S., O’Donoghue, S.I.: How to learn about gene function: text-mining or ontologies? Methods 74, 3–15 (2015)
Camon, E., Magrane, M., Barrell, D., Lee, V., Dimmer, E., Maslen, J., Binns, D., Harte, N., Lopez, R., Apweiler, R.: The Gene Ontology annotation (GOA) database: sharing knowledge in Uniprot with Gene Ontology. Nucleic Acids Res. 32, D262–D266 (2004)
Pesquita, C., Faria, D., Falcao, A., Lord, P., Couto, F.: Semantic similarity in biomedical ontologies. PLoS Comput. Biol. 5(7), e1000443 (2009)
Pesquita, C., Faria, D., Bastos, H., Ferreira, A., Falcao, A., Couto, F.: Metrics for GO based protein semantic similarity: a systematic evaluation. BMC Bioinformatics 9(Suppl. 5), S4 (2008)
Resnik, P.: Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural language. J. Artif. Intell. Res. 11, 95–130 (1999)
Lin, D.: An information-theoretic definition of similarity. In: Proceedings of the 15th International Conference on Machine Learning, pp. 296–304. Morgan Kaufmann, San Francisco (1998)
Jiang, J.J., Conrath, D.W.: Semantic similarity based on corpus statistics and lexical taxonomy. In: Proceedings of International Conference Research on Computational Linguistics, Taiwan, pp. 9008–9022 (1997)
Schlicker, A., Domingues, F., Rahnenfuhrer, J., Lengauer, T.: A new measure for functional similarity of gene products based on Gene Ontology. BMC Bioinformatics 7, 302 (2006)
Yu, H., Jansen, R., Stolovitzky, G., Gerstein, M.: Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications. Bioinformatics 23(16), 2163–2173 (2007)
Wang, J., Du, Z., Payattakool, R., Yu, P., Chen, C.: A new method to measure the semantic similarity of GO terms. Bioinformatics 23(10), 1274–1281 (2007)
Peng, J., Li, H., Jiang, Q., Wang, Y., Chen, J.: An integrative approach for measuring semantic similarities using gene ontology. BMC Syst. Biol. 8(Suppl 5), S8 (2014)
Peng, J., Uygun, S., Kim, T., Wang, Y., Rhee, S.Y., Chen, J.: Measuring semantic similarities by combining gene ontology annotations and gene co-function networks. BMC Bioinformatics 16, 44 (2015)
Yang, H., et al.: Improving GO semantic similarity measures using downward random walks. Bioinformatics 28, 1383–1389 (2012)
Pedersen, T., Pakhomov, S.V.S., Patwardhan, S., Chute, C.G.: Measures of semantic similarity and relatedness in the biomedical domain. J. Biomed. Inform. 40(3), 288–299 (2007)
Dessì, N., Pascariello, E., Pes, B.: Integrating ontological information about genes. In: 2014 IEEE 23rd International WETICE Conference, pp. 417–422. IEEE (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Dessì, N., Pes, B. (2016). The Effectiveness of Gene Ontology in Assessing Functionally Coherent Groups of Genes: A Case Study. In: Fujita, H., Ali, M., Selamat, A., Sasaki, J., Kurematsu, M. (eds) Trends in Applied Knowledge-Based Systems and Data Science. IEA/AIE 2016. Lecture Notes in Computer Science(), vol 9799. Springer, Cham. https://doi.org/10.1007/978-3-319-42007-3_24
Download citation
DOI: https://doi.org/10.1007/978-3-319-42007-3_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42006-6
Online ISBN: 978-3-319-42007-3
eBook Packages: Computer ScienceComputer Science (R0)