Skip to main content

The Effectiveness of Gene Ontology in Assessing Functionally Coherent Groups of Genes: A Case Study

  • Conference paper
  • First Online:
Trends in Applied Knowledge-Based Systems and Data Science (IEA/AIE 2016)

Abstract

In recent years, ontologies have been extensively used in many biological fields to support a variety of applications. A well known example is Gene Ontology (GO) that organizes a vocabulary of terms about gene products and functions. GO offers an effective support for evaluating the similarity between two genes by measuring the distance of their respective GO terms. The advent of high-throughput technologies and the consequent production of lists of genes associated with specific conditions is stressing the need of recognizing groups of genes which cooperate within a specific biological event. This paper compares six popular similarity measures on GO in order to evaluate their effectiveness in discovering functionally coherent genes from an assigned list of genes. The aim is to discover which measure performs best. We also investigate about the potential of GO in evaluating the similarity of a set of genes according to its cardinality and the characteristics of the similarity measures. Experiments take into consideration: (a) 84 groups of genes sharing similar molecular functions through the production of enzymes within the human organism; (b) 150 groups of randomly selected genes. The paper demonstrates the efficient support of GO in detecting functionally related groups of genes, despite the GO’s hierarchical structure limits the representation of richer forms of knowledge.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Consortium GO: Gene Ontology annotations and resources. Nucleic Acids Res. 41, D530–D535 (2013)

    Article  Google Scholar 

  2. Lord, P.W., Stevens, R.D., Brass, A., Goble, C.A.: Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation. Bioinformatics 19(10), 1275–1283 (2003)

    Article  Google Scholar 

  3. Lee, W.N., Shah, N., Sundlass, K., Musen, M.: Comparison of Ontology-based Semantic-Similarity Measures. AMIA Annu Symp Proceedings 2008, V2008:384-388

    Google Scholar 

  4. Mazandu, G.K., Mulder, N.J.: Information content-based gene ontology semantic similarity approaches: toward a unified framework theory. Biomed Res. Int. 2013, Article ID 292063 (2013)

    Google Scholar 

  5. Guzzi, P.H., Mina, M., Guerra, C., Cannataro, M.: Semantic similarity analysis of protein data: assessment with biological features and issues. Briefings Bioinf. 13(5), 569–585 (2012)

    Article  Google Scholar 

  6. Peng, J., Wang, Y., Chen, J.: Towards integrative gene functional similarity measurement. BMC Bioinformatics 15(Suppl 2), S5 (2014)

    Article  Google Scholar 

  7. Richards, A.J., Muller, B., Shotwell, M., Cowart, L.A., Rohrer, B., Lu, X.: Assessing the functional coherence of gene sets with metrics based on the Gene Ontology graph. Bioinformatics 26(12), i79–i87 (2010)

    Article  Google Scholar 

  8. Kandula, S., Zeng-Treitler, Q.: Exploring relations among semantic groups: a comparison of concept co-occurrence in biomedical sources. Stud. Health Technol. Inform. 160, 995–999 (2010)

    Google Scholar 

  9. Teng, Z., Guo, M., Liu, X., Dai, Q., Wang, C., Xuan, P.: Measuring gene functional similarity based on group-wise comparison of GO terms. Bioinformatics 29, 1424–1432 (2013)

    Article  Google Scholar 

  10. Soldatos, T.G., Perdigão, N., Brown, N.P., Sabir, K.S., O’Donoghue, S.I.: How to learn about gene function: text-mining or ontologies? Methods 74, 3–15 (2015)

    Article  Google Scholar 

  11. Camon, E., Magrane, M., Barrell, D., Lee, V., Dimmer, E., Maslen, J., Binns, D., Harte, N., Lopez, R., Apweiler, R.: The Gene Ontology annotation (GOA) database: sharing knowledge in Uniprot with Gene Ontology. Nucleic Acids Res. 32, D262–D266 (2004)

    Article  Google Scholar 

  12. Pesquita, C., Faria, D., Falcao, A., Lord, P., Couto, F.: Semantic similarity in biomedical ontologies. PLoS Comput. Biol. 5(7), e1000443 (2009)

    Article  MathSciNet  Google Scholar 

  13. Pesquita, C., Faria, D., Bastos, H., Ferreira, A., Falcao, A., Couto, F.: Metrics for GO based protein semantic similarity: a systematic evaluation. BMC Bioinformatics 9(Suppl. 5), S4 (2008)

    Article  Google Scholar 

  14. Resnik, P.: Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural language. J. Artif. Intell. Res. 11, 95–130 (1999)

    MATH  Google Scholar 

  15. Lin, D.: An information-theoretic definition of similarity. In: Proceedings of the 15th International Conference on Machine Learning, pp. 296–304. Morgan Kaufmann, San Francisco (1998)

    Google Scholar 

  16. Jiang, J.J., Conrath, D.W.: Semantic similarity based on corpus statistics and lexical taxonomy. In: Proceedings of International Conference Research on Computational Linguistics, Taiwan, pp. 9008–9022 (1997)

    Google Scholar 

  17. Schlicker, A., Domingues, F., Rahnenfuhrer, J., Lengauer, T.: A new measure for functional similarity of gene products based on Gene Ontology. BMC Bioinformatics 7, 302 (2006)

    Article  Google Scholar 

  18. Yu, H., Jansen, R., Stolovitzky, G., Gerstein, M.: Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications. Bioinformatics 23(16), 2163–2173 (2007)

    Article  Google Scholar 

  19. Wang, J., Du, Z., Payattakool, R., Yu, P., Chen, C.: A new method to measure the semantic similarity of GO terms. Bioinformatics 23(10), 1274–1281 (2007)

    Article  Google Scholar 

  20. http://humancyc.org

  21. Peng, J., Li, H., Jiang, Q., Wang, Y., Chen, J.: An integrative approach for measuring semantic similarities using gene ontology. BMC Syst. Biol. 8(Suppl 5), S8 (2014)

    Article  Google Scholar 

  22. Peng, J., Uygun, S., Kim, T., Wang, Y., Rhee, S.Y., Chen, J.: Measuring semantic similarities by combining gene ontology annotations and gene co-function networks. BMC Bioinformatics 16, 44 (2015)

    Article  Google Scholar 

  23. Yang, H., et al.: Improving GO semantic similarity measures using downward random walks. Bioinformatics 28, 1383–1389 (2012)

    Article  Google Scholar 

  24. Pedersen, T., Pakhomov, S.V.S., Patwardhan, S., Chute, C.G.: Measures of semantic similarity and relatedness in the biomedical domain. J. Biomed. Inform. 40(3), 288–299 (2007)

    Article  Google Scholar 

  25. Dessì, N., Pascariello, E., Pes, B.: Integrating ontological information about genes. In: 2014 IEEE 23rd International WETICE Conference, pp. 417–422. IEEE (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Barbara Pes .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Dessì, N., Pes, B. (2016). The Effectiveness of Gene Ontology in Assessing Functionally Coherent Groups of Genes: A Case Study. In: Fujita, H., Ali, M., Selamat, A., Sasaki, J., Kurematsu, M. (eds) Trends in Applied Knowledge-Based Systems and Data Science. IEA/AIE 2016. Lecture Notes in Computer Science(), vol 9799. Springer, Cham. https://doi.org/10.1007/978-3-319-42007-3_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-42007-3_24

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-42006-6

  • Online ISBN: 978-3-319-42007-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics