Abstract
The growing available genomic and proteomic information gives new opportunities for novel research approaches and biomedical discoveries through effective data management and analysis support. Integration and comprehensive evaluation of available controlled data can highlight information patterns leading to unveil new biomedical knowledge. For this purpose, the University Politecnico di Milano, is developing a software framework to create and maintain a Genomic and Proteomic Data Warehouse (GPDW) that integrates information from many data sources on the basis of a conceptual data model that relates molecular entities and biomedical features.
Here we illustrate and discuss the extension of framework for integrating biomolecular interaction data in the GPDW. The comprehensive and mining of the reliable interaction data together with the other biomolecular information in the GPDW constitutes a powerful computational support for novel biomedical knowledge discoveries.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ceol, A., Chatr Aryamontri, A., Licata, L., Peluso, D., Briganti, L., Perfetto, L., Castagnoli, L., Cesareni, G.: MINT, the molecular interaction database: 2009 update. Nucleic Acids Res. 38(Database issue), D532–D539 (2009)
Aranda, B., Achuthan, P., Alam-Faruque, Y., Armean, I., Bridge, A., Derow, C., Feuermann, M., Ghanbarian, A.T., Kerrien, S., Khadake, J., et al.: The IntAct molecular interaction database in 2010. Nucleic Acids Res. 38, D525–D531 (2010)
Jayapandian, M., Chapman, A., Tarcea, V.G., Yu, C., Elkiss, A., Ianni, A., Liu, B., Nandi, A., Santos, C., Andrews, P., et al.: Michigan Molecular Interactions (MiMI): putting the jigsaw puzzle together. Nucleic Acids Res. 35, 566–571 (2007)
Kerrien, S., Orchard, S., Montecchi-Palazzi, L., Aranda, B., Quinn, A.F., Vinod, N., Bader, G.D., Xenarios, I., Wojcik, J., Sherman, D., et al.: Broadening the horizonlevel 2.5 of the HUPO-PSI format for molecular interactions. BMC Biol. 5, 44 (2007)
Orchard, S., Kerrien, S., Jones, P., Ceol, A., Chatr-Aryamontri, A., Salwinski, L., Nerothin, J., Hermjakob, H.: Submit your interaction data the IMEx way: a step by step guide to trouble-free deposition. Proteomics 7(suppl. 1), 28–34 (2007)
Kulikova, T., Akhtar, R., Aldebert, P., Althorpe, N., Andersson, M., Baldwin, A., Bates, K., Bhattacharyya, S., Bower, L., Browne, P., et al.: EMBL Nucleotide Sequence Database in 2006. Nucleic Acids Res. 35, D16–D20 (2007)
Benson, D.A., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J., Wheeler, D.L.: GenBank. Nucleic Acids Res. 36, 25–30 (2008)
Sugawara, H., Ogasawara, O., Okubo, K., Gojobori, T., Tateno, Y.: DDBJ with new system and face. Nucleic Acids Res. 36, D22–D24 (2008)
Kasprzyk, A., Keefe, D., Smedley, D., London, D., Spooner, W., Melsopp, C., et al.: EnsMart: A Generic System for Fast and Flexible Access to Biological Data. Genome Res. 14(1), 160–169 (2004)
Lee, T.J., Pouliot, Y., Wagner, V., Gupta, P., Stringer-Calvert, D.W., Tenenbaum, J.D., Karp, P.D.: BioWarehouse: A Bioinformatics Database Warehouse Toolkit. BMC Bioinformatics 7(170), 1–14 (2006)
Masseroli, M., Martucci, D., Pinciroli, F.: GFINDer: Genome Function INtegrated Discoverer through dynamic annotation, statistical analysis, and mining. Nucleic Acids Res. 32(suppl. 2), W293–W300 (2004)
Masseroli, M., Galati, O., Pinciroli, F.: GFINDer: genetic disease and phenotype location statistical analysis and mining of dynamically annotated gene lists. Nucleic Acids Res. 33(suppl. 2), W717–W723 (2005)
Batini, C., Scannapieco, M.: Data Quality: Concepts, Methodologies and Techniques. Springer (2006)
Batini, C., Cappiello, C., Francalanci, C., Maurino, A.: Methodologies for Data Quality Assessment and Improvement. ACM Comput. Surv. 41(3), 16, 1–52 (2009)
Madnick, S.E., Wang, R.Y., Lee, Y.W., Zhu, H.: Overview and Framework for Data and Information Quality Research. ACM J. Data Inform. Quality 1(1), 2, 1–22 (2009)
Ghisalberti, G., Masseroli, M., Tettamanti, L.: Quality Controls in Integrative Approaches to Detect Errors and Inconsistencies in Biological Databases. J. Integr. Bioinform. 7(3), 2010–2119 (2010)
Hubbard, T.J., Aken, B.L., Ayling, S., Ballester, B., Beal, K., Bragin, E., Brent, S., Chen, Y., Clapham, P., Clarke, L., et al.: Ensembl 2009. Nucleic Acids Res. 37(Database issue), 690–697 (2009)
Pruitt, K.D., Tatusova, T., Maglott, D.R.: NCBI reference sequences (RefSeq): a Curated Non-Redundant Sequence Database of Genomes, Transcripts and Proteins. Nucleic Acids Res. 35(Database issue), D61–D65 (2007)
UniProt Consortium. The Universal Protein Resource (UniProt) 2009. Nucleic Acids Res. 37(Database issue), D169–D174 (2009)
Hermjakob, H., Montecchi-Palazzi, L., Bader, G., Wojcik, J., Salwinski, L., Ceol, A., Moore, S., Orchard, S., Sarkans, U., et al.: The HUPO PSI’s molecular interaction format–a community standard for the representation of protein interaction data. Nature Biotechnology 22(2), 177–183 (2004)
Gasteiger, E., Gattiker, A., Hoogland, C., Ivanyi, I., Appel, R.D., Bairoch, A.: ExPASy: the proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res. 1, 31(13), 3784–3788 (2003)
Kanehisa, M., Goto, S.: KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 28(1), 27–30 (2000)
Amberger, J., Bocchini, C.A., Scott, A.F., Hamosh, A.: McKusick’s Online Mendelian Inheritance in Man (OMIM). Nucleic Acids Res. 37(Database issue), 793–796 (2009)
Ashburner, M., Ball, C.A., Blake, J.A., Botstein, D., Butler, H., Cherry, J.M., et al.: Gene Ontology: Tool for the Unification of Biology. Nat. Genet. 25(1), 25–29 (2000)
Matthews, L., Gopinath, G., Gillespie, M., Caudy, M., Croft, D., de Bono, B., et al.: Reactome Knowledgebase of Human Biological Pathways and Processes. Nucleic Acids Res. 37(Database issue), D619–D622 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Canakoglu, A., Ghisalberti, G., Masseroli, M. (2012). Integration of Biomolecular Interaction Data in a Genomic and Proteomic Data Warehouse to Support Biomedical Knowledge Discovery. In: Biganzoli, E., Vellido, A., Ambrogi, F., Tagliaferri, R. (eds) Computational Intelligence Methods for Bioinformatics and Biostatistics. CIBB 2011. Lecture Notes in Computer Science(), vol 7548. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35686-5_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-35686-5_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35685-8
Online ISBN: 978-3-642-35686-5
eBook Packages: Computer ScienceComputer Science (R0)