Abstract
In the current understanding, translation of genomic sequences into proteins is the most important path for realization of genome information. In exercising their intended function, proteins work together through various forms of direct (physical) or indirect interaction mechanisms. For a variety of basic functions, many proteins form a large complex representing a molecular machine or a macromolecular super-structural building block. After several high-throughput techniques for detection of protein–protein interactions had matured, protein interaction data became available in a large scale and curated databases for protein–protein interactions (PPIs) are a new necessity for efficient research. Here, their scope, annotation quality, and retrieval tools are reviewed. In addition, attention is paid to portals that provide unified access to a variety of such databases with added annotation value.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Uetz, P., Giot, L., Cagney, G., Mansfield, T. A., Judson, R. S., Knight, J. R., Lockshon, D., Narayan, V., Srinivasan, M., Pochart, P., et al. (2000) A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature 403, 623–627.
Yu, H., Braun, P., Yildirim, M. A., Lemmens, I., Venkatesan, K., Sahalie, J., Hirozane-Kishikawa, T., Gebreab, F., Li, N., Simonis, N., et al. (2008) High-quality binary protein interaction map of the yeast interactome network. Science 322, 104–110.
Hughes, T. R., Marton, M. J., Jones, A. R., Roberts, C. J., Stoughton, R., Armour, C. D., Bennett, H. A., Coffey, E., Dai, H., He, Y. D., et al. (2000) Functional discovery via a compendium of expression profiles. Cell 102, 109–126.
Cho, R. J., Campbell, M. J., Winzeler, E. A., Steinmetz, L., Conway, A., Wodicka, L., Wolfsberg, T. G., Gabrielian, A. E., Landsman, D., Lockhart, D. J., et al. (1998) A genome-wide transcriptional analysis of the mitotic cell cycle. Mol Cell 2, 65–73.
Tong, A. H., Evangelista, M., Parsons, A. B., Xu, H., Bader, G. D., Page, N., Robinson, M., Raghibizadeh, S., Hogue, C. W., Bussey, H., et al. (2001) Systematic genetic analysis with ordered arrays of yeast deletion mutants. Science 294, 2364–2368.
Marcotte, E. M., Pellegrini, M., Ng, H. L., Rice, D. W., Yeates, T. O., Eisenberg, D. (1999) Detecting protein function and protein-protein interactions from genome sequences. Science 285, 751–753.
Date, S. V., Marcotte, E. M. (2003) Discovery of uncharacterized cellular systems by genome-wide analysis of functional linkages. Nat Biotechnol 21, 1055–1062.
Enright, A. J., Iliopoulos, I., Kyrpides, N. C., Ouzounis, C. A. (1999) Protein interaction maps for complete genomes based on gene fusion events. Nature 402, 86–90.
Kamburov, A., Goldovsky, L., Freilich, S., Kapazoglou, A., Kunin, V., Enright, A. J., Tsaftaris, A., Ouzounis, C. A. (2007) Denoising inferred functional association networks obtained by gene fusion analysis. BMC Genomics 8, 460.
Dandekar, T., Snel, B., Huynen, M., Bork, P. (1998) Conservation of gene order: a fingerprint of proteins that physically interact. Trends Biochem Sci 23, 324–328.
Overbeek, R., Fonstein, M., D’Souza, M., Pusch, G. D., Maltsev, N. (1999) The use of gene clusters to infer functional coupling. Proc Natl Acad Sci USA 96, 2896–2901.
Overbeek, R., Fonstein, M., D’Souza, M., Pusch, G. D., Maltsev, N. (1999) Use of contiguity on the chromosome to predict functional coupling. In Silico Biol 1, 93–108.
Korbel, J. O., Jensen, L. J., von, M. C., Bork, P. (2004) Analysis of genomic context: prediction of functional associations from conserved bidirectionally transcribed gene pairs. Nat Biotechnol 22, 911–917.
Makarova, K. S., Koonin, E. V. (2003) Filling a gap in the central metabolism of archaea: prediction of a novel aconitase by comparative-genomic analysis. FEMS Microbiol Lett 227, 17–23.
Pellegrini, M., Marcotte, E. M., Thompson, M. J., Eisenberg, D. Yeates, T. O. (1999) Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. Proc Natl Acad Sci USA 96, 4285–4288.
Sato, T., Yamanishi, Y., Kanehisa, M., Toh, H. (2005) The inference of protein-protein interactions by co-evolutionary analysis is improved by excluding the information about the phylogenetic relationships. Bioinformatics. 21, 3482–3489.
Sato, T., Yamanishi, Y., Horimoto, K., Kanehisa, M., Toh, H. (2006) Partial correlation coefficient between distance matrices as a new indicator of protein-protein interactions. Bioinformatics 22, 2488–2492.
Morett, E., Korbel, J. O., Rajan, E., Saab-Rincon, G., Olvera, L., Olvera, M., Schmidt, S., Snel, B., Bork, P. (2003) Systematic discovery of analogous enzymes in thiamin biosynthesis. Nat Biotechnol 21, 790–795.
Bader, G. D., Betel, D., Hogue, C. W. (2003) BIND: the Biomolecular Interaction Network Database. Nucleic Acids Res 31, 248–250.
Bader, G. D. and Hogue, C. W. (2000) BIND – a data specification for storing and describing biomolecular interactions, molecular complexes and pathways. Bioinformatics 16, 465–477.
Fraser, H. B., Plotkin, J. B. (2007) Using protein complexes to predict phenotypic effects of gene mutation. Genome Biol 8, R252.
Xenarios, I., Salwinski, L., Duan, X. J., Higney, P., Kim, S. M., Eisenberg, D. (2002) DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions. Nucleic Acids Res 30, 303–305.
Zanzoni, A., Montecchi-Palazzi, L., Quondam, M., Ausiello, G., Helmer-Citterich, M., Cesareni, G. (2002) MINT: a Molecular INTeraction database. FEBS Lett 513, 135–140.
Kerrien, S., am-Faruque, Y., Aranda, B., Bancarz, I., Bridge, A., Derow, C., Dimmer, E., Feuermann, M., Friedrichsen, A., Huntley, R., et al. (2007) IntAct – open source resource for molecular interaction data. Nucleic Acids Res 35, D561–D565.
McDowall, M. D., Scott, M. S., Barton, G. J. (2009) PIPs: human protein-protein interaction prediction database. Nucleic Acids Res 37, D651–D656.
Brown, K. R., Jurisica, I. (2005) Online predicted human interaction database. Bioinformatics 21, 2076–2082.
Persico, M., Ceol, A., Gavrila, C., Hoffmann, R., Florio, A., Cesareni, G. (2005) HomoMINT: an inferred human network based on orthology mapping of protein interactions discovered in model organisms. BMC Bioinformatics 6(Suppl 4), S21.
Jensen, L. J., Kuhn, M., Stark, M., Chaffron, S., Creevey, C., Muller, J., Doerks, T., Julien, P., Roth, A., Simonovic, M., et al. (2009) STRING 8 – a global view on proteins and their functional interactions in 630 organisms. Nucleic Acids Res 37, D412–D416.
von Mering, C., Huynen, M., Jaeggi, D., Schmidt, S., Bork, P., Snel, B. (2003) STRING: a database of predicted functional associations between proteins. Nucleic Acids Res 31, 258–261.
Mathivanan, S., Periaswamy, B., Gandhi, T. K., Kandasamy, K., Suresh, S., Mohmood, R., Ramachandra, Y. L., Pandey, A. (2006) An evaluation of human protein-protein interaction data in the public domain. BMC Bioinformatics 7(Suppl 5), S19.
Noirot, P., Noirot-Gros, M. F. (2004) Protein interaction networks in bacteria. Curr Opin Microbiol 7, 505–512.
Su, C., Peregrin-Alvarez, J. M., Butland, G., Phanse, S., Fong, V., Emili, A., Parkinson, J. (2008) Bacteriome.org – an integrated protein interaction database for E. coli. Nucleic Acids Res 36, D632–D636.
Bader, G. D., Cary, M. P., Sander, C. (2006) Pathguide: a pathway resource list. Nucleic Acids Res 34, D504–D506.
Graeber, T. G., Eisenberg, D. (2001) Bioinformatic identification of potential autocrine signaling loops in cancers from gene expression profiles. Nat Genet 29, 295–300.
Hermjakob, H., Montecchi-Palazzi, L., Bader, G., Wojcik, J., Salwinski, L., Ceol, A., Moore, S., Orchard, S., Sarkans, U., von Mering, C., et al. (2004) The HUPO PSI’s molecular interaction format – a community standard for the representation of protein interaction data. Nat Biotechnol 22, 177–183.
Kerrien, S., Orchard, S., Montecchi-Palazzi, L., Aranda, B., Quinn, A. F., Vinod, N., Bader, G. D., Xenarios, I., Wojcik, J., Sherman, D., et al. (2007) Broadening the horizon – level 2.5 of the HUPO-PSI format for molecular interactions. BMC Biol 5, 44.
Stromback, L., Lambrix, P. (2005) Representations of molecular pathways: an evaluation of SBML, PSI MI and BioPAX. Bioinformatics 21, 4401–4407.
Breitkreutz, B. J., Stark, C., Tyers, M. (2003) Osprey: a network visualization system. Genome Biol 4, R22.
Chiang, T., Li, N., Orchard, S., Kerrien, S., Hermjakob, H., Gentleman, R., Huber, W. (2008) Rintact: enabling computational analysis of molecular interaction data from the IntAct repository. Bioinformatics 24, 1100–1101.
Lomax, J. (2005) Get ready to GO! A biologist’s guide to the Gene Ontology. Brief Bioinformatics 6, 298–304.
Hunter, S., Apweiler, R., Attwood, T. K., Bairoch, A., Bateman, A., Binns, D., Bork, P., Das, U., Daugherty, L., Duquenne, L., et al. (2009) InterPro: the integrative protein signature database. Nucleic Acids Res 37, D211–D215.
Breitkreutz, B. J., Stark, C., Reguly, T., Boucher, L., Breitkreutz, A., Livstone, M., Oughtred, R., Lackner, D. H., Bahler, J., Wood, V., et al. (2008) The BioGRID Interaction Database: 2008 update. Nucleic Acids Res 36, D637–D640.
Stark, C., Breitkreutz, B. J., Reguly, T., Boucher, L., Breitkreutz, A., Tyers, M. (2006) BioGRID: a general repository for interaction datasets. Nucleic Acids Res 34, D535–D539.
Keshava Prasad, T. S., Goel, R., Kandasamy, K., Keerthikumar, S., Kumar, S., Mathivanan, S., Telikicherla, D., Raju, R., Shafreen, B., Venugopal, A., et al. (2009) Human Protein Reference Database – 2009 update. Nucleic Acids Res 37, D767–D772.
Guldener, U., Munsterkotter, M., Oesterheld, M., Pagel, P., Ruepp, A., Mewes, H. W. and Stumpflen, V. (2006) MPact: the MIPS protein interaction resource on yeast. Nucleic Acids Res 34, D436–D441.
Guldener, U., Munsterkotter, M., Kastenmuller, G., Strack, N., van Helden, J., Lemer, C., Richelles, J., Wodak, S. J., Garcia-Martenez, J., Perez-Ortin, J. E., et al. (2005) CYGD: the Comprehensive Yeast Genome Database. Nucleic Acids Res 33, D364–D368.
Wuchty, S. (2004) Evolution and topology in the yeast protein interaction network. Genome Res 14, 1310–1314.
von Mering, C., Krause, R., Snel, B., Cornell, M., Oliver, S. G., Fields, S., Bork, P. (2002) Comparative assessment of large-scale data sets of protein-protein interactions. Nature 417, 399–403.
Jansen, R., Yu, H., Greenbaum, D., Kluger, Y., Krogan, N. J., Chung, S., Emili, A., Snyder, M., Greenblatt, J. F., Gerstein, M. (2003) A Bayesian networks approach for predicting protein-protein interactions from genomic data. Science 302, 449–453.
Snel, B., Lehmann, G., Bork, P., Huynen, M. A. (2000) STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene. Nucleic Acids Res 28, 3442–3444.
von Mering, C., Jensen, L. J., Kuhn, M., Chaffron, S., Doerks, T., Kruger, B., Snel, B., Bork, P. (2007) STRING 7 – recent developments in the integration and prediction of protein interactions. Nucleic Acids Res 35, D358–D362.
Chaurasia, G., Malhotra, S., Russ, J., Schnoegl, S., Hanig, C., Wanker, E. E., Futschik, M. E. (2009) UniHI 4: new tools for query, analysis and visualization of the human protein-protein interactome. Nucleic Acids Res 37, D657–D660.
Okuda, S., Yamada, T., Hamajima, M., Itoh, M., Katayama, T., Bork, P., Goto, S., Kanehisa, M. (2008) KEGG Atlas mapping for global analysis of metabolic pathways. Nucleic Acids Res 36, W423–W426, PMID: 18077471.
Shannon, P., Markiel, A., Ozier, O., Baliga, N. S., Wang, J. T., Ramage, D., Amin, N., Schwikowski, B., Ideker, T. (2003) Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13, 2498–2504.
Jiang, K., Nash, C. (2006) Application of XML database technology to biological pathway datasets. Conference proceedings : Annual International Conference of the IEEE Engineering in Medicine and Biology Society IEEE Engineering in Medicine and Biology Society Conference 1, 4217–4220.
Cerami, E. G., Bader, G. D., Gross, B. E., Sander, C. (2006) cPath: open source software for collecting, storing, and querying biological pathways. BMC Bioinformatics 7, 497.
Hart, G. T., Ramani, A. K., Marcotte, E. M. (2006) How complete are current yeast and human protein-interaction networks? Genome Biol 7, 120.
Chiang, T., Scholtens, D., Sarkar, D., Gentleman, R., Huber, W. (2007) Coverage and error models of protein-protein interaction data by directed graph analysis. Genome Biol 8, R186.
Gentleman, R., Huber, W. (2007) Making the most of high-throughput protein-interaction data. Genome Biol 8, 112.
Thorne, T., Stumpf, M. P. (2007) Generating confidence intervals on biological networks. BMC Bioinformatics 8, 467.
Gavin, A. C., Aloy, P., Grandi, P., Krause, R., Boesche, M., Marzioch, M., Rau, C., Jensen, L. J., Bastuck, S., Dumpelfeld, B., et al. (2006) Proteome survey reveals modularity of the yeast cell machinery. Nature 440, 631–636.
Schwikowski, B., Uetz, P., Fields, S. (2000) A network of protein-protein interactions in yeast. Nat Biotechnol 18, 1257–1261.
Jensen, L. J., Jensen, T. S., de, L. U., Brunak, S., Bork, P. (2006) Co-evolution of transcriptional and post-translational cell-cycle regulation. Nature 443, 594–597.
Jensen, L. J., de, L. U., Jensen, T. S., Brunak, S., Bork, P. (2008) Circular reasoning rather than cyclic expression. Genome Biol 9, 403.
Nikolsky, Y., Ekins, S., Nikolskaya, T., Bugrim, A. (2005) A novel method for generation of signature networks as biomarkers from complex high throughput data. Toxicol Lett 158, 20–29.
Nikolsky, Y., Nikolskaya, T., Bugrim, A. (2005) Biological networks and analysis of experimental data in drug discovery. Drug Discov Today 10, 653–662.
Nikolsky, Y., Sviridov, E., Yao, J., Dosymbekov, D., Ustyansky, V., Kaznacheev, V., Dezso, Z., Mulvey, L., Macconaill, L. E., Winckler, W., et al. (2008) Genome-wide functional synergy between amplified and mutated genes in human breast cancer. Cancer Res 68, 9532–9540.
van Noort, V., Snel, B., Huynen, M. A. (2007) Exploration of the omics evidence landscape: adding qualitative labels to predicted protein-protein interactions. Genome Biol 8, R197, PMID: 17880677.
Pagel, P., Kovac, S., Oesterheld, M., Brauner, B., Dunger-Kaltenbach, I., Frishman, G., Montrone, C., Mark, P., Stumpflen, V., Mewes, H. W., et al. (2005) The MIPS mammalian protein-protein interaction database. Bioinformatics 21, 832–834.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Humana Press, a part of Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Ooi, H.S., Schneider, G., Chan, YL., Lim, TT., Eisenhaber, B., Eisenhaber, F. (2010). Databases of Protein–Protein Interactions and Complexes. In: Carugo, O., Eisenhaber, F. (eds) Data Mining Techniques for the Life Sciences. Methods in Molecular Biology, vol 609. Humana Press. https://doi.org/10.1007/978-1-60327-241-4_9
Download citation
DOI: https://doi.org/10.1007/978-1-60327-241-4_9
Published:
Publisher Name: Humana Press
Print ISBN: 978-1-60327-240-7
Online ISBN: 978-1-60327-241-4
eBook Packages: Springer Protocols