Skip to main content

Databases of Protein–Protein Interactions and Complexes

  • Protocol
  • First Online:
Data Mining Techniques for the Life Sciences

Abstract

In the current understanding, translation of genomic sequences into proteins is the most important path for realization of genome information. In exercising their intended function, proteins work together through various forms of direct (physical) or indirect interaction mechanisms. For a variety of basic functions, many proteins form a large complex representing a molecular machine or a macromolecular super-structural building block. After several high-throughput techniques for detection of protein–protein interactions had matured, protein interaction data became available in a large scale and curated databases for protein–protein interactions (PPIs) are a new necessity for efficient research. Here, their scope, annotation quality, and retrieval tools are reviewed. In addition, attention is paid to portals that provide unified access to a variety of such databases with added annotation value.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 159.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Uetz, P., Giot, L., Cagney, G., Mansfield, T. A., Judson, R. S., Knight, J. R., Lockshon, D., Narayan, V., Srinivasan, M., Pochart, P., et al. (2000) A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature 403, 623–627.

    Article  CAS  PubMed  Google Scholar 

  2. Yu, H., Braun, P., Yildirim, M. A., Lemmens, I., Venkatesan, K., Sahalie, J., Hirozane-Kishikawa, T., Gebreab, F., Li, N., Simonis, N., et al. (2008) High-quality binary protein interaction map of the yeast interactome network. Science 322, 104–110.

    Article  CAS  PubMed  Google Scholar 

  3. Hughes, T. R., Marton, M. J., Jones, A. R., Roberts, C. J., Stoughton, R., Armour, C. D., Bennett, H. A., Coffey, E., Dai, H., He, Y. D., et al. (2000) Functional discovery via a compendium of expression profiles. Cell 102, 109–126.

    Article  CAS  PubMed  Google Scholar 

  4. Cho, R. J., Campbell, M. J., Winzeler, E. A., Steinmetz, L., Conway, A., Wodicka, L., Wolfsberg, T. G., Gabrielian, A. E., Landsman, D., Lockhart, D. J., et al. (1998) A genome-wide transcriptional analysis of the mitotic cell cycle. Mol Cell 2, 65–73.

    Article  CAS  PubMed  Google Scholar 

  5. Tong, A. H., Evangelista, M., Parsons, A. B., Xu, H., Bader, G. D., Page, N., Robinson, M., Raghibizadeh, S., Hogue, C. W., Bussey, H., et al. (2001) Systematic genetic analysis with ordered arrays of yeast deletion mutants. Science 294, 2364–2368.

    Article  CAS  PubMed  Google Scholar 

  6. Marcotte, E. M., Pellegrini, M., Ng, H. L., Rice, D. W., Yeates, T. O., Eisenberg, D. (1999) Detecting protein function and protein-protein interactions from genome sequences. Science 285, 751–753.

    Article  CAS  PubMed  Google Scholar 

  7. Date, S. V., Marcotte, E. M. (2003) Discovery of uncharacterized cellular systems by genome-wide analysis of functional linkages. Nat Biotechnol 21, 1055–1062.

    Article  CAS  PubMed  Google Scholar 

  8. Enright, A. J., Iliopoulos, I., Kyrpides, N. C., Ouzounis, C. A. (1999) Protein interaction maps for complete genomes based on gene fusion events. Nature 402, 86–90.

    Article  CAS  PubMed  Google Scholar 

  9. Kamburov, A., Goldovsky, L., Freilich, S., Kapazoglou, A., Kunin, V., Enright, A. J., Tsaftaris, A., Ouzounis, C. A. (2007) Denoising inferred functional association networks obtained by gene fusion analysis. BMC Genomics 8, 460.

    Article  PubMed  Google Scholar 

  10. Dandekar, T., Snel, B., Huynen, M., Bork, P. (1998) Conservation of gene order: a fingerprint of proteins that physically interact. Trends Biochem Sci 23, 324–328.

    Article  CAS  PubMed  Google Scholar 

  11. Overbeek, R., Fonstein, M., D’Souza, M., Pusch, G. D., Maltsev, N. (1999) The use of gene clusters to infer functional coupling. Proc Natl Acad Sci USA 96, 2896–2901.

    Article  CAS  PubMed  Google Scholar 

  12. Overbeek, R., Fonstein, M., D’Souza, M., Pusch, G. D., Maltsev, N. (1999) Use of contiguity on the chromosome to predict functional coupling. In Silico Biol 1, 93–108.

    CAS  PubMed  Google Scholar 

  13. Korbel, J. O., Jensen, L. J., von, M. C., Bork, P. (2004) Analysis of genomic context: prediction of functional associations from conserved bidirectionally transcribed gene pairs. Nat Biotechnol 22, 911–917.

    Article  CAS  PubMed  Google Scholar 

  14. Makarova, K. S., Koonin, E. V. (2003) Filling a gap in the central metabolism of archaea: prediction of a novel aconitase by comparative-genomic analysis. FEMS Microbiol Lett 227, 17–23.

    Article  CAS  PubMed  Google Scholar 

  15. Pellegrini, M., Marcotte, E. M., Thompson, M. J., Eisenberg, D. Yeates, T. O. (1999) Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. Proc Natl Acad Sci USA 96, 4285–4288.

    Article  CAS  PubMed  Google Scholar 

  16. Sato, T., Yamanishi, Y., Kanehisa, M., Toh, H. (2005) The inference of protein-protein interactions by co-evolutionary analysis is improved by excluding the information about the phylogenetic relationships. Bioinformatics. 21, 3482–3489.

    Article  CAS  PubMed  Google Scholar 

  17. Sato, T., Yamanishi, Y., Horimoto, K., Kanehisa, M., Toh, H. (2006) Partial correlation coefficient between distance matrices as a new indicator of protein-protein interactions. Bioinformatics 22, 2488–2492.

    Article  CAS  PubMed  Google Scholar 

  18. Morett, E., Korbel, J. O., Rajan, E., Saab-Rincon, G., Olvera, L., Olvera, M., Schmidt, S., Snel, B., Bork, P. (2003) Systematic discovery of analogous enzymes in thiamin biosynthesis. Nat Biotechnol 21, 790–795.

    Article  CAS  PubMed  Google Scholar 

  19. Bader, G. D., Betel, D., Hogue, C. W. (2003) BIND: the Biomolecular Interaction Network Database. Nucleic Acids Res 31, 248–250.

    Article  CAS  PubMed  Google Scholar 

  20. Bader, G. D. and Hogue, C. W. (2000) BIND – a data specification for storing and describing biomolecular interactions, molecular complexes and pathways. Bioinformatics 16, 465–477.

    Article  CAS  PubMed  Google Scholar 

  21. Fraser, H. B., Plotkin, J. B. (2007) Using protein complexes to predict phenotypic effects of gene mutation. Genome Biol 8, R252.

    Article  PubMed  Google Scholar 

  22. Xenarios, I., Salwinski, L., Duan, X. J., Higney, P., Kim, S. M., Eisenberg, D. (2002) DIP, the Database of Interacting Proteins: a research tool for studying cellular networks of protein interactions. Nucleic Acids Res 30, 303–305.

    Article  CAS  PubMed  Google Scholar 

  23. Zanzoni, A., Montecchi-Palazzi, L., Quondam, M., Ausiello, G., Helmer-Citterich, M., Cesareni, G. (2002) MINT: a Molecular INTeraction database. FEBS Lett 513, 135–140.

    Article  CAS  PubMed  Google Scholar 

  24. Kerrien, S., am-Faruque, Y., Aranda, B., Bancarz, I., Bridge, A., Derow, C., Dimmer, E., Feuermann, M., Friedrichsen, A., Huntley, R., et al. (2007) IntAct – open source resource for molecular interaction data. Nucleic Acids Res 35, D561–D565.

    Article  CAS  PubMed  Google Scholar 

  25. McDowall, M. D., Scott, M. S., Barton, G. J. (2009) PIPs: human protein-protein interaction prediction database. Nucleic Acids Res 37, D651–D656.

    Article  CAS  PubMed  Google Scholar 

  26. Brown, K. R., Jurisica, I. (2005) Online predicted human interaction database. Bioinformatics 21, 2076–2082.

    Article  CAS  PubMed  Google Scholar 

  27. Persico, M., Ceol, A., Gavrila, C., Hoffmann, R., Florio, A., Cesareni, G. (2005) HomoMINT: an inferred human network based on orthology mapping of protein interactions discovered in model organisms. BMC Bioinformatics 6(Suppl 4), S21.

    Article  PubMed  Google Scholar 

  28. Jensen, L. J., Kuhn, M., Stark, M., Chaffron, S., Creevey, C., Muller, J., Doerks, T., Julien, P., Roth, A., Simonovic, M., et al. (2009) STRING 8 – a global view on proteins and their functional interactions in 630 organisms. Nucleic Acids Res 37, D412–D416.

    Article  CAS  PubMed  Google Scholar 

  29. von Mering, C., Huynen, M., Jaeggi, D., Schmidt, S., Bork, P., Snel, B. (2003) STRING: a database of predicted functional associations between proteins. Nucleic Acids Res 31, 258–261.

    Article  Google Scholar 

  30. Mathivanan, S., Periaswamy, B., Gandhi, T. K., Kandasamy, K., Suresh, S., Mohmood, R., Ramachandra, Y. L., Pandey, A. (2006) An evaluation of human protein-protein interaction data in the public domain. BMC Bioinformatics 7(Suppl 5), S19.

    Article  PubMed  Google Scholar 

  31. Noirot, P., Noirot-Gros, M. F. (2004) Protein interaction networks in bacteria. Curr Opin Microbiol 7, 505–512.

    Article  CAS  PubMed  Google Scholar 

  32. Su, C., Peregrin-Alvarez, J. M., Butland, G., Phanse, S., Fong, V., Emili, A., Parkinson, J. (2008) Bacteriome.org – an integrated protein interaction database for E. coli. Nucleic Acids Res 36, D632–D636.

    Article  CAS  PubMed  Google Scholar 

  33. Bader, G. D., Cary, M. P., Sander, C. (2006) Pathguide: a pathway resource list. Nucleic Acids Res 34, D504–D506.

    Article  CAS  PubMed  Google Scholar 

  34. Graeber, T. G., Eisenberg, D. (2001) Bioinformatic identification of potential autocrine signaling loops in cancers from gene expression profiles. Nat Genet 29, 295–300.

    Article  CAS  PubMed  Google Scholar 

  35. Hermjakob, H., Montecchi-Palazzi, L., Bader, G., Wojcik, J., Salwinski, L., Ceol, A., Moore, S., Orchard, S., Sarkans, U., von Mering, C., et al. (2004) The HUPO PSI’s molecular interaction format – a community standard for the representation of protein interaction data. Nat Biotechnol 22, 177–183.

    Article  CAS  PubMed  Google Scholar 

  36. Kerrien, S., Orchard, S., Montecchi-Palazzi, L., Aranda, B., Quinn, A. F., Vinod, N., Bader, G. D., Xenarios, I., Wojcik, J., Sherman, D., et al. (2007) Broadening the horizon – level 2.5 of the HUPO-PSI format for molecular interactions. BMC Biol 5, 44.

    Article  PubMed  Google Scholar 

  37. Stromback, L., Lambrix, P. (2005) Representations of molecular pathways: an evaluation of SBML, PSI MI and BioPAX. Bioinformatics 21, 4401–4407.

    Article  CAS  PubMed  Google Scholar 

  38. Breitkreutz, B. J., Stark, C., Tyers, M. (2003) Osprey: a network visualization system. Genome Biol 4, R22.

    Article  PubMed  Google Scholar 

  39. Chiang, T., Li, N., Orchard, S., Kerrien, S., Hermjakob, H., Gentleman, R., Huber, W. (2008) Rintact: enabling computational analysis of molecular interaction data from the IntAct repository. Bioinformatics 24, 1100–1101.

    Article  CAS  PubMed  Google Scholar 

  40. Lomax, J. (2005) Get ready to GO! A biologist’s guide to the Gene Ontology. Brief Bioinformatics 6, 298–304.

    Article  CAS  PubMed  Google Scholar 

  41. Hunter, S., Apweiler, R., Attwood, T. K., Bairoch, A., Bateman, A., Binns, D., Bork, P., Das, U., Daugherty, L., Duquenne, L., et al. (2009) InterPro: the integrative protein signature database. Nucleic Acids Res 37, D211–D215.

    Article  CAS  PubMed  Google Scholar 

  42. Breitkreutz, B. J., Stark, C., Reguly, T., Boucher, L., Breitkreutz, A., Livstone, M., Oughtred, R., Lackner, D. H., Bahler, J., Wood, V., et al. (2008) The BioGRID Interaction Database: 2008 update. Nucleic Acids Res 36, D637–D640.

    Article  CAS  PubMed  Google Scholar 

  43. Stark, C., Breitkreutz, B. J., Reguly, T., Boucher, L., Breitkreutz, A., Tyers, M. (2006) BioGRID: a general repository for interaction datasets. Nucleic Acids Res 34, D535–D539.

    Article  CAS  PubMed  Google Scholar 

  44. Keshava Prasad, T. S., Goel, R., Kandasamy, K., Keerthikumar, S., Kumar, S., Mathivanan, S., Telikicherla, D., Raju, R., Shafreen, B., Venugopal, A., et al. (2009) Human Protein Reference Database – 2009 update. Nucleic Acids Res 37, D767–D772.

    Article  CAS  PubMed  Google Scholar 

  45. Guldener, U., Munsterkotter, M., Oesterheld, M., Pagel, P., Ruepp, A., Mewes, H. W. and Stumpflen, V. (2006) MPact: the MIPS protein interaction resource on yeast. Nucleic Acids Res 34, D436–D441.

    Article  PubMed  Google Scholar 

  46. Guldener, U., Munsterkotter, M., Kastenmuller, G., Strack, N., van Helden, J., Lemer, C., Richelles, J., Wodak, S. J., Garcia-Martenez, J., Perez-Ortin, J. E., et al. (2005) CYGD: the Comprehensive Yeast Genome Database. Nucleic Acids Res 33, D364–D368.

    Article  CAS  PubMed  Google Scholar 

  47. Wuchty, S. (2004) Evolution and topology in the yeast protein interaction network. Genome Res 14, 1310–1314.

    Article  CAS  PubMed  Google Scholar 

  48. von Mering, C., Krause, R., Snel, B., Cornell, M., Oliver, S. G., Fields, S., Bork, P. (2002) Comparative assessment of large-scale data sets of protein-protein interactions. Nature 417, 399–403.

    Article  Google Scholar 

  49. Jansen, R., Yu, H., Greenbaum, D., Kluger, Y., Krogan, N. J., Chung, S., Emili, A., Snyder, M., Greenblatt, J. F., Gerstein, M. (2003) A Bayesian networks approach for predicting protein-protein interactions from genomic data. Science 302, 449–453.

    Article  CAS  PubMed  Google Scholar 

  50. Snel, B., Lehmann, G., Bork, P., Huynen, M. A. (2000) STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene. Nucleic Acids Res 28, 3442–3444.

    Article  CAS  PubMed  Google Scholar 

  51. von Mering, C., Jensen, L. J., Kuhn, M., Chaffron, S., Doerks, T., Kruger, B., Snel, B., Bork, P. (2007) STRING 7 – recent developments in the integration and prediction of protein interactions. Nucleic Acids Res 35, D358–D362.

    Article  Google Scholar 

  52. Chaurasia, G., Malhotra, S., Russ, J., Schnoegl, S., Hanig, C., Wanker, E. E., Futschik, M. E. (2009) UniHI 4: new tools for query, analysis and visualization of the human protein-protein interactome. Nucleic Acids Res 37, D657–D660.

    Article  CAS  PubMed  Google Scholar 

  53. Okuda, S., Yamada, T., Hamajima, M., Itoh, M., Katayama, T., Bork, P., Goto, S., Kanehisa, M. (2008) KEGG Atlas mapping for global analysis of metabolic pathways. Nucleic Acids Res 36, W423–W426, PMID: 18077471.

    Article  CAS  PubMed  Google Scholar 

  54. Shannon, P., Markiel, A., Ozier, O., Baliga, N. S., Wang, J. T., Ramage, D., Amin, N., Schwikowski, B., Ideker, T. (2003) Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13, 2498–2504.

    Article  CAS  PubMed  Google Scholar 

  55. Jiang, K., Nash, C. (2006) Application of XML database technology to biological pathway datasets. Conference proceedings : Annual International Conference of the IEEE Engineering in Medicine and Biology Society IEEE Engineering in Medicine and Biology Society Conference 1, 4217–4220.

    Google Scholar 

  56. Cerami, E. G., Bader, G. D., Gross, B. E., Sander, C. (2006) cPath: open source software for collecting, storing, and querying biological pathways. BMC Bioinformatics 7, 497.

    Article  PubMed  Google Scholar 

  57. Hart, G. T., Ramani, A. K., Marcotte, E. M. (2006) How complete are current yeast and human protein-interaction networks? Genome Biol 7, 120.

    Article  PubMed  Google Scholar 

  58. Chiang, T., Scholtens, D., Sarkar, D., Gentleman, R., Huber, W. (2007) Coverage and error models of protein-protein interaction data by directed graph analysis. Genome Biol 8, R186.

    Article  PubMed  Google Scholar 

  59. Gentleman, R., Huber, W. (2007) Making the most of high-throughput protein-interaction data. Genome Biol 8, 112.

    Article  PubMed  Google Scholar 

  60. Thorne, T., Stumpf, M. P. (2007) Generating confidence intervals on biological networks. BMC Bioinformatics 8, 467.

    Article  PubMed  Google Scholar 

  61. Gavin, A. C., Aloy, P., Grandi, P., Krause, R., Boesche, M., Marzioch, M., Rau, C., Jensen, L. J., Bastuck, S., Dumpelfeld, B., et al. (2006) Proteome survey reveals modularity of the yeast cell machinery. Nature 440, 631–636.

    Article  CAS  PubMed  Google Scholar 

  62. Schwikowski, B., Uetz, P., Fields, S. (2000) A network of protein-protein interactions in yeast. Nat Biotechnol 18, 1257–1261.

    Article  CAS  PubMed  Google Scholar 

  63. Jensen, L. J., Jensen, T. S., de, L. U., Brunak, S., Bork, P. (2006) Co-evolution of transcriptional and post-translational cell-cycle regulation. Nature 443, 594–597.

    CAS  PubMed  Google Scholar 

  64. Jensen, L. J., de, L. U., Jensen, T. S., Brunak, S., Bork, P. (2008) Circular reasoning rather than cyclic expression. Genome Biol 9, 403.

    Article  PubMed  Google Scholar 

  65. Nikolsky, Y., Ekins, S., Nikolskaya, T., Bugrim, A. (2005) A novel method for generation of signature networks as biomarkers from complex high throughput data. Toxicol Lett 158, 20–29.

    Article  CAS  PubMed  Google Scholar 

  66. Nikolsky, Y., Nikolskaya, T., Bugrim, A. (2005) Biological networks and analysis of experimental data in drug discovery. Drug Discov Today 10, 653–662.

    Article  CAS  PubMed  Google Scholar 

  67. Nikolsky, Y., Sviridov, E., Yao, J., Dosymbekov, D., Ustyansky, V., Kaznacheev, V., Dezso, Z., Mulvey, L., Macconaill, L. E., Winckler, W., et al. (2008) Genome-wide functional synergy between amplified and mutated genes in human breast cancer. Cancer Res 68, 9532–9540.

    Article  CAS  PubMed  Google Scholar 

  68. van Noort, V., Snel, B., Huynen, M. A. (2007) Exploration of the omics evidence landscape: adding qualitative labels to predicted protein-protein interactions. Genome Biol 8, R197, PMID: 17880677.

    Article  PubMed  Google Scholar 

  69. Pagel, P., Kovac, S., Oesterheld, M., Brauner, B., Dunger-Kaltenbach, I., Frishman, G., Montrone, C., Mark, P., Stumpflen, V., Mewes, H. W., et al. (2005) The MIPS mammalian protein-protein interaction database. Bioinformatics 21, 832–834.

    Article  CAS  PubMed  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Humana Press, a part of Springer Science+Business Media, LLC

About this protocol

Cite this protocol

Ooi, H.S., Schneider, G., Chan, YL., Lim, TT., Eisenhaber, B., Eisenhaber, F. (2010). Databases of Protein–Protein Interactions and Complexes. In: Carugo, O., Eisenhaber, F. (eds) Data Mining Techniques for the Life Sciences. Methods in Molecular Biology, vol 609. Humana Press. https://doi.org/10.1007/978-1-60327-241-4_9

Download citation

  • DOI: https://doi.org/10.1007/978-1-60327-241-4_9

  • Published:

  • Publisher Name: Humana Press

  • Print ISBN: 978-1-60327-240-7

  • Online ISBN: 978-1-60327-241-4

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics