skip to main content
research-article

Constrained Community-Based Gene Regulatory Network Inference

Published:17 February 2015Publication History
Skip Abstract Section

Abstract

The problem of gene regulatory network inference is a major concern of systems biology. In recent years, a novel methodology has gained momentum, called community network approach. Community networks integrate predictions from individual methods in a “metapredictor,” in order to compose the advantages of different methods and soften individual limitations. This article proposes a novel methodology to integrate prediction ensembles using constraint programming, a declarative modeling and problem solving paradigm. Constraint programming naturally allows the modeling of dependencies among components of the problem as constraints, facilitating the integration and use of different forms of knowledge. The new paradigm, referred to as constrained community network, uses constraints to capture properties of the regulatory networks (e.g., topological properties) and to guide the integration of knowledge derived from different families of network predictions. The article experimentally shows the potential of this approach: The addition of biological constraints can offer significant improvements in prediction accuracy.

Skip Supplemental Material Section

Supplemental Material

References

  1. D. Allocco, I. Kohane, and A. Butte. 2004. Quantifying the relationship between co-expression, co-regulation and gene function. BMC Bioinf. 5, 1 (2004), 18+.Google ScholarGoogle Scholar
  2. U. Alon. 2007. Network motifs: Theory and experimental approaches. Nat. Rev. Genet. 8, 6 (2007), 450--461.Google ScholarGoogle ScholarCross RefCross Ref
  3. G. Altay, M. Asim, F. Markowetz, and D. E. Neal. 2011. Differential C3NET reveals disease networks of direct physical interactions. BMC Bioinf. 12 (2011), 296.Google ScholarGoogle ScholarCross RefCross Ref
  4. G. Altay and F. E. Streib. 2010. Inferring the conservative causal core of gene regulatory networks. BMC Syst. Biol. 4, 1 (2010), 132+.Google ScholarGoogle Scholar
  5. K. Apt. 2009. Principles of Constraint Programming. Cambridge University Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Balaji, M. M. Babu, L. M. Iyer, N. M. Luscombe, and L. Aravind. 2006. Comprehensive analysis of combinatorial regulation using the transcriptional regulatory network of yeast. J. Mol. Biol. 360, 1 (2006), 213--227.Google ScholarGoogle ScholarCross RefCross Ref
  7. P. Baldi, S. Brunak, Y. Chauvin, C. A. Andersen, and H. Nielsen. 2000. Assessing the accuracy of prediction algorithms for classification: An overview. Bioinformatics 16, 5 (2000), 412--424.Google ScholarGoogle ScholarCross RefCross Ref
  8. M. Bansal, V. Belcastro, A. Ambesi-Impiombato, and D. Di Bernardo. 2007. How to infer gene networks from expression profiles. Mol. Syst. Biol. 3 (2007), 78.Google ScholarGoogle ScholarCross RefCross Ref
  9. A. Bauer-Mehren, L. I. Furlong, and F. Sanz. 2009. Pathway databases and tools for their exploitation: Benefits, current limitations and challenges. Mol. Syst. Biol. 5, 1 (2009).Google ScholarGoogle Scholar
  10. C. M. Bishop and N. M. Nasrabadi. 2006. Pattern Recognition and Mmachine Learning. Vol. 1. Springer. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. R. Bonneau et al. 2006. The inferelator: An algorithm for learning parsimonious regulatory networks from systems-biology data sets de novo. Genome Biol. 7, 5 (2006), R36.Google ScholarGoogle ScholarCross RefCross Ref
  12. J. C. Borda. 1971. Memoire sur les elections au scrutin.Google ScholarGoogle Scholar
  13. L. Bortolussi and A. Policriti. 2008. Modeling biological systems in stochastic concurrent constraint programming. Constraints 13, 1--2 (2008). Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. L. Breiman, J. Friedman, C. J. Stone, and R. A. Olshen. 1984. Classification and Regression Trees. Chapman & Hall, New York, NY.Google ScholarGoogle Scholar
  15. F. Corblin, E. Fanchon, and L. Trilling. 2010. Applications of a formal approach to decipher discrete genetic networks. BMC Bioinf. 11 (2010), 385.Google ScholarGoogle ScholarCross RefCross Ref
  16. F. Corblin, S. Tripodi, E. Fanchon, D. Ropers, and L. Trilling. 2009. A declarative constraint-based method for analyzing discrete genetic regulatory networks. Biosystems 98, 2 (2009), 91--104.Google ScholarGoogle ScholarCross RefCross Ref
  17. H. De Jong. 2002. Modeling and simulation of genetic regulatory systems: A literature review. J. Comput. Biol. 9, 1 (2002), 67--103.Google ScholarGoogle ScholarCross RefCross Ref
  18. R. de Matos Simoes and F. Emmert-Streib. 2012. Bagging statistical network inference from large-scale gene expression data. PloS ONE 7, 3 (2012), e33624+.Google ScholarGoogle Scholar
  19. P. D’Haeseleer, X. Wen, S. Fuhrman, and R. Somogyi. 1999. Linear modeling of mRNA expression levels during CNS development and injury. In Pacific Symposium on Biocomputing. 41--52.Google ScholarGoogle Scholar
  20. F. Eduati, J. De Las Rivas, G. Di Camillo, G. Toffolo, and J. Saez-Rodriguez. 2012. Integrating literature-constrained and data-driven inference of signalling networks. Bioinformatics 28, 18 (2012), 2311--2317. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. B. Efron, T. Hastie, I. Johnstone, and R. Tibshirani. 2004. Least angle regression. Ann. Statist 32, 2 (2004), 407--499.Google ScholarGoogle ScholarCross RefCross Ref
  22. F. Fages, G. Batt, E. D. Maria, D. Jovanovska, A. Rizk, and S. Soliman. 2010. Computational systems biology in BIOCHAM. ERCIM News 2010, 82, 36.Google ScholarGoogle Scholar
  23. J. J. Faith, B. Hayete, J. T. Thaden, I. Mogno, J. Wierzbowski, G. Cottarel, S. Kasif, J. J. Collins, and T. S. Gardner. 2007. Large-scale mapping and validation of escherichia coli transcriptional regulation from a compendium of expression profiles. PLoS Biol. 5, 1 (2007), e8.Google ScholarGoogle ScholarCross RefCross Ref
  24. F. Fioretto and E. Pontelli. 2013. Constraint programming in community-based gene regulatory network inference. In CMSB 2013 (LNBI), Vol. 8130. Springer-Verlag, 135--149.Google ScholarGoogle Scholar
  25. S. Gama-Castro, V. Jiménez-Jacinto, M. Peralta-Gil, A. Santos-Zavaleta, M.I. Peñaloza-Spinola, B. Contreras-Moreira, J. Segura-Salazar, L. Muñiz-Rascado, I. Martínez-Flores, and H. Salgado. 2008. RegulonDB: Gene regulation model of Escherichia coli K-12 beyond transcription, active annotated promoters and Textpresso navigation. Nucleic Acids Res. 36, D120--D124.Google ScholarGoogle ScholarCross RefCross Ref
  26. M. Gebser, T. Schaub, S. Thiele, and P. Veber. 2008. Detecting inconsistencies in large biological networks with answer set programming. In Logic Programming, LNCS, Vol. 5366. Springer, Berlin, 130--144. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. A. Greenfield, A. Madar, H. Ostrer, and R. Bonneau. 2010. DREAM4: Combining genetic and dynamic information to identify biological networks and dynamical models. PLoS ONE 5, 10 (2010), e13397.Google ScholarGoogle ScholarCross RefCross Ref
  28. N. Guelzim, S. Bottani, P. Bourgine, and F. Képès. 2002. Topological and causal structure of the yeast transcriptional regulatory network. Nat. Genet. 31, 1 (April 2002), 60--63.Google ScholarGoogle ScholarCross RefCross Ref
  29. M. A. Harris and Gene Ontology Consortium. 2004. The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res. 32, Database issue (2004), D258--D261.Google ScholarGoogle Scholar
  30. T. Hastie, R. Tibshirani, and J. Friedman. 2009. The Elements of Statistical Learning: Data Mining, Inference and Prediction (2nd. ed.). Springer, New York, NY.Google ScholarGoogle Scholar
  31. S. Kim, S. Imoto, and S. Miyano. 2003. Dynamic Bayesian network and nonparametric regression for nonlinear modeling of gene networks from time series gene expression data. Biosystems 104--113.Google ScholarGoogle Scholar
  32. S. K. Kummerfeld and S. A. Teichmann. 2006. DBD: A transcription factor prediction database. Nucleic Acids Res. 34, Suppl. 1 (2006), D74--D81.Google ScholarGoogle ScholarCross RefCross Ref
  33. L. P. Lim, N. C. Lau, P. Garrett-Engele, A. Grimson, J. M. Schelter, J. Castle, D. P. Bartel, P. S. Linsley, and J. M. Johnson. 2005. Microarray analysis shows that some microRNAs downregulate large numbers of target mRNAs. Nature 433, 7027 (2005), 769--773.Google ScholarGoogle ScholarCross RefCross Ref
  34. P. B. Madhamshettiwar, S. R. Maetschke, M. J. Davis, A. Reverter, and M. A. Ragan. 2012. Gene regulatory network inference: Evaluation and application to ovarian cancer allows the prioritization of drug targets. Genome Med. 4, 5 (1 May 2012), 41+.Google ScholarGoogle Scholar
  35. D. Marbach, J. C. Costello, R. Küffner, N. M. Vega, R. J. Prill, D. M. Camacho, and the DREAM5 Consortium. 2012. Wisdom of crowds for robust gene network inference. Nat. Meth. 9, 8 (2012), 796--804.Google ScholarGoogle ScholarCross RefCross Ref
  36. A. A. Margolin, I. Nemenman, K. Basso, C. Wiggins, G. Stolovitzky, R. D. Favera, and A. Califano. 2006. ARACNE: An algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinf. 7, Suppl. 1 (2006).Google ScholarGoogle Scholar
  37. N. Meinshausen and P. Bühlmann. 2010. Stability selection. J. R. Stat. Soc.: Ser. B (Statistical Methodology) 72, 4 (Sept. 2010), 417--473.Google ScholarGoogle ScholarCross RefCross Ref
  38. P. E. Meyer, K. Kontos, F. Lafitte, and G. Bontempi. 2007. Information-theoretic inference of large transcriptional regulatory networks. EURASIP J. Bioinf. Sys. Biol. (2007). DOI:10.1155/2007/79879 Google ScholarGoogle ScholarCross RefCross Ref
  39. M. P. Perrone. 1993. Improving Regression Estimation: Averaging Methods for Variance rReduction with Extensions to General Convex Measure Optimization. Ph.D. dissertation. Brown University, Providence, RI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. R. J. Prill, D. Marbach, J. Saez-Rodriguez, P. K. Sorger, L. G. Alexopoulos, X. Xue, N. D. Clarke, G. Altan-Bonnet, and G. Stolovitzky. 2010. Towards a rigorous assessment of systems biology models: The DREAM3 challenges. PLoS ONE 5, 2 (2010), e9202.Google ScholarGoogle ScholarCross RefCross Ref
  41. T. Reguly, A. Breitkreutz, L. Boucher, B. Breitkreutz, G. C. Hon, C. L. Myers, A. Parsons, H. Friesen, R. Oughtred, and A. Tong. 2006. Comprehensive curation and analysis of global interaction networks in Saccharomyces cerevisiae. J. Biol. 5, 4 (08 June 2006), 11+.Google ScholarGoogle ScholarCross RefCross Ref
  42. M. Renda and U. Straccia. 2003. Web metasearch: Rank vs. score based rank aggregation methods. In Proceedings of the 2003 ACM Symposium on Applied Computing. ACM, New York, NY, 841--846. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. T. Schaffter, D. Marbach, and D. Floreano. 2011. GeneNetWeaver: In silico benchmark generation and performance profiling of network inference methods. Bioinformatics 27, 16 (June 2011), 2263--2270. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. C. Schulte and P. J. Stuckey. 2008. Efficient constraint propagation engines. ACM Trans. Prog. Lang. Syst. 31, 1. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. S. S. Shen-Orr, R. Milo, S. Mangan, and U. Alon. 2002. Network motifs in the transcriptional regulation network of Escherichia coli. Nat. Genet. 31, 1 (2002), 1061--4036.Google ScholarGoogle ScholarCross RefCross Ref
  46. A. Sîrbu, H. J. Ruskin, and M. Crane. 2012. Integrating heterogeneous gene expression data for gene regulatory network modelling. Theor. Biosci. 131, 2 (2012), 95--102.Google ScholarGoogle ScholarCross RefCross Ref
  47. T. Soh and K. Inoue. 2010. Identifying necessary reactions in metabolic pathways by minimal model generation. In ECAI 2010. IOS Press, Amsterdam, The Netherlands, 277--282. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. M. J. Song, C. K. Lewis, E. R. Lance, E. J. Chesler, R. K. Yordanova, M. A. Langston, K. H. Lodowski, and S. E. Bergeson. 2009. Reconstructing generalized logical networks of transcriptional regulation in mouse brain from temporal gene expression data. EURASIP J. Bioinf. Syst. Biol. (2009), 5. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. N. Sun and H. Zhao. 2009. Reconstructing transcriptional regulatory networks through genomics data. Stat. Meth. Med. Res. 18, 6 (2009), 595--617.Google ScholarGoogle ScholarCross RefCross Ref
  50. R. Thomas. 1973. Boolean formalization of genetic control circuits. J. Theor. Biol. 42, 3, 563--585.Google ScholarGoogle ScholarCross RefCross Ref
  51. S. Videla, C. Guziolowski, F. Eduati, S. Thiele, N. Grabe, J. Saez-Rodriguez, and A. Siegel. 2012. Revisiting the training of logic models of protein signaling networks with ASP. In Comput. Meth. Syst. Biol. LNCS, Vol. 7605, 342--361. Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. X. Zhou, X. Wang, and E. Dougherty. 2006. Genomic Networks: Statistical Inference from Microarray Data. John Wiley & Sons.Google ScholarGoogle Scholar

Index Terms

  1. Constrained Community-Based Gene Regulatory Network Inference

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Modeling and Computer Simulation
      ACM Transactions on Modeling and Computer Simulation  Volume 25, Issue 2
      Special Issue on Computational Methods in Systems Biology
      April 2015
      161 pages
      ISSN:1049-3301
      EISSN:1558-1195
      DOI:10.1145/2737798
      Issue’s Table of Contents

      Copyright © 2015 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 17 February 2015
      • Accepted: 1 October 2014
      • Revised: 1 August 2014
      • Received: 1 January 2014
      Published in tomacs Volume 25, Issue 2

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader