Integrative systems biology: an attempt to describe a simple weed

https://doi.org/10.1016/j.pbi.2012.01.004Get rights and content

Genome-scale studies hold great promise for revealing novel plant biology. Because of the complexity of these techniques, numerous considerations need to be made before embarking on a study. Here we focus on the Arabidopsis model system because of the wealth of available genome-scale data. Many approaches are available that provide genome-scale information regarding the state of a given organism (e.g. genomics, epigenomics, transcriptomics, proteomics, metabolomics interactomics, ionomics, phenomics, etc.). Integration of all of these types of data will be necessary for a comprehensive description of Arabidopsis. In this review we propose that ‘triangulation’ among transcriptomics, proteomics and metabolomics is a meaningful approach for beginning this integrative analysis and uncovering a systems level perspective of Arabidopsis biology.

Highlights

► Genome-scale studies are now regularly performed in Arabidopsis thaliana. ► Transcriptomics, proteomics and metabolomics provide a wealth of descriptive information. ► Integration of all genome-scale datasets will enable a systems-level understanding of Arabidopsis.

Introduction

The completion of the Arabidopsis genome sequence facilitated extraordinary progress toward understanding plant biology. In particular, complete genomic sequence data drove the development of genome-wide transcriptional approaches, such as microarrays. Genome-scale studies (hereafter -omics) include, but are not limited to, analysis of RNAs, proteins, and metabolites. Significant progress has been made annotating and determining the function of many Arabidopsis genes. However, a plant is not just the sum of its genes, but a complex system where gene product interactions result in emergent properties. Therefore, with the ultimate intention of studying the biology of the whole organism, it is important to frame the next long-term goals for plant scientists.

We propose that a key long-term goal is the integration of different genome-scale approaches. The first steps in this direction have already occurred although the tools for the integration, visualization, and modeling of -omics data are still at a relatively early stage [1•, 2]. In this review, we focus on gene expression, protein and metabolite profiling data, briefly introducing each of the individual approaches, and then highlighting recent efforts to integrate these -omics (Figure 1).

Section snippets

Omics technology development: transcriptomics, proteomics and metabolomics

Transcriptomics: The completion of the Arabidopsis genome sequence has facilitated whole genome transcriptomic (gene expression) studies. The development of microarray technology enabled the simultaneous examination of thousands of genes; thus providing a comprehensive view of gene activity. Microarray gene expression data now cover organs, tissues, cell-types and developmental events, as well as responses to a variety of environmental perturbations [3, 4, 5, 6••, 7, 8]. For profiling

Data integration from molecular information: the gene–protein–metabolite relationship

Transcriptomics–proteomics: As gene expression profiling and proteomic methods improve, data can be combined to achieve a better understanding of Arabidopsis as a system. One important aspect of future investigation will involve the identification of variably spliced transcripts and the discrete proteins they encode. Before next generation sequencing analysis, estimates of alternatively spliced genes based on EST analysis ranged from 22 to 30% [44, 45, 46, 47]. More recent information using

Conclusion and future prospectives

As the studies that are highlighted here demonstrate, the integration of multiple genomic-scale studies can reveal novel biology. A comprehensive systems-level understanding of Arabidopsis will require -omics methods to be integrated and combined [80, 81]. In the near future, each of these -omics approaches will be used in an integrative fashion to inform and validate the findings of other genome-scale projects. Proteomics has been used to predict metabolic activity in the roots and shoots of

Conflict of interests

The authors declare that they have no competing interests.

References and recommended reading

Papers of particular interest, published within the period of review, have been highlighted as:

  • • of special interest

  • •• of outstanding interest

Acknowledgments

We are grateful to the members of the Benfey lab for helpful comments. Work in our lab on this area is funded by grants from the NIH (R01 GM-43778 and P50-GM081883) and the NSF (IOS-1021619). LML is funded by a fellowship from the Jane Coffin Childs Fund for Medical Research.

References (82)

  • S.V. Petersson et al.

    An auxin gradient and maximum in the Arabidopsis root apex shown by high-resolution cell-specific analysis of IAA distribution and synthesis

    Plant Cell

    (2009)
  • M.Y. Hirai et al.

    Global expression profiling of sulfur-starved Arabidopsis by DNA macroarray reveals the role of O-acetyl-l-serine as a general regulator of gene expression in response to sulfur nutrition

    Plant J

    (2003)
  • K. Saito et al.

    Decoding genes with coexpression networks and metabolomics  ‘majority report by precogs’

    Trends Plant Sci

    (2008)
  • L.C. Boavida et al.

    Whole genome analysis of gene expression reveals coordinated activation of signaling and metabolic pathways during pollen-pistil interactions in Arabidopsis

    Plant Physiol

    (2011)
  • S.M. Brady et al.

    Web-queryable large-scale data sets for hypothesis generation in plant biology

    Plant Cell

    (2009)
  • S. Ruffel et al.

    A systems view of responses to nutritional cues in Arabidopsis: toward a paradigm shift for predictive network modeling

    Plant Physiol

    (2010)
  • K. Birnbaum et al.

    A gene expression map of the Arabidopsis root

    Science

    (2003)
  • S.M. Brady et al.

    A high-resolution root spatiotemporal map reveals dominant expression patterns

    Science

    (2007)
  • J.R. Dinneny et al.

    Cell identity mediates the response of Arabidopsis roots to abiotic stress

    Science

    (2008)
  • A.S. Iyer-Pascuzzi et al.

    Cell identity regulators link development and stress responses in the Arabidopsis root

    Dev Cell

    (2011)
  • R.K. Yadav et al.

    Gene expression map of the Arabidopsis shoot apical meristem stem cell niche

    Proc Natl Acad Sci USA

    (2009)
  • R. Lister et al.

    Finding the fifth base: genome-wide sequencing of cytosine methylation

    Genome Res

    (2009)
  • O. Morozova et al.

    Applications of next-generation sequencing technologies in functional genomics

    Genomics

    (2008)
  • A.P. Weber et al.

    Sampling the Arabidopsis transcriptome with massively parallel pyrosequencing

    Plant Physiol

    (2007)
  • W.B. Barbazuk et al.

    SNP discovery via 454 transcriptome sequencing

    Plant J

    (2007)
  • S. Filichkin et al.

    Genome-wide mapping of alternative splicing in Arabidopsis thaliana

    Genome Res

    (2010)
  • R. Lister et al.

    Highly integrated single-base resolution maps of the epigenome in Arabidopsis

    Cell

    (2008)
  • N.W. Breakfield et al.

    High-resolution experimental and computational profiling of tissue-specific known and novel miRNAs in Arabidopsis

    Genome Res

    (2012)
  • M. Beck et al.

    Comprehensive proteomics

    Curr Opin Biotechnol

    (2011)
  • N.L. Taylor et al.

    The Arabidopsis thaliana 2-D gel mitochondrial proteome: refining the value of reference maps for assessing protein abundance, contaminants and post-translational modifications

    Proteomics

    (2011)
  • J.V. Jorrín-Novo et al.

    Plant proteomics update (2007–2008): second-generation proteomic techniques, an appropriate experimental design, and data analysis to fulfill MIAPE standards, increase plant proteome coverage and expand biological knowledge

    J Proteomics

    (2009)
  • R. Aebersold et al.

    Mass spectrometry-based proteomics

    Nature

    (2003)
  • C.H. Ahrens et al.

    Generating and navigating proteome maps using mass spectrometry

    Nat Rev Mol Cell Biol

    (2010)
  • S. Baginsky et al.

    Analysis of shotgun proteomics and RNA profiling data from Arabidopsis thaliana chloroplasts

    J Proteome Res

    (2005)
  • J.M. Palma et al.

    Proteome of plant peroxisomes: new perspectives on the role of these organelles in cell biology

    Proteomics

    (2009)
  • S. Wienkoop et al.

    Arabidopsis thaliana as a model organism for plant proteome research

    J Proteomics

    (2010)
  • A.I.M. Consortium

    Evidence for network evolution in an Arabidopsis interactome map

    Science

    (2011)
  • M.S. Mukhtar et al.

    Independently evolved virulence effectors converge onto hubs in a plant immune system network

    Science

    (2011)
  • R.C. De Vos et al.

    Untargeted large-scale plant metabolomics using liquid chromatography coupled to mass spectrometry

    Nat Protoc

    (2007)
  • J.L. Ward et al.

    Recent applications of NMR spectroscopy in plant metabolomics

    FEBS J

    (2007)
  • K. Dettmer et al.

    Mass spectrometry-based metabolomics

    Mass Spectrom Rev

    (2007)
  • Cited by (33)

    • Systems-integration of plant metabolism: Means, motive and opportunity

      2013, Current Opinion in Plant Biology
      Citation Excerpt :

      Advances in technology to measure transcripts [1,2,3•,4•], proteins [4•,5•] and metabolites [6,7•,8•] are generating daunting amounts of data that have to be integrated and linked to phenotypes in a predictive manner [9,10].

    • Data mining methods for omics and knowledge of crude medicinal plants toward big data biology

      2013, Computational and Structural Biotechnology Journal
      Citation Excerpt :

      The KNApSAcK Core DB was utilized in very diverged purposes of metabolomics studies including identification of metabolites (‘Exp’ in Table 1), construction of integrated databases (‘DB’), bioinformatics and systems biology (‘Bioinfo’), and cited in at least 110 papers listed in Table 1, that is, in 29 papers in the period of 2006–2008, 25 papers in the period of 2009, 20 papers in 2010, 18 papers in 2011, 18 papers in 2012–2013. In addition, it was applied in diverged species from bacteria to plants and animals, in total 28 species, that is, Angelica acutiloba [74], Arabidopsis lyrata ssp. petraea [56], Arabidopsis thaliana[25, 30, 33, 35, 37, 46, 47, 62, 70, 86, 99, 103, 104, 108, 109, 121, 122], Atriplex halimus [127], Bacillus subtilis [113], Brassica oleraceae var capitata [60], Brufelsia calycina [81], Capsicum sp. [123], Citrus sinensis [131], Curcuma longa [77], Ephedra sp. [67], Escherichia coli [51], Fragaria × ananassa [40, 43, 44], Fragaria vesca [105], Glycine max [53], Glycyrrhiza uralensis [94], Hordeum vulgare [80, 102], Homo sapiens [63, 101], Jatropha curcas [124, 125], Malx × domestica [126], Ophiorrhiza pumila [117], Oryza sativa [49, 61], Papaver somniferum [42], Rattus norvegicus [39, 97], Rizotania solani [79], Solanum lycopersicum [45, 48], Solanum tuberosum [98] and Zea mays [120]. In the period of 2006–2008, many review papers [‘Review’ in Table 1] focused on metabolomics platforms integrated by mass-spectrometry and metabolite databases including KNApSAcK Core [29, 31, 34, 38, 42, 49, 52] and on linking chemistry with biology [24], and on metabolome researches targeting the model plant Arabidopsis thaliana [30, 33, 35, 37].

    • Genetic analysis of metabolome-phenotype interactions: From model to crop species

      2013, Trends in Genetics
      Citation Excerpt :

      Another influential study further demonstrated the value of this approach by discovering key regulators of aliphatic glucosinolate biosynthesis using an entirely ‘omics’-based elucidation [53]. A substantial amount of systems-biology approaches have been applied in Arabidopsis, owing to the wealth of available data for this species [54], but significant amounts of data are also available for crop species amenable to similar analyses. The current status of omics tools available for the top 20 crop species illustrates the large effort that has been put towards implementing molecular approaches in crop breeding [15].

    View all citing articles on Scopus
    1

    These authors contributed equally to this work.

    View full text