Integrative systems biology: an attempt to describe a simple weed
Highlights
► Genome-scale studies are now regularly performed in Arabidopsis thaliana. ► Transcriptomics, proteomics and metabolomics provide a wealth of descriptive information. ► Integration of all genome-scale datasets will enable a systems-level understanding of Arabidopsis.
Introduction
The completion of the Arabidopsis genome sequence facilitated extraordinary progress toward understanding plant biology. In particular, complete genomic sequence data drove the development of genome-wide transcriptional approaches, such as microarrays. Genome-scale studies (hereafter -omics) include, but are not limited to, analysis of RNAs, proteins, and metabolites. Significant progress has been made annotating and determining the function of many Arabidopsis genes. However, a plant is not just the sum of its genes, but a complex system where gene product interactions result in emergent properties. Therefore, with the ultimate intention of studying the biology of the whole organism, it is important to frame the next long-term goals for plant scientists.
We propose that a key long-term goal is the integration of different genome-scale approaches. The first steps in this direction have already occurred although the tools for the integration, visualization, and modeling of -omics data are still at a relatively early stage [1•, 2]. In this review, we focus on gene expression, protein and metabolite profiling data, briefly introducing each of the individual approaches, and then highlighting recent efforts to integrate these -omics (Figure 1).
Section snippets
Omics technology development: transcriptomics, proteomics and metabolomics
Transcriptomics: The completion of the Arabidopsis genome sequence has facilitated whole genome transcriptomic (gene expression) studies. The development of microarray technology enabled the simultaneous examination of thousands of genes; thus providing a comprehensive view of gene activity. Microarray gene expression data now cover organs, tissues, cell-types and developmental events, as well as responses to a variety of environmental perturbations [3, 4, 5, 6••, 7, 8]. For profiling
Data integration from molecular information: the gene–protein–metabolite relationship
Transcriptomics–proteomics: As gene expression profiling and proteomic methods improve, data can be combined to achieve a better understanding of Arabidopsis as a system. One important aspect of future investigation will involve the identification of variably spliced transcripts and the discrete proteins they encode. Before next generation sequencing analysis, estimates of alternatively spliced genes based on EST analysis ranged from 22 to 30% [44, 45, 46, 47]. More recent information using
Conclusion and future prospectives
As the studies that are highlighted here demonstrate, the integration of multiple genomic-scale studies can reveal novel biology. A comprehensive systems-level understanding of Arabidopsis will require -omics methods to be integrated and combined [80, 81]. In the near future, each of these -omics approaches will be used in an integrative fashion to inform and validate the findings of other genome-scale projects. Proteomics has been used to predict metabolic activity in the roots and shoots of
Conflict of interests
The authors declare that they have no competing interests.
References and recommended reading
Papers of particular interest, published within the period of review, have been highlighted as:
• of special interest
•• of outstanding interest
Acknowledgments
We are grateful to the members of the Benfey lab for helpful comments. Work in our lab on this area is funded by grants from the NIH (R01 GM-43778 and P50-GM081883) and the NSF (IOS-1021619). LML is funded by a fellowship from the Jane Coffin Childs Fund for Medical Research.
References (82)
- et al.
A gene expression map of Arabidopsis thaliana development
Nat Genet
(2005) - et al.
RNA-Seq: a revolutionary tool for transcriptomics
Nat Rev Genet
(2009) - et al.
Genome-scale proteomics reveals Arabidopsis thaliana gene models and proteome dynamics
Science
(2008) - et al.
Enhanced analytical power of SDS-PAGE using machine learning algorithms
Proteomics
(2008) - et al.
Advancements in plant proteomics using quantitative mass spectrometry
J Proteomics
(2009) - et al.
KNApSAcK: a comprehensive species-metabolite relationship database
Plant Metabol
(2006) - et al.
Gas chromatography mass spectrometry-based metabolite profiling in plants
Nat Protoc
(2006) - et al.
Chloroplast 2010: a database for large-scale phenotypic screening of Arabidopsis mutants
Plant Physiol
(2011) - et al.
Proteome mapping of mature pollen of Arabidopsis thaliana
Proteomics
(2005) - et al.
Proteogenomics to discover the full coding content of genomes: a computational perspective
J Proteomics
(2010)
An auxin gradient and maximum in the Arabidopsis root apex shown by high-resolution cell-specific analysis of IAA distribution and synthesis
Plant Cell
Global expression profiling of sulfur-starved Arabidopsis by DNA macroarray reveals the role of O-acetyl-l-serine as a general regulator of gene expression in response to sulfur nutrition
Plant J
Decoding genes with coexpression networks and metabolomics — ‘majority report by precogs’
Trends Plant Sci
Whole genome analysis of gene expression reveals coordinated activation of signaling and metabolic pathways during pollen-pistil interactions in Arabidopsis
Plant Physiol
Web-queryable large-scale data sets for hypothesis generation in plant biology
Plant Cell
A systems view of responses to nutritional cues in Arabidopsis: toward a paradigm shift for predictive network modeling
Plant Physiol
A gene expression map of the Arabidopsis root
Science
A high-resolution root spatiotemporal map reveals dominant expression patterns
Science
Cell identity mediates the response of Arabidopsis roots to abiotic stress
Science
Cell identity regulators link development and stress responses in the Arabidopsis root
Dev Cell
Gene expression map of the Arabidopsis shoot apical meristem stem cell niche
Proc Natl Acad Sci USA
Finding the fifth base: genome-wide sequencing of cytosine methylation
Genome Res
Applications of next-generation sequencing technologies in functional genomics
Genomics
Sampling the Arabidopsis transcriptome with massively parallel pyrosequencing
Plant Physiol
SNP discovery via 454 transcriptome sequencing
Plant J
Genome-wide mapping of alternative splicing in Arabidopsis thaliana
Genome Res
Highly integrated single-base resolution maps of the epigenome in Arabidopsis
Cell
High-resolution experimental and computational profiling of tissue-specific known and novel miRNAs in Arabidopsis
Genome Res
Comprehensive proteomics
Curr Opin Biotechnol
The Arabidopsis thaliana 2-D gel mitochondrial proteome: refining the value of reference maps for assessing protein abundance, contaminants and post-translational modifications
Proteomics
Plant proteomics update (2007–2008): second-generation proteomic techniques, an appropriate experimental design, and data analysis to fulfill MIAPE standards, increase plant proteome coverage and expand biological knowledge
J Proteomics
Mass spectrometry-based proteomics
Nature
Generating and navigating proteome maps using mass spectrometry
Nat Rev Mol Cell Biol
Analysis of shotgun proteomics and RNA profiling data from Arabidopsis thaliana chloroplasts
J Proteome Res
Proteome of plant peroxisomes: new perspectives on the role of these organelles in cell biology
Proteomics
Arabidopsis thaliana as a model organism for plant proteome research
J Proteomics
Evidence for network evolution in an Arabidopsis interactome map
Science
Independently evolved virulence effectors converge onto hubs in a plant immune system network
Science
Untargeted large-scale plant metabolomics using liquid chromatography coupled to mass spectrometry
Nat Protoc
Recent applications of NMR spectroscopy in plant metabolomics
FEBS J
Mass spectrometry-based metabolomics
Mass Spectrom Rev
Cited by (33)
Integrated transcriptome and proteome analysis of near-isogenic line provides insights on regulatory function of Pup1 QTL in rice under phosphorus-starvation stress
2024, Environmental and Experimental BotanyQuantifying the impact of dynamic plant-environment interactions on metabolic regulation
2023, Journal of Plant PhysiologySystems-integration of plant metabolism: Means, motive and opportunity
2013, Current Opinion in Plant BiologyCitation Excerpt :Advances in technology to measure transcripts [1,2,3•,4•], proteins [4•,5•] and metabolites [6,7•,8•] are generating daunting amounts of data that have to be integrated and linked to phenotypes in a predictive manner [9,10].
Data mining methods for omics and knowledge of crude medicinal plants toward big data biology
2013, Computational and Structural Biotechnology JournalCitation Excerpt :The KNApSAcK Core DB was utilized in very diverged purposes of metabolomics studies including identification of metabolites (‘Exp’ in Table 1), construction of integrated databases (‘DB’), bioinformatics and systems biology (‘Bioinfo’), and cited in at least 110 papers listed in Table 1, that is, in 29 papers in the period of 2006–2008, 25 papers in the period of 2009, 20 papers in 2010, 18 papers in 2011, 18 papers in 2012–2013. In addition, it was applied in diverged species from bacteria to plants and animals, in total 28 species, that is, Angelica acutiloba [74], Arabidopsis lyrata ssp. petraea [56], Arabidopsis thaliana[25, 30, 33, 35, 37, 46, 47, 62, 70, 86, 99, 103, 104, 108, 109, 121, 122], Atriplex halimus [127], Bacillus subtilis [113], Brassica oleraceae var capitata [60], Brufelsia calycina [81], Capsicum sp. [123], Citrus sinensis [131], Curcuma longa [77], Ephedra sp. [67], Escherichia coli [51], Fragaria × ananassa [40, 43, 44], Fragaria vesca [105], Glycine max [53], Glycyrrhiza uralensis [94], Hordeum vulgare [80, 102], Homo sapiens [63, 101], Jatropha curcas [124, 125], Malx × domestica [126], Ophiorrhiza pumila [117], Oryza sativa [49, 61], Papaver somniferum [42], Rattus norvegicus [39, 97], Rizotania solani [79], Solanum lycopersicum [45, 48], Solanum tuberosum [98] and Zea mays [120]. In the period of 2006–2008, many review papers [‘Review’ in Table 1] focused on metabolomics platforms integrated by mass-spectrometry and metabolite databases including KNApSAcK Core [29, 31, 34, 38, 42, 49, 52] and on linking chemistry with biology [24], and on metabolome researches targeting the model plant Arabidopsis thaliana [30, 33, 35, 37].
Genetic analysis of metabolome-phenotype interactions: From model to crop species
2013, Trends in GeneticsCitation Excerpt :Another influential study further demonstrated the value of this approach by discovering key regulators of aliphatic glucosinolate biosynthesis using an entirely ‘omics’-based elucidation [53]. A substantial amount of systems-biology approaches have been applied in Arabidopsis, owing to the wealth of available data for this species [54], but significant amounts of data are also available for crop species amenable to similar analyses. The current status of omics tools available for the top 20 crop species illustrates the large effort that has been put towards implementing molecular approaches in crop breeding [15].
- 1
These authors contributed equally to this work.