Means to optimize protein expression in transgenic plants

https://doi.org/10.1016/j.copbio.2014.11.011Get rights and content

Highlights

  • Multifactorial study designs result in improved prediction power.

  • More focus should be placed on directional targeting to cell compartments.

  • Previously neglected inside and outside factors are emerging.

  • More emphasis on RNA structure to influence protein expression is in order.

The biotechnological production of proteins is currently achieved via expression systems derived from different lineages. In the past years transgenic plants have proven to be able to compete with bacteria or mammalian cell systems. Gene engineering approaches exist to raise yields by controlling mandatory processes in the course of biopharmaceutical protein production. Here we review and discuss the current status and recent improvements of parameters influencing recombinant protein production in transgenic plants. In particular, this review focuses on the so-called inside (mRNA sequence and structure) and outside factors (host and production system/conditions), which are adjustable and allow to optimize protein production via gene engineering.

Introduction

The potential of transgenic plants to act as alternatives to classical/standard expression systems for protein production has led to an increasing interest in this field of biotechnology. Several studies (reviewed in [1]) have already highlighted that transgenic plants can produce low-cost biomass and yield high amounts of recombinant proteins. Moreover, they can modify pharmaceutically important proteins in a post-transcriptional manner and some of them can be grown like crops [2]. In 2012 the U.S. Food and Drug Administration (FDA) approved the first plant cell-expressed therapeutic drug in a transgenic carrot cell culture system. The yield of the biotechnologically relevant protein is dependent on the used host system, the transformation method, the targeted cellular compartment, the realized expression level in the host system and by the accumulation status of the protein [1].

Adjustable parameters for gene engineering can be grouped into inside features (IF; mRNA and protein primary structure and features) and outside features (OF; host system, vector system, culture conditions). IFs and OFs are closely connected and impact each other. For most research projects individual parameter sets are evaluated and improved on a per case basis, therefore a bullet-proof correlation of gene features and their predicted impact on the amount of end-product is still lacking. In the same context it is still under debate exactly which impact the changes in codon usage can have on the final protein abundance. In some cases it apparently does not increase gene expression at all and in other cases up to 1000-fold [1•, 3•, 4, 5]. It is also debated how codon usage effects other features of the underlying transcript like for example RNA folding, that might result in changes of protein expression level [6].

As reviewed in [3], maximizing heterologous protein expression is a multidimensional optimization problem. To benefit from plant-specific features, one needs to decode the complex nature of multiple factors influencing and controlling the outcome of eukaryotic expression systems. By using new technologies, knowledge gaps are being closed by monitoring the necessary parameters [5]. Recently a study [7] based on a hypothesis about selection constraints and codon bias [8] could link the IF codon bias and OFs like ribosomal dependent translation, tRNA abundance and host evolution in prokaryotes. The question remains whether these findings also apply for eukaryotic expression systems [9].

Next to the already mentioned IF codon bias a diverse set of IF and OF parameters can influence protein expression and should be considered for optimization as highlighted in Figure 1. In this review we give an overview about recent improvements which have been made in optimization, correlation and prediction of these IFs and OFs which altogether influence protein expression in transgenic plants. We will further clarify the importance of experimental design and highlight how new resources and design-rules can be adapted for genetic engineering.

Section snippets

Improving inside features (IFs)

In the process of developing gene engineering, codon bias is an intensively studied parameter influencing protein expression and hence impacts industrial production systems [10]. Next to a dependence on the OF tRNA abundance and the IF codon bias even altering synonymous sites on the mRNA level (maintaining the original protein sequence) can lead to tremendous effects on transcription level. This can be due to changed folding capacities of the mRNA or by introducing or removing miRNA target

Nucleotide frequencies of mRNA (codon usage and GC content)

One important step in introducing a transgene for protein production comprises altering the codon usage to match the host genome (either be the nuclear genome or the genome of the targeted semi-autonomous organelle). Codon usage data for some plants can be found in the ‘codon usage database’ (http://www.kazusa.or.jp/codon/ [12]). There are several metrics that can be applied to measure codon usage. In [13] a whole chapter deals with different classification categories, evaluates selected

Importance of untranslated regions (UTRs)

Recently our knowledge of sequences upstream of the translation start site and correlation of this upstream stretch with translation efficiency in plants was expanded. The 21 bp upstream region, highly conserved in plants, was shown to account for translational variation up to 200 fold in Arabidopsis thaliana, opening new possibilities for improving 5′ UTRs for gene engineering [24]. As reviewed in [25], plant viruses harbor so-called 3′ CAP-independent translational enhancers which can overcome

Improving outside features (OFs)

As recently reviewed [32••], modifying the host system can be the method of choice to overcome endogenous mechanisms like the unfolded protein response, which can influence recombinant protein production in plants. Host modification also helps to conquer other host specific limiting OFs such as altering glycosylation capabilities, reducing the production of proteases and increasing the content of chaperones. The unfolded protein response and its underlying mechanisms are conserved among

Expression cassette repertoire

Expressing a transgene in a host plant requires expression cassettes that produce high amounts of protein. Several systems are used in different host plants and an overview of plant hosts and expression cassettes is provided by [40].

Different plant viral vector systems are used for transient expression of transgenes; see [41] for a summary of the currently used vector systems. A newer trend is using viral vector systems to also down-regulate host transcripts via virus induced gene silencing

Host features

Akin to mammalian expression systems, plant systems benefit from their ability to perform post-translational modifications. The introduced modifications, for example glycosylation patterns, can have severe effects on the structural and functional properties of the recombinant proteins [3]. Several plant systems can now be used to produce human-like glycosylation patterns in biopharmaceuticals and may be a good alternative to mammalian expression hosts; however, only a few plant systems have

Culture conditions

The plant status, including time point during development and interaction with the environment, can feedback into the expression status, highlighting the importance of culture conditions as an OF. Individual culture condition factors in plant cell culture or conventional crop systems which directly or indirectly influence protein production are diverse and previous work has covered them comprehensively [69, 70]. A recent review summarizes the existing modeling approaches for plant cultures and

Conclusions

The host spectrum for genetic engineering expands and new species are introduced to offer advantages as compared to the existing expression systems. As an example, seaweed was recently introduced as a novel expression system [74], potentially unlocking vast underwater habitats. However, systematic analysis of new host systems are required for improving recombinant protein production and to cope with the needs of molecular farming in terms of for example eliminating non-human glycosylation

References and recommended reading

Papers of particular interest, published within the period of review, have been highlighted as:

  • • of special interest

  • •• of outstanding interest

Acknowledgement

We thank Linley Jesson for critical reading of the manuscript.

References (76)

  • E.L. Decker et al.

    Glycoprotein production in moss bioreactors

    Plant Cell Rep

    (2012)
  • S.E. Abdel-Ghany et al.

    Metabolic engineering of Arabidopsis for butanetriol production using bacterial genes

    Metab Eng

    (2013)
  • S.A. Wilson et al.

    Recent advances towards development and commercialization of plant cell culture processes for the synthesis of biomolecules

    Plant Biotechnol J

    (2012)
  • M. Skarjinskaia et al.

    Hairy roots as a vaccine production and delivery system

    Adv Biochem Eng Biotechnol

    (2013)
  • R. Fischer et al.

    Commercial aspects of pharmaceutical protein production in plants

    Curr Pharm Des

    (2013)
  • C. Gustafsson et al.

    Engineering genes for predictable protein expression

    Protein Expr Purif

    (2012)
  • R.C. Hunt et al.

    Exposing synonymous mutations

    Trends Genet

    (2014)
  • Y. Mao et al.

    Universally increased mRNA stability downstream of the translation initiation site in eukaryotes and prokaryotes

    Gene

    (2013)
  • A. Porceddu et al.

    The signatures of selection for translational accuracy in plant genes

    Genome Biol Evol

    (2013)
  • C. Elena et al.

    Expression of codon optimized genes in microbial systems: current industrial applications and perspectives

    Front Microbiol

    (2014)
  • F. Li et al.

    Regulatory impact of RNA secondary structure across the Arabidopsis transcriptome

    Plant Cell

    (2012)
  • Y. Nakamura et al.

    Codon usage tabulated from international DNA sequence databases: status for the year 2000

    Nucleic Acids Res

    (2000)
  • G. Cannarozzi et al.

    Codon Evolution: Mechanisms and Models

    (2012)
  • L. Serres-Giardi et al.

    Patterns and evolution of nucleotide landscapes in seed plants

    Plant Cell

    (2012)
  • F.F. Chevance et al.

    The effects of codon context on in vivo translation speed

    PLoS Genet

    (2014)
  • G. Sablok et al.

    Combinational effect of mutational bias and translational selection for translation efficiency in tomato (Solanum lycopersicum) cv. Micro-Tom

    Genomics

    (2013)
  • W. Qian et al.

    Balanced codon usage optimizes eukaryotic translational efficiency

    PLoS Genet

    (2012)
  • S. Camiolo et al.

    The relation of codon bias to tissue-specific gene expression in Arabidopsis thaliana

    Genetics

    (2012)
  • T. Tuller et al.

    An evolutionarily conserved mechanism for controlling the efficiency of protein translation

    Cell

    (2010)
  • C. Berens et al.

    Riboswitch engineering – making the all-important second and third steps

    Curr Opin Biotechnol

    (2015)
  • A.E. Simon et al.

    3′ cap-independent translation enhancers of plant viruses

    Annu Rev Microbiol

    (2013)
  • Y.A. Meshcheriakova et al.

    Fine-tuning levels of heterologous gene expression in plants by orthogonal variation of the untranslated regions of a nonreplicating transient expression system

    Plant Biotechnol J

    (2014)
  • A.M. Inka Borchers et al.

    Increased accumulation and stability of rotavirus VP6 protein in tobacco chloroplasts following changes to the 5′ untranslated region and the 5′ end of the coding region

    Plant Biotechnol J

    (2012)
  • A.B. Rose et al.

    Promoter-proximal introns in Arabidopsis thaliana are enriched in dispersed signals that elevate gene expression

    Plant Cell

    (2008)
  • A.B. Rose et al.

    Evidence for a DNA-based mechanism of intron-mediated enhancement

    Front Plant Sci

    (2011)
  • T. Akua et al.

    The Arabidopsis thaliana MHX gene includes an intronic element that boosts translation when localized in a 5′ UTR intron

    J Exp Bot

    (2013)
  • S. Emami et al.

    The effects of a stimulating intron on the expression of heterologous genes in Arabidopsis thaliana

    Plant Biotechnol J

    (2013)
  • D.R. Thomas et al.

    The effect of the unfolded protein response on the production of recombinant proteins in plants

    Plant Cell Rep

    (2014)
  • Cited by (25)

    • Optimizing protein expression in heterologous system: Strategies and tools

      2021, Meta Gene
      Citation Excerpt :

      Prokaryotic genes do not have any intron and mostly are AT rich, while eukaryotic protein coding genes possess introns and are mostly GC rich. Optimal expression of a transgene in a heterologous system depends on its structural features vis-à-vis the structural features of genes of the recipient host (Ullrich et al., 2015; Parret et al., 2016). For a heterologous gene to be efficiently expressed, it should be transcribed and translated efficiently by the host cellular machinery.

    • Production of pharmaceutical active recombinant globular adiponectin as a secretory protein in Withania Somnifera hairy root culture

      2020, Journal of Biotechnology
      Citation Excerpt :

      Data were analyzed using one- or two-way ANOVA for evaluating the significant differences between more than two means and more than two independent variables effects, respectively and Duncan’s Multiple Range Test (DMRT) was used to compare means at p<0.05. To improve the gene expression in plants, critical parameters, such as adjustment of codon usage bias, GC content, type and location of restriction enzymes recognition sites, and cis-acting elements, as well as the elimination of repeated sequences should be considered closely (Ullrich et al., 2015). Among different statistical methods, the codon adaptation index (CAI) and frequency of optimal codons (FOP) have been applied to analyze codon usage bias for W. somnifera (Cannarozzi and Schneider, 2012).

    • New biorefineries and sustainable agriculture: Increased food, biofuels, and ecosystem security

      2015, Renewable and Sustainable Energy Reviews
      Citation Excerpt :

      Furthermore, to decrease biomass recalcitrance, the discovery of genetic variants in native populations of bioenergy plants and direct manipulation of biosynthesis pathways have produced less recalcitrant feedstocks with favorable properties for biomass pretreatment and downstream conversion [157–159]. Also, to decrease protein production costs, plants can be modified for low-cost production of recombinant proteins [160–162]. At the same time, the most important R&D directions of new biorefineries focus on biomass saccharification, bioconversions of advanced biofuels limited to a few (e.g., butanol and hydrogen), and new applications of lignocellulose-derived products.

    • Plants as Bioreactors: An Overview

      2023, Plants as Bioreactors: An Overview
    View all citing articles on Scopus
    View full text