Elsevier

Gene

Volume 768, 5 February 2021, 145272
Gene

Research paper
Selection and validation of reference genes for RT-qPCR normalization in different tissues of milk thistle (Silybum marianum, Gaert.)

https://doi.org/10.1016/j.gene.2020.145272Get rights and content

Highlights

  • S. marianum is an officinal plant important for silymarin and for other productions.

  • Ten reference genes in six tissues were evaluated for gene expression studies.

  • MON1 is the most stable gene in all datasets according to RefFinder.

  • In the fruits MON1, FAD2-2, ACT and NDHF are the top ranked reference genes.

  • Analysis of CHS1, PRX1 and ACP1 confirmed the importance accurate RGs selection.

Abstract

Quantitative reverse transcription PCR is a sensitive technique for evaluating transcriptional profiles in different experimental datasets. To obtain a reliable quantification of the transcripts level, data normalization with stable reference genes is required. Stable reference genes are identified after analysis of their transcripts profile in every new experiment and species of interest. In Silybum marianum, a widely cultivated officinal plant, only few gene expression studies exist, and reference genes for RT-qPCR studies in the diverse plant tissues have never been investigated before. In this work, the expression stability of 10 candidate reference genes was evaluated in leaves, roots, stems and fruits of S. marianum grown under physiological environmental condition. The stability values for each candidate reference gene were calculated by four canonical statistical algorithms GeNorm, NormFinder, Bestkeeper and ΔCt method in different subsets of samples, then they were ranked with RefFinder from the most to the least suitable for normalization.

Best combinations of reference genes are finally proposed for different experimental data sets, including all tissues, vegetative, and reproductive tissues separately. Three target genes putatively involved in important biosynthetic pathway leading to key metabolites in the fruits of milk thistle, such as silymarin and fatty acids, were analyzed with the chosen panels of reference genes, in comparison to the ones used in previous papers.

To the best of our knowledge, this is the first report on a reliable and systematic identification and validation of the reference genes for RT-qPCR normalization to study gene expression in S. marianum.

Introduction

Milk thistle (Silybum marianum Gaertn.) is an important officinal plant worldwide. It has been used as a medicinal herb for more than 2000 years (Morazzoni and Bombardelli, 1995) and recently studies addressing the use of S. marianum in the treatment of several pathologies have increased (Camini and Costa, 2020). Despite that, the species has not been completely domesticated (Martinelli, 2019). Its health-promoting properties are attributable to the presence in the fruit integument of a group of flavonolignans called silymarin, which has been widely investigated for the treatment of human liver diseases (Abenavoli et al., 2011) and for many other effects such as cancer prevention, antioxidant and neuroprotective activities (Kren and Walterova, 2005, Lama et al., 2019, Lani et al., 2015, Li et al., 2010). Due to its characteristics, milk thistle is one of the 10 bestselling herbal products in the US (Smith et al., 2017) and silymarin has been used as bioactive component of drugs for almost 50 years (Albrecht et al., 1992).

Apart from silymarin, milk thistle fruits (cypselae – fruits containing a single seed) have a high oil content (20–30%) (Martinelli et al., 2016). Prior silymarin extraction it is necessary to remove oil from the fruits, making oil an important byproduct of silymarin production. Moreover, its high fruit and biomass yield under low input conditions makes this species an interesting multipurpose crop for marginal Mediterranean environments (Afshar et al., 2014, Domínguez et al., 2017, Sulas et al., 2008).

Only in 2019, and limited to the PubMed database, more than 300 studies have been published dealing with S. marianum medicinal properties, suggesting that milk thistle is a species of interest in the current scientific scenario worldwide. Despite that, as for most non-model crop, only limited molecular studies exist and specific analytical protocols for functional genomic and transcriptomic studies are not available yet. In these species, very few genes involved in specific biosynthetic pathways have already been identified and their transcriptional patterns studied. However, despite next generation sequencing technology has enabled a quick and cost-effective analysis of whole genomes and transcriptomes, S. marianum genomic information is still limited and only a representative first draft whole genome sequence (Accession number: ASM154182v1) and transcriptome (Accession number: PRJNA298444) has been recently released (Lv et al., 2017).

The discovery of genes involved in silymarin biosynthesis and accumulation in fruits integument during development (El-Garhy et al., 2016, Elsayed et al., 2019, Lv et al., 2017, Sanjari et al., 2015, Soliman et al., 2018, Torres and Corchete, 2016) or in the synthesis of other valuable constituents of the fruits (Alemardan et al., 2013) are interesting research topics in S. marianum. The identification of key genes involved in fruit shattering is another milestone toward the genetic improvement of this crop (Martinelli, 2019). Moreover, studies regarding leaves variegation recently provided new information in this species about physiological adaptation to heat, UV radiation or defence from herbivores but, again, the involvement of specific genes is not known yet (Shelef et al., 2019). Finally, molecular studies of roots would be useful for a better comprehension of this tissue in order to improve the silymarin production by hairy root cultures (Hasanloo et al., 2008, Rahnama et al., 2013).

Quantitative real-time PCR (RT-qPCR) is an efficient strategy to estimate transcripts expression levels and the accuracy of its results is largely due to the expression stability of the reference genes. Accurate normalization of transcriptional data using selected reference controls enables the comparison of mRNA concentrations across samples (Bustin et al., 2009). Many studies have reported significant variations in the expression of reference genes in different experimental conditions and plant tissues. Therefore, the identification of reference genes with stable expression across samples, and their validation in every new experiment, cell type, tissue, species is a pre-requisite for a reliable transcriptional analysis (Barbierato et al., 2017). As reported in the Minimum Information for Publication of Quantitative Real-Time PCR Experiments (MIQE) guide, normalization using a single reference gene is no more acceptable and “the optimal number and choice of reference genes must be experimentally determined, and the method reported” (Bustin et al., 2009).

Genes involved in the maintenance of basic cellular functions, named housekeeping genes, are good candidates to be used for normalization as they are expected to be stably expressed in the cells (Wu et al., 2016). Tubulin alpha (TUBα), Tap42-interacting protein of 41 kDa (TIP41), NADH dehydrogenase (NADH), 18S rRNA (18S), Actin (ACT), ubiquitin conjugating enzyme E2 (UBC), monensin sensivity1 (MON1) and protein phosphatase 2A (PP2A) genes are among the most common reference genes in plants (Bustin, 2002, Czechowski et al., 2005, Expósito-Rodríguez et al., 2008, Goidin et al., 2001, Kim et al., 2003). Nevertheless, it has been reported that these genes are expressed diversely according to genotypes, tissues, stress conditions or developmental stages (Dheda et al., 2004, Garrido et al., 2020, Gutierrez et al., 2008, Nikalje et al., 2018, Reddy et al., 2013, Sinha et al., 2015). Therefore, the assessment of their expression reliability under certain conditions and between different tissues is highly recommended to ensure the accuracy of results.

In this study, 10 reference genes have been analyzed with the aim to provide an appropriate selection of reference genes for RT-qPCR data normalization in S. marianum. The stability of the selected candidates was assessed in different physiological tissues that can be site of expression of interesting target candidate genes, such as fruits at different developmental stages but also leaves, stems and roots. Moreover, the selected reference genes were validated by analyzing the transcriptional profiles of target genes involved in silymarin and fatty acids biosynthesis. To the best of our knowledge, the current study is the first report on a systematic analysis for selection of reference genes in various milk thistle tissues and development stages and will boost genomic and functional studies in this species.

Section snippets

Plant materials

The S. marianum accession n° RCAT 057475 originated from Germany and was obtained from the Institute for Agrobotany (Nébih, Tápiószele, Hungary). Seeds were sown in autumn in the experimental field of Consiglio per la ricerca in agricoltura e l’analisi dell’economia agraria (CREA) in Bologna, North Italy (N:44°34′30.8″; E:11°09′55.6″). Fruits were harvested from 3 plants (biological replicates) at each BBCH stage 71, 75 and 79 (Martinelli et al., 2015), for a total of 9 samples. Stem sampling

Primers specificity and transcriptional profile of candidate reference genes

A total of 10 candidate reference genes were studied by RT-qPCR in order to identify the most stable ones to be used for data normalization in gene expression analysis in S. marianum. Out of these, two sequences were already available in the nucleotide database of NCBI (NADH dehydrogenase subunit F, ID: KC589999.1 and 18S, ID: AJ831537.1) while the remaining ones were retrieved from the S. marianum whole genome sequence, by sequence similarity search with known genes from other plant species.

Discussion

Several techniques can be used to investigate gene expression at transcriptional level. RT-qPCR is one of the most utilized, as it allows to reliably quantify transcripts abundance of target genes in a precise and reproducible way (Bustin et al., 2009). However, because of its sensitivity, it is highly subjected to technical variations that can severely affect the data if a correct normalization is not performed. The selection of suitable reference genes, whose transcription is not influenced

Conclusions

The present study focused on S. marianum, an officinal plant widely cultivated for the production of silymarin. In order to improve genetic and molecular knowledge on this species and on its peculiar metabolic pathway, a pilot study was conducted to identify for the first time reference genes sets for accurate normalization of the transcript levels by RT-qPCR analyses. The expression stability of 10 candidate genes was evaluated in six different tissues, integrating the results with five

CRediT authorship contribution statement

Fulvio Flavia: Conceptualization, Methodology, Validation, Formal analysis, Investigation, Writing - original draft, Visualization. Martinelli Tommaso: Resources, Project administration. Paris Roberta: Conceptualization, Methodology, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

Authors thank Dr. Laura Bassolino of Consiglio per la ricerca in agricoltura e l’analisi dell’economia agraria – Centro di ricerca Cerealicoltura e Colture Industriali for her valuable advise and for reading the manuscript.

References (73)

  • M. Torres et al.

    Gene expression and flavonolignan production in fruits and cell cultures of Silybum marianum

    J. Plant Physiol.

    (2016)
  • L. Abenavoli et al.

    Milk Thistle in liver diseases: Past, present, future.

    Phytotherapy Res.

    (2011)
  • M. Albrecht et al.

    Therapy of toxic liver pathologies with Legalon

    Zeitschrift fur Klinische Medizin

    (1992)
  • A. Alemardan et al.

    Breeding objectives and selection criteria for milk thistle [silybum marianum (L.) gaertn.] improvement

    Notulae Botanicae Horti Agrobotanici Cluj-Napoca

    (2013)
  • C.L. Andersen et al.

    Normalization of real-time quantitative reverse transcription-PCR data: a model-based variance estimation approach to identify genes suited for normalization, applied to bladder and colon cancer data sets

    Cancer Res.

    (2004)
  • J. Andrzejewska et al.

    Silybum marianum: Non-medical exploitation of the species

    Ann. Appl. Biol.

    (2015)
  • V. Barbierato et al.

    A spiking strategy facilitates housekeeping selection for RT-qPCR analysis under different biotic stresses in eggplant

    Protoplasma

    (2017)
  • S.A. Bustin

    Quantification of mRNA using real-time reverse transcription PCR (RT-PCR): Trends and problems

    J. Mol. Endocrinol.

    (2002)
  • S.A. Bustin et al.

    The MIQE guidelines: minimum information for publication of quantitative real-time PCR experiments

    Clinical Chem.

    (2009)
  • F.C. Camini et al.

    Silymarin: Not just another antioxidant

    J. Basic Clin. Physiol. Pharmacol.

    (2020)
  • S. Cao et al.

    A large and functionally diverse family of Fad2 genes in safflower (Carthamus tinctorius L.)

    BMC Plant Biol

    (2013)
  • D.L. Court et al.

    RNase III: Genetics and Function; Structure and Mechanism

    Annu. Rev. Genet.

    (2013)
  • Y. Cui et al.

    Monensin Sensitivity1 (MON1)/calcium caffeine zinc sensitivity1 (CCZ1)-mediated Rab7 activation regulates tapetal programmed cell death and pollen development

    Plant Physiol.

    (2017)
  • T. Czechowski et al.

    Genome-wide identification and testing of superior reference genes for transcript normalization in arabidopsis

    Plant Physiol.

    (2005)
  • T.T.H. Dao et al.

    Chalcone synthase and its functions in plant resistance

    Phytochem. Rev.

    (2011)
  • K. Dheda et al.

    Validation of housekeeping genes for normalizing RNA expression in real-time PCR

    Biotechniques

    (2004)
  • A.I. Elsayed et al.

    The impact of drought stress on antioxidant responses and accumulation of flavonolignans in milk thistle (Silybum marianum (l.) gaertn)

    Plants

    (2019)
  • M. Expósito-Rodríguez et al.

    Selection of internal control genes for quantitative real-time RT-PCR studies during tomato development process

    BMC Plant Biol.

    (2008)
  • C. Fan et al.

    Selection of reference genes for quantitative real-time PCR in bamboo (Phyllostachys edulis)

    PLoS ONE

    (2013)
  • J. Garrido et al.

    Identification and validation of reference genes for RT-qPCR normalization in wheat meiosis

    Sci. Rep.

    (2020)
  • R. Gopalam et al.

    Selection and validation of appropriate reference genes for quantitative real-time PCR analysis in Salvia hispanica

    PloS one

    (2017)
  • L. Gutierrez et al.

    The lack of a systematic validation of reference genes: A serious pitfall undervalued in reverse transcription-polymerase chain reaction (RT-PCR) analysis in plants

    Plant Biotechnol. J.

    (2008)
  • T. Hasanloo et al.

    Elicitor-enhanced Silymarin production in hairy root cultures of Silybum marianum L

    Gaertn. Planta Medica

    (2008)
  • J. Hellemans et al.

    qBase relative quantification framework and software for management and automated analysis of real-time quantitative PCR data

    Genome Biol.

    (2008)
  • A.V. Jacobsen et al.

    Reference gene selection for qPCR Is dependent on cell type rather than treatment in colonic and vaginal human epithelial cell lines

    PLoS ONE

    (2014)
  • N. Jafari et al.

    Expression levels of microRNA machinery components Drosha, dicer and DGCR8 in human (AGS, HepG2, and KEYSE-30) cancer cell lines

    Int. J. Clin. Exp. Med.

    (2013)
  • Cited by (8)

    • Silybum marianum chemotype differentiation is genetically determined by factors involved in silydianin biosynthesis

      2023, Journal of Applied Research on Medicinal and Aromatic Plants
      Citation Excerpt :

      Relative target genes expression was calculated using the 2-ΔΔCt method, adjusted for the efficiency of PCR amplification of each gene according to Pfaffl (2001). Normalisation was done with four reference genes (Vandesompele et al., 2002) chosen after Fulvio et al. (2021). G20 was used as control.

    • A single point mutation in a member of FAD2 multigene family resulted in the generation of a high oleic line of Silybum marianum (L.) Gaertn.

      2022, Industrial Crops and Products
      Citation Excerpt :

      Relative gene expression levels were calculated using the ‘Standard Curve Method’ (Larionov et al., 2005) and presented as the mean ± standard error of the mean (SE) of three biological replicates. Internal control genes used for data normalization were selected based on Fulvio et al. (2021). Genotypes G20 and G62 were crossed according to Martinelli (2018).

    View all citing articles on Scopus
    View full text