Research paperSelection and validation of reference genes for RT-qPCR normalization in different tissues of milk thistle (Silybum marianum, Gaert.)
Graphical abstract
Introduction
Milk thistle (Silybum marianum Gaertn.) is an important officinal plant worldwide. It has been used as a medicinal herb for more than 2000 years (Morazzoni and Bombardelli, 1995) and recently studies addressing the use of S. marianum in the treatment of several pathologies have increased (Camini and Costa, 2020). Despite that, the species has not been completely domesticated (Martinelli, 2019). Its health-promoting properties are attributable to the presence in the fruit integument of a group of flavonolignans called silymarin, which has been widely investigated for the treatment of human liver diseases (Abenavoli et al., 2011) and for many other effects such as cancer prevention, antioxidant and neuroprotective activities (Kren and Walterova, 2005, Lama et al., 2019, Lani et al., 2015, Li et al., 2010). Due to its characteristics, milk thistle is one of the 10 bestselling herbal products in the US (Smith et al., 2017) and silymarin has been used as bioactive component of drugs for almost 50 years (Albrecht et al., 1992).
Apart from silymarin, milk thistle fruits (cypselae – fruits containing a single seed) have a high oil content (20–30%) (Martinelli et al., 2016). Prior silymarin extraction it is necessary to remove oil from the fruits, making oil an important byproduct of silymarin production. Moreover, its high fruit and biomass yield under low input conditions makes this species an interesting multipurpose crop for marginal Mediterranean environments (Afshar et al., 2014, Domínguez et al., 2017, Sulas et al., 2008).
Only in 2019, and limited to the PubMed database, more than 300 studies have been published dealing with S. marianum medicinal properties, suggesting that milk thistle is a species of interest in the current scientific scenario worldwide. Despite that, as for most non-model crop, only limited molecular studies exist and specific analytical protocols for functional genomic and transcriptomic studies are not available yet. In these species, very few genes involved in specific biosynthetic pathways have already been identified and their transcriptional patterns studied. However, despite next generation sequencing technology has enabled a quick and cost-effective analysis of whole genomes and transcriptomes, S. marianum genomic information is still limited and only a representative first draft whole genome sequence (Accession number: ASM154182v1) and transcriptome (Accession number: PRJNA298444) has been recently released (Lv et al., 2017).
The discovery of genes involved in silymarin biosynthesis and accumulation in fruits integument during development (El-Garhy et al., 2016, Elsayed et al., 2019, Lv et al., 2017, Sanjari et al., 2015, Soliman et al., 2018, Torres and Corchete, 2016) or in the synthesis of other valuable constituents of the fruits (Alemardan et al., 2013) are interesting research topics in S. marianum. The identification of key genes involved in fruit shattering is another milestone toward the genetic improvement of this crop (Martinelli, 2019). Moreover, studies regarding leaves variegation recently provided new information in this species about physiological adaptation to heat, UV radiation or defence from herbivores but, again, the involvement of specific genes is not known yet (Shelef et al., 2019). Finally, molecular studies of roots would be useful for a better comprehension of this tissue in order to improve the silymarin production by hairy root cultures (Hasanloo et al., 2008, Rahnama et al., 2013).
Quantitative real-time PCR (RT-qPCR) is an efficient strategy to estimate transcripts expression levels and the accuracy of its results is largely due to the expression stability of the reference genes. Accurate normalization of transcriptional data using selected reference controls enables the comparison of mRNA concentrations across samples (Bustin et al., 2009). Many studies have reported significant variations in the expression of reference genes in different experimental conditions and plant tissues. Therefore, the identification of reference genes with stable expression across samples, and their validation in every new experiment, cell type, tissue, species is a pre-requisite for a reliable transcriptional analysis (Barbierato et al., 2017). As reported in the Minimum Information for Publication of Quantitative Real-Time PCR Experiments (MIQE) guide, normalization using a single reference gene is no more acceptable and “the optimal number and choice of reference genes must be experimentally determined, and the method reported” (Bustin et al., 2009).
Genes involved in the maintenance of basic cellular functions, named housekeeping genes, are good candidates to be used for normalization as they are expected to be stably expressed in the cells (Wu et al., 2016). Tubulin alpha (TUBα), Tap42-interacting protein of 41 kDa (TIP41), NADH dehydrogenase (NADH), 18S rRNA (18S), Actin (ACT), ubiquitin conjugating enzyme E2 (UBC), monensin sensivity1 (MON1) and protein phosphatase 2A (PP2A) genes are among the most common reference genes in plants (Bustin, 2002, Czechowski et al., 2005, Expósito-Rodríguez et al., 2008, Goidin et al., 2001, Kim et al., 2003). Nevertheless, it has been reported that these genes are expressed diversely according to genotypes, tissues, stress conditions or developmental stages (Dheda et al., 2004, Garrido et al., 2020, Gutierrez et al., 2008, Nikalje et al., 2018, Reddy et al., 2013, Sinha et al., 2015). Therefore, the assessment of their expression reliability under certain conditions and between different tissues is highly recommended to ensure the accuracy of results.
In this study, 10 reference genes have been analyzed with the aim to provide an appropriate selection of reference genes for RT-qPCR data normalization in S. marianum. The stability of the selected candidates was assessed in different physiological tissues that can be site of expression of interesting target candidate genes, such as fruits at different developmental stages but also leaves, stems and roots. Moreover, the selected reference genes were validated by analyzing the transcriptional profiles of target genes involved in silymarin and fatty acids biosynthesis. To the best of our knowledge, the current study is the first report on a systematic analysis for selection of reference genes in various milk thistle tissues and development stages and will boost genomic and functional studies in this species.
Section snippets
Plant materials
The S. marianum accession n° RCAT 057475 originated from Germany and was obtained from the Institute for Agrobotany (Nébih, Tápiószele, Hungary). Seeds were sown in autumn in the experimental field of Consiglio per la ricerca in agricoltura e l’analisi dell’economia agraria (CREA) in Bologna, North Italy (N:44°34′30.8″; E:11°09′55.6″). Fruits were harvested from 3 plants (biological replicates) at each BBCH stage 71, 75 and 79 (Martinelli et al., 2015), for a total of 9 samples. Stem sampling
Primers specificity and transcriptional profile of candidate reference genes
A total of 10 candidate reference genes were studied by RT-qPCR in order to identify the most stable ones to be used for data normalization in gene expression analysis in S. marianum. Out of these, two sequences were already available in the nucleotide database of NCBI (NADH dehydrogenase subunit F, ID: KC589999.1 and 18S, ID: AJ831537.1) while the remaining ones were retrieved from the S. marianum whole genome sequence, by sequence similarity search with known genes from other plant species.
Discussion
Several techniques can be used to investigate gene expression at transcriptional level. RT-qPCR is one of the most utilized, as it allows to reliably quantify transcripts abundance of target genes in a precise and reproducible way (Bustin et al., 2009). However, because of its sensitivity, it is highly subjected to technical variations that can severely affect the data if a correct normalization is not performed. The selection of suitable reference genes, whose transcription is not influenced
Conclusions
The present study focused on S. marianum, an officinal plant widely cultivated for the production of silymarin. In order to improve genetic and molecular knowledge on this species and on its peculiar metabolic pathway, a pilot study was conducted to identify for the first time reference genes sets for accurate normalization of the transcript levels by RT-qPCR analyses. The expression stability of 10 candidate genes was evaluated in six different tissues, integrating the results with five
CRediT authorship contribution statement
Fulvio Flavia: Conceptualization, Methodology, Validation, Formal analysis, Investigation, Writing - original draft, Visualization. Martinelli Tommaso: Resources, Project administration. Paris Roberta: Conceptualization, Methodology, Supervision.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgements
Authors thank Dr. Laura Bassolino of Consiglio per la ricerca in agricoltura e l’analisi dell’economia agraria – Centro di ricerca Cerealicoltura e Colture Industriali for her valuable advise and for reading the manuscript.
References (73)
- et al.
Interactive effect of deficit irrigation and soil organic amendments on seed yield and flavonolignan production of milk thistle (Silybum marianum L. Gaertn.)
Ind. Crops Prod.
(2014) - et al.
Novel energy crops for Mediterranean contaminated lands: Valorization of Dittrichia viscosa and Silybum marianum biomass by pyrolysis
Chemosphere
(2017) - et al.
Silybin content and overexpression of chalcone synthase genes in Silybum marianum L. plants under abiotic elicitation
Plant Physiol. Biochem.
(2016) - et al.
Ribosomal 18S RNA prevails over glyceraldehyde-3-phosphate dehydrogenase and β-actin genes as internal standard for quantitative comparison of mRNA levels in invasive and noninvasive human melanoma cell subpopulations
Anal. Biochem.
(2001) - et al.
A MicroRNA Targeting Dicer for Metastasis Control
Cell
(2010) - et al.
Arabidopsis mutants deficient in polyunsaturated fatty acid synthesis: biochemical and genetic characterization of a plant oleoyl-phosphatidylcholine desaturase
J. Biol. Chem.
(1992) - et al.
Identification and validation of reference genes for quantitative real-time PCR under salt stress in a halophyte, Sesuvium portulacastrum
Plant Gene
(2018) - et al.
Plants have isoforms for acyl carrier protein that are expressed differently in different tissues
J. Biol. Chem.
(1985) - et al.
Molecular cloning and functional expression of the gene for a cotton Δ-12 fatty acid desaturase (FAD2)
Biochimica et Biophysica Acta (BBA) - Gene Structure and Expression
(2001) - et al.
Light response and potential interacting proteins of a grape flavonoid 3′-hydroxylase gene promoter
Plant Physiol. Biochem.
(2015)
Gene expression and flavonolignan production in fruits and cell cultures of Silybum marianum
J. Plant Physiol.
Milk Thistle in liver diseases: Past, present, future.
Phytotherapy Res.
Therapy of toxic liver pathologies with Legalon
Zeitschrift fur Klinische Medizin
Breeding objectives and selection criteria for milk thistle [silybum marianum (L.) gaertn.] improvement
Notulae Botanicae Horti Agrobotanici Cluj-Napoca
Normalization of real-time quantitative reverse transcription-PCR data: a model-based variance estimation approach to identify genes suited for normalization, applied to bladder and colon cancer data sets
Cancer Res.
Silybum marianum: Non-medical exploitation of the species
Ann. Appl. Biol.
A spiking strategy facilitates housekeeping selection for RT-qPCR analysis under different biotic stresses in eggplant
Protoplasma
Quantification of mRNA using real-time reverse transcription PCR (RT-PCR): Trends and problems
J. Mol. Endocrinol.
The MIQE guidelines: minimum information for publication of quantitative real-time PCR experiments
Clinical Chem.
Silymarin: Not just another antioxidant
J. Basic Clin. Physiol. Pharmacol.
A large and functionally diverse family of Fad2 genes in safflower (Carthamus tinctorius L.)
BMC Plant Biol
RNase III: Genetics and Function; Structure and Mechanism
Annu. Rev. Genet.
Monensin Sensitivity1 (MON1)/calcium caffeine zinc sensitivity1 (CCZ1)-mediated Rab7 activation regulates tapetal programmed cell death and pollen development
Plant Physiol.
Genome-wide identification and testing of superior reference genes for transcript normalization in arabidopsis
Plant Physiol.
Chalcone synthase and its functions in plant resistance
Phytochem. Rev.
Validation of housekeeping genes for normalizing RNA expression in real-time PCR
Biotechniques
The impact of drought stress on antioxidant responses and accumulation of flavonolignans in milk thistle (Silybum marianum (l.) gaertn)
Plants
Selection of internal control genes for quantitative real-time RT-PCR studies during tomato development process
BMC Plant Biol.
Selection of reference genes for quantitative real-time PCR in bamboo (Phyllostachys edulis)
PLoS ONE
Identification and validation of reference genes for RT-qPCR normalization in wheat meiosis
Sci. Rep.
Selection and validation of appropriate reference genes for quantitative real-time PCR analysis in Salvia hispanica
PloS one
The lack of a systematic validation of reference genes: A serious pitfall undervalued in reverse transcription-polymerase chain reaction (RT-PCR) analysis in plants
Plant Biotechnol. J.
Elicitor-enhanced Silymarin production in hairy root cultures of Silybum marianum L
Gaertn. Planta Medica
qBase relative quantification framework and software for management and automated analysis of real-time quantitative PCR data
Genome Biol.
Reference gene selection for qPCR Is dependent on cell type rather than treatment in colonic and vaginal human epithelial cell lines
PLoS ONE
Expression levels of microRNA machinery components Drosha, dicer and DGCR8 in human (AGS, HepG2, and KEYSE-30) cancer cell lines
Int. J. Clin. Exp. Med.
Cited by (8)
Silybum marianum chemotype differentiation is genetically determined by factors involved in silydianin biosynthesis
2023, Journal of Applied Research on Medicinal and Aromatic PlantsCitation Excerpt :Relative target genes expression was calculated using the 2-ΔΔCt method, adjusted for the efficiency of PCR amplification of each gene according to Pfaffl (2001). Normalisation was done with four reference genes (Vandesompele et al., 2002) chosen after Fulvio et al. (2021). G20 was used as control.
A single point mutation in a member of FAD2 multigene family resulted in the generation of a high oleic line of Silybum marianum (L.) Gaertn.
2022, Industrial Crops and ProductsCitation Excerpt :Relative gene expression levels were calculated using the ‘Standard Curve Method’ (Larionov et al., 2005) and presented as the mean ± standard error of the mean (SE) of three biological replicates. Internal control genes used for data normalization were selected based on Fulvio et al. (2021). Genotypes G20 and G62 were crossed according to Martinelli (2018).
Microbial degradation of the benzimidazole fungicide carbendazim by Bacillus velezensis HY-3479
2023, International MicrobiologySelection and Validation of Reference Genes for RT-qPCR in Phalaenopsistype Dendrobium Hybrid
2022, Acta Horticulturae Sinica