Identification of loci controlling timing of stem elongation in red clover using genotyping by sequencing of pooled phenotypic extremes

Ergon, Åshild; Milvang, Øystein W.; Skøt, Leif; Ruttink, Tom

doi:10.1007/s00438-022-01942-x

Identification of loci controlling timing of stem elongation in red clover using genotyping by sequencing of pooled phenotypic extremes

Original Article
Open access
Published: 24 August 2022

Volume 297, pages 1587–1600, (2022)
Cite this article

Download PDF

You have full access to this open access article

Molecular Genetics and Genomics Aims and scope Submit manuscript

Identification of loci controlling timing of stem elongation in red clover using genotyping by sequencing of pooled phenotypic extremes

Download PDF

Åshild Ergon ORCID: orcid.org/0000-0003-1275-0450¹,
Øystein W. Milvang¹,
Leif Skøt² &
…
Tom Ruttink³

2039 Accesses
1 Citation
Explore all metrics

Abstract

Main conclusion

Through selective genotyping of pooled phenotypic extremes, we identified a number of loci and candidate genes putatively controlling timing of stem elongation in red clover.

Abstract

We have identified candidate genes controlling the timing of stem elongation prior to flowering in red clover (Trifolium pratense L.). This trait is of ecological and agronomic significance, as it affects fitness, competitivity, climate adaptation, forage and seed yield, and forage quality. We genotyped replicate pools of phenotypically extreme individuals (early and late-elongating) within cultivar Lea using genotyping-by-sequencing in pools (pool-GBS). After calling and filtering SNPs and GBS locus haplotype polymorphisms, we estimated allele frequencies and searched for markers with significantly different allele frequencies in the two phenotypic groups using BayeScan, an F_ST-based test utilizing replicate pools, and a test based on error variance of replicate pools. Of the three methods, BayeScan was the least stringent, and the error variance-based test the most stringent. Fifteen significant markers were identified in common by all three tests. The candidate genes flanking the markers include genes with potential roles in the vernalization, autonomous, and photoperiod regulation of floral transition, hormonal regulation of stem elongation, and cell growth. These results provide a first insight into the potential genes and mechanisms controlling transition to stem elongation in a perennial legume, which lays a foundation for further functional studies of the genetic determinants regulating this important trait.

Identification of Loci Controlling Timing of Stem Elongation in Red Clover Using GBS of Pooled Phenotypic Extremes

Range-wide phenotypic and genetic differentiation in wild sunflower

Article Open access 10 November 2016

Population structure and genetic diversity in red clover (Trifolium pratense L.) germplasm

Article Open access 20 May 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The timing of sexual reproductive development is an important agronomic attribute of all agricultural crops, and a trait with implications for fitness in natural populations. Proper timing in relation to the environmental conditions, in particular temperature and photoperiod, ensures a large and timely harvest of seed crops, but is also important in grassland species, in which the above-ground biomass is grazed or harvested for use as animal feed. For example, during the first stage of reproductive development of red clover (Trifolium pratense L.), the above-ground biomass increases fast. The timing is therefore decisive for the seasonal pattern of productivity, and in addition, it affects the competitive interactions with weeds and companion species. The timing also affects forage quality, as an increasing proportion of stem tissue reduces the digestibility. In spite of its importance in relation both to plant breeding and to fitness in natural ecosystems, there is very limited information available on the genetic control of the timing of transition to reproductive development in the large genus Trifolium. We therefore set out to identify genetic markers, chromosomal regions and candidates for genes and processes controlling timing of reproductive development in red clover, a perennial legume used extensively in production of silage and hay in temperate regions.

In its vegetative stage, red clover grows as a rosette consisting of many branches with non-elongating stems. The first visible sign of transition from vegetative to reproductive development is the initiation of stem elongation and appearance of internodes on some of the branches. This occurs in response to long photoperiods and increasing temperatures. There is little requirement for vernalization in red clover, but in some populations, the proportion of flowering plants increases after cold treatment (Fejer 1960; Van Dobben 1964). Under Nordic conditions, red clover of Nordic origin remains in a vegetative state throughout the year of sowing, while populations originating from further south can enter reproductive development. The reason for this difference is not known, but it might be due to a combination of juvenility, a facultative vernalization requirement and a very long photoperiod requirement in the Nordic populations. The transition from vegetative to reproductive development of the shoot apical meristem (SAM) and the onset of stem elongation are two components of sexual reproduction in rosette-forming dicots. Floral transition and stem elongation are usually coordinated processes, but stem elongation can be induced independently from floral transition (see examples in McKim 2020). Also, the two processes occur in different tissues in the shoot apex and may have slightly different responses to environmental variables (including red clover; Ergon et al. 2016).

Trifolium belongs to the vicioid/galegoid clade of the legume subfamily Papilionoideae, that also comprises the model species pea (Pisum sativum) and Medicago truncatula (Wojciechowski et al. 2004). Using sequences known from Arabidopsis thaliana, Hecht et al. (2005), Jung et al. (2012) and Kim et al. (2013) screened sequence databases for flowering-related genes of M. truncatula, Lotus corniculatus and Glycine max. They found that the majority of genes known from A. thaliana were present as orthologs in the legume genomes; some of the gene families appeared to have undergone differential expansion. Orthologs of some central genes (CONSTANS (CO), FLOWERING LOCUS C (FLC) and FRIGIDA (FRI)) appeared to be missing, while orthologs of genes with similarity to CO and FRI were present. Putterill et al. (2013), Liew et al. (2014) and Weller and Ortega (2015) have reviewed what is known about the genetic control of flowering time in the legume family, which currently relates mainly to the photoperiod and vernalization/autonomous pathways. Compared to floral transition, the knowledge on the genetic control of stem elongation in rosette-forming dicots in general is limited (reviewed by Serrano-Mislata and Sablowski 2018; McKim 2019, 2020).

Red clover is an outbreeding species with a gametophytic self-incompatibility system (Taylor 1982). Thus, there is a considerable amount of genetic variation within cultivars, which are usually synthetic populations with a large number of parents. The genome size is estimated to be 420 Mb (Sato et al. 2005). Sequences of the red clover genome have been published by Ištvánek et al. (2014) and De Vega et al. (2015), but only the latter represents a draft genome at pseudo-chromosome level, covering 309 Mb of the genome (of which 164 Mb are placed on chromosomes). In a classical linkage map QTL study of red clover, Herrmann et al. (2006) identified eight QTLs for flowering time spread across all chromosomes except chromosome 1. In a recent study of a collection of 70 European and Asian natural populations and five modern cultivars by Jones et al. (2020), a genome-wide association study of flowering time identified one significant marker in a gene with homology to VEG2, a gene associated with flowering and inflorescence development in pea.

Our approach to advance our understanding of the timing of reproductive development in red clover was to identify individuals with extremely early or late stem elongation within one variety, sequence these in separate pools, identify polymorphisms, and search for genomic regions with significantly different allele frequency estimates in the two phenotypic groups (similar to a bulked segregant analysis). Sequencing pools of DNA from many individuals using reduced representation genotyping, e.g., genotyping-by-sequencing (GBS, Elshire et al. 2011) has emerged as an efficient means of estimating allele frequencies of a large number of loci in different types of (sub)populations (Dorant et al. 2019; Ergon et al. 2019). This is particularly useful for outbreeding species like red clover, where, particularly in a breeding context, it often is more relevant to characterize the genome at population or family level than on individual level. Sequencing of pools of individuals rather than separate individuals is a cost-effective way of obtaining accurate allele frequency estimates for a large number of markers and populations (Gautier et al. 2013; Byrne et al. 2013). In addition to single-nucleotide polymorphisms (SNPs), we utilized additional sequence polymorphisms that can be detected as variation in start and end points of the GBS read mapping positions (SMAPs, Stack Mapping Anchor Point polymorphisms), as well as short haplotypes defined by SNPs and SMAPs within reads (Schaumont et al. 2022). Single SNPs can be shared by several haplotype variants, and therefore be associated with more than one allele of nearby candidate genes. In such cases, allele frequencies may be aggregated at the SNP level, thus masking associations with the phenotype, whereas associations may be revealed by multi-allelic haplotype information (Rafalski 2002; Hamblin and Jannink 2011). Following SNP and haplotype calling, we employed three different methods to identify loci with significantly different allele frequencies in the two phenotypic groups; BayeScan (Foll and Gagiotti 2008), as well as two methods that utilize replicates of pools (Kawaguchi et al. 2018; Ergon et al. 2019), for stringent identification of genomic regions involved in regulation of the timing of reproductive development in red clover.

Materials and methods

Phenotyping and genotyping

Seeds of red clover (Trifolium pratense L., cultivar ‘Lea’ from Graminor AS) were sown in a greenhouse in September 2015 and grown at approximately 16 °C with a 20 h photoperiod (natural light supplied with 90 µmol m⁻² s⁻¹ PAR (HPQ/HTI-P lamps)). The number of days from sowing until the first elongating internode was 2 cm long (days to elongation, DTE) was recorded for each plant. The 52 earliest and 52 latest individuals (excluding non-elongating individuals) were selected for genotyping by pool-GBS. Pools were created as follows: DNA was extracted from leaf tissue of each individual with the DNeasy 96 Plant kit (Qiagen). The 52 individuals in each of the two phenotypic groups (early, late) were randomly divided into three subgroups of 17–18 individuals and equal amounts of DNA from each individual in each subgroup were combined in a pool, creating a total of six DNA pools. Each DNA pool was distributed into 15–16 wells per 96-well plate to create replicate GBS libraries. One plate each was used for PstI and ApeKI single-digest GBS library preparation according to Elshire et al. (2011) and single-end sequenced on an Illumina HiSeq2000 instrument by Cornell University, Biotechnology Resource Center.

After demultiplexing, barcodes and 5’ restriction site remnants were removed, and reads were trimmed to maximum length of 74 bp (ApeKI) or 86 bp (PstI). Reads were aligned with BWA-MEM (Li 2013) to the red clover reference genome sequence v2.1 (De Vega et al. 2015), in which 164 Mb ungapped sequence length out of a total estimated genome size of 420 Mb has been allocated to chromosomes. Alignments were sorted, indexed and filtered on mapping quality 20 (q20) with SAMtools 1.10 (Li et al. 2009). BAM files were converted to mpileup files with SAMtools and filtered to retain only genome positions with minimum read depth of 30, thus joining the neighboring GBS stacks and excluding the part of the genome without coverage. Mpileup files were used to calculate the Watterson’s theta estimator with NPStat v0.99 (Ferretti et al. 2013) with minor allele count equal to one read (MAC1), window-size equal to 10,000 bp, haploid sample size of 36, and maximum coverage equal to 8000. Per pool, a single genome-wide theta value was calculated as the mean across all windows (about 500 windows per sample). The NPStat derived theta values per pool-GBS sample were used as diversity prior for the Bayesian SNP calling algorithm implemented in SNAPE-pooled (Raineri et al. 2012) to identity significant SNPs in each pooled sample. SNAPE-pooled was run with settings:—priortype = informative, -fold = folded, -nchr = 36 for consistency with NPStat.

We used a custom python script to apply filters on the SNAPE-pooled reference allele frequency (RAF) data. Filters i–iv were applied in the following order and per sample: (i) SNP positions were deleted if the reference allele was not A, C, G, or T; (ii) SNP frequencies were set to missing data per sample when the two observed alleles were both different from the reference allele, or when the sum of the reference and the alternative allele read counts was lower than 30, (iii) using the Bayesian estimates of the probability of allele presence provided by SNAPE-pooled, we set the alternative allele frequency (and allele counts) to 0 if p(freq_alt ≠ 0) < 0.95 and the reference allele frequency (and allele counts) to 0 if p(freq_ref ≠ 0) < 0.95, and (iv) filtered out loci with low coverage (minimal read depth 27) that remained after removing read counts with filter iii; Next, we integrated all SNP frequency data into one matrix with six samples and all polymorphic loci, and applied filters v–vi: (v) we discarded SNP positions with more than two remaining alleles across the six samples per GBS library type; (vi) we retained only markers with an RAF between 0.05 and 0.95 (informative loci) in at least one sample, and frequency data in all six samples.

Next, haplotypes, of which there may be more than two variants per GBS locus, were called and their relative frequencies estimated using the SMAP package (Shaumont et al. 2022). The SMAP package is available on Gitlab (at https://gitlab.com/truttink/smap/) and a detailed description of the working procedure and guidelines are available online in the User Manual (https://ngs-smap.readthedocs.io/en/dev/home.html). In short, module SMAP delineate defines GBS loci by locating the outer positions of ‘stacked’ read mappings (called Stack Mapping Anchor Points (SMAPs)). SMAP delineate thus selects regions of the genome consistently covered by read mapping across the sample set, while simultaneously delineating start and end points for read-backed haplotyping of the SNPs identified with SNAPE-pooled (see above). The SMAP package further exploits polymorphisms in read mapping positions as additional information to define haplotype strings. A set of indexed BAM files with mapped reads, a custom BED file with GBS locus positions, and a VCF file with SNP positions then serve as input for the module SMAP haplotype-sites. SMAP haplotype-sites evaluate the read-reference alignment at each polymorphic position within a locus and creates a short haplotype string per read that combines the call of neighboring polymorphisms (SNPs and SMAPs) across the genome region covered by the GBS locus. It then counts the read depth per unique haplotype per locus, integrates all haplotype counts across all loci and samples, quantifies the relative abundance of haplotypes per locus and finally outputs the haplotype frequency table. Here, a GBS locus that contains multiple haplotype alleles is referred to as a haplotype polymorphism (HTP). SMAP delineate was run with parameters: mapping_orientation stranded, min_mapping_quality 30, min_cluster_length 50, max_cluster_length 130, min_stack_depth 5, max_stack_depth 1500, min_cluster_depth 30, max_cluster_depth 1500, max_stack_number 20, min_stack_depth_fraction 5, completeness 0, max_smap_number 20. A VCF file with SNP positions was used for haplotyping after filtering SNPs with data in at least one pool, only 2 alleles across all DNA-pool samples with data, and an RAF SNP frequency between 0.05 and 0.95 in at least one of the pools (see above). SMAP haplotype-sites was run with parameters: mapping_orientation stranded, partial include, mapping_quality 30, min_read_count 30, no-indels, min_haplotype_frequency 5 (meaning MAF > 0.05 in at least one of the DNA pools). After SMAP haplotype-sites, only loci with at least 30 reads (in total across all haplotype variants) in each of the six pools were retained, because downstream analyses required a complete genotype call matrix.

To visualize differentiation between the six pooled samples, a principal component analysis (PCA) was performed in The Unscrambler X v.10.3 (Camo Software, Norway), using minor allele frequencies (MAF) of all identified SNPs with a known chromosomal location.

Identification of loci with significantly different allele frequencies in early versus late elongating groups

For each GBS dataset (PstI and ApeKI), both SNPs and HTPs with significantly different allele frequencies in early and in late elongating pools were identified using three different methods; (i) an F_ST-based test utilizing replicate pools (method 1, adapted from Ergon et al. 2019); (ii) a test based on error variance of replicate pools (method 2, adapted from Kawaguchi et al. 2018); and (iii) BayeScan v2.1 (Foll and Gaggiotti 2008). A previous study showed that two first principal components in a PCA of SNP data for 86 individuals derived from the same seed lot of cultivar ‘Lea’ as studied here explained only a few percent of the genetic variation (De Vega et al. 2015), suggesting that there is very little genetic structure and linkage disequilibrium. Hence, we did not consider it necessary to take population structure into consideration in the current analyses.

With method 1, for each SNP or HTP, allele frequencies in each of the six pools were compared with the average allele frequency in the three pools of the contrasting phenotype. Pairwise F_ST values \((\frac{\overline{{q}^{2}}-{\overline{q}}^{2}}{\overline{q}\left(1-\overline{q}\right)})\) were calculated, where \(\overline{q}\) and \(\overline{{q}^{2}}\) represent the weighted average of the allele frequencies or weighted squared allele frequencies in the pairwise comparison, respectively (when q was zero in both, F_ST was set to zero). This resulted in a total of six F_ST values for each SNP and HTP variant. A Chi-square test was used to identify significant F_ST’s at different P-levels, using the test statistic X² = 2NF_ST, where 2N = the sum of genotyped gametes in the two populations (Hedrick 2011). Only SNPs and HTPs with a significant F_ST in all six comparisons, and a consistently higher allele frequency in early pools relative to the average of the late pools and vice versa, were regarded as significant at the given P-level. Corresponding estimates of the false discovery rate (FDR) were calculated for different P-levels as\({\left(\frac{l*{P}^{3}}{d}\right)}^{2}\), where l is number of SNPs or haplotypes tested and d is the number of SNPs or haplotypes identified as significant, and a P-level corresponding to an FDR of 0.05 was chosen. The formula for calculating FDR is modified compared to the standard \((\frac{l*P}{d})\), because of the requirement of significance in all three replicate pools, and in both directions.

With method 2, a Chi-square test was used to identify SNPs and HTP variants with significantly different average allele frequency in early versus late elongating pools at different P-levels, using the test statistic Z² = \(\frac{{\left({\overline{q} }_{L}-{\overline{q} }_{E}\right)}^{2}}{{(V1}_{L}+{V1}_{E}+{V2}_{L}+{V2}_{E})}\), where \(\overline{q }\) = the weighted average allele frequency across three replicate pools, V1 = \(\frac{\overline{q }(1-\overline{q })}{{2N}_{ind}}\), the variance due to sampling of the N_ind = 52 individuals in each phenotypic class, and V2 = \(\frac{{{\sigma }_{rep}}^{2}}{{N}_{rep}}\), the experimental variance between the N_rep = 3 replicate pools in each phenotypic class. Subscripts L and E refer to the late and early phenotypic classes, respectively. Here, V1 considers the variance due to the sampling of 52 individuals, and V2 considers the subsampling of 17 or 18 individuals out of 52, and all other experimental variance between (technical) replicates. Corresponding estimates of the false discovery rate (FDR) were calculated for different P-levels as \(\frac{l*P}{d}\), where l is number of SNPs or haplotypes tested and d is the number of SNPs or haplotypes identified as significant, and a P-level corresponding to an FDR of 0.05 was chosen.

For the analysis with BayeScan, allele frequencies were converted into absolute allele numbers, using the number of haploid genomes that had been sequenced in each pool as the total number of alleles (34 for pools of 17 individuals, 36 for pools of 18 individuals). The allele numbers from the three replicate pools of each phenotypic group were summed, resulting in one value per marker (SNP or HTP) for both the early and the late group, which were analyzed using standard input parameters. BayeScan uses logistic regression to decompose locus-population F_ST values into population-specific and locus-specific components. Loci with different allele frequencies in the compared populations are identified as those, where the locus-specific component is necessary to explain the observed variation.

SNPs and HTPs identified by all three methods were retained for further description by searching for potential candidate genes in the 50 kb upstream and downstream genome region flanking each significant locus. Sequences and gene annotations were retrieved from the red clover genome sequence (Tp2.0, De Vega et al. 2015), using the Legume Information System (LIS, legumeinfo.org). The closest homologues in A. thaliana and M. truncatula were identified using coding sequences of the genes (CDS) as query in blastn searches (blast.ncbi.nlm.nih.gov).

Results

Phenotyping and genotyping

The number of days to stem elongation (DTE) varied from 23 to 94 (Fig. 1). Of the 672 plants tested, 146 plants did not elongate during the 94-day course of the experiment and were excluded. Fifty two of the earliest elongating individuals (average DTE of 37.7, range 23–43, top 10%) were randomly assigned to one of three replicate early pools, and similarly, 52 of the latest elongating individuals (average DTE of 80.5, range 71–94, bottom 10%) were randomly assigned to one of three replicate late pools.

The six pools were genotyped by pool-GBS and a total of 229 M and 327 M high-quality reads were obtained for the PstI and ApeKI libraries, respectively. After SNP calling and further filtering, we obtained 12,074 and 91,136 SNPs with read depth > 30 in each of the six pools and MAF > 0.05 in at least one of the six pools from the PstI and ApeKI libraries, respectively (Supplementary File 1). Of these, 66,458 SNPs (64%) had a known chromosomal location, the rest were located on unplaced scaffolds. Haplotype calling and filtering (read depth > 30 in each sample, and haplotype MAF > 0.05 in at least one pool) resulted in 4653 HTPs with 2–13 haplotype variants per HTP locus (3.6 on average) based on the PstI libraries, and 41,073 HTPs with 2–20 haplotype variants per HTP locus (3.6 on average) based on the ApeKI libraries. Of these, 30,215 (66%) had a known chromosomal location. For the combined PstI and ApeKI libraries, this equals an average density of one HTP with known chromosomal location per 5.4 kb of the chromosome-anchored part of the reference genome sequence.

A PCA of MAF data for the 66,458 chromosome-anchored SNPs showed that there was a large amount of random variation in allele frequencies among the replicate pools. However, the first principal component, explaining 22% of the variation, separated the three early pools from the three late pools (Fig. 2), indicating that variation between replicate pools is smaller than variation between early and late pools.

Identification of loci with significantly different allele frequencies in early versus late elongating groups

BayeScan detected a much higher number of markers with a significantly different allele frequency in the early elongating group versus the late elongating group than the two tests utilizing variation between replicates to evaluate significance. Among the latter, the F_ST-based method (method 1) detected a higher number than the method based on error variance (method 2) (Supplementary File 2). Across marker types, we detected a total of 20 loci that were significant according to all three methods. Of these, 15 had a known chromosomal location; 10 were detected based on the PstI data set and 5 on the ApeKI data set (Table 1). Ten of the 15 loci were found only in the SNP analyses, three only in the haplotype analyses, while only two loci were found both among SNPs and haplotypes. The 15 loci were distributed across all chromosomes except chromosome 2 and chromosome 5, with 1–6 loci per chromosome (Fig. 3). The maximum calculated F_ST value of markers within significant chromosomal loci ranged from 0.08 to 0.68.

Table 1 Loci with significantly different marker allele frequencies in early and late phenotypic groups of red clover, cultivar ‘Lea’, according to all of three different tests (BayeScan and two tests utilizing replicate pools) at a false discovery rate of 0.05

Full size table

The 199 genes present in the ± 50 kb regions flanking the 15 loci are reported in Supplementary File 3; these are candidate genes that may contribute to the differentiation between early and late elongating groups. Within 13 of these regions, we identified a number of genes that, based on their functional annotation and similarity to genes with known functions in A. thaliana or M. truncatula, have potential roles in the vernalization, autonomous and photoperiod regulation of floral transition, as well as hormonal and cellular signaling of stem elongation and cell growth. (Table 2 and Supplementary File 4).

Table 2 Selected candidate genes present in the ± 50 kb regions flanking markers with significantly different allele frequencies in early versus late elongating pools of red clover

Full size table

Discussion

Marker density and linkage disequilibrium

Identification of genomic regions and candidates for genes that control the transition to reproductive development in red clover using GBS relies on the density of polymorphic read loci, and the linkage disequilibrium (LD) between these loci and the allelic variation that controls the phenotypic trait. If LD decays at short distances, the required marker density is high. The level of LD in a synthetic variety depends on the number of parents and the relatedness between them (i.e., the level of genetic diversity) as well as the number of generations after the original crossing between these parents (Rafalski, 2002; Auzanneau et al. 2007). “Lea” is a synthetic variety made from 33 parental genotypes (15 individuals of the Norwegian variety “Nordi”, 7 individuals from the Swedish variety/landrace “Bjursele”, and 11 individuals from a Norwegian synthetic population (LGRk 8801−2× = Syn 1/88 2×, Vestad 1990), pers. comm. Petter Marum, Graminor AS, Norway). Commercial seed was used in the present study; the exact number of generations is not known, but is likely to be around 4 and therefore the chromosomal segments inherited from the 33 parents are likely to be rather long. However, with this many parents, there are theoretically up to 66 different haplotypes present per chromosomal segment. De Vega et al. (2015) characterized the same seed lot as the one we used in the present study and found that the average LD (scaled from 0 to 1) at distances of 100 kb, ranged between 0.19 and 0.25 for the different chromosomes. At 500 kb, LD had decayed completely to background levels (0.02–0.05). In the present study, the average distance on the reference genome between polymorphic GBS loci was 5.4 kb, which appears to be sufficient to detect a large proportion of loci underlying the phenotypic differentiation, provided that the effect on the phenotype and the power of the statistical tests are strong enough.

Sequence variation in genes in the proximity of markers with significantly different allele frequency in early versus late pools potentially contributes to phenotypic variation in earliness of stem elongation. Based on a previous study of the LD decay in the same population (De Vega et al. 2015), we considered any gene within 50 kb flanking a significant marker as a candidate gene. Based on currently available knowledge about regulation of stem elongation and the transition to reproductive development in plants in general, we here discuss the possible role of the apparently most relevant candidate genes.

Regulation of transition to reproductive development

In a previous study of vernalized plants of three Norwegian cultivars, including Lea, which we used here, we found that the effect of ambient temperature on number of days to stem elongation leveled off between 10 and 14 °C (at both 16 and 20 h photoperiod; Ergon et al. 2016). DTE decreased as photoperiod increased from 16 to 20 h (at temperatures between 14 and 18 °C). In the current study, plants were grown in a greenhouse at 16 °C, with a 20 h photoperiod, and although these plants were not vernalized, we regard it likely that the ambient temperature requirement was saturated, while the photoperiod requirement might not have been. Although red clover generally has very little requirement for vernalization, some Norwegian material responds to vernalization by flowering earlier (Van Dobben 1964). Hence, we consider that the variation in timing of stem elongation that we observed within cultivar Lea in the present study is likely reflecting variation in photoperiod requirement, vernalization requirement and/or juvenility, and thus that the identified candidate genes flanking the significant markers may be involved in corresponding pathways controlling reproductive development in red clover.

Several of the identified candidate genes have potential roles in the autonomous or vernalization pathway regulating transition from vegetative to reproductive development (Table 2). A vernalization response has evolved independently many times and regulatory pathways differ among different angiosperm lineages (Bouché et al. 2017). A central gene in the vernalization response of A. thaliana is FLOWERING LOCUS C (FLC). FRIGIDA (FRI) is an activator of FLC expression, while several genes in the autonomous and vernalization pathways are repressors of FLC. FRI overrides the effect of components of the autonomous pathway and thereby creates a requirement for vernalization, and vernalization overrides this effect of FRI. Orthologs of FRI and FLC do not appear to be present in the legumes studied so far (Kim et al. 2013), but some FRI-like genes are identified. One of the candidate genes we identified, Tripr.gene5544, at locus 1_1.16, is an ortholog of M. truncatula Medtr1g103710, encoding an FRI-like protein and located in a syntenic region of chromosome 1. Tripr.gene5544 has almost 100% identity at mRNA level with an FRI-like gene in M. sativa, FRI-L, which was identified by Chao et al. (2013). Expression of MsFRI-L in transgenic A. thaliana plants resulted in late flowering phenotypes. In addition, transcript profiling of floral regulatory genes in these transgenic plants showed enhanced expression of the flowering repressor FLC and decreased expression of FT, suggesting that MsFRI-L delays flowering time by regulating gene expression (Chao et al. 2013). This supports a role of Tripr.gene5544 in control of transition to reproductive development in red clover. However, as FLC appears not to be present in legumes, the molecular function of both MsFRI-L and Tripr.gene5544 remains unknown.

A candidate at locus 4_17.77 is similar to ANTAGONIST OF LIKE HETEROCHROMATIN PROTEIN 1 (ALP1), another gene that regulates FLC in A. thaliana. ALP1 is a PIF/Harbinger class transposase that has acquired a novel function in epigenetic gene regulation in the plant kingdom (Liang et al. 2015). ALP1 inhibits polycomb group (PcG)-mediated FLC silencing in A. thaliana by blocking the interaction of the core REDUCED VERNALIZATION RESPONSE 2—Polycomb repressor complex 2 (VRN2-PRC2) with some of its accessory components. PRC2 takes part in epigenetic regulation not only of FLC but of a range of genes, e.g., the central flowering signal gene FT (Jiang et al. 2008), meaning that ALP1 could possibly affect timing of transition to reproduction through genes other than FLC.

Two of the loci that we identified (1_10.40 and 1_24.28) harbor in total 5 genes belonging to the REM family of B3 DNA-binding domain proteins. This family comprises 45 members in A. thaliana (Romanel et al. 2009), of which some are known to be involved in the vernalization pathway, e.g., VERNALIZATION1 (VRN1). Several other REM genes are involved in inflorescence meristem identity or development of flowers (Mantegazza et al. 2014). The REM family has experienced a rapid divergence in plants (Romanel et al. 2009). For example, Verma and Bhatia (2019) identified 19 REM genes in Cicer arietinum, of which they found only 3 homologs in M. truncatula and none in Glycine max or A. thaliana. Hence, it is difficult to identify orthologs and to hypothesize about the possible specific roles of the REM genes that we identified near significant markers.

In addition to the flowering-related transcription factors mentioned above, we found several genes belonging to families that are more generally known to be involved in transcriptional or post-transcriptional gene regulation. At least five of the identified loci (1_0.76, 1_7.60, 1_12.58, 3_30.48 and 7_14.52) contained genes encoding proteins with putative roles in RNA processing, ribosome biogenesis and transcriptional or post-transcriptional gene silencing (two RNA-directed DNA methylation, a Dicer-like (DCL) gene, two DExH-box ATP-dependent RNA helicases and ribosome production factor (RPF)). RDM and DCL genes are involved in the RNA-directed DNA methylation (RdDM) pathway of transcriptional gene silencing (TGS) (Rowley et al. 2011; Matzke and Mosher 2014; Borges and Martienssen 2015). RdDM is the major small interfering RNA (siRNA)-mediated epigenetic silencing mechanism in plants and it has a range of biological roles (Matzke and Mosher 2014). Some RDM proteins and DExH RNA helicases can regulate flowering time through RNA silencing of genes in the autonomous pathway (Herr et al. 2006; Greenberg et al. 2011) and some RPF genes can affect flowering time or stem length (Weis et al. 2015; Maekawa et al. 2018; Choi et al. 2020).

There are a large number of receptor-like kinases in plants, of which some have been shown to play developmental roles (Wang et al. 2008). One of the identified loci (6_21.88) includes two genes with some similarity to the cell wall-associated receptor-like kinases FERONIA (FER), which takes part in the photoperiodic, vernalization and autonomous signaling pathways of floral transition and a range of other processes in A. thaliana (Wang et al. 2020a; Solis-Miranda et al. 2020). FER can also regulate cell elongation or cell wall formation (Galindo-Trigo et al. 2016; Li et al. 2017). FER belongs to the CrRLKL1 subfamily of receptor-like kinases that has 36 members in M. truncatula, but the function of the individual genes is not known (Solis-Miranda et al. 2020).

In some of the regions flanking significant markers, we found genes related to hormonal signaling. Active forms of gibberellic acid (GA) stimulate stem elongation in legumes like pea and M. truncatula (Reinecke et al. 2013; Jaudal et al. 2018), but are thought to be of lesser importance for floral transition in legumes than in A. thaliana and were found to either not affect or delay flowering in pea (Weller et al. 1997; Reinecke et al. 2013; Liew et al. 2014). Both GA and auxin have regulatory functions in cell division and subsequent cell elongation in the shoot apex and the developing stem (Serrano-Mislata and Sablowski 2018; McKim 2019, 2020). In red clover, buds on axillary branches give rise to elongating flowering stems, and the main shoot does not. Hence, apical dominance resulting from auxin repression of axillary bud outgrowth, via strigolactone and cytokinin (Barbier et al. 2019), must be broken.

The promoting effect of GA on both flowering and stem elongation in A. thaliana is mainly accomplished through their interaction with DELLA proteins (Hedden and Sponsel 2015; Bao et al. 2020). When bound to GA, GID1 proteins bind DELLA proteins and target them for destruction by the 26S proteasome. This removes the suppressive action of DELLA on a range of factors promoting growth and development. In addition to this proteolytic mechanism, GID1 proteins are involved in interactions between GA and DELLA that affect growth and development through non-proteolytic mechanisms (Hauvermale et al. 2014). GID1 and DELLA are also present and interact in M. truncatula (Jiao et al. 2020). One of the candidates that we identified was a gene similar to M. truncatula GIBBERELIN-INSENSITIVE 1C (GID1C) (Wang et al. 2020b), located in 1_12.58. A gene similar to the INDETERMINATE DOMAIN (IDD) group of the C2H2 zinc finger protein family was located in 7_14.52. There are at least 19 IDD-like genes in M. truncatula (Jiao et al. 2020). IDD proteins can function in floral transition and a variety of other processes; some can compete in binding to DELLA proteins, and thereby regulate growth and development, while others can regulate auxin synthesis and transport (Kumar et al. 2019). An OVATE FAMILY PROTEIN (OFP) was located in 1_1.16. OFPs are transcriptional repressors that regulate multiple aspects of plant growth and development, which are likely achieved by interaction with different types of transcription factors and/or by directly regulating the expression of target genes such as Gibberellin 20 oxidase (GA20ox) (Wang et al. 2016). Some delay flowering and inhibit stem growth (Wang et al. 2016; Zhang et al. 2018). Finally, we found two cyclin-dependent kinase (CdK) inhibitors (locus 3_30.48), which can bind to CdKs and thus control cell cycle progression in interaction with abscisic acid, cytokinin and GA (Francis and Sorrell 2001).

AUXIN RESPONSE FACTORs (ARFs), of which we found one in locus 7_14.52, are transcription factors mediating auxin-induced gene expression (Die et al. 2018; Gao et al. 2019; Gomes and Scortecci 2021). One effect of auxin is its stimulation of cell elongation through transcriptional changes leading to increased cell wall acidification and extensibility, allowing for cell elongation (Arsuffi and Braybrook, 2018; Majda and Robert 2018; Wang et al. 2020b). This is achieved partly through the upregulation of cell wall-modifying enzymes, like for example pectate lyase (locus 1_0.76), which is regulated by auxin, degrades pectin and thus contributes to increased cell wall extensibility. Candidate genes with roles in cell wall formation were also found. Both COBRA (COB) (locus 6_21.88) and FASCICLIN-LIKE ARABINOGLUCAN PROTEINs (FLA) (locus 7_14.52) are glucosylphosphatidylinositol (GPI)-anchored proteins (Roudier et al. 2005; Schultz et al. 2002; Huang et al. 2013; He et al. 2019). COB proteins are thought to control anisotropic cell expansion through their involvement with the orientation of cellulose microfibrils transversely to the axis of elongation (Roudier et al. 2005), while FLA proteins have roles in for example secondary cell wall formation. Some of our significant markers are located near genes involved in providing the building blocks for the synthesis of cellulose, hemicellulose and pectin in cell walls (a beta-fructofurantosidase, locus 1_10.40, and a UDP-arabinopyranose mutase, locus 6_21.88). Sucrose, transported from photosynthetic or storage tissues through the phloem, can be hydrolyzed by either sucrose synthases or invertases, and the latter has been shown to be responsible for making hexoses available for cellulose synthesis, as invertase mutants, but not sucrose synthase mutants, strongly reduce growth in both Lotus japonicus and A. thaliana (Welham et al. 2009; Barratt et al. 2009). L-arabinose, found in pectins and hemicelluloses, is derived from cell wall UDP-arabinofuranose, which is converted from cytosolic UDP-arabinopyranose by UDP-arabinopyranose mutases (Saqib et al. 2019).

Both signaling and the cell wall formation itself depend on trafficking of signaling molecules and compounds used as building blocks through the cellular membrane systems. Locus 6_10.41 harbors six copies of PLEIOTROPIC DRUG RESISTANCE PROTEIN 1 (PDR1), which in petunia has been shown to be responsible for short-distance transport of strigolactone (Shiratake et al. 2019). In locus 7_14.52, we found a Sec14-like phosphatidylinositol transfer protein (SEC14L-PITP). The 35 SEC14L-PITPs in A. thaliana are associated with membrane systems and transfer different phospholipids (e.g., phosphatidylinositol) between membranes to stimulate signaling pathways leading to development and stress responses (Tejos et al. 2018; Zhou et al. 2019; Montag et al. 2020). Some SEC14L-PITPs are regulated by auxin (Tejos et al. 2018). SORTING NEXIN 1 (SNX1) (locus 1_10.40) is associated with a sorting endosome in A. thaliana, thought to play a role in cellular trafficking (Jaillais et al. 2008). Movement of membrane-bound vesicles in the cell is guided by the cytoskeleton, consisting of microtubules and actin filaments. Among the identified candidate genes are ACTIN-RELATED PROTEIN 3/DISTORTED 1 (ARP3/DIS1), an ACTIN DEPOLYMERIZATION FACTOR (ADF) and an FORMIN-like gene (locus 3_5.42, 3_8.10 and 4_17.77, respectively). These proteins play key roles in remodeling and function of actin (Kandasamy et al. 2004; Staiger and Blanchoin 2006; Nan et al. 2017). ARP3/DIS1 has also been shown to play a role in PIN-mediated polar auxin transport in A. thaliana root cells (Zou et al. 2016).

Finally, several other genes in the significant loci encode proteins with regulatory functions and which may be involved in developmental processes, e.g., F-box/LRR proteins, protein phosphatases, ribosomal and RNA-binding proteins, zinc finger proteins, pentatricopeptide proteins, ubiquitins, transmembrane proteins, WD40-repeat proteins, calcium-binding proteins, nodulin-like proteins, myb transcription factors, and methyltransferases.

Conclusions

Performing GBS on replicate pools of phenotypic extremes in a population allowed us to identify genetic markers with significantly different allele frequencies in the two phenotypic groups, in this case red clover with early and late stem elongation. SNPs and GBS locus haplotypes were complementary, so that using both allowed us to identify a higher number of significant loci. Given the low LD in the studied population, candidates for genes controlling the trait can be assumed to be relatively close to the marker. Within ± 50 kb regions flanking significant markers, we found genes with potential roles in the vernalization, autonomous, and photoperiod regulation of floral transition, hormonal regulation of stem elongation and cell growth. These results provide a first insight into the potential genes and mechanisms controlling transition to stem elongation in a forage legume.

References

Arsuffi G, Braybrook SA (2018) Acid growth: an ongoing trip. J Exp Bot 69:137–146. https://doi.org/10.1093/jxb/erx390
Article CAS PubMed Google Scholar
Auzanneau J, Huyghe C, Julier B, Barre P (2007) Linkage disequilibrium in synthetic varieties of perennial ryegrass. Theor Appl Genet 115:837–847. https://doi.org/10.1007/s00122-007-0612-3
Article CAS PubMed Google Scholar
Bao S, Hua C, Shen L, Yu H (2020) New insights into gibberellin signalling in regulating flowering in Arabidopsis. J Integr Plant Biol 62:118–131. https://doi.org/10.1111/jipb.12892
Article CAS PubMed Google Scholar
Barbier FF, Dun EA, Kerr SC, Chabikwa TG, Beveridge CA (2019) An update on the signals controlling shoot branching. Trends Plant Sci 24:220
Article CAS Google Scholar
Barratt DHP, Derbyshire P, Findlay K, Pike M, Wellner N, Lunn J, Feil R, Simpson C, Maule AJ, Smith AM (2009) Normal growth of Arabidopsis requires cytosolic invertase but not sucrose synthase. PNAS 106:13124–13129. https://doi.org/10.1073/pnas.0900689106
Article PubMed PubMed Central Google Scholar
Borges F, Martienssen RA (2015) The expanding world of small RNAs in plants. Nature Rev Mol Cell Biol 16:727. https://doi.org/10.1038/nrm4085
Article CAS Google Scholar
Bouché F, Woods DP, Amasino RM (2017) Winter memory throughout the plant kingdom: different paths to flowering. Plant Physiol 173:27–35. https://doi.org/10.1104/pp.16.01322
Article CAS PubMed Google Scholar
Byrne S, Czaban A, Studer B, Panitz F, Bendixen C, Asp T (2013) Genome wide allele frequency fingerprints (GWAFFs) of populations via genotyping by sequencing. PLoS ONE 8:e57438
Article CAS Google Scholar
Chao Y, Yang Q, Kang J, Zhang T, Sun Y (2013) Expression of the alfalfa FRIGIDA-Like Gene, MsFRI-L delays flowering time in transgenic Arabidopsis thaliana. Mol Biol Rep 40:2083–2090. https://doi.org/10.1007/s11033-012-2266-8
Article CAS PubMed Google Scholar
Choi I, Jeon Y, Yoo Y, Cho H-S, Pai H-S (2020) The in vivo functions of ARPF2 and ARRS1 in ribosomal RNA processing and ribosome biogenesis in Arabidopsis. J Exp Bot 71:2596–2611. https://doi.org/10.1093/jxb/eraa019
Article CAS PubMed Google Scholar
De Vega JJ, Ayling S, Hegarty M, Kudrna D, Goicoechea JL, Ergon Å, Rognli OA, Jones C, Swain M, Geurts R, Lang C, Mayer KFX, Rössner S, Yates S, Webb KJ, Donnison IS, Oldroyd GED, Wing RA, Caccamo M, Powell W, Abberton MT, Skøt L (2015) Red clover (Trifolium pratense L.) draft genome provides a platform for trait improvement. Sci Rep 5:17394
Article Google Scholar
Die JV, Gil J, Millan T (2018) Genome-wide identification of the auxin response factor gene family in Cicer Arietinum. BMC Genomics 19:301. https://doi.org/10.1186/s12864-018-4695-9
Article CAS PubMed PubMed Central Google Scholar
Dorant Y, Benestan L, Rougemont Q, Normandeau E, Boyle B, Rochette R, Bernatchez L (2019) Comparing Poolseq, Rapture, and GBS genotyping for inferring weak population structure: the American lobster (Homarus americanus) as a case study. Ecology Evol 9:6606–6623. https://doi.org/10.1002/ece3.5240
Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, Mitchell SE (2011) A robust simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 6:e19379
Article CAS Google Scholar
Ergon Å, Skøt L, Sæther VE, Rognli OA (2019) Allele frequency changes provide evidence for selection and identification of candidate loci for survival in red clover (Trifolium pratense L.). Front Plant Sci 10:718. https://doi.org/10.3389/fpls.2019.00718
Article PubMed PubMed Central Google Scholar
Ergon Å, Solem S, Uhlen AK, Bakken AK (2016) Generative development in red clover in response to temperature and photoperiod. In: Roldan-Ruiz I, Baert J, Reheul D (eds.) Breeding in a world of scarcity. Proceedings of the 2015 meeting of the section “Forage crops and amenity grasses” of Eucarpia. Springer, 243–247
Fejer SO (1960) Response of some New Zealand pasture species to vernalization. New Zeal J Agr Res 3:656–662. https://doi.org/10.1080/00288233.1960.10427145
Article Google Scholar
Ferretti L, Ramos-Onsins SE, Perez-Enciso M (2013) Population genomics from pool sequencing. Mol Ecol 22:5561–5576. https://doi.org/10.1111/mec.12522 (PMID: 24102736)
Article PubMed Google Scholar
Foll M, Gaggiotti OE (2008) A genome scan method to identify selected loci appropriate for both dominant and codominant markers: a Bayesian perspective. Genetics 180:977–993. https://doi.org/10.1534/genetics.108.092221
Article PubMed PubMed Central Google Scholar
Francis D, Sorrell DA (2001) The interface between the cell cycle and plant growth regulators: a mini Review. Plant Growth Reg 33:1–12
Article CAS Google Scholar
Galindo-Trigo S, Gray JE, Smith LM (2016) Conserved roles of CrRLK1L receptor-like kinases in cell expansion and reproduction from algae to angiosperms. Front Plant Sci 7:1269. https://doi.org/10.3389/fpls.2016.01269
Article PubMed PubMed Central Google Scholar
Gao B, Wang L, Oliver M, Chen M, Zhang J (2019) Evolution of auxin response factors 1 in plants characterized by phylogenomic synteny network analyses. Biology 9:1–22. https://doi.org/10.1101/603175
Article CAS Google Scholar
Gautier M, Foucaud J, Gharbi K, Cezard T, Galan M, Loiseau A, Thomson M, Pudlo P, Kerdelhué C, Estoup A (2013) Estimation of population allele frequencies from next-generation sequencing data: pool-versus individual-based genotyping. Mol Ecol 22:3766–3779. https://doi.org/10.1111/mec.12360
Article CAS PubMed Google Scholar
Gomes GLB, Scortecci KC (2021) Auxin and its role in plant development: structure, signalling, regulation and response mechanisms. Plant Biol 23:894–904
Article CAS Google Scholar
Greenberg MVC, Ausin I, Chan SWL, Cokus SJ, Cuperus JT, Feng S, Law JA, Chu S, Pellegrini M, Carrington JC, Jacobsen S (2011) Identification of genes required for de novo DNA methylation in Arabidopsis. Epigen 6:344–354
Article CAS Google Scholar
Hamblin M, Jannink J-L (2011) Factors affecting the power of haplotype markers in association studies. Plant Genome 4:145–153. https://doi.org/10.3835/plantgenome2011.03.0008
Article CAS Google Scholar
Hauvermale AL, Ariizumi T, Steber CM (2014) The roles of the GA receptors GID1a, GID1b, and GID1c in sly1-independent GA signaling. Plant Sign Behav 9(2):e28030. https://doi.org/10.4161/psb.28030
Article CAS Google Scholar
He J, Zhao H, Cheng Z, Ke Y, Liu J, Ma H (2019) Evolution analysis of the fasciclin-like arabinogalactan proteins in plants shows variable fasciclin-AGP domain constitutions. Int J Mol Sci. https://doi.org/10.3390/ijms20081945
Article PubMed PubMed Central Google Scholar
Hecht V, Foucher F, Ferrándiz C, Macknight R, Navarro C, Morin J, Vardy ME, Ellis N, Beltrán JP, Rameau C, Weller JL (2005) Conservation of Arabidopsis flowering genes in model legumes. Plant Physiol 137:1420–1434
Article CAS Google Scholar
Hedden P, Sponsel V (2015) A Century of gibberellin research. J Plant Growth Regul 34:740–760
Article CAS Google Scholar
Hedrick PW (2011) Genetics of populations, 4th edn. Jones and Bartlett Publishers, Sudbury, USA, p 675
Google Scholar
Herr AJ, Molnàr A, Jones A, Baulcombe DC (2006) Defective RNA processing enhances RNA silencing and influences flowering of Arabidopsis. PNAS 103:14994–15001. https://doi.org/10.1073/pnas.0606536103
Article CAS PubMed PubMed Central Google Scholar
Herrmann D, Boller B, Studer B, Widmer F, Kölliker R (2006) QTL analysis of seed yield components in red clover (Trifolium pratense L.). Theor Appl Genet 112:536–545. https://doi.org/10.1007/s00122-005-0158-1
Huang G-Q, Gong S-J, Xu W-L, Li W, Li P, Zhang C-J, Li D-D, Zheng Y, Li F-G, Li X-B (2013) A fasciclin-like arabinogalactan protein, GhFLA1, is involved in fiber initiation and elongation of cotton. Plant Physiol 161:1278–1290. https://doi.org/10.1104/pp.112.203760
Article CAS PubMed PubMed Central Google Scholar
Ištvánek J, Jaroš M, Křenek A, Řepková J (2014) Genome assembly and annotation for red clover (Trifolium pratense; Fabaceae). Amer J Bot 101:327–337
Article Google Scholar
Jaillais Y, Fobis-Loisy I, Miège C, Gaude T (2008) Evidence for a sorting endosome in Arabidopsis root cells. Plant J 53:237–247. https://doi.org/10.1111/j.1365-313X.2007.03338.x
Article CAS PubMed Google Scholar
Jaudal M, Zhang L, Che C, Li G, Tang Y, Wen J, Mysore KS, Putterill J (2018) A SOC1-like gene MtSOC1a promotes flowering and primary stem elongation in Medicago. J Exp Bot 69:4867–4880. https://doi.org/10.1093/jxb/ery284
Article CAS PubMed PubMed Central Google Scholar
Jiang D, Wang Y, He Y (2008) Repression of FLOWERING LOCUS C and FLOWERING LOCUS T by the Arabidopsis Polycomb repressive complex 2 components. PLoS ONE 3:e3404
Article Google Scholar
Jiao Z, Wang L, Du H, Wang Y, Wang W, Liu J, Huang J, Huang W, Ge L (2020) Genome-wide study of C2H2 zinc finger gene family in Medicago Truncatula. BMC Plant Biol 20:401. https://doi.org/10.1186/s12870-020-02619-6
Article CAS PubMed PubMed Central Google Scholar
Jung C-H, Wong CE, Singh MB, Bhalla PL (2012) Comparative genomic analysis of soybean flowering genes. PLoS ONE 7:e38250. https://doi.org/10.1371/journal.pone.0038250
Article CAS PubMed PubMed Central Google Scholar
Kandasamy MK, Deal RB, McKinney EC, Meagher RB (2004) Plant actin-related proteins. Trends Plant Sci 9:196–202. https://doi.org/10.1016/j.tplants.2004.02.004
Article CAS PubMed Google Scholar
Kawaguchi F, Kigoshi H, Nakajima A, Matsumoto Y, Uemoto Y, Fukushima M, Yoshida E, Iwamoto E, Akiyama T, Kohama N, Kobayashi E, Honda T, Oyama K, Mannen H, Sasazaki S (2018) Pool-based genome-wide association study identified novel candidate regions on BTA9 and 14 for oleic acid percentage in Japanese Black cattle. Anim Sci J 89:1060–1066. https://doi.org/10.1111/asj.13035
Article CAS PubMed Google Scholar
Kim MY, Kang YJ, Lee T, Lee S-H (2013) Divergence of flowering-related genes in three legume species. Plant Genome 6:1–12. https://doi.org/10.3835/plantgenome2013.03.0008
Article CAS Google Scholar
Kumar M, Le DT, Hwang S, Seo PJ, Kim HU (2019) Role of the INDETERMINATE DOMAIN genes in plants. Int J Mol Sci 20:2286. https://doi.org/10.3390/ijms20092286
Article CAS PubMed Central Google Scholar
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Subgroup (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079
Article Google Scholar
Li C, Wu H-M, Cheung AY (2017) FERONIA and her pals: functions and mechanisms. Plant Physiol 171:2379–2392. https://doi.org/10.1104/pp.16.00667
Article CAS Google Scholar
Li H (2013) Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. Avaiable at arXiv:1303.3997v2 [q-bio.GN]
Liang SC, Hartwig B, Perera P, Mora-García S, de Leau E, Thornton H, Lima de Alves F, Rapsilber J, Yang S, James GV, Schneeberger K, Finnegan EJ, Turck F, Goodrich J (2015) Kicking against the PRCs – A domesticated transposase antagonises silencing mediated by polycomb group proteins and is an accessory component of polycomb repressive complex 2. PLoS Genet 11:e1005660. https://doi.org/10.1371/journal.pgen.1005660
Article CAS PubMed PubMed Central Google Scholar
Liew LC, Singh MB, Bhalla PL (2014) Unique and conserved features of floral evocation in legumes. J Integr Plant Biol 56:714–728. https://doi.org/10.1111/jipb.12187
Article PubMed Google Scholar
Maekawa S, Ueda Y, Yanagisawa S (2018) Overexpression of a brix domain-containing ribosome biogenesis factor ARPF2 and its interactor ARRS1 causes morphological changes and lifespan extension in Arabidopsis thaliana. Front Plant Sci 9:1177. https://doi.org/10.3389/fpls.2018.01177
Article PubMed PubMed Central Google Scholar
Majda M, Robert S (2018) The role of auxin in cell wall expansion. Int J Mol Sci 19:951. https://doi.org/10.3390/ijms19040951
Article CAS PubMed Central Google Scholar
Mantegazza O, Gregis V, Mendes MA, Morandini P, Alves-Ferreira M, Patreze CM, Nardeli SM, Kater MM, Colombo L (2014) Analysis of the arabidopsis REM gene family predicts functions during flower development. Ann Bot 114:1507–1515. https://doi.org/10.1093/aob/mcu124
Article CAS PubMed PubMed Central Google Scholar
Matzke MA, Mosher RA (2014) RNA-directed DNA methylation: an epigenetic pathway of increasing complexity. Nat Rev Genet 15:394–408
Article CAS Google Scholar
McKim SM (2019) How plants grow up. J Integr Plant Biol 61:257–277. https://doi.org/10.1111/jipb.12786
Article PubMed Google Scholar
McKim SM (2020) Moving on up—controlling internode growth. New Phytol 226:672–678. https://doi.org/10.1111/nph.16439
Article PubMed Google Scholar
Montag K, Hornbergs J, Ivanov R, Bauer P (2020) Phylogenetic analysis of plant multi-domain SEC14-like phosphatidylinositol transfer proteins and structure–function properties of PATELLIN2. Plant Mol Biol 104:665–678. https://doi.org/10.1007/s11103-020-01067-y
Article CAS PubMed PubMed Central Google Scholar
Nan Q, Qian D, Niu Y, He Y, Tong S, Niu Z, Ma J, Yang Y, An L, Wan D, Xiang Y (2017) Biochemical properties arising from key amino acid changes throughout evolution. Plant Cell 29:395–408. https://doi.org/10.1105/tpc.16.00690
Article CAS PubMed PubMed Central Google Scholar
Putterill J, Zhang AL, Yeoh CC, Balcerowicz M, Jaudal M, Gasic EV (2013) FT genes and regulation of flowering in the legume Medicago truncatula. Funct Plant Biol 40:1199–1207. https://doi.org/10.1071/FP13087
Article PubMed Google Scholar
Rafalski A (2002) Applications of single nucleotide polymorphisms in crop genetics. Curr Opin Plant Biol 5:94–100
Article CAS Google Scholar
Raineri E, Ferretti L, Esteve-Codina A, Nevado B, Heath S, Pérez-Enciso M (2012) SNP calling by sequencing pooled samples. BMC Bioinf 13:239. https://doi.org/10.1186/1471-2105-13-239
Article Google Scholar
Reinecke DM, Wickramarathna AD, Ozga JA, Kurepin LV, Jin AL, Good AG, Pharis RP (2013) Gibberellin 3-oxidase gene expression patterns influence gibberellin biosynthesis, growth, and development in pea. Plant Physiol 163:929–945. https://doi.org/10.1104/pp.113.225987
Article CAS PubMed PubMed Central Google Scholar
Romanel EAC, Schrago CG, Couñago RM, Russo CAM, Alves-Ferreira M (2009) Evolution of the B3 DNA binding superfamily: new insights into REM family gene diversification. PLoS ONE 4:e5791. https://doi.org/10.1371/journal.pone.0005791
Article CAS PubMed PubMed Central Google Scholar
Roudier F, Schindelman G, DeSalle R, Benfey PN (2005) The COBRA family of putative GPI-anchored proteins in Arabidopsis. A new fellowship in expansion. Plant Physiol 130:538–548. https://doi.org/10.1104/pp.007468
Article CAS Google Scholar
Rowley MJ, Avrutsky MI, Sifuentes CJ, Pereira L, Wierzbicki AT (2011) Independent chromatin binding of ARGONAUTE4 and SPT5L/KTF1 mediates transcriptional gene silencing. PLoS Genet 7:e1002120. https://doi.org/10.1371/journal.pgen.1002120
Article CAS PubMed PubMed Central Google Scholar
Saqib A, Scheller HV, Fredslund F, Welner DH (2019) Molecular characteristics of plant UDP-arabinopyranose mutases. Glycobiol 29:839–846. https://doi.org/10.1093/glycob/cwz067
Article CAS Google Scholar
Sato S, Isobe S, Asamizu E, Ohmido N, Kataoka R, Nakamura Y et al (2005) Comprehensive structural analysis of the genome of red clover (Trifolium pratense L.). DNA Res 12:301–364. https://doi.org/10.1093/dnares/dsi018
Article CAS PubMed Google Scholar
Schaumont D, Veeckman E, Van der Jeugt F, Haegeman A, van Glabeke S, Bawin Y, Lukasiewic J, Blugeon S, Barre P, de la O Leyva-Pérez M, Byrne S, Dawyndt P, Ruttink T (2022) Stack Mapping Anchor Points (SMAP): a versatile suite of tools for read-backed haplotyping. bioRxiv preprint, doi: https://doi.org/10.1101/2022.03.10.483555
Schultz CJ, Rumsewicz MP, Johnson KL, Jones BJ, Gaspar YM, Bacic A (2002) Using genomic resources to guide research directions. The arabinogalactan protein gene family as a test case. Plant Physiol 129:1448–1463
Article CAS Google Scholar
Serrano-Mislata A, Sablowski R (2018) The pillars of land plants: new insights into stem development. Curr Opin Plant Biol 45:11–17. https://doi.org/10.1016/j.pbi.2018.04.016
Article CAS PubMed PubMed Central Google Scholar
Shiratake K, Notaguchi M, Makino H, Sawai Y, Borghi L (2019) Petunia PLEIOTROPIC DRUG RESISTANCE 1 is a strigolactone short-distance transporter with long-distance outcomes. Plant Cell Physiol 60:1722–1733. https://doi.org/10.1093/pcp/pcz081
Article CAS PubMed Google Scholar
Solis-Miranda J, Fonseca-García C, Nava N, Pacheco R, Quinto C (2020) Genome-wide identification of the CrRLK1L subfamily and comparative analysis of its role in the legume-rhizobia symbiosis. Genes 11:793. https://doi.org/10.3390/genes11070793
Article CAS PubMed Central Google Scholar
Staiger CJ, Blanchoin L (2006) Actin dynamics: old friends with new stories. Curr Opin Plant Biol 9:554–562. https://doi.org/10.1016/j.pbi.2006.09.013
Article CAS PubMed Google Scholar
Taylor NL (1982) Stability of S alleles in a doublecross hybrid of red clover. Crop Sci 22:1222–1225
Article Google Scholar
Tejos R, Rodriguez-Furlán C, Adamowski M, Sauer M, Norambuena L, Friml J (2018) PATELLINS are regulators of auxin-mediated PIN1 relocation and plant development in Arabidopsis thaliana. J Cell Sci. https://doi.org/10.1242/jcs.204198
Article PubMed Google Scholar
Van Dobben WH (1964) Influence of photoperiod and temperature on the flowering of red clover. Instituut voor Biologish en Scheikundig Onderzoek van Landbouwgewassen, Wageningen. Medeling 241. Jaarboek 1964:77–85
Google Scholar
Verma S, Bhatia S (2019) A comprehensive analysis of the B3 superfamily identifies tissue-specific and stress-responsive genes in chickpea (Cicer arietinum L.). 3 Biotech 9:346. https://doi.org/10.1007/s13205-019-1875-5
Article PubMed PubMed Central Google Scholar
Vestad R (1990) Rødkløver i norsk engdyrking. Fortid og framtid, Norwegian Agricultural Research, 165–172.
Wang G, Ellendorff U, Kemp B, Mansfield JW, Forsyth A, Mitchell K, Bastas K, Liu C-M, Woods-Tör A, Zipfel C, de Wit PJGM, Jones JDG, Tör M, Thomma BPHJ (2008) A Genome-wide functional investigation into the roles of receptor-like proteins in Arabidopsis. Plant Physiol 147:503–517. https://doi.org/10.1104/pp.108.119487
Article CAS PubMed PubMed Central Google Scholar
Wang S, Chang Y, Ellis B (2016) Overview of OVATE FAMILY PROTEINS, a novel class of plant-specific growth regulators. Front Plant Sci 7:417. https://doi.org/10.3389/fpls.2016.00417
Article PubMed PubMed Central Google Scholar
Wang L, Yang T, Lin Q, Wang B, Li X, Luan S, Yu F (2020a) Receptor kinase FERONIA regulates flowering time in Arabidopsis. BMC Plant Biol 20:26. https://doi.org/10.1186/s12870-019-2223-y
Article CAS PubMed PubMed Central Google Scholar
Wang H, Jiang H, Xu Y, Wang Y, Zhu L, Yu X, Kong F, Zhou C, Han L (2020b) Systematic analysis of gibberellin pathway components in Medicago truncatula reveals the potential application of gibberellin in biomass improvement. Int J Mol Sci 21:7180. https://doi.org/10.3390/ijms21197180
Article CAS PubMed Central Google Scholar
Weis BL, Palm D, Missbach S, Bohnsack MT, Schleiff E (2015) atBRX1-1 and atBRX1-2 are involved in an alternative rRNA processing pathway in Arabidopsis thaliana. RNA 21:415–425. https://doi.org/10.1261/rna.047563.114
Article CAS PubMed PubMed Central Google Scholar
Welham T, Pike J, Horst I, Flemetakis E, Katinakis P, Kaneko T, Sato S, Tabata S, Perry J, Parniske M, Wang TL (2009) A cytosolic invertase is required for normal growth and cell development in the model legume, Lotus japonicus. J Exp Bot 60:3353–3365. https://doi.org/10.1093/jxb/erp169
Article CAS PubMed PubMed Central Google Scholar
Weller JL, Ortega R (2015) Genetic control of flowering time in legumes. Front Plant Sci 6:207. https://doi.org/10.3389/fpls.2015.00207
Article PubMed PubMed Central Google Scholar
Weller JL, Reid JB, Taylor SA, Murfet C (1997) The genetic control of flowering in pea. Trends Plant Sci 2:412–418
Article Google Scholar
Wojciechowski MF, Lavin M, Sanderson MJ (2004) A phylogeny of legumes (Leguminosae) based on analysis of the plastid MATK gene resolves many well supported subclades within the family. Amer J Bot 91:1846–1862
Article CAS Google Scholar
Zhang L, Sun L, Zhang X, Zhang S, Xie D, Liang C, Huang W, Fan L, Fang Y, Chang Y (2018) OFP1 interaction with ATH1 regulates stem growth, flowering time and flower basal boundary formation in Arabidopsis. Genes 9:399. https://doi.org/10.3390/genes9080399
Article CAS PubMed Central Google Scholar
Zhou H, Duan H, Liu Y, Sun X, Zhao J, Lin H (2019) Patellin protein family functions in plant development and stress response. J Plant Physiol 234–235:94–97. https://doi.org/10.1016/j.jplph.2019.01.012
Article CAS PubMed Google Scholar
Zou J-J, Zheng Z-Y, Xue S, Li H-H, Wang Y-R, Le J (2016) The role of Arabidopsis actin-related protein 3 in amyloplast sedimentation and polar auxin transport in root gravitropism. J Exp Bot 67:5325–5337. https://doi.org/10.1093/jxb/erw294
Article CAS PubMed PubMed Central Google Scholar

Download references

Funding

Open access funding provided by Norwegian University of Life Sciences. This study was funded by the Norwegian Research Council (Project AGROPRO—Grant Agreement Number 225330).

Author information

Authors and Affiliations

Department of Plant Sciences, Faculty of Biosciences, Norwegian University of Life Sciences, P.O. Box 5003, N-1432 Ås, Norway
Åshild Ergon & Øystein W. Milvang
Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Aberystwyth, UK
Leif Skøt
Flanders Research Institute for Agriculture, Fisheries and Food (ILVO), Plant Sciences Unit, Caritasstraat 39, B-9090 Melle, Belgium
Tom Ruttink

Authors

Åshild Ergon
View author publications
You can also search for this author in PubMed Google Scholar
Øystein W. Milvang
View author publications
You can also search for this author in PubMed Google Scholar
Leif Skøt
View author publications
You can also search for this author in PubMed Google Scholar
Tom Ruttink
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

ÅE conceived the idea and designed the experiment. ØWM conducted the growth experiment, extracted DNA, and performed initial data analysis with supervision from ÅE. TR performed SNP and haplotype calling, quality assessment, and filtering. ÅE performed data analysis and investigated candidate loci. TR and LS gave advice on analysis and interpretation of the results. ÅE wrote the manuscript and TR contributed to the writing. All authors revised the manuscript and approved the final version.

Corresponding author

Correspondence to Åshild Ergon.

Ethics declarations

Conflicts of interest

The authors have no relevant financial or non-financial interests to disclose.

Data availability

The raw sequence data split per DNA pool (three early, three late) per library preparation method (PstI, ApeKI) are available in NCBI SRA under accession number PRJNA784180. The SNP and haplotype allele frequencies per pool are deposited in the NMBU Open Research Data database (https://doi.org/10.18710/P6FYU7).

Additional information

Communicated by Bing Yang.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 18 KB)

Supplementary file2 (DOCX 18 KB)

Supplementary file3 (XLSX 36 KB)

Supplementary file4 (XLSX 24 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ergon, Å., Milvang, Ø.W., Skøt, L. et al. Identification of loci controlling timing of stem elongation in red clover using genotyping by sequencing of pooled phenotypic extremes. Mol Genet Genomics 297, 1587–1600 (2022). https://doi.org/10.1007/s00438-022-01942-x

Download citation

Received: 01 March 2022
Accepted: 07 August 2022
Published: 24 August 2022
Issue Date: November 2022
DOI: https://doi.org/10.1007/s00438-022-01942-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Identification of loci controlling timing of stem elongation in red clover using genotyping by sequencing of pooled phenotypic extremes