Next Article in Journal
Effect of CUX1 on the Proliferation of Hu Sheep Dermal Papilla Cells and on the Wnt/β-Catenin Signaling Pathway
Previous Article in Journal
An Efficient Feature Selection Algorithm for Gene Families Using NMF and ReliefF
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Communication

Genome-Wide Identification and Evolutionary Analyses of SrfA Operon Genes in Bacillus

1
State Key Laboratory of Enhanced Oil Recovery, PetroChina Research Institute of Petroleum Exploration & Development, Beijing 100083, China
2
College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing 210095, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Genes 2023, 14(2), 422; https://doi.org/10.3390/genes14020422
Submission received: 18 November 2022 / Revised: 21 January 2023 / Accepted: 3 February 2023 / Published: 7 February 2023
(This article belongs to the Section Microbial Genetics and Genomics)

Abstract

:
A variety of secondary metabolites contributing to plant growth are synthesized by bacterial nonribosomal peptide synthases (NRPSs). Among them, the NRPS biosynthesis of surfactin is regulated by the SrfA operon. To explore the molecular mechanism for the diversity of surfactins produced by bacteria within the genus Bacillus, we performed a genome-wide identification study focused on three critical genes of the SrfA operon—SrfAA, SrfAB and SrfAC—from 999 Bacillus genomes (belonging to 47 species). Gene family clustering indicated the three genes can be divided into 66 orthologous groups (gene families), of which a majority comprised members of multiple genes (e.g., OG0000009 had members of all three SrfAA, SrfAB and SrfAC genes), indicating high sequence similarity among the three genes. Phylogenetic analyses also found that none of the three genes formed monophyletic groups, but were usually arranged in a mixed manner, suggesting the close evolutionary relationship among the three genes. Considering the module structure of the three genes, we propose that self-duplication, especially tandem duplications, might have contributed to the initial establishment of the entire SrfA operon, and further gene fusion and recombination as well as accumulated mutations might have continuously shaped the different functional roles of SrfAA, SrfAB and SrfAC. Overall, this study provides novel insight into metabolic gene clusters and operon evolution in bacteria.

1. Introduction

A nonribosomal peptide synthase (NRPS) is a multifunctional mega-enzyme that usually includes many modules and functions to assemble the peptide backbones of structurally diverse and biologically active natural products. It is well known that surfactin is one of the most important biosurfactants; it has the ability to efficiently decrease the surface tension of water and has good heat endurance and salt tolerance, meaning that it has great potential for use in enhanced oil recovery (EOR) [1,2,3]. The SrfA operon, consisting of four adjacent genes, SrfAA, SrfAB, SrfAC and SrfAD, is required for the production of surfactin [4]. Among these four genes, SrfAA and SrfAB are both composed of three functional modules, and SrfAC consists of only one module. Each module is comprised of an adenylation (A) domain, a thiolation (T) domain and a condensation (C) domain [5,6], and has the ability to incorporate a specific amino acid into a peptide backbone, so the order of the modules in assembly lines is always in accordance with the product peptide sequence. SrfAD contains a thioesterase (TE) domain, and is responsible for catalyzing the cyclization of peptide chains. Additionally, a few single-module NRPS-like enzymes that lack the important C domain have been reported in fungi and bacteria [7]. The A domain, C domain and T domain in the functional module have been associated with selective substrate activation, peptide bond formation and substrate shuttling in activation sites, respectively. They are vital for biosynthesis pathways, and the TE domain alone has been shown to be responsible for the release of compounds [8]. Thus, the SrfAA, SrfAB and SrfAC genes are considered as critical components in the SrfA operon.
Bacillus bacteria, known as a remarkable source of bioactive lipopeptides, are widely applied in the biosynthesis of a wealth of diverse compounds, including antibiotics, biosurfactants and antitumor agents via gene-cluster mining [9,10,11,12]. Bacillus bacteria produce a diversity of surfactins; however, it remains poorly understood how such a diversity is generated. In this study, we performed genome-wide identification of SrfAA, SrfAB and SrfAC genes from 999 Bacillus genomes, searched for conserved motifs in whole genes and conducted phylogenetic analyses to clarify the gene fusion of SrfAA, SrfAB and SrfAC genes in the SrfA operon, and proposed that self-duplication within the operon has been a driving force in the formation of the SrfA operon. Overall, our findings provide evidence that self-duplication could be an important mechanism in the formation of metabolic gene clusters and operon evolution in bacteria.

2. Materials and Methods

2.1. Data Source

A total of 999 Bacillus genomes (belonging to 47 species), including sequences and annotation files, were downloaded from the NCBI database (https://www.ncbi.nlm.nih.gov/, accessed on 7 June 2022). All data sources are shown in Table S1.

2.2. Genome-Wide Identification of SrfAA, SrfAB and SrfAC Genes

SrfAA.hmm was built using hundreds of representative SrfA gene sequences from various bacteria downloaded from the NCBI database, using a hidden Markov model build (HMMbuild). SrfAB.hmm and SrfAC.hmm were built in the same way (Supplementary Datasets S1–S6). Briefly, a two-step strategy was adopted to identify SrfAA genes. The first step was to perform an hmmsearch for protein sequences in each genome, with SrfAA.hmm as a query. Then, the remaining sequences were used to conduct a Blast search against representative SrfAA genes, using the amino acid sequences of every genome as a query for potential SrfAA genes. The threshold expectation value was set to 10−4 for the BLAST search. The same strategy was adopted for the identification of SrfAB and SrfAC genes.

2.3. Classification of Orthologous Genes and Phylogenetic Analysis

Orthologous genes encoded in 999 Bacillus genomes were classified using OrthoFinder (version 2.2.7) [13] with default parameters. SrfAA, SrfAB and SrfAC genes were selected from orthologous groups (OGs) containing SrfAA, SrfAB and SrfAC genes. Amino acid sequences of target genes were used for multiple alignments using ClustalW integrated in MEGA 7.0 with default parameter settings [14]. Overly divergent sequences were removed to prevent interference with the alignments and subsequent phylogenetic inference. Then, the resulting alignments were manually corrected using MEGA 7.0 for further improvement. Phylogenetic analyses were conducted using IQ-TREE and the maximum likelihood method [15]. The best-fit model was estimated using ModelFinder [16]. Branch support values were assessed using UFBoot2 tests [17]. The scale bar in the model indicates genetic distance.

2.4. Conserved Motif Analysis

The conserved protein motifs in the whole SrfAA, SrfAB and SrfAC genes were analyzed using Multiple Expectation Maximization for Motif Elicitation (MEME) and WebLogo with the default settings [18,19].

3. Results

3.1. Genome-Wide Identificaiton and Orthology Classification of SrfAA, SrfAB and SrfAC Genes

Altogether, 4680 SrfAA genes, 3522 SrfAB genes and 3558 SrfAC genes were identified from 999 Bacillus genomes using OrthoFinder. They were scattered across 66 orthologous groups (OGs) (Table 1), indicating that these genes can be classified into 66 evolutionary gene families and suggesting a great diversity within the SrfA operon. Different OGs contained varying numbers of genes, with two OGs (OG0000009 and OG0000006) accounting for nearly 50% (5864/11,760) of all identified genes; the 18 largest OGs contained 98.3% of the genes, and the remaining 48 OGs contained fewer than 15 genes each. SrfAA, SrfAB and SrfAC genes were distributed in 49, 45 and 35 OGs, respectively. Of the 66 OGs, 39 contained more than one gene type; 21 OGs contained SrfAA, SrfAB and SrfAC genes, 11 contained only SrfAA and SrfAB genes, 4 contained only SrfAA and SrfAC genes, and 3 contained only SrfAB and SrfAC genes (Supplementary Table S1). Such mixed compositions of genes suggest high sequence similarity between SrfAA, SrfAB and SrfAC genes, and implies potential intraspecies recombination within the same SrfA operon.
Among the OGs containing mixtures of gene members, the gene compositions were usually not balanced; sometimes SrfAA dominated the others, e.g., in OG0000009 and OG0000878, and sometimes SrfAB outnumbered the others, e.g., in OG0000006 and OG0000736. The largest family, OG0000009, had 1867 copies of SrfAA and 1269 copies of SrfAB, but had only 9 copies of SrfAC. Such discrepancy of gene numbers within OGs may suggest differential degrees of duplication events among species and/or specificity of different gene families.

3.2. Phylogenetic Analysis and Gene Duplications of SrfA Operon

Phylogenetic analyses were conducted on the 17 largest OGs containing members of all three gene families (OG0004815 was excluded because it had no SrfAC genes). In each of the constructed phylogenetic trees, it was observed that SrfAA, SrfAB and SrfAC genes often clustered with each other while seldom forming separate monophyletic branches for different gene types, suggesting a high identity of the sequences within the SrfA operon and close evolutionary relationships among different genes (Figure 1 and Supplementary Figures S1–S17). These results are in accordance with the above-mentioned high sequence similarity among the SrfA operon genes. All evidence supports a hypothesis that the complete operon is probably derived from self-duplication of only one gene, and further module fusion and recombination may have led to the formation of the complete operon.
Gene duplication events within the SrfA operon and the corresponding mechanisms were therefore analyzed. By randomly investigating 10 genomes, it was observed that a complete SrfA operon usually comprises multiple copies of SrfAA, SrfAB and SrfAC genes at a close distance (fewer than 50 bp) to each other, forming a gene cluster (Figure 2, Supplementary Table S2 and Supplementary Figures S18–S26). These physically close genes are normally the results of tandem duplications, which have likely made the biggest contribution to the establishment of a complete operon; they do not result from segmental or ectopic duplications.

3.3. Conserved Motifs in SrfAA, SrfAB and SrfAC Genes

Although the three genes of the SrfA operon have very similar gene sequence, their functional roles are different and thus they can be expected to have evolved distinct important motifs that are critical to their differentiation of functions. To explore the critical motifs, we searched for conserved motifs in the amino acid sequences of SrfAA, SrfAB and SrfAC proteins. For each gene, the five most conserved motifs were identified and displayed using WebLogo. We found that some of these conserved motifs were shared by different genes, suggesting conserved functional roles in these regions. For instance, motif 1 was shared among all three genes, motif 4 was shared between SrfAA and SrfAB. There were conserved motifs specific to one gene, for instance, motif 2 showed absolutely different sequences among three genes, while motif 3 exhibited a homology among the three genes but had a low level of sequence similarity (Figure 3). Therefore, these divergent motifs may be derived from different regions or the results of different degrees of mutations among different genes, and should indicate functional divergence of the three genes.

4. Discussion

Microorganisms produce a large variety of products, the majority of which are derived via biosynthesis assembly lines known as NRPSs. The SrfA operon comprises a number of NRPS genes producing diverse metabolic products with a broad application, e.g., surfactin can be used to break the long carbon chains found in crude oil, degrading it into small molecules and thereby improving oil recovery [20]. SrfAA, SrfAB and SrfAC genes harbor entire modules (C-A-T) or didomains (e.g., T-A, T-C), the order and specificity of which determine the amino acid sequence in final peptide products [21,22]. Using a total of 999 Bacillus genomes, we identified all SrfAA, SrfAB and SrfAC genes and performed comparative and phylogenetic analyses to understand how these genes have evolved in the genus. The usual way in which prokaryotes acquire new genes is through horizontal gene transfer [3,23,24,25]. However, this study reported another potential mechanism—self-duplication, which does not require any novel genetic material from the environment or other organisms, and thus may be considered an alternative approach for genomic innovation.
Certainly, to equip a complete SrfA operon is a complicated process, and gene duplication is just the beginning. Duplicated gene copies need to fuse to form multimodule genes like SrfAA and SrfAB, which comprise three duplicated modules each, and different genes have to accumulate mutations to achieve functional divergence, allowing them to be functionally responsible for different metabolic products. Meanwhile, tandem duplications should result in high-density gene clusters, and the multiple gene copies of a cluster likely increase the expressional efficiency and improve the yield of final products. Recent studies have provided a body of evidence supporting the opinion that gene duplication serves as an important driving force in the evolution of operons in bacteria. For instance, gene duplication within the mamAB operon in Alphaproteobacteria, the EPS operon in Gram-negative bacteria, and PE and PPE gene duplication in the ppe-pe operon in Mycobacterium genus has resulted in functional diversification and increased complexity of biological pathways [26,27,28].
Among the studied Bacillus genomes, it is true that some species had a complete SrfA operon (the Bacillus albus operon comprised all four genes, SrfAA, SrfAB, SrfAC and SrfAD), and some had incomplete SrfA operons with certain genes missing (e.g., Bacillus beveridgei). However, a similar scenario can be observed within the same Bacillus species: certain stains/accessions of one Bacillus species had the complete SrfA operon (Bacillus altitudinis strain: Ba1449), but other strains/accessions of the same species possessed incomplete operons (Bacillus altitudinis strain: SCU11). This phenomenon reflects the highly active and complicated evolution of operons, which are subject to gene duplication, loss, rearrangement and horizontal gene transfer [29].
Moreover, a broad repertoire of synthetic compounds depends on module self-function and module–module interaction among the SrfAA, SrfAB and SrfAC genes. Whole-module deletions/insertions or individual domain changes in a module are likely to re-engineer the biosynthetic pathway, resulting in the generation of novel products. A previous study reported that to reorganize the molecular structure in gene clusters and to deliver new products, gene editing was applied to swap the conserved flavodoxin-like subdomain (FSD), which still preserved the overall structure of the A domain [30]. However, it is possible that deletion of domains was followed by module skipping [31]. To date, the engineering of biosynthesis pathways and production of novel peptides needs further exploration.
It is well known that the majority of the metabolic gene clusters in bacterial genomes are organized into operons and clusters, which probably benefits the co-regulation and co-expression of adjacent genes with related functions [32]. Additionally, gene order and structure in operons are vital to the products and synthetic efficiency of protein complexes [33]. Thus, complicated metabolic networks including regulatory checkpoints and cascades are dominant in bacterial metabolism. To sum up, our findings suggest another means by which genomic innovation can be achieved, and provide novel insight into bacterial genome evolution.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/genes14020422/s1, Figure S1. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0000006; Figure S2. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0000009; Figure S3. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0000249; Figure S4. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0000642; Figure S5. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0000675; Figure S6. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0000736; Figure S7. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0000753; Figure S8. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0000878; Figure S9. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0000898; Figure S10. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0001033; Figure S11. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0001616; Figure S12. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0002144; Figure S13. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0002176; Figure S14. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0002286; Figure S15. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0003280; Figure S16. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0003557; Figure S17. Detailed phylogenetic tree of SrfAA, SrfAB and SrfAC genes in OG0004686; Figure S18. Distribution of SrfAA, SrfAB and SrfAC genes in the Bacillus subtilis subsp. subtilis str. 168 (GCF_000009045) genome; Figure S19. Distribution of SrfAA, SrfAB and SrfAC genes in the Bacillus pseudomycoides DSM 12442 (GCF_000161455) genome; Figure S20. Distribution of SrfAA, SrfAB and SrfAC genes in the Bacillus amyloliquefaciens Y2 (GCF_000262385) genome; Figure S21. Distribution of SrfAA, SrfAB and SrfAC genes in the Bacillus sp. Pc3 (GCF_000815145) genome; Figure S22. Distribution of SrfAA, SrfAB and SrfAC genes in the Bacillus velezensis (GCF_003073255) genome; Figure S23. Distribution of SrfAA, SrfAB and SrfAC genes in the Bacillus velezensis (GCF_003390415) genome; Figure S24. Distribution of SrfAA, SrfAB and SrfAC genes in the Bacillus amyloliquefaciens (GCF_ 008824205) genome; Figure S25. Distribution of SrfAA, SrfAB and SrfAC genes in the Bacillus subtilis (GCF_012647565) genome; Figure S26. Distribution of SrfAA, SrfAB and SrfAC genes in the Bacillus velezensis (GCF_016904925) genome. Table S1. A list of 999 Bacillus species genomes used in this study; Table S2. Number of SrfAA, SrfAB and SrfAC genes in 999 Bacillus species genomes. Dataset S1. Representative SrfAA gene sequences used to build HMM profiles of SrfAA genes; Dataset S2. Representative SrfAB gene sequences used to build HMM profiles of SrfAB genes; Dataset S3. Representative SrfAC gene sequences used to build HMM profiles of SrfAC genes; Dataset S4. HMM profiles of SrfAA genes built in this study; Dataset S5. HMM profiles of SrfAB genes built in this study; Dataset S6. HMM profiles of SrfAC genes used in this study.

Author Contributions

Y.X. and Q.-J.L. designed the study. J.-Y.W. and J.-Y.X. analyzed the data and wrote the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets generated for this study can be found in the National Center for Biotechnology Information (NCBI) (https://www.ncbi.nlm.nih.gov/, accessed on 7 June 2022).

Conflicts of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Zou, A.; Liu, J.; Garamus, V.M.; Yang, Y.; Willumeit, R.; Mu, B. Micellization Activity of the Natural Lipopeptide [Glu1, Asp5] Surfactin-C15 in Aqueous Solution. J. Phys. Chem. B 2010, 114, 2712–2718. [Google Scholar] [CrossRef]
  2. Liu, Q.; Lin, J.; Wang, W.; Huang, H.; Li, S. Production of surfactin isoforms by Bacillus subtilis BS-37 and its applicability to enhanced oil recovery under laboratory conditions. Biochem. Eng. J. 2015, 93, 31–37. [Google Scholar] [CrossRef]
  3. Blondel, L.; Jones, T.E.; Extavour, C.G. Bacterial contribution to genesis of the novel germ line determinant oskar. eLife 2020, 9, e45539. [Google Scholar] [CrossRef] [PubMed]
  4. Hu, F.; Liu, Y.; Li, S. Rational strain improvement for surfactin production: Enhancing the yield and generating novel structures. Microb. Cell Fact. 2019, 18, 42. [Google Scholar] [CrossRef] [PubMed]
  5. Marahiel, M.A.; Stachelhaus, T.; Mootz, H.D. Modular Peptide Synthetases Involved in Nonribosomal Peptide Synthesis. Chem. Rev. 1997, 97, 2651–2674. [Google Scholar] [CrossRef] [PubMed]
  6. Roongsawang, N.; Washio, K.; Morikawa, M. Diversity of Nonribosomal Peptide Synthetases Involved in the Biosynthesis of Lipopeptide Biosurfactants. Int. J. Mol. Sci. 2010, 12, 141–172. [Google Scholar] [CrossRef] [PubMed]
  7. Shi, J.; Xu, X.; Liu, P.Y.; Hu, Y.L.; Zhang, B.; Jiao, R.H.; Bashiri, G.; Tan, R.X.; Ge, H.M. Discovery and biosynthesis of guanipiperazine from a NRPS-like pathway. Chem. Sci. 2021, 12, 2925–2930. [Google Scholar] [CrossRef]
  8. Marahiel, M.A. A structural model for multimodular NRPS assembly lines. Nat. Prod. Rep. 2016, 33, 136–140. [Google Scholar] [CrossRef]
  9. Kumar, A.; Johri, B.N. Antimicrobial Lipopeptides of Bacillus: Natural Weapons for Biocontrol of Plant Pathogens. In Microorganisms in Sustainable Agriculture and Biotechnology; Prakash, A., Satyanarayana, T., Johri, B.N., Eds.; Springer: Dordrecht, The Netherlands, 2012; pp. 91–111. ISBN 978-94-007-2213-2. [Google Scholar]
  10. Medema, M.H.; Kottmann, R.; Yilmaz, P.; Cummings, M.; Biggins, J.B.; Blin, K.; de Bruijn, I.; Chooi, Y.H.; Claesen, J.; Coates, R.C.; et al. Minimum Information about a Biosynthetic Gene cluster. Nat. Chem. Biol. 2015, 11, 625–631. [Google Scholar] [CrossRef]
  11. Mongkolthanaruk, W. Classification of Bacillus Beneficial Substances Related to Plants, Humans and Animals. J. Microbiol. Biotechnol. 2012, 22, 1597–1604. [Google Scholar] [CrossRef] [Green Version]
  12. Théatre, A.; Hoste, A.C.R.; Rigolet, A.; Benneceur, I.; Bechet, M.; Ongena, M.; Deleu, M.; Jacques, P. Bacillus sp.: A Remarkable Source of Bioactive Lipopeptides. In Biosurfactants for the Biobased Economy; Hausmann, R., Henkel, M., Eds.; Springer International Publishing: Cham, Switzerland, 2021; Volume 181, pp. 123–179. ISBN 978-3-031-07336-6. [Google Scholar]
  13. Emms, D.M.; Kelly, S. OrthoFinder: Phylogenetic orthology inference for comparative genomics. Genome Biol. 2019, 20, 238. [Google Scholar] [CrossRef] [PubMed]
  14. Kumar, S.; Stecher, G.; Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Mol. Biol. Evol. 2016, 33, 1870–1874. [Google Scholar] [CrossRef] [PubMed]
  15. Nguyen, L.-T.; Schmidt, H.A.; von Haeseler, A.; Minh, B.Q. IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies. Mol. Biol. Evol. 2015, 32, 268–274. [Google Scholar] [CrossRef]
  16. Kalyaanamoorthy, S.; Minh, B.Q.; Wong, T.K.F.; von Haeseler, A.; Jermiin, L.S. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat. Methods 2017, 14, 587–589. [Google Scholar] [CrossRef]
  17. Minh, B.Q.; Nguyen, M.A.T.; von Haeseler, A. Ultrafast Approximation for Phylogenetic Bootstrap. Mol. Biol. Evol. 2013, 30, 1188–1195. [Google Scholar] [CrossRef]
  18. Bailey, T.L.; Williams, N.; Misleh, C.; Li, W.W. MEME: Discovering and analyzing DNA and protein sequence motifs. Nucleic Acids Res. 2006, 34, W369–W373. [Google Scholar] [CrossRef] [PubMed]
  19. Crooks, G.E.; Hon, G.; Chandonia, J.-M.; Brenner, S.E. WebLogo: A Sequence Logo Generator: Figure 1. Genome Res. 2004, 14, 1188–1190. [Google Scholar] [CrossRef] [PubMed]
  20. Hu, F.; Liu, Y.; Lin, J.; Wang, W.; Yu, D.; Li, S. Acetoin modulates conformational change of surfactin: Interfacial assembly and crude oil-washing performance. Colloids Surf. B Biointerfaces 2021, 200, 111602. [Google Scholar] [CrossRef]
  21. Strieker, M.; Tanović, A.; Marahiel, M.A. Nonribosomal peptide synthetases: Structures and dynamics. Curr. Opin. Struct. Biol. 2010, 20, 234–240. [Google Scholar] [CrossRef]
  22. Marahiel, M.A. Working outside the protein-synthesis rules: Insights into non-ribosomal peptide synthesis: Insights into non-ribosomal peptide synthesis. J. Peptide Sci. 2009, 15, 799–807. [Google Scholar] [CrossRef]
  23. Michael, A.J. Evolution of biosynthetic diversity. Biochem. J. 2017, 474, 2277–2299. [Google Scholar] [CrossRef] [PubMed]
  24. Morris, P.F.; Schlosser, L.R.; Onasch, K.D.; Wittenschlaeger, T.; Austin, R.; Provart, N. Multiple Horizontal Gene Transfer Events and Domain Fusions Have Created Novel Regulatory and Metabolic Networks in the Oomycete Genome. PLoS ONE 2009, 4, e6133. [Google Scholar] [CrossRef]
  25. Ocaña-Pallarès, E.; Najle, S.R.; Scazzocchio, C.; Ruiz-Trillo, I. Reticulate evolution in eukaryotes: Origin and evolution of the nitrate assimilation pathway. PLoS Genet. 2019, 15, e1007986. [Google Scholar] [CrossRef] [PubMed]
  26. Du, H.; Zhang, W.; Zhang, W.; Zhang, W.; Pan, H.; Pan, Y.; Bazylinski, D.A.; Wu, L.-F.; Xiao, T.; Lin, W. Magnetosome Gene Duplication as an Important Driver in the Evolution of Magnetotaxis in the Alphaproteobacteria. mSystems 2019, 4, e00315-19. [Google Scholar] [CrossRef] [PubMed]
  27. Bundalovic-Torma, C.; Whitfield, G.B.; Marmont, L.S.; Howell, P.L.; Parkinson, J. A systematic pipeline for classifying bacterial operons reveals the evolutionary landscape of biofilm machineries. PLoS Comput. Biol. 2020, 16, e1007721. [Google Scholar] [CrossRef] [PubMed]
  28. Delogu, G.; Brennan, M.J.; Manganelli, R. PE and PPE Genes: A Tale of Conservation and Diversity. In Strain Variation in the Mycobacterium tuberculosis Complex: Its Role in Biology, Epidemiology and Control; Gagneux, S., Ed.; Springer International Publishing: Cham, Switzerland, 2017; Volume 1019, pp. 191–207. ISBN 978-3-319-64369-4. [Google Scholar]
  29. Nguyen, H.N.; Jain, A.; Eulenstein, O.; Friedberg, I. Tracing the ancestry of operons in bacteria. Bioinformatics 2019, 35, 2998–3004. [Google Scholar] [CrossRef]
  30. Thong, W.L.; Zhang, Y.; Zhuo, Y.; Robins, K.J.; Fyans, J.K.; Herbert, A.J.; Law, B.J.C.; Micklefield, J. Gene editing enables rapid engineering of complex antibiotic assembly lines. Nat. Commun. 2021, 12, 6872. [Google Scholar] [CrossRef]
  31. Gao, L.; Guo, J.; Fan, Y.; Ma, Z.; Lu, Z.; Zhang, C.; Zhao, H.; Bie, X. Module and individual domain deletions of NRPS to produce plipastatin derivatives in Bacillus subtilis. Microb. Cell Fact. 2018, 17, 84. [Google Scholar] [CrossRef]
  32. Bibb, M.J. Regulation of secondary metabolism in streptomycetes. Curr. Opin. Microbiol. 2005, 8, 208–215. [Google Scholar] [CrossRef]
  33. Wells, J.N.; Bergendahl, L.T.; Marsh, J.A. Operon Gene Order Is Optimized for Ordered Protein Complex Assembly. Cell Rep. 2016, 14, 679–685. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Phylogenetic trees of SrfAA, SrfAB and SrfAC genes in OG0000006 (A) and OG0000009 (B).
Figure 1. Phylogenetic trees of SrfAA, SrfAB and SrfAC genes in OG0000006 (A) and OG0000009 (B).
Genes 14 00422 g001
Figure 2. Distribution of SrfAA, SrfAB and SrfAC genes in Bacillus amyloliquefacieus UMAF6614 (GCF_001593785) genome.
Figure 2. Distribution of SrfAA, SrfAB and SrfAC genes in Bacillus amyloliquefacieus UMAF6614 (GCF_001593785) genome.
Genes 14 00422 g002
Figure 3. Conserved motifs of SrfAA, SrfAB and SrfAC proteins.
Figure 3. Conserved motifs of SrfAA, SrfAB and SrfAC proteins.
Genes 14 00422 g003
Table 1. The number of SrfAA, SrfAB and SrfAC genes in 66 orthogroups.
Table 1. The number of SrfAA, SrfAB and SrfAC genes in 66 orthogroups.
SrfAASrfABSrfACTotal
OG00000030101
OG000000685411377482739
OG00000091867124993125
OG00002011001
OG000024947320564931
OG0000642191422442
OG00006757859315452
OG00007366917795341
OG00007531429195346
OG000087856262570
OG0000898256587348
OG000103313228232
OG0001616349267310
OG00021445925128338
OG0002176496291346
OG000228619715142390
OG00026701001
OG000328075181193
OG00035571241133168
OG000468694213109
OG00047520101
OG0004815141400181
OG00102097209
OG00102520909
OG00104780505
OG001087882414
OG00109954419
OG001101336211
OG00115001001
OG001154043512
OG00119661001
OG001212847011
OG00128710202
OG00130961023
OG00131446309
OG00135851001
OG00136451708
OG00137633003
OG00138432068
OG00139580213
OG00140740426
OG00141144015
OG00144470303
OG00144494004
OG00146011034
OG00146110022
OG00146250033
OG00147433306
OG00148641506
OG00153932103
OG00155663003
OG00172200314
OG00174390022
OG00176602204
OG00180223003
OG00180422002
OG00181153003
OG00187191102
OG00193241102
OG00199982002
OG00203860101
OG00204900011
OG00206350202
OG00228520101
OG00362720011
OG00363700011
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xu, Y.; Wu, J.-Y.; Liu, Q.-J.; Xue, J.-Y. Genome-Wide Identification and Evolutionary Analyses of SrfA Operon Genes in Bacillus. Genes 2023, 14, 422. https://doi.org/10.3390/genes14020422

AMA Style

Xu Y, Wu J-Y, Liu Q-J, Xue J-Y. Genome-Wide Identification and Evolutionary Analyses of SrfA Operon Genes in Bacillus. Genes. 2023; 14(2):422. https://doi.org/10.3390/genes14020422

Chicago/Turabian Style

Xu, Ying, Jia-Yi Wu, Qing-Jie Liu, and Jia-Yu Xue. 2023. "Genome-Wide Identification and Evolutionary Analyses of SrfA Operon Genes in Bacillus" Genes 14, no. 2: 422. https://doi.org/10.3390/genes14020422

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop