Transporter genes in biosynthetic gene clusters predict metabolite characteristics and siderophore activity

  1. Jillian F. Banfield2,4,7,8,9
  1. 1Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA;
  2. 2Innovative Genomics Institute, Berkeley, California 94720, USA;
  3. 3Department of Mathematics, University of California, Berkeley, California 94720, USA;
  4. 4Department of Microbiology and Immunology, Stanford University, California 94305, USA;
  5. 5Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, California 94720, USA;
  6. 6Department of Statistics, University of California, Berkeley, California 94720, USA;
  7. 7Chan Zuckerberg Biohub, San Francisco, California 94158, USA;
  8. 8Department of Environmental Science, Policy, and Management, University of California, Berkeley, California 94720, USA;
  9. 9Earth Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
  • Corresponding author: jbanfield{at}berkeley.edu
  • Abstract

    Biosynthetic gene clusters (BGCs) are operonic sets of microbial genes that synthesize specialized metabolites with diverse functions, including siderophores and antibiotics, which often require export to the extracellular environment. For this reason, genes for transport across cellular membranes are essential for the production of specialized metabolites and are often genomically colocalized with BGCs. Here, we conducted a comprehensive computational analysis of transporters associated with characterized BGCs. In addition to known exporters, in BGCs we found many importer-specific transmembrane domains that co-occur with substrate binding proteins possibly for uptake of siderophores or metabolic precursors. Machine learning models using transporter gene frequencies were predictive of known siderophore activity, molecular weights, and a measure of lipophilicity (log P) for corresponding BGC-synthesized metabolites. Transporter genes associated with BGCs were often equally or more predictive of metabolite features than biosynthetic genes. Given the importance of siderophores as pathogenicity factors, we used transporters specific for siderophore BGCs to identify both known and uncharacterized siderophore-like BGCs in genomes from metagenomes from the infant and adult gut microbiome. We find that 23% of microbial genomes from premature infant guts have siderophore-like BGCs, but only 3% of those assembled from adult gut microbiomes do. Although siderophore-like BGCs from the infant gut are predominantly associated with Enterobacteriaceae and Staphylococcus, siderophore-like BGCs can be identified from taxa in the adult gut microbiome that have rarely been recognized for siderophore production. Taken together, these results show that consideration of BGC-associated transporter genes can inform predictions of specialized metabolite structure and function.

    Footnotes

    • [Supplemental material is available for this article.]

    • Article published online before print. Article, supplemental material, and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.268169.120.

    • Freely available online through the Genome Research Open Access option.

    • Received June 30, 2020.
    • Accepted December 16, 2020.

    This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    | Table of Contents
    OPEN ACCESS ARTICLE

    Preprint Server