Unusual peptide-binding proteins guide pyrroloindoline alkaloid formation in crocagin biosynthesis

Adam, Sebastian; Zheng, Dazhong; Klein, Andreas; Volz, Carsten; Mullen, William; Shirran, Sally L.; Smith, Brian O.; Kalinina, Olga V.; Müller, Rolf; Koehnke, Jesko

doi:10.1038/s41557-023-01153-w

Download PDF

Article
Open access
Published: 09 March 2023

Unusual peptide-binding proteins guide pyrroloindoline alkaloid formation in crocagin biosynthesis

Nature Chemistry volume 15, pages 560–568 (2023)Cite this article

9484 Accesses
11 Citations
49 Altmetric
Metrics details

Subjects

Abstract

Ribosomally synthesized and post-translationally modified peptide natural products have provided many highly unusual scaffolds. This includes the intriguing alkaloids crocagins, which possess a tetracyclic core structure and whose biosynthesis has remained enigmatic. Here we use in vitro experiments to demonstrate that three proteins, CgnB, CgnC and CgnE, are sufficient for the production of the hallmark tetracyclic crocagin core from the precursor peptide CgnA. The crystal structures of the homologues CgnB and CgnE reveal them to be the founding members of a peptide-binding protein family and allow us to rationalize their distinct functions. We further show that the hydrolase CgnD liberates the crocagin core scaffold, which is subsequently N-methylated by CgnL. These insights allow us to propose a biosynthetic scheme for crocagins. Bioinformatic analyses based on these data led to the discovery of related biosynthetic pathways that may provide access to a structurally diverse family of peptide-derived pyrroloindoline alkaloids.

Emergence of fractal geometries in the evolution of a metabolic enzyme

Article Open access 10 April 2024

Streptomyces umbrella toxin particles block hyphal growth of competing species

Article Open access 17 April 2024

Elucidation of genes enhancing natural product biosynthesis through co-evolution analysis

Article 12 April 2024

Main

Natural products and their derivatives have been vital for modern medicine and a main contributor to the increase in our quality of life over the past century^1,2. Alkaloids are a profoundly important class of natural products that includes prominent compounds such as morphine and strychnine. A common motif found in alkaloids is the hexahydropyrrolo[2,3-b]indole, also referred to as pyrroloindoline. Pyrroloindoline-containing natural products (Fig. 1a) possess diverse and potent bioactivities, but have presented formidable synthetic challenges^3,4,5. A better understanding of their biosynthesis may advance synthetic efforts or provide semi-synthetic routes to access these valuable molecules. Pyrroloindolines are biosynthetically derived from tryptophan, and different enzymatic routes have been reported. They all result in bond formation between the tryptophan’s α-amino group and its indoles’ C2/δ-carbon: in dibrevianamide⁶, naseseazine⁶ and (−)-ditryptophenaline biosynthesis⁷, P450 enzymes create indole radicals (Supplementary Fig. 1).

**Fig. 1: Natural products containing pyrroloindoline moieties and the crocagin BGC.**

In the case of okaramine⁸ or himastatin⁹, a flavin-dependent monooxygenase or P450 enzyme, respectively, has been proposed to catalyse the 2,3-epoxidation of the indole, which, after ring-opening of the epoxide, leads to a hydroxylated indole C3 and the formation of the pyrroloindoline (Supplementary Fig. 1). Finally, enzymatic methylation (cyclic ditryptophan¹⁰, physostigmine¹¹), prenylation (ardeemin¹², acetylaszonalenin¹³) or farnesylation (drimentine¹⁴) of the indole C3 can trigger pyrroloindoline formation (Supplementary Fig. 1).

Ribosomally synthesized and post-translationally modified peptides (RiPPs) are a rapidly growing, major class of natural products and can possess strong and varied bioactivities^15,16. Their biosynthesis begins with the ribosomal expression of a short gene to yield a precursor peptide that usually consists of an N-terminal ‘leader’ and a C-terminal ‘core’ peptide. While the leader peptide is important for substrate recognition by the enzyme(s) installing class-defining modifications, the core peptide is ultimately converted into the final product and can be extensively modified^17,18,19. Few pyrroloindoline-containing RiPP classes have been identified to date (for example, omphalotins²⁰, ComX²¹ and kawaguchipeptin A²²), and in each case, a prenyltransferase appears to trigger pyrroloindoline formation as described above²³. Crocagins (1 and 2; Fig. 1b)²⁴ are very small, intriguing RiPPs that possess a tetracyclic ring system, which consists of a pyrroloindoline fused to a tetrahydropyrimidinone moiety. This rare scaffold, identified during metabolome mining, is thus far unique among RiPPs²⁴. While the biosynthetic gene cluster (BGC) for 1 and 2 could be identified²⁴, it does not encode for an enzyme previously associated with pyrroloindoline formation. The steps involved in the biosynthesis of 1 and 2 from the precursor peptide CgnA are unknown, as are the functions of the predicted proteins found in their BGC (Fig. 1c) with the exception of the carbamoyltransferase CgnI²⁴. It has been demonstrated that 1 inhibits the highly conserved, key global regulator of the bacterial transcriptome, CsrA, in vitro²⁴. CsrA coordinates the expression of a variety of proteins including specific virulence factors as well as biofilm formation and represents a novel target^25,26,27.

We were intrigued by the novelty of the scaffold of 1 and 2 and the possibility of establishing an unprecedented biosynthetic route to peptide-derived pyrroloindoline alkaloids. Here, we demonstrate that four proteins, CgnB, CgnC, CgnD and CgnE, are sufficient to produce the core scaffold of 1 and 2 in vitro and propose a biosynthetic scheme for 1 and 2. We identify CgnB and CgnE as unusual peptide-binding proteins, which enabled genome mining that led to the in silico discovery of putative related and unrelated biosynthetic pathways.

Results

In vitro biosynthesis of the tetracyclic crocagin core

It had been reported that the protein CgnB binds to the precursor peptide CgnA, and that a cgnB knockout abolished the production of crocagins A and B²⁴. Interestingly, a knockout of the cgnB homologue cgnE (38% sequence identity; Supplementary Fig. 2) resulted in a reduction of crocagin production and the detection of two new species: one had a mass suggestive of 1 unmodified at the Trp Cβ position, while the other agreed with a des-N-methyl, des-carbamoyl 1 (Supplementary Fig. 2).

To begin our investigation of the biosynthesis of 1 and 2, we decided to focus on the operon containing cgnA, which encodes the precursor peptide CgnA, and putative biosynthetic gene products CgnB, CgnC, CgnD, and CgnE. CgnC was predicted to be a dioxygenase, and the putative esterase CgnD was a likely candidate for leader peptide removal. We thus initially focused on CgnB, CgnC and CgnE. Incubation of CgnA with either CgnB, CgnE or both did not result in any observable product peak (Fig. 2a and Supplementary Fig. 3). Expression of CgnC required the use of an alternative, upstream start codon (Supplementary Fig. 4), and incubation with CgnA did not lead to observable product formation, unless FeCl₂, ascorbic acid and α-ketoglutarate were added (Fig. 2a and Supplementary Fig. 5). The addition of these cofactors, always implied in CgnC reactions from this point forward, yielded a new peak with a mass suggestive of hydroxylation (+16 Da), but the predominant peak remained unmodified CgnA (Fig. 2a and Supplementary Fig. 5). The addition of CgnE to CgnA–CgnC led to the formation of an additional, minor peak with a mass shift of +28 Da, which could indicate two oxidation events, for example, oxidation to ketones at the Cβ atoms of the Tyr and Trp residues of the core peptides (Fig. 2a and Supplementary Fig. 5). Intriguingly, the addition of the CgnE homologue CgnB led to a different result, with two additional product peaks detected when compared to CgnA–CgnE or CgnA–CgnC reactions (Fig. 2a and Supplementary Fig. 5). One of these had a mass shift of +14 Da, which could indicate the formation of the tetracyclic core structure (3a; Fig. 2a and Supplementary Fig. 5). These data agree with the knockout studies, where the inactivation of cgnE led to reduced product formation, while the inactivation of cgnB abolished the production of 1 and 2. The addition of both CgnB and CgnE to CgnA–CgnC reactions had a profound effect and led to the formation of a major product with high turnover (Fig. 2a and Supplementary Fig. 5). This product, 3a, had a mass shift of +14 Da, the same retention time as that observed for the +14 Da product of CgnA–CgnB–CgnC reactions, and lost its UV absorption at 280 nm, implying loss of aromaticity of the Trp side chain (Supplementary Fig. 5).

**Fig. 2: In vitro biosynthesis of the crocagin core structure.**

Tandem mass spectrometry (MS²) of the +16 Da species suggested hydroxylation of the core peptide’s Trp (Supplementary Fig. 6). MS² of the +28 Da species was not possible due to low abundance, but MS² of the +30 Da peak placed +14 Da on the core peptide’s Tyr (keto group) and +16 Da on the core peptide’s Trp residue (hydroxylation; Supplementary Fig. 6). We were able to identify a low abundance +32 Da species, and, as expected, MS² analysis placed hydroxylations on the core peptide’s Tyr and Trp (Supplementary Fig. 6). All mass errors can be found in Supplementary Table 1. A minor product with a mass shift of +12 Da, 3b, was also observed. We reasoned that this could be an intermediate towards 2, with a mass of –2 Da compared to crocagin A due to its additional double bond (Supplementary Fig. 5). NMR analysis of 3a confirmed the presence of the hexahydropyrrolo[2,3-b]indole fused to a tetrahydropyrimidinone moiety (Fig. 2b and Supplementary Table 2) and thus the successful in vitro biosynthesis of the tetracyclic crocagin core.

CgnB and CgnE may have evolved from Zn²⁺-dependent aminopeptidases

We were very intrigued by the different behaviour of the homologues CgnB and CgnE in biochemical assays. CgnB and CgnE did not show similarity to any family in the Pfam database, and a sequence-based search using the HHpred server²⁸ returned only three hits with more than 50% sequence coverage, which were aminopeptidases (Pfam family Peptidase_M29). We did not observe aminopeptidase activity, and the sequence identities of CgnB and CgnE with the aminopeptidases were <25% (Supplementary Fig. 7). It was thus apparent that structural information would be valuable to better characterize these two proteins. First, the structure of CgnE was determined to 2.0 Å (Fig. 3a and Supplementary Fig. 8). Details for all reported protein structures can be found in the experimental section, and all data collection and refinement statistics can be found in Supplementary Table 3.

**Fig. 3: Structures of CgnB and CgnE, affinity measurements and genome mining.**

CgnB was recalcitrant to crystallization until a slight molar excess of CgnA was added prior to crystallization trials. We determined the CgnB structure to 2.3 Å resolution but observed no electron density for CgnA, so the peptide may simply have acted as an additive to promote crystallization. As expected for homologous proteins, the Cα root-mean-square deviation (r.m.s.d.) between the structures of CgnB and CgnE was only 1.2 Å over 284 residues (Supplementary Fig. 9). Curiously, CgnB contained a dinuclear metal centre that was partially occupied. The CheckMyMetal validation server²⁹ suggested that either Zn²⁺ or Co²⁺ could be bound and inductively coupled plasma mass spectrometry (ICP-MS) identified the metals as zinc, with only traces of cobalt found (Supplementary Fig. 10). The binding sites comprised CgnB residues Glu161, Glu252 (shared between Zn²⁺ 1 and 2), Glu274 and His280 (Zn²⁺ 1) and His291 and Asp293 (Zn²⁺ 2; Fig. 3b and Supplementary Fig. 10). In CgnE, the metal binding site of Zn²⁺ 1 was disrupted by mutations in two metal-coordinating residues (CgnB numbering), Glu274Lys and His280Tyr, and no density for metal ions was observed in the CgnE structure (Fig. 3b and Supplementary Fig. 10). To exclude the possibility that CgnE may bind Fe²⁺, which is added as a CgnC cofactor to the reactions and could be oxidized and released from CgnE before structure determination, we incubated the protein with a large excess of fresh FeCl₂ and processed the sample immediately for analysis by ICP-MS. No iron could be detected (Supplementary Fig. 10).

A search for structural homologues of CgnB and CgnE using the DALI server³⁰ returned only three viable hits, which concurred with the sequence-based results obtained from HHpred²⁸: aminopeptidase T (Protein Data Bank (PDB) no. 2ayi), aminopeptidase PepS (PDB no. 4ics) and aminopeptidase AMPS (PDB no. 1zjc), which are structural homologues (Cα r.m.s.d. 4ics/1zjc = 2.2 Å; Cα r.m.s.d. 4ics/2ayi = 4.3 Å over >90% of residues; Supplementary Fig. 10). The key difference between CgnB or CgnE and the aminopeptidases was found in the N-terminal portions of the proteins (Supplementary Fig. 11). The aminopeptidases form an extensive dimer interface, and the residues involved in its formation are largely missing in CgnB and CgnE, which rationalizes their monomeric state in the crystals. All three aminopeptidases harbour dinuclear metal centres in the same position as CgnB and contain either Zn²⁺ or Co²⁺. To test the effect of different metal ions on the reaction among CgnA, CgnB, CgnC and CgnE, we dialysed CgnB extensively with ethylenediaminetetraacetic acid and reduced the Zn²⁺ concentration to background (buffer) levels (Supplementary Fig. 10). Unfortunately, the protein denatured in the process, and activity could not be restored.

Inference of protein complex formation between CgnB, CgnC and CgnE

The requirement for all three components—CgnB, CgnC and CgnE—to be present for efficient catalysis of the formation of 3a or 3b implied that the proteins may interact. We probed each protein’s ability to bind the substrate CgnA by microscale thermophoresis (MST) and found that CgnB and CgnE, but not CgnC, were able to bind CgnA (Fig. 3c and Supplementary Fig. 12). The equilibrium dissociation constant (K_D) of 10.5 µM for the CgnA–CgnB interaction is approximately one order of magnitude lower than that measured by surface plasmon resonance²⁴ and five times weaker than the one we determined for the CgnA–CgnE interaction (1.8 µM; Fig. 3c). When we investigated the binding of the homologues CgnB and CgnE to CgnC, we found the CgnE–CgnC K_D to be ~1 µM (Fig. 3c and Supplementary Fig. 13). The CgnB–CgnC K_D was weaker (~6 µM), and no interaction could be detected for CgnB–CgnE (Fig. 3c and Supplementary Fig. 13). We wondered if the presence of CgnA would change this behaviour and determined the K_D of CgnC to CgnB in the presence of CgnA (400 µM). The effect was an ~60-fold improved affinity to ~100 nM (Fig. 3c and Supplementary Fig. 13). Addition of CgnA (400 µM) to CgnC–CgnE had very little effect (2-fold increase) on the CgnC–CgnE affinity (Fig. 3c and Supplementary Fig. 14). It therefore appeared that CgnB and CgnE might be recruiting the substrate CgnA to CgnC. When we analysed the binding of CgnA to CgnB in the presence of a large excess of CgnE (250 µM), we did not observe binding (Fig. 3c and Supplementary Fig. 13). This implied that CgnB and CgnE cannot bind simultaneously to the same substrate molecule. To support our MST data, we attempted pull-down assays. While these confirmed the interactions between CgnA and CgnB or CgnE, we could not find conditions that prevented CgnC from binding non-specifically to resin materials. The same was observed for surface plasmon resonance measurements (Supplementary Fig. 13).

Size-exclusion chromatography data suggested that CgnB and CgnE were monomers in solution (as observed in the crystal structures), while CgnC was a dimer (Supplementary Fig. 13). Unfortunately, putative CgnB–CgnC–CgnE complexes were unstable in size-exclusion chromatography, and experiments to stabilize them for structure elucidation and detailed mechanistic studies are currently underway. The question we then asked was whether binding of CgnB and CgnE would occur at the same binding site on CgnC, as might be expected for homologues. We thus determined the affinity of CgnB–CgnC in the presence of a constant, large excess (250 µM) of CgnE. We obtained a K_D of 28 µM, which was approximately five times weaker than in the absence of CgnE, but these data still suggested that CgnB and CgnE can concurrently bind to CgnC (Supplementary Fig. 13). Taken together, our data imply that a complex may form during crocagin biosynthesis. If CgnC bound CgnB and CgnE at the same time, as MST measurements suggest it could, the dimeric nature of CgnC would likely result in a complex with the stoichiometry CgnB₂/CgnC₂/CgnE₂ and have the ability to bind up to four CgnA molecules.

Genome mining using CgnB and CgnE reveals crocagin-like BGCs

The unique features of CgnB and CgnE prompted us to investigate their evolution. Sequence similarity searches in bacterial whole-genome sequencing contigs revealed that CgnB and CgnE homologues can be found in ~100 genomes from Actinobacteria, Cyanobacteria, Betaproteobacteria, Gammaproteobacteria and Deltaproteobacteria. In all cases, the similarity stretched over almost the entire protein length, and the homologues did not contain any additional domains, which makes CgnB and CgnE the founding members of a new protein family.

In all genomes containing only one copy of a CgnB or CgnE homologue, the metal binding site was intact, despite an overall low sequence identity (between 21% and 42%). Finding two CgnB and/or CgnE homologues in the same genomic contig was much rarer (Supplementary Table 4), and in all cases, these two copies lie at a distance <2.1 kbp. To investigate the evolution of the duplicated CgnB and CgnE homologues, we constructed a bootstrapped maximum likelihood phylogenetic tree (Supplementary Fig. 14 and Supplementary Information). Low bootstrap values did not allow many internal branches to be resolved with certainty, but it appears probable that three to five duplication events occurred during the evolution of this protein family. In every case, the second copy of the protein acquired deleterious mutations in the metal binding site that were different in each case. Mutations designed to restore metal binding in CgnE or disrupt metal binding in CgnB led to insoluble protein for all mutations tested. It is remarkable that at least three duplication events happened independently, which may be a sign of strong evolutionary pressure to acquire a second, mutated copy of the protein.

Genome mining for novel RiPPs using instances of a single copy of CgnB or CgnE with the program RiPPER³¹ resulted in the discovery of a number of putative precursor peptides (Supplementary Data), which were used to create a sequence similarity network (Supplementary Fig. 15). In genomic contigs harbouring two CgnB or CgnE copies, we searched for a potential precursor and were able to find a candidate within <1.5 kbp in all cases (Supplementary Table 4). Aligning these candidate peptides at their C termini revealed a strong conservation pattern for the Tyr and Trp positions of the core peptide (Ile-Tyr-Trp for crocagin; Supplementary Table 4). The Ile position is variable, and we found an Ile to Gly mutation to be a very good substrate for CgnB, CgnC or CgnE (Supplementary Fig. 16). Two other single leader peptide point mutations in highly conserved positions also had no effect on processing (Lys2Glu and Gly17Phe; Supplementary Fig. 16).

The leader peptide is enriched with positively charged amino acids, and both CgnB and CgnE show a large, strongly negatively charged patch near the possible binding site (Supplementary Fig. 16). Of the six positively charged residues in the 18-amino-acid-long CgnA leader peptide (CgnA^LP), three are found in the N-terminal half, while the other three are close to the core peptide (Fig. 3d). Deleting residues 1–5 of CgnA (and thus two lysines; Fig. 3d) had no effect on turnover, but removing residues 1–10, which include a third lysine, abolished processing (Supplementary Fig. 17). To analyse the importance of the positively charged residues, we reacted a CgnA variant with the three N-terminal lysines mutated to alanine (CgnA^N3A; Lys2Ala, Lys3Ala and Lys6Ala) with CgnBCgnC–CgnE and found the reaction to be somewhat impaired (Supplementary Fig. 17). Exchanging the three C-terminal positively charged residues to alanine (CgnA^C3A; Arg13Ala, Lys15Ala and Lys16Ala), on the other hand, severely reduced product formation (Supplementary Fig. 17). These data imply that essential CgnA residues for substrate recognition are located after position five and include four positively charged residues.

The structures of the putative crocagin-like BGCs identified here differ in all genomes, but one can identify five key architectures (Fig. 3e). The crocagin BGC is an outlier, and intriguingly all other clusters contain a predicted glycosyltransferase instead of the carbamoyltransferase present in the crocagin BGC. This suggests that crocagin-like natural products may be glycosylated, which has thus far been a very rare modification in RiPPs. Some BGCs contain a predicted decarboxylase, which may remove the negative charge of crocagin-like molecules and thus improve membrane permeability in addition to providing biosynthetic access to new structural variants.

Protease CgnD releases the crocagin core scaffold

The product of the CgnB–CgnC–CgnE reaction, 3a, contained the crocagin tetracyclic ring system, and we thus suspected that proteolytic removal of the leader peptide by CgnD might be possible. When CgnD was added to the product of CgnA–CgnB–CgnC–CgnE reactions containing 3a and 3b, we observed the slow production of 4a ([M]⁺ calculated, 495.2238; observed, 495.2239; mass error (Δ) = 0.2 ppm), which represents the crocagin core scaffold (des-methyl and des-carbamoyl) and also corresponds to the mass produced by the cgnE knockout (Fig. 4a and Supplementary Figs. 2 and 18). Interestingly, we again observed a minor peak of –2 Da (4b; [M]⁺ calculated, 493.2082; observed, 493.2083; Δ = 0.2 ppm), which may be the core scaffold of crocagin B (Supplementary Fig. 18). Slow proteolytic processing of modified precursor peptides has been observed in a number of RiPP biosynthetic pathways and may be linked to biosynthetic timing³². In an attempt to accelerate the production of 4a, we replaced the Asn residue preceding the core peptide with a Lys. This mutant was a very poor substrate for CgnB–CgnC–CgnE, possibly because the introduced positive charge interfered with native substrate binding. The addition of trypsin did lead to the formation of traces of 4a (Supplementary Fig. 18). Rapid production of crocagin-like molecules will thus require the selection of alternative proteases or enzyme engineering of CgnD. To rationalize the slow turnover, we attempted to determine the affinity of CgnD for 3a, CgnA and the leader peptide, but were unable to detect binding by microscale thermophoresis (Supplementary Fig. 19). We hoped that the crystal structure of CgnD might yield additional insights and determined the structure to 2.35 Å resolution (Supplementary Fig. 20). The two protomers found in the asymmetric unit form an intricate dimer interface (Supplementary Table 6), which is unlike that of the closest structural homologues identified by a Dali server³⁰ search. Intriguingly, it appeared as if the dimer provides two antiparallel channels for substrate binding that terminated at the respective active sites (Fig. 4b). The catalytic triad comprised Asp325, His328 and Ser79, and the distance observed between the Asp325 atom Oδ2 and His328 atom Hδ2 was within the expected range (1.8 Å). The distance between His328 atom Nε2 and Ser79 atom Hγ could not be determined because a sulfate ion was bound at the active site, which led to a rotation of the Ser side chain. Removal of the sulfate in silico and subsequent rotation of the Ser side chain did not allow the distance to decrease below 2.8 Å (Supplementary Fig. 20). The resulting lack of nucleophilicity appeared a probable cause for the slow turnover and was supported by our inability to label CgnD with phenylmethylsulfonyl fluoride (Supplementary Fig. 20). This situation is reminiscent of PatA from the patellamide pathway, which also turns over very slowly³².

**Fig. 4: Processing of 3a and 3b by CgnD and CgnL.**

N-methylation of 4a and 4b is catalysed by CgnL

The final unresolved step in the biosynthesis of 1 and 2 was the N-methylation of 4a and 4b, as the carbamoyltransferase function of CgnI had been determined previously²⁴. Based on sequence alignments, CgnL, named ‘MT’ in the original crocagin publication²⁴, was the likely candidate. Indeed, incubation of 4a and 4b with CgnL and S-adenosylmethionine led to the formation of 5a ([M]⁺ calculated, 509.2395; observed, 509.2391; Δ = –0.8 ppm) and the –2 Da 5b ([M]⁺ calculated, 507.2238; observed, 507.2239; Δ = 0.2 ppm; Fig. 4c and Supplementary Fig. 21). The mass increase of +14 Da suggested methylation and thus the production of des-carbamoyl 1 and 2.

The crystal structure of CgnL in complex with S-adenosylhomocysteine was determined to a resolution of 1.96 Å and appeared monomeric, which was consistent with the closest structural homologues of CgnL identified in a Dali search³⁰. The overall shape of CgnL was typical for class I S-adenosylmethionine-dependent methyltransferases with a Rossman fold (Fig. 4d and Supplementary Fig. 22). The electron density for S-adenosylhomocysteine was unambiguous as expected for a well-coordinated ligand (Supplementary Fig. 22). Interestingly, the closest structural homologues with a defined function are involved in natural product biosynthetic pathways: CcbJ (PDB no. 4hh4) N-methylates the antibiotic celesticetin³³, while KedS8 (PDB no. 5bsz) N-methylates the chromophore of the anticancer chromoprotein kedaricidin³⁴.

A proposed mechanism for the biosynthesis of 3a and 3b

The precise molecular mechanism of the initial step in crocagin biosynthesis, leading to the formation of 3a and 3b, will require further study. We favour the following sequence of events based on the knockout, biochemical and binding data (Fig. 5a): CgnA binds to CgnE, which may either be part of a CgnC–CgnE complex, or bind to CgnC after binding CgnA. We found the binding of CgnE to the CgnA^LP to be almost 300 times weaker than binding to CgnA (K_D = 514 µM; Supplementary Fig. 23). The main driver for binding thus appears to be the core peptide, and it is tempting to suggest that CgnE’s role may be to recruit the substrate peptide to CgnC and present the core peptide in a particular way that enables or accelerates correctly timed catalysis. This would agree with our knockout and in vitro biochemical data, which show that the presence of CgnE without CgnB does not enable formation of the crocagin core structure. When we tested the mutant CgnA Y20F, we could only detect trace amounts of a +14 Da species, and the major product peak had a mass increase of 16 Da. (Supplementary Fig. 23). Unlike the +16 Da product of a CgnA–CgnC–CgnE reaction (Trp hydroxylation; Supplementary Fig. 6), MS² analysis of this intermediate revealed a mixture with hydroxylation of either Phe or Trp (Supplementary Fig. 24). Hydroxylation at the Tyr20 Cβ position allows CgnB to catalyse an elimination reaction via its bound Zn²⁺ (or Co²⁺) ions, which results in a double bond between the Tyr20 Cα and Cβ atoms, creating a conjugated amine. The transition state, in which the Tyr20 Cα hydrogen would possess a lowered pK_a, could be stabilized by the Tyr side chain, rationalizing why a Tyr20Phe mutation is detrimental to turnover (Supplementary Fig. 23). Tyr20 then undergoes conjugate addition with the Trp21 indole nitrogen, and subsequent proton transfer completes the first ring-forming reaction. The second ring-closure reaction between the Trp α-nitrogen and δ₁ carbon atom could then occur spontaneously to complete the formation of the tetracyclic ring system. It is unclear at which point hydroxylation of the Trp21 Cβ position occurs, but the inferred biosynthetic complex would enable fast transfer from one substrate binding site to the next. Interestingly, CgnA binding by CgnB is not dependent on the core peptide to the same extent as observed for CgnE, with an 8-fold reduction in affinity for CgnA^LP compared to CgnA (Supplementary Fig. 23).

**Fig. 5: Biosynthetic proposal leading to the formation of crocagins A (1) and B (2).**

It has been observed in a number of RiPP systems that the leader peptide can activate the processing enzymes when it is supplied in trans³⁵. This was not the case for CgnB–CgnC–CgnE reactions, where turnover of the core peptide was marginal when the leader peptide was added in trans (Supplementary Fig. 23). This suggests that a physical connection between leader and core peptide is essential for catalysis, perhaps to enable efficient transfer of the substrate between CgnE and CgnB.

Discussion

It is intriguing to see how the world of small alkaloids merges with that of ribosomal natural products. The recent discovery that the pyrroloquinoline alkaloids ammosamides are in fact RiPPs³⁶ and crocagins may just be the beginning. We propose the following biosynthetic scheme for crocagins (Fig. 5b): After expression of CgnA, the precursor peptide is processed by the proteins CgnB, CgnC and CgnE and undergoes hydroxylation at the Tyr20 and Trp21 Cβ positions, which triggers the two ring-closing reactions. It is possible that pyrroloindoline formation is the first ring-closure reaction, but we think that the seeming absence of an indole ‘activator’ (for example, C3 prenylation; Supplementary Fig. 1) and the disfavoured nature of 5-endo-trig ring-forming reactions makes it more likely that the pyrroloindoline forms second. In a small fraction of CgnA, the hydroxylated Tyr20 Cβ is oxidized to the ketone, which results in a double bond between the Tyr20 Cα and Cβ upon ring closure and thus ultimately the formation of 2. This reaction would not be catalysed by CgnB, but rather be spontaneous and slow. In our view, it is likely that the observed CgnA +28 Da peak is a shunt product and an artefact of a system missing either CgnB or CgnE. It is unclear if the CgnA +16 Da peak is a shunt product or true intermediate, because we do not observe it when CgnB and CgnE are both present. The +30 Da peak, which we observe only in reactions where 3a and 3b are formed, appears to be an intermediate towards 2, because MS² analysis placed the keto group on the core peptide’s Tyr, while the Trp is hydroxylated (Supplementary Fig. 6). The intermediates 3a and 3b are then cleaved by CgnD to release the core scaffold and make the N terminus available for methylation by CgnL. It remains unclear if N-methylation or carbamoylation occurs first, or whether both modifications are independent of each other, since the des-carbamoyl 1 can be observed after knocking out cgnI in vivo²⁴. It is also possible that carbamoylation occurs prior to leader peptide cleavage, which would offer an alternative explanation for the slow turnover of CgnD that we observed.

CgnE appears to recruit the substrate peptide CgnA to CgnC and assist in catalysis, perhaps by presenting the core peptide in a specific conformation or by ensuring appropriate timing of the hydroxylation reactions, but does not seem to possess catalytic activity itself. The CgnE homologue CgnB on the other hand appears to possess catalytic activity. Unfortunately, all attempts to obtain crystal structures of CgnA–CgnB or CgnA–CgnE complexes, which would reveal more details of the interactions, failed. A search for CgnB and CgnE in available genomic data places them in a variety of genomes that warrant further investigation. In all cases where two copies were found, they were placed in a crocagin-like BGC with a CgnC homologue, and we thus expect them to install the hallmark tetracyclic core. While these BGCs also contained a methyltransferase, likely to cap the N terminus as observed in crocagins, we were surprised by the variety of different putative modifying enzymes. These may be used for the modification of the hydroxyl group found at the Trp Cβ (glycosylation), the Trp carboxy group (decarboxylation) or the Tyr hydroxyl group. Further studies will be required to identify the natural products of these BGCs, explore their bioactivities and link predicted open reading frames to biosynthetic functions. Interestingly, the gene annotated as G in the BGC identified in Anabaena (Fig. 3d) shares 61% sequence identity with a bifunctional 6′-aminoglycoside-N-acetyltransferase/aminoglycoside-2′-phosphotransferase³⁷, which confers broad-spectrum resistance to aminoglycoside antibiotics. It may serve as a self-resistance gene and hint at the bioactivity of glycosylated crocagin derivatives.

It will be fascinating to investigate the crocagin biosynthetic system in more detail, as these data could then be used in conjunction with enzymes from the newly discovered crocagin-like BGCs to access a structurally diverse family of peptide-derived pyrroloindoline alkaloids.

Methods

The Supplementary Methods contain additional information on protein purification, production, crystallography and biophysical and biochemical methods, as well as bioinformatics.

Protein purification of CgnB, CgnC, CgnD, CgnE and CgnL

For protein purification, the cell pellets were resuspended in the respective lysis buffers (for CgnD and CgnE, 500 mM NaCl, 20 mM Tris buffer with pH 8.0, 20 mM imidazole, 3 mM β-mercaptoethanol; for CgnB, CgnC and CgnL, 200 mM NaCl, 20 mM Tris, 20 mM imidazole, 10% glycerol, 3 mM β-mercaptoethanol). For every 25 g of wet cell mass, 100 ml of lysis buffer was added and supplemented with two cOmplete ethylenediaminetetraacetic-acid-free protease inhibitor tablets (Sigma-Aldrich) and 4 mg g⁻¹ DNase (Sigma-Aldrich). Cell lysis was carried out via passage through a cell disruptor (30 kpsi, Constant Systems), and the cell debris was removed by centrifugation (43,000g, 15 min, 4 °C). The supernatant was decanted, filtered through a 0.45 µm filter and applied to a HisTrap HP 5 ml column (GE Healthcare) pre-equilibrated in lysis buffer at a flow rate of 5 ml min⁻¹. The column was extensively washed with 150 ml lysis buffer, and the target protein was eluted using lysis buffer supplemented with 250 mM imidazole. The proteins were passed over a desalting column (16/10, GE Healthcare) at a flow rate of 10 ml min⁻¹, pre-equilibrated in lysis buffer. To remove the His-SUMO tag, the desalted protein was incubated with TEV protease for 14 h and 4 °C at a 1:10 mass ratio of TEV/target-protein. Subsequent passage of the solution over a 5 ml HisTrap HP column allowed for a separation of the digested target protein and the His₆-tagged SUMO. The proteins were then passed over a Superdex 200 16/600 size-exclusion column (GE Healthcare) at a flow rate of 1 ml min⁻¹, pre-equilibrated in gel filtration buffer (150 mM NaCl, 10 mM HEPES buffer, 0.5 mM TCEP reducing agent, pH 7.4). The resulting peak was collected and concentrated to the desired concentration using a 30 kDa cut-off filter (Thermo Scientific). The protein concentration was determined using photometric analysis (Nanodrop 2000, Thermo Scientific) and subsequently analysed by sodium dodecyl-sulfate polyacrylamide gel electrophoresis. Selenomethionine variants of CgnD and CgnE were purified analogously to the native proteins.

Biochemical reactions

All biochemical reactions were performed in 150 mM NaCl, 10 mM HEPES, 0.5 mM TCEP, pH 7.4. For CgnA–CgnB–CgnC reactions, a mixture of 50 µM CgnA, 10 µM CgnB, 5 µM CgnC, 4 mM ascorbic acid, 2.5 mM FeCl₂ and 1 mM 2-oxoglutaric acid was incubated at 37 °C for 2 h. For CgnA–CgnB–CgnC–CgnE reactions, a mixture of 50 µM CgnA, 10 µM CgnB, 5 µM CgnC, 5 µM CgnE, 4 mM ascorbic acid, 2.5 mM FeCl₂ and 1 mM 2-oxoglutaric acid was incubated at 37 °C for 2 h. The standard reaction volume was 50 µl. Proteins were precipitated by adding 50% acetonitrile and freezing the mixture at −80 °C for 2 h prior to liquid chromatography and mass spectrometry (LC-MS) analysis.

CgnD activity was analysed by adding 20 µM CgnD to a 100 µM solution of purified 3a and 3b and incubating this mixture at 37 °C for up to 96 h. Remaining enzyme was precipitated by adding 50% acetonitrile and freezing the mixture at −80 °C for 2 h prior to LC-MS analysis. CgnL reaction was performed by adding 20 µM CgnL to 100 µM 4a and 4b (product of CgnD reaction), supplemented by 1 mM S-adenosyl-l-methionine and 1 mM MgCl₂. The mixture was incubated at room temperature for 24 h. Remaining enzyme was precipitated by adding 50% acetonitrile and freezing the mixture at −80 °C for 2 h prior to LC-MS analysis.

For large-scale purification of 3a and 3b, three 9 ml reactions of the aforementioned condition were pooled and precipitated by addition of 50% acetonitrile and the mixture being frozen at −80 °C for 16 h. Thawed samples were centrifuged at 3,000g for 20 min at room temperature. The supernatant was removed and sent to Protein Peptide Research for high-performance liquid chromatography purification.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Data supporting the main findings of this work are available within the Article and Supplementary Information. Diffraction data and refined structural models (Supplementary Table 3) have been deposited to the PDB: CgnB (6zsv), CgnD (8a2n), CgnE (6zsu) and CgnL (7pd7). The very large raw mass spectrometry data files are available from the authors upon request.

Code availability

The code/scripts used for genome mining are available in Supplementary Code 1.

References

Demain, A. L. Antibiotics: natural products essential to human health. Med. Res. Rev. 29, 821–842 (2009).
Article CAS PubMed Google Scholar
Hutchings, M. I., Truman, A. W. & Wilkinson, B. Antibiotics: past, present and future. Curr. Opin. Microbiol. 51, 72–80 (2019).
Article CAS PubMed Google Scholar
Repka, L. M. & Reisman, S. E. Recent developments in the catalytic, asymmetric construction of pyrroloindolines bearing all-carbon quaternary stereocenters. J. Org. Chem. 78, 12314–12320 (2013).
Article CAS PubMed PubMed Central Google Scholar
Ruiz-Sanchis, P., Savina, S. A., Albericio, F. & Alvarez, M. Structure, bioactivity and synthesis of natural products with hexahydropyrrolo[2,3-b]indole. Chemistry 17, 1388–1408 (2011).
Article CAS PubMed Google Scholar
Mei, G. J., Koay, W. L., Tan, C. X. A. & Lu, Y. Catalytic asymmetric preparation of pyrroloindolines: strategies and applications to total synthesis. Chem. Soc. Rev. 50, 5985–6012 (2021).
Article CAS PubMed Google Scholar
Tian, W. et al. Efficient biosynthesis of heterodimeric C³-aryl pyrroloindoline alkaloids. Nat. Commun. 9, 4428 (2018).
Article PubMed PubMed Central Google Scholar
Saruwatari, T. et al. Cytochrome P450 as dimerization catalyst in diketopiperazine alkaloid biosynthesis. ChemBioChem 15, 656–659 (2014).
Article CAS PubMed Google Scholar
Lai, C. Y. et al. Biosynthesis of complex indole alkaloids: elucidation of the concise pathway of okaramines. Angew. Chem. Int. Ed. 56, 9478–9482 (2017).
Article CAS Google Scholar
Ma, J. et al. Biosynthesis of himastatin: assembly line and characterization of three cytochrome P450 enzymes involved in the post-tailoring oxidative steps. Angew. Chem. Int. Ed. 50, 7797–7802 (2011).
Article CAS Google Scholar
Li, H. et al. Pyrroloindoline cyclization in tryptophan-containing cyclodipeptides mediated by an unprecedented indole C3 methyltransferase from Streptomyces sp. HPH0547. Chem. Commun. 55, 8390–8393 (2019).
Article CAS Google Scholar
Liu, J., Ng, T., Rui, Z., Ad, O. & Zhang, W. Unusual acetylation-dependent reaction cascade in the biosynthesis of the pyrroloindole drug physostigmine. Angew. Chem. Int. Ed. 53, 136–139 (2014).
Article CAS Google Scholar
Haynes, S. W., Gao, X., Tang, Y. & Walsh, C. T. Complexity generation in fungal peptidyl alkaloid biosynthesis: a two-enzyme pathway to the hexacyclic MDR export pump inhibitor ardeemin. ACS Chem. Biol. 8, 741–748 (2013).
Article CAS PubMed PubMed Central Google Scholar
Yin, W. B., Grundmann, A., Cheng, J. & Li, S. M. Acetylaszonalenin biosynthesis in Neosartorya fischeri. Identification of the biosynthetic gene cluster by genomic mining and functional proof of the genes by biochemical investigation. J. Biol. Chem. 284, 100–109 (2009).
Article CAS PubMed Google Scholar
Yao, T. et al. Genome mining of cyclodipeptide synthases unravels unusual tRNA-dependent diketopiperazine-terpene biosynthetic machinery. Nat. Commun. 9, 4091 (2018).
Article PubMed PubMed Central Google Scholar
Montalban-Lopez, M. et al. New developments in RiPP discovery, enzymology and engineering. Nat. Prod. Rep. 38, 130–239 (2020).
Article PubMed PubMed Central Google Scholar
Arnison, P. G. et al. Ribosomally synthesized and post-translationally modified peptide natural products: overview and recommendations for a universal nomenclature. Nat. Prod. Rep. 30, 108–160 (2013).
Article CAS PubMed PubMed Central Google Scholar
McIntosh, J. A., Donia, M. S. & Schmidt, E. W. Ribosomal peptide natural products: bridging the ribosomal and nonribosomal worlds. Nat. Prod. Rep. 26, 537–559 (2009).
Article CAS PubMed PubMed Central Google Scholar
Walsh, C. T. Blurring the lines between ribosomal and nonribosomal peptide scaffolds. ACS Chem. Biol. 9, 1653–1661 (2014).
Article CAS PubMed Google Scholar
Hudson, G. A. & Mitchell, D. A. RiPP antibiotics: biosynthesis and engineering potential. Curr. Opin. Microbiol. 45, 61–69 (2018).
Article CAS PubMed PubMed Central Google Scholar
Büchel, E., Martini, U., Mayer, A., Anke, H. & Sterner, O. Omphalotins B, C and D, nematicidal cyclopeptides from Omphalotus olearius. Absolute configuration of omphalotin A. Tetrahedron 54, 5345–5352 (1998).
Article Google Scholar
Okada, M. et al. Structure of the Bacillus subtilis quorum-sensing peptide pheromone ComX. Nat. Chem. Biol. 1, 23–24 (2005).
Article CAS PubMed Google Scholar
Ishida, K., Matsuda, H., Murakami, M. & Yamaguchi, K. Kawaguchipeptin A, a novel cyclic undecapeptide from cyanobacterium Microcystis aeruginosa (NIES-88). Tetrahedron 52, 9025–9030 (1996).
Article CAS Google Scholar
Okada, M., Sugita, T. & Abe, I. Posttranslational isoprenylation of tryptophan in bacteria. Beilstein J. Org. Chem. 13, 338–346 (2017).
Article CAS PubMed PubMed Central Google Scholar
Viehrig, K. et al. Structure and biosynthesis of crocagins: polycyclic posttranslationally modified ribosomal peptides from Chondromyces crocatus. Angew. Chem. Int. Ed. 56, 7407–7410 (2017).
Article CAS Google Scholar
Maurer, C. K. et al. Discovery of the first small-molecule CsrA–RNA interaction inhibitors using biophysical screening technologies. Future Med. Chem. 8, 931–947 (2016).
Article CAS PubMed Google Scholar
Heroven, A. K., Bohme, K. & Dersch, P. The Csr/Rsm system of Yersinia and related pathogens: a post-transcriptional strategy for managing virulence. RNA Biol. 9, 379–391 (2012).
Article CAS PubMed Google Scholar
Vakulskas, C. A., Potts, A. H., Babitzke, P., Ahmer, B. M. & Romeo, T. Regulation of bacterial virulence by Csr (Rsm) systems. Microbiol. Mol. Biol. Rev. 79, 193–224 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hildebrand, A., Remmert, M., Biegert, A. & Soding, J. Fast and accurate automatic structure prediction with HHpred. Proteins 77, 128–132 (2009).
Article CAS PubMed Google Scholar
Zheng, H. et al. CheckMyMetal: a macromolecular metal-binding validation tool. Acta Crystallogr. D Struct. Biol. 73, 223–233 (2017).
Article CAS PubMed PubMed Central Google Scholar
Holm, L. Using Dali for protein structure comparison. Methods Mol. Biol. 2112, 29–42 (2020).
Article CAS PubMed Google Scholar
Santos-Aberturas, J. et al. Uncovering the unexplored diversity of thioamidated ribosomal peptides in Actinobacteria using the RiPPER genome mining tool. Nucleic Acids Res. 47, 4624–4637 (2019).
Article CAS PubMed PubMed Central Google Scholar
Houssen, W. E. et al. The discovery of new cyanobactins from Cyanothece PCC 7425 defines a new signature for processing of patellamides. ChemBioChem 13, 2683–2689 (2012).
Article CAS PubMed Google Scholar
Bauer, J. et al. Structure and possible mechanism of the CcbJ methyltransferase from Streptomyces caelestis. Acta Crystallogr. D Biol. Crystallogr. 70, 943–957 (2014).
Article CAS PubMed Google Scholar
Delvaux, N. A., Thoden, J. B. & Holden, H. M. Molecular architecture of KedS8, a sugar N-methyltransferase from Streptoalloteichus sp. ATCC 53650. Protein Sci. 24, 1593–1599 (2015).
Article CAS PubMed PubMed Central Google Scholar
Czekster, C. M., Ge, Y. & Naismith, J. H. Mechanisms of cyanobactin biosynthesis. Curr. Opin. Chem. Biol. 35, 80–88 (2016).
Article CAS PubMed PubMed Central Google Scholar
Jordan, P. A. & Moore, B. S. Biosynthetic pathway connects cryptic ribosomally synthesized posttranslationally modified peptide genes with pyrroloquinoline alkaloids. Cell Chem. Biol. 23, 1504–1514 (2016).
Article CAS PubMed PubMed Central Google Scholar
Smith, C. A., Toth, M., Weiss, T. M., Frase, H. & Vakulenko, S. B. Structure of the bifunctional aminoglycoside-resistance enzyme AAC(6′)-Ie-APH(2″)-Ia revealed by crystallographic and small-angle X-ray scattering analysis. Acta Crystallogr. D Biol. Crystallogr. 70, 2754–2764 (2014).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by the European Research Council (Consolidator Grant 101002326 to J.K.). We are grateful for A. Heckmanns’s help with the knockout studies and to V. Olive at the Scottish Universities Environmental Research Centre for ICP-MS analysis. We acknowledge the College of Medical, Veterinary & Life Sciences Structural Biology and Biophysical Characterisation Facility, University of Glasgow, for the use of Microscale Thermophoresis. We thank J. Prunet and A. Jamieson for helpful discussions, M. Molinari for help running RiPPER and D. France and M. Groll for critical reading of the paper.

Author information

Authors and Affiliations

Workgroup Structural Biology of Biosynthetic Enzymes, Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), Saarland University, Saarbrücken, Germany
Sebastian Adam, Andreas Klein & Jesko Koehnke
School of Chemistry, University of Glasgow, Glasgow, UK
Dazhong Zheng & Jesko Koehnke
Department of Microbial Natural Products, HIPS; HZI; Department of Pharmacy, Saarland University, Saarbrücken, Germany
Carsten Volz & Rolf Müller
Institute of Cardiovascular and Medical Sciences, University of Glasgow, Glasgow, UK
William Mullen
Biomedical Sciences Research Complex, University of St Andrews, St Andrews, UK
Sally L. Shirran
School of Molecular Biosciences, University of Glasgow, Glasgow, UK
Brian O. Smith
Drug Bioinformatics Group, HIPS, HZI, Saarland University, Saarbrücken, Germany
Olga V. Kalinina
Medical Faculty, Saarland University, Homburg, Germany
Olga V. Kalinina
Center for Bioinformatics, Saarbrücken, Germany
Olga V. Kalinina
Hannover-Braunschweig Site, German Centre for Infection Research (DZIF), Hanover, Germany
Rolf Müller

Authors

Sebastian Adam
View author publications
You can also search for this author in PubMed Google Scholar
Dazhong Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Klein
View author publications
You can also search for this author in PubMed Google Scholar
Carsten Volz
View author publications
You can also search for this author in PubMed Google Scholar
William Mullen
View author publications
You can also search for this author in PubMed Google Scholar
Sally L. Shirran
View author publications
You can also search for this author in PubMed Google Scholar
Brian O. Smith
View author publications
You can also search for this author in PubMed Google Scholar
Olga V. Kalinina
View author publications
You can also search for this author in PubMed Google Scholar
Rolf Müller
View author publications
You can also search for this author in PubMed Google Scholar
Jesko Koehnke
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.A., A.K., D.Z., C.V., R.M. and J.K. designed the study. C.V. and R.M. oversaw and analysed the cgnE knockout data; S.A. and A.K. established and optimized protein production. S.A., A.K. and D.Z. performed and analysed biochemical experiments. S.A., A.K., D.Z. and J.K. determined and analysed the protein crystal structures. S.L.S. recorded and interpreted MS² data. S.A. and W.M. recorded and interpreted mass spectrometry data. O.V.K. was responsible for the bioinformatics analysis. B.O.S. acquired and interpreted the NMR data. S.A., O.V.K., R.M. and J.K. wrote the paper.

Corresponding author

Correspondence to Jesko Koehnke.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Chemistry thanks Yanyan Li, Satish Nair and Weixin Tang for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Figs. 1–24, Tables 1–8 and Methods.

Reporting Summary

Supplementary Data 1

Phylogenetic tree of CgnB and CgnE homologues with accession codes.

Supplementary Table 1

Detailed data for sequence similarity network in Supplementary Fig. 15 (sheet 1). List of reagents and primer (sheet 2).

Supplementary Code 1

Custom scripts used for genome mining in this study.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Adam, S., Zheng, D., Klein, A. et al. Unusual peptide-binding proteins guide pyrroloindoline alkaloid formation in crocagin biosynthesis. Nat. Chem. 15, 560–568 (2023). https://doi.org/10.1038/s41557-023-01153-w

Download citation

Received: 13 December 2021
Accepted: 27 January 2023
Published: 09 March 2023
Issue Date: April 2023
DOI: https://doi.org/10.1038/s41557-023-01153-w