Introduction

The groundnut, or peanut (Arachis hypogaea), is a species in the legume or bean categorized under the family Fabaceae. The groundnut was probably first domesticated and cultivated in the valleys of Paraguay1. Aspergillus is a large genus composed of more than 180 accepted anamorphic species, with teleomorphs described in nine different genera2. The genus is subdivided in 7 sub genera, which in turn are further divided into sections3. Groundnuts are known to colonize and contaminated by Aspergillus spp. at pre and post harvesting stage4. Development of resistant varieties to seed invasion by Aspergillus spp. is one of the feasible means to minimize the aflatoxin contamination during groundnut storage with no extras input cost for the farmers. A. flavus is a fungus pathogenic to important agricultural crops and its products are a key concern on human health due to its ability to secret and synthesize the hazardous secondary metabolite called as aflatoxin5,6. The contaminated peanut with the mold A. flavus which produces a carcinogenic substance called aflatoxin B1 is considered to be the major cause of liver cancer and other chronic and acute diseases related to aflatoxin poisoning7,8. Stunting in children also reported due to sub lethal chronic exposure to aflatoxin9. Aspergillus genus is an important fungal genus with prolific sources of secondary metabolites of industrial and commercial importance10. A. flavus is well-known mycotoxin, ochratoxin A (OTA) producing species11. OTA is a mycotoxin possessing immunosuppressive and carcinogenic properties in humans as well as in animals and is responsible for oxidative DNA damage12. In rodents, OTA can generate life-threatening renal adenomas and hepatocellular carcinomas13.

It is very crucial to understand the regulation and biosynthesis of secondary metabolites due to their importance in plant pathogenesis and animal toxicities but still hindered by important information gaps. A. flavus possess 56 SM biosynthesis gene clusters14, but very few secondary metabolites have been assigned to a particular gene cluster. A. flavus hence could produce metabolites other than well-known mycotoxins that could be underrated contributors to its toxicity to humans and animals. It is a challenging task to develop aflatoxin resistant groundnut varieties due to the non-availability of reliable resistance sources, lack of information on plant-fungus interactions, and different environmental factors.

The whole genome of A. flavus was sequenced15, and annotation of the genome of the fungus showed various proteins, genes and other regulatory factors that are potentially related to conidial development and aflatoxin biosynthesis16. In the previous study, large numbers of strains were found non-aflatoxigenic while testing the aflatoxin-producing ability of A. flavus17. The ability of fungal species to produce aflatoxins is strain-specific and a different source of the strain that contains a different virulence genetic material18,19. Mostly such a variation may be due to hindrance in the aflatoxin biosynthesis genes or belong to other non non-aflatoxin producing species. The sequencing of aflatoxin biosynthesis gene cluster was extensively studied in order to understand the mechanism and biosynthesis pathway of aflatoxin in aflatoxigenic fungi20,21. The presence of these genes is required by the Aspergillus spp. to produce aflatoxin, and any changes therein might cause disruption in the biosynthetic pathway.

A. flavus is a major threat to food safety in study region with warm and humid climates that are affected by high levels of contamination. In this study, we sequence of highly toxic and virulent Aspergillus isolates JAM-JKB-B HA-GG2022 and also compared with other known aflatoxin producing and non-producing genome of Aspergillus spp. and total eight genes and 61 secondary metabolite (SM) gene clusters were annotated with toxic strain JAM-JKB-BHA-GG20 which showed similarity with other Aspergillus spp. Our analysis is useful to understand the aflatoxin gene pathway in comparison to other Aspergillus spp. and predicted the presence of other secondary metabolites clusters viz. Nrps, T1pks, etc. genes associated with biosynthesis of the OTA mycotoxin. The outcomes of the present study are briefly highlighted and thoroughly described below with important outputs.

Results

Screening of Aspergillus for aflatoxin using groundnut variety GG-20

Out of 21 Aspergillus isolates, 12 isolates were found to produce aflatoxin in the biochemical ammonium hydroxide test (Table 1; Additional file: Table S1).

Table 1 Total aflatoxin production by Aspergillus spp. (µg kg−1) after infections with GG-20 groundnut variety.

Aflatoxin detection by LCMS-Q TOF

The LCMS-Q TOF assay revealed gradual increase in aflatoxins production and accumulation up to 10 days incubation period and then decrease at 15 days incubation in isolates number 3 (JAM-JKB-BHA-GG20), 12 (RJD-UPA-KUN-TJ37A), 14 (RJD-DHO-KAN-GG45) and 18 (JND-MEN-MEN-TJ45) While, isolate number 1 (JND-VAD-VAD-GG45), 2 (RJD-UPA-KUN-G-2) and 5 (DWK-DWK-GG20A) showed increase in aflatoxins production and accumulation was observed up to 15 days. On the other hand, isolate number 13 (RJD-DHO-PAR-GG37), 15 (JND-MAN-LIM-GG20A) and 21 (JND- JAU –ISS-1) illustrates the production and accumulation of aflatoxins only at 15 days (Table 1).

Among Aspergillus isolates, isolate number 3 (JAM-JKB-BHA-GG20) was found to be the most toxic, which produced high level of total aflatoxin (1, 43, 270 µg kg−1) followed by isolate 14 (RJD-DHO-KAN-GG45) (1, 21, 862 µg kg−1). The lowest level of aflatoxin was produced by isolate 9 (JND- YARD-1-GG20) and 16 (JND-MAN-LIM-GG20 B) (565 µg kg−1 and 199 µg kg−1, respectively). The comparative chromatogram represented the high production of B1 aflatoxin among isolates (Additional file: Figure S1A). Thus, the isolate 3 (JAM-JKB-BHA-GG20) was selected as a highly virulent and toxigenic aflatoxin producing Aspergillus which was collected from Village: Bhanvad; Taluka: Jamkhubhalia and District: Jamnagar from groundnut variety GG-20 (Table 1). Hence, isolate number 3 was used for further studies.

Gene specific molecular characterization

For the differentiation of aflatoxin producers and non-producers, gene specific PCR was tested with several genes related to the aflatoxin biosynthetic pathway. The list of synthesized primers along with amplicon size is depicted in Table 2. A total 21 different isolates of Aspergillus were screened for aflatoxin production by using specific primers. Out of 21, 8 samples showed the aflatoxin production by amplifying the three target genes (Additional file: Figure S1B).

Table 2 Aflatoxin producing gene specific primers and their PCR products.

PCR was applied using four sets of primers (Table 5) for different genes involved in the aflatoxin biosynthetic pathway. Additional file: Figure S1B shows the PCR products obtained from each primer. Bands of the fragments of tub1, aflD, aflO and aflP genes can be visualized at 1495, 985, 1326 and 1487 bp, respectively. The tub1 gene producing band of 1495 bp was amplified in all 21 isolates (Additional file: Figure S1C). The housekeeping gene tub1 coding β-tubulin was served as control in the aflatoxin PCR study. The 3 genes aflD, aflO and aflP were tested as markers for discriminating between aflatoxin producers and non-producer. Out of 21 Aspergillus isolates, 08 isolates (isolate number 2, 3, 12, 13, 14 15, 16 and 21) showed similar pattern indicating the presence of the four genes and other strains failed to produce amplification patterns.

Whole genome sequencing of identified most toxic Aspergillus isolate

The most toxic and highly virulent Aspergillus isolate JAM-JKB-BHA-GG20 was considered for the whole genome sequencing. The genome assembly is approximately 37.3 Mb in size with an N50 length of 49,272 bp (Table 3). Comparatively, A. flavus (JAM-JKB-BHA-GG20) had a similar GC% and proteins to A. flavus and the sister group A. oryzae, which differ considerably from the others.

Table 3 Contig measurements of genome of A. flavus JAM-JKB-BHA-GG20.

Quality coverage obtained from raw data of genome sequence

The number of sequences that support (cover) individual base positions is represented in Additional file: Figure S1D in that 100 percentages was observed at 100 bp positions and was good up to 300 bp. While, Additional file: Figure S1E indicates duplication level distribution, duplication levels are simply the count of how often a particular sequence has been found. Quality distribution observed at a base position above the median and percentiles of quality scores were neglected. The phred score median was found at 103 indicating 1 ambiguous base out of 1000 bases (Additional file: Figure S1F).

Trimming of whole genome sequence

Trimming of fungus genome sequences was done by CLC Genomics Workbench 9. The trimmed data showed 1, 172 bp (17.67%) trimmed reads with average trimmed length 30, 136. 3 bp (Table 4).

Table 4 Whole genome sequence trimming of Aspergillus flavus JAM-JKB-BHA-GG20.

Assembly of genome sequencing

After quality control of the raw NGS data, the most toxic and highly virulent Aspergillus flavus (JAM-JKB-BHA-GG20) genome assembly of reads was done by using CLC Genome Workbench 9. The assembled contigs smaller than 300 bp were discarded from the further analysis. The result of CLC Genome Workbench 9 showed maximum contig of 2, 26, 257 bp and minimum contig of 200 bp, with N75, N50 and N25 value of 25, 320 bp, 49, 272 bp and 84, 433 bp respectively (Table 5). N50 denotes a statistic that defines assembly quality. Given a set of contigs, each with its own length, the N50 length is defined as the shortest sequence length at 50% of the genome. The N75 and N25 statistic is less than or equal to the N50 statistic; it is the length for which the collection of all contigs of that length or longer contains at least 75% or 25% of the sum of the lengths of all contigs. The distribution of nucleotides in the raw sequence was 26.0% Adenine 24.0% Cytosine, 24.1% Guanine and 25.9% Thymine making 48.1% of GC (Fig. 1).

Table 5 Contig measurements of whole genome in Aspergillus flavus JAM-JKB-BHA-GG20.
Figure 1
figure 1

Coverage's for the four DNA nucleotides and ambiguous bases. x: base position; y: number of nucleotides observed per type normalized to the total number of nucleotides observed at that position.

Genome annotation of Aspergillus flavus JAM-JKB-BHA-GG20

The draft genome of the most toxic and highly virulent Aspergillus JAM-JKB-BHA-GG20 was annotated by using the widely used AUGUSTUS version 2.7. A total of 3, 73, 54, 834 bp genome was assembled, which consists of 2, 26, 257 contigs. The software annotated 12,400 genes from the total fungal genome with 12,717 mRNA (Table 6). Thus, while the number of genes might be slightly overestimated because of initio prediction limitations, at least 70.8% of the annotations were supported by experimental evidence. It had confirmed that the assembly was able to capture full-length genes by searching the predictions for full open reading frames (ORFs), finding that most of the genes contained start (12,500) and stop codons (12, 483) (Table 6). Sequence comparisons in RepeatMasker were performed by ABBlast/WUBlast search engine. The RepeatMasker search determined 7050 repeat regions in total genome of the most toxic fungus A. flavus (Table 6). Models of putative inter spread repeats were calculated by RepeatModeler which determined 416 repeat regions. Different putative clusters were found for this strain, suggesting that the Aspergillus fungus is capable of producing a great many more compounds than just the aflatoxins (Table 6). A total 5, 33, 371 open reading frame (ORF) were identified by using the program GETORF details, which is part of EMBOSS package. Further, 237 tRNA’s were sorted out by using tRNAscan software (Table 6).

Table 6 Genome annotation results of A. flavus JAM-JKB-BHA-GG20.

Gene identification related to aflatoxin biosynthesis pathway

Total 8 genes related to aflatoxin biosynthesis pathway were identified (Table 7). These include the ver-1 gene, which is involved in the conversion of versicolorin A to sterigmatocystin, a regulatory gene, aflR (previously named afl-2 for A. flavus and apa-2 for A. parasiticus), that codes for a regulatory factor (AFLR protein) and involved in the activation of pathway gene transcription. The pksA gene which codes for a polyketide synthase whereas, a putative fatty acid synthase gene uvm8 is potentially involved in polyketide backbone synthesis. The omt1 gene coding for an S-adenosylmethionine-dependent O-methyltransferase that converts sterigmatocystin to O-methylsterigmatocystin and dihydrosterigmatocystin to dihydro-O-methylsterigmatocystin and the nor-1 gene, which codes for a reductase that converts norsolorinic acid to averantin. Vha and aflP genes were potentially dehydrogenase involved in an intermediate step of aflatoxin biosynthesis. Genes were searched for the corresponding homologizes in NCBI data base. The BLAST results illustrated that the identified genes resembled close phylogenetic relationship to maximum (100%) similarity with A. oryzae and low (85%) similarity with A. nidulans etc. The genes annotation and their association with homologous nucleotide in different fungal classes are shown in Table 8.

Table 7 Identified gene related to aflatoxin biosynthesis pathway in Aspergillus flavus JAM-JKB-BHA-GG20.
Table 8 Comparison of aflatoxin specific genes with other Aspergillus spp. based on blast identities and base pair score.

Comparative analysis of aflatoxin pathway genes

Pairwise comparisons between the aflatoxin pathway genes viz. nor-1, pksA, ver-1 and Uvm8 of A. flavus JAM-JKB-BHA-GG20 with A. flavus NRRL 3357, A. parasiticus, A. nomius NRRL 13,137 and A. oryzae RIB 40, was conducted by applying to the CLC Genomics Workbench 9 (Table 8; Fig. 2). The highest sequence similarity 99.64% was found with A. flavus NRRL 3357 whereas, the least sequence similarity 13.05% was found in A. parasiticus nor-1gene.

Figure 2
figure 2

The putative cluster- 39 of A. flavus JAM-JKB-BHA-GG20 found in this study and its comparison aflatoxin biosynthesis gene of A. parasiticus (AY371490), A. nomius isolate AN13137 (AY510454) and A. flavus isolate BN008 (AY510452) isolates.

Dotplot comparison of the ver-1 gene of A. flavus JAM-JKB-BHA-GG20 to A. flavus NRRL 3357 revealed sequence similarity (84.76%) and synteny over a region of at least 1452 bp, A. parasiticus revealed sequence similarity (39.63%) and synteny over a region of at least 732 (Additional file: Figure S1G).

Dotplot comparison of the pksA gene of A. flavus JAM-JKB-BHA-GG20 to A. flavus NRRL 3357 revealed sequence similarity (86.57%) and synteny over a region of at least 1424 bp, A. oryzae RIB 40 revealed sequence similarity (79.42%) and synteny over a region of at least 1374 bp whereas, A. nomius NRRL 13,137 revealed sequence similarity (76.47%) and synteny over a region of at least 1258 bp (Additional file: Figure S1G). Dotplot comparison of the Uvm8 gene of A. flavus JAM-JKB-BHA-GG20 to A. flavus NRRL 3357 revealed sequence similarity 80.56%) and synteny over a region of at least 924 bp, whereas, A. oryzae RIB 40 revealed sequence similarity (65.18%) and synteny over a region of at least 848 bp (Fig. 3). As such the nucleic acid similarity matches above 98% had a continuous straight diagonal line in the center of the matrix. Dotplot comparison of the nor-1 gene of A. flavus JAM-JKB-BHA-GG20 to A. flavus NRRL 3357 revealed sequence similarity (37.21%) and synteny over a region of at least 995 bp, A. parasiticus revealed sequence similarity (13.05%) and synteny over a region of at least 342 bp whereas, A. oryzae RIB 40 revealed sequence similarity (98.77%) and synteny over a region of at least 1289 bp (Additional file: Figure S1G).

Figure 3
figure 3

Genes and enzymes in the aflatoxin B1 and B2 biosynthetic pathway.

Secondary biosynthetic gene clusters of A. flavus JAM-JKB-BHA-GG20

Using the whole genome of A. flavus JAM-JKB-BHA-GG20 as the query sequences of the antiSMASH 3.0 platform, many short and dense fragments are present in some SM gene clusters, suggesting low prediction accuracy. Therefore, SM gene clusters were constructed. Many of the putative SM gene clusters belong to type Ipks (t1pks) or non-ribosomal peptide synthase (nrps) gene clusters, while the remaining were terpene, indole, or other hybrid gene clusters (Table 9). The number of putative nrps gene (16) pks gene (14) and hybrid nrps/pks gene (07) of A. flavus JAM-JKB-BHA-GG20 were noted. The hybrid T1pks-indole-nrps gene cluster encoded Dehydrogenase, Beta ketoacyl synthase, Drug, cytochrome 450 genes. The genes present in this cluster showed 20% homology towards aflatoxin pathway gene clusters of A. parasiticus (AY371490), A. nomius isolate AN13137 (AY510454) and A. flavus isolate BN008 (AY510452) isolates whereas, 42% homology towards known gene cluster of A. flavus aflatoxin (BGC0000006, BGC0000008) AY510451 and AY510453 was acquired from Minimum Information about a Biosynthetic Gene cluster (MIBiG) database (Fig. 3). The generally accepted aflatoxin B1 and B2 biosynthetic pathway in A. parasiticus and A. flavus, enzymes for some specific conversion steps, and cloned genes are schematically presented. The regulatory gene, aflR, coding for the regulatory factor (AFLR protein), controls the expression of all of the characterized structural genes (nor-1, ver-1, and omtA). The ver-1 gene product(s) has not been fully characterized; the catalytic steps in the aflatoxin biosynthetic pathway of the uvm8 and the aad gene products, fatty acid synthase and a dehydrogenase, respectively, are not defined; and the ord-1 and ord-2 gene products are under investigation. The approximate sizes, relative locations, and directions of transcription of the identified genes, are indicated. Transcription of structural genes nor-1, ver-1, and omtA is regulated by AFLR. Abbreviations: NOR, norsolorinic acid; AVN, averantin; AVNN, averufanin; AVF, averufin; VHA, versiconal hemiacetal acetate; VHOH versiconal; Ver B, versicolorin B; Ver A, versicolorin A; ST, sterigmatocystin; DHST, dihydrosterigmatocystin; OMST, O-methylsterigmatocystin; DHOMST, dihydro-O-methylsterigmatocystin; AFB1 aflatoxin B1; AFB2, aflatoxin B2 (Fig. 3).

Table 9 Classification of putative secondary metabolites clusters of predicted by anti-SMASH.

Discussion

Groundnut is grown on a large scale in almost all the tropicals and sub-tropical countries in the world. A groundnut is the third important oil seed crop of the world in production after the soybean and cotton23. India shares 23% of the world’s groundnut area and production. In India, it is grown over an area of 43.16 lakh hectares with total production of 51.27 lakh tonnes24. The groundnut crop is attacked by a number of diseases. The economically important foliar fungal diseases are early and late blight rusts etc. Seed and soil borne diseases like collar rot, stem rot and dry root rot are also restricted the production of groundnut. Viral diseases like bud necrosis, peanut mottle diseases etc. are also affecting groundnut producing25. Groundnut may be contaminated with the mold Aspergillus flavus which produces a carcinogenic substance called aflatoxin and it detonates the export quality of a groundnut. Aflatoxin accumulation and production by the fungus in groundnut poses a serious threat to human health and agriculture commodities. Variability in production of aflatoxins, especially among A. flavus isolates, has often been reported25,26,27,28. For example, only half of A. flavus strains produce aflatoxins, however, many of these strains produce more than 106 μg kg−129.The production of aflatoxin involves a complex biosynthetic pathway consisting of at least 25 genes30. Identifying key genes and respective product involve in aflatoxin production has been a debate over the past issue. Therefore, the proposed study was aimed to identify key genes involve in highly infected A. flavus strain using molecular aspects in the selected area as the first approach.

Among 21 isolates studied, 12 of them were reported producing aflatoxin confirmed by biochemical test as well as LCMS-Q TOF. These strains are reported producing aflatoxin in the range of 1, 43, 270 µg kg−1 to 199 µg kg−1 till 15 days of infection. This report is somewhat similar or dissimilar to other studies presented earlier. The variation in the ability of isolates to produce aflatoxin may be due to genetic differences between fungal isolates as well as different physiological conditions31. Likewise, Harry and Hugo32 demonstrated the production or accumulation of aflatoxins in vitro by four isolates on three substrates (acid-delinted cottonseed, shelled Spanish peanut, and rough rice) with relation to temperature in the range of 100–400 °C. Within the first 10 days after inoculation, the optimal temperature range for aflatoxin production was between 200 and 350 °C. Only small amounts of the toxins were produced at 100 and 400 °C. Within the optimal temperature range, the time required for toxin production and for significant accumulation decreased as the temperature increased. More aflatoxin G was produced or accumulated in relation to aflatoxin B at low temperatures.

Nagarajan and Ramesh33 studied toxin production by the different fungal isolates varied considerably. Isolates of A. flavus produced only B1 and B2, whereas A. parasiticus produced B1, B2, G1 and G2. The results indicate clearly that there are species and varietal differences in toxin production. The variation in toxin production appears to be intimately related to the inherent ability of the fungal isolate to produce the toxin. The fungal isolates used by Rao and Tulpule34 was perhaps so low in toxin-producing capability that no detectable toxin could be detected when it was grown on US-26, although measurable amounts of the toxin were produced on other varieties of peanuts. The toxin production was also related to the species of Aspergillus used; A. parasiticus always produced greater amounts as compared to A. flavus.

Aflatoxins may be produced but not detected because of the inherent detection limits of the analytical systems. This approach aims at selecting a solid molecular marker for discriminating Aspergillus flavus species. For the differentiation of aflatoxin-producers and non-producers, gene specific PCR were tested with several genes related to the aflatoxin biosynthetic pathway. In this study, aflatoxin producing Aspergillus isolates were conformed for its toxicity with molecular approach. For diffracting isolated Aspergillus for aflatoxin producing and non-producing, the 3 genes viz. aflD, aflO and aflP specific primers were tested by earlier researcher35. The 3 genes aflD, aflO and aflP were tested as markers for discriminating between aflatoxin producers and non-producers36. Out of 21 Aspergillus isolates, 08 isolates such as isolate no. 2, 3, 12, 13, 14 15, 16 and 21 showed a similar pattern indicating the presence of the four genes and other strains failed to produce amplification patterns.

Researchers have reported use of PCR technology as rapid and sensitive method for detection and diagnosis of aflatoxin production, to detect aflatoxigenic strain from non-aflatoxigenic strain in other food commodities apart from the groundnut37. PCR analysis was able to amplify two structural genes i.e. nor1 and ver1 and regulatory gene aflR by using specially designed primers. The aflR is a positive regulatory gene which is required for transcriptional activation of most of the structural genes38,39.

Adeela et al.37 evaluated fungal flora of peanuts to test aflatoxin contamination. 10 isolates out of 40 were aflatoxin producers when tested by PCR analysis. The study revealed that nor1 gene was being amplified in almost all of them as it was a structural gene required in the initial step. Three primers were carefully selected to be highly specific for these three genes known to be essential for aflatoxin biosynthesis. Each primer pairs yielded a single DNA fragment of the expected size of 400, 600 and 1000 bp for nor1, ver1 and aflR respectively.

In present study, β-tubulin gene found to be expressed in all the isolates. Kanbe et al.40 reported single-copy conserved genes can also be used as targets for taxonomic studies within the A. flavus group, when multi-copy segments from the rRNA gene complex lack variability. Universal β-tubulin, calmodulin and topoisomerase II genes have been used in fungal species identification. Genes involved in secondary metabolism are considered to be more variable within closely related species41.

Yu et al.36 surveyed fourteen strains of A. flavus as a test sample and three samples of other fungi, such as A. ngier, P. expansium, F. verticillioides, which were assumed a positive control through TLC technique and PCR method, working with nor-1, ver-1, omt-1, aflR primers. Nor-1, ver-1, omt-1 are three structural genes in cluster genes in biosynthesis aflatoxin pathway that coding for key enzymes in production of aflatoxin, thus they are essential for aflatoxin production whereas, aflR is a positive regulatory gene required for activation of structural genes. The result revealed that three samples of fourteen strains of A. flavus were positive using TLC technique and totally twelve samples with the four mentioned primers used in PCR technique showed positive results. None of the other fungal strains using TLC and PCR did show any positive results. The positive control in both techniques was positive.

The similar results of tub1, aflD, aflO and aflP genes expression was studied which revealed those applied conditions were set up perfectly, as though each primer formed sharp and distinct bands in its specific area. Other studies suggest that regulation of aflatoxin biosynthesis in Aspergillus spp. involves a complex pattern of positive and negative acting transcriptional regulatory factors, which are affected by environmental and nutritional parameters38,39.

The same result has been achieved by using PCR and multiplex PCR procedures42,43 The interpretation of the results revealed that PCR is a rapid and sensitive method (sensitivity 100%, specificity 75%) in diagnosis of aflatoxinogenic molds but, this technique (PCR) cannot differentiate between toxic and nontoxic fungi. Geiser et al.44 suggests that the lack of aflatoxin production could also be due to simple mutations including substitution of some bases and Liu and Chu45 suggest that a variety of different physiological conditions affecting aflatoxin biosynthesis.

In previous study a multiplex real time-PCR protocol has been standardized based on aflD, aflO, and aflQ genes to differentiate aflatoxin-producing strains of A. flavus from the aflatoxin-nonproducing ones. They found a good correlation between the target genes expression in nearly all samples38,39.

The result of whole genome sequencing revealed approximately 37.3 Mb in size with an N50 length of 49,272 bp. Comparatively, the species isolated in our study had a similar GC% and proteins to that of A. flavus and the sister group A. oryzae, somewhat differ considerably from the others. A total of 3, 73, 54, 834 bp genome was assembled, which consists of 2, 26, 257 contigs. The software annotated 12,400 genes from total fungal genome with 12,717 mRNA. Thus, while the number of genes might be slightly overestimated because of initio prediction limitations, at least 70.8% of the annotations were supported by experimental evidences. It had confirmed that the assembly was able to capture full-length genes by searching the predictions for full open reading frames (ORFs), finding that most of the genes contained start (12,500) and stop codons (12, 483). Sequence comparisons in RepeatMasker were performed by ABBlast/WUBlast search engine. The RepeatMasker search determined 7050 repeat regions in total genome of most toxic fungus A. flavus. Models of putative interspread repeats were calculated by RepeatModeler which determined 416 repeat regions. Different putative clusters were found for this strain, suggesting that the Aspergillus fungus is capable of producing a great many more compounds than just the aflatoxins. The result are agreed with This agreed with the preceding research results of Nierman et al.46, who reported 5 × genome coverage, which was assembled into 958 contigs comprising 331 scaffolds ranging in size from 4.46 Mbp to 211 bp and containing a total of 958 contigs. The genome size 37 Mbp which matches with our findings i.e.37.3 Mb genome size. The average GC content was 44% which was less than our results (48.1%). The number of predicted protein-coding genes is 13,485. Similarly, Yoko et al.47 performed genome sequencing on a Pacific Biosciences RS II (Pacific Biosciences, Menlo Park, CA, USA) using libraries prepared with the SMRTbell template prep kit 1.0 (Pacific Biosciences). A draft genome of A. lentulus was assembled using SMRT Analysis 2.3 (Pacific Biosicences). Eventually, 19 scaffolds were obtained, summing up to 30,956,128 bp with an overall GC content of 49.45% greater than our report (48.1%). The N50 and maximum scaffold were 4,166,724 bp, and 5,062,644 bp, which are more as compare to our findings (49,272) respectively. Gene annotation using AUGUSTUS program 2.5.5 trained with the parameters of the species Aspergillus fumigatus resulted in 9860 genes. 200 tRNAs and 106 rRNAs were predicted by tRNAscan-SE 1.3.1 and RNAmmer 1.2, respectively. Maria et al.48 assembled a 54.9 Mb P. lycopersici draft genome sequence based on Illumina short reads, and annotated approximately 17,000 genes. The P. lycopersici genome is closely related to hemibiotrophs and necrotrophs, in agreement with the phenotypic characteristics of the fungus and its lifestyle.

Filamentous fungi A. flavus produce many bioactive secondary metabolites, such as various mycotoxins or other bioactive compounds that have been exploited for pathogenicity. The genes responsible for the production of the secondary metabolites tend to be organized in biosynthetic gene clusters49. Using the whole genome of A. flavus JAM-JKB-BHA-GG20 as the query sequences of the antiSMASH 3.0 platform, many short and dense fragments present in some SM gene clusters, suggesting low prediction accuracy. Therefore, SM gene clusters were constructed.

From the present study, it can be summarized that the Aspergillus flavus, a most toxic and virulent fungal pathogen exist in Saurashtra region of Gujarat, which escort the threat of crashing down the export of groundnut seed market of this region in the future. This study identified fungal pathogen on basis of macroscopic, microscopic and molecular characters. The LC–MS QTOF demonstrated maximum presence of aflatoxin fractions B1, B2 than G1, G2 of total aflatoxin.

The whole genome data helped in understanding of aflatoxin gene pathway in comparison to other Aspergillus spp. and predicted presence of other secondary metabolites clusters viz. Nrps, T1pks etc. genes associated with biosynthesis of the OTA mycotoxin.

To get abdicated from such notorious fungal pathogen, an ecofriendly tool had been demonstrated by this study. The green AgNP’s formulated from the best antagonist bacterium were able to inhibit the most toxic and virulent fungus paving a new biological tool against a fungal pathogen and having capacity to hold down future economical crises in groundnut dependent families.

Material and methods

Extraction of aflatoxin using groundnut variety GG-20

The seeds of groundnut susceptible variety GG-20 were collected from Main Oil Seed Research Station, Junagadh Agricultural University, Gujarat. 10 g seed of GG-20 variety of groundnut were rehydrated with 10 ml of water, sterilized at 12 °C for 15 min, and inoculated with 1 ml of a spore suspension (approximately 6 × 10–5 spores/ml) from 21 isolates of a fungus50. The flasks were incubated at 28 °C for 5th, 10th and 15th day in an incubator with 80% relative humidity. After incubation for 5th, 10th and 15th day the seeds were crushed in 70% methanol and shaken at 150 rpm for 30 min at room temperature on a shaker in amber color 100 ml flasks and Filtered by Whatman filter paper. Samples filtrate were concentrated under N2 evaporator till it became to 2 ml and store in amber color 2 ml bottles until further used for aflatoxin detection by LCMS Q TOF. All methods were carried out in accordance with relevant guidelines by utilizing strict protocols as determined by the institute.

Aflatoxin assay by LCMS Q TOF Analysis

Aflatoxin analyses were performed using an Agilent 6540UHD Accurate-Mass Q-TOF LC/MS51. The LC system consisted of a binary pump (G1312B), vacuum degasser (G1379B), a low carryover automatic liquid sampler (G1367D), thermostatted column compartment (G1316B) and Mass Hunter data system. Purified aflatoxin standards (B1, B2, G1 and G2) were obtained from Sigma-Aldrich and run 0.1–25 ppb of each simultaneously for quantifying aflatoxin fractions from unknown samples. Detailed procedure for aflatoxin assay using LCMS ids briefly highlighted in Additional File Text S2.

Whole genome sequencing of highly virulent and toxigenic aflatoxin producing A. flavus

Isolation of genomic DNA

The fresh genomic DNA was isolated from seven day old culture grown on PDA medium at 28 °C52. Isolation of genomic DNA for whole genome sequencing was performed as described in Additional File Text S1.

Quantification and purity of genomic DNA

The genome DNA was quantified by Nanodrop machine and the concentration at A260/A280 was measured as described in supplementary material I. Pure DNA has an A260/A280 ratio of 1.8 in TE. DNA integrity was measure on 0.8% agarose gel electrophoresis. The concentration of DNA was adjusted to 100 ng µl−1 for further work.

Genome sequencing and assembly

Genome sequencing and assembly was performed into 3 steps i.e. 1: Library preparation; 2: Template preparation; 3: Sequencing. The detail of procedure employed is described in the Additional File Text S3.

Genome assembly and functional annotations

After quality control of the raw NGS data, the most toxic and highly virulent Aspergillus flavus JAM-JKB-BHA-GG20) genome assembly of reads was done by using CLC Genome Workbench 9. The draft genome was annotated by using the widely used AUGUSTUS version 2.7. To evaluate the potential of a gene to produce secondary metabolites, the assembled genome of the A. flavus was supplied to antiSMASH (v3.0.5, with default parameters, except for checking the “DNA of Eukaryotic Origin” box). RepeatMasker is program that screens DNA sequences for interspersed repeats and low complexity DNA sequence. Sequence comparisons in RepeatMasker were performed by ABBlast/WUBlast search engine. The Modeles of putative inter spread repeats were calculated by RepeatModeler which determined 416 repeat regions. The open reading frame (ORF) were identified by using programmed getorf details, which is part of EMBOSS package and tRNA’s were sorted out by using tRNA scans software.

Comparative evaluation of aflatoxin producing predicted genes with other Aspergillus spp. genome

The genes predicted by software AUGUSTUS version 2.7 were converted into fasta format and the Basic Local Alignment Search (BLAST) analysis was carried with NCBI. The resulting hits were recorded to demonstrate percent identity and base pair coverage of test fungal genome with other Aspergillus spp.

Dotplot analysis using aflatoxin specific gene sequences

Dotplot is a way to visualize the similarity between two protein or nucleic acid sequences53. Dotplots compare two sequences by organizing one sequence on the x-axis, and another on the y-axis, of a plot. When the residues of both sequences match at the same location on the plot, a dot is drawn at the corresponding position. To visualize the aflatoxin genes similarity within Aspergillus spp. a dotplot analysis was conducted. The aflatoxin genes derived from NCBI gene bank were utilized for creating dotplot.

Ethical approval and consent to participate

The groundnut seeds varieties used in the proposed work were developed and released by Junagadh Agriculture University, India. Therefore, no specific permission was required to obtain seeds for research. Also, the work doesn’t involve any endangered or protected plant species so no further authorization is needed.

Conclusion

In this study, we generated genome assembly of a highly toxic Aspergillus flavus isolate JAM-JKB-B HA-GG20. The whole genome data helped in understanding of aflatoxin gene pathway in comparison to other Aspergillus spp. and predicted presence of other secondary metabolites clusters viz Nrps, T1pks etc. genes associated with biosynthesis of the OTA mycotoxin. An aflatoxin producer isolate JAM-JKB-B HA-GG20 found morphologically similar to closely related non aflatoxigenic species, hence, will be useful for developing genetic screens to identify this toxic mold in crops and stored food products.