Introduction

Listeria monocytogenes is a ubiquitous Gram-positive bacterium and an opportunistic pathogen that causes listeriosis in humans and in some animal species1. In humans, listeriosis is a potentially lethal infection, with the vulnerable populations such as the elderly, pregnant women, and immunocompromised persons at particular risk for meningitis, sepsis, premature birth, or abortion2. In animals, particularly in ruminants, listeriosis can manifest as rhombencephalitis and in its septicemic form may cause abortion, stillbirth, and death3.

Listeria monocytogenes is classified into four major evolutionary lineages and four PCR based serogroups4,5. The majority of L. monocytogenes isolates belong to lineage I that is associated with serotypes 1/2b, 3b and 4b, and to lineage II which includes serotype 1/2a, 1/2c, 3a, and 3c5. Further, multilocus sequence typing (MLSTs) subdivides these categories into numerous clonal complexes (CCs) and sequence types (STs). Certain serotypes specifically 1/2a, 1/2b and 4b, and certain CCs including hypervirulent strains assigned to CC1, CC4 and CC6, are frequently encountered in clinical cases5,6.

Following survival within the gastrointestinal (GI) tract, multiple virulence factors (VFs) enable L. monocytogenes to invade and survive within mammalian host cells7. Key VFs include InlA and InlB, which belong to a family of internalins that allow the bacteria to incorporate within vacuoles of the host cells, listeriolysin O (LLO) and phospholipases PlcA and PlcB which are involved in vacuolar lysis, as well as ActA, which provides motility within the cytosol1,8,9.

Listeria monocytogenes virulence genes are mostly organized in clusters located throughout the chromosome, such as the internalin gene operon, several listerial pathogenicity islands (LIPI-1to LIPI-4), and stress survival islets (SSIs). L. monocytogenes harboring LIPI-4 are considered hypervirulent and are associated with enhanced invasion and neural and placental infection6. By contrast, L. monocytogenes isolated from food processing environments and from food are frequently associated with reduced pathogenicity due to truncated and non-functional major virulence factors such as inlA/B10,11. Such InlA truncations are partially accountable for hypovirulence in L. monocytogenes CC9 and CC121, which are major CCs associated with a food origin6.

Consumption of food, notably ready-to-eat food, fresh raw produce, and animal-derived food products that are consumed raw is the primary route of exposure of humans to pathogenic L. monocytogenes 12. The occurrence and persistence of Listeria in processing plants are frequently caused by environmental recontamination at plant or farm level13. At farm level, the route of L. monocytogenes infection in ruminants is understood to be contaminated silage and contamination of farm environments including cattle bedding and water throughs14. The introduction sources of L. monocytogenes to farm animals, and the pathways permitting pathogenic L. monocytogenes entering the food chain are currently not completely understood. Listeria is widely distributed in the natural environment, with soil representing a key niche for the persistence of globally distributed L. monocytogenes isolates13. Soil runoff may contaminate water sources which then serve as reservoirs that transfer Listeria through the environment15. In addition, contaminated sewage or wastewater effluent is reportedly an important cause of Listeria in rivers16,17, with L. monocytogenes found more commonly than other Listeria species in surface waters in urban environments18. The aquatic ecosystem therefore provides an ideal setting for the circulation of L. monocytogenes between the habitats such as soil, plants, animals, natural and urban environments19. A number of studies have shown that Listeria occurs in waterways in farm environments and agricultural areas, including water which could be used for irrigation20,21. Several studies have reported the prevalence of L. monocytogenes in river water in the US and Canada, with serogrouping and pulsed field gel electrophoresis (PFGE) analysis revealing serogroups and pulsotypes that were similar to human L. monocytogenes isolates16,21. Furthermore, surface water strains may carry functional inlA genes thus being potentially virulent15,17. Although it is therefore recognized that pathogenic L. monocytogenes occur in surface water, whole genome sequencing-based information regarding the molecular diversity of L. monocytogenes occurring within the aquatic environment is currently limited.

This study was designed to evaluate the occurrence of L. monocytogenes in flowing surface waters throughout Switzerland and to characterize the isolated strains using whole genome analyses. We aimed to identify in the aquatic environment, any epidemic clones associated with human and animal infections in order to assess the relevance of such strains to public health as well as the health of animals. Further emphasis was placed on identifying virulence factors (VFs).

Results

Occurrence of L. monocytogenes in flowing surface water

A total of 191 water samples were collected and analyzed. The geographical distribution of the 191 sampling sites is depicted in Fig. 1. Of the 190 sites, 141 (74%) were located downstream of a wastewater treatment plant (WWTP) (see Supplementary Table S1).

Figure 1
figure 1

source geographic information system (GIS) software QGIS version 3.10 (https://qgis.org).

Map of Switzerland showing surface waters, sampling locations, and sites of strain isolation. The map was created using the open

After enrichment, 25 (13%) samples revealed one to three presumptive colonies on OCLA plates. From each plate, one colony was selected for species identification and further analysis using whole genome sequencing (WGS). The positive samples were from sites situated between 280 and 1560 m above sea level, and 19 (76%) were located downstream of WWTPs (Table 1).

Table 1 Key features of 25 surface water sampling sites testing positive for Listeria monocytogenes.

Assignment of L. monocytogenes lineages and serotypes

After implementation of WGS for the 25 isolates, in silico analyses including serotyping identified thirteen (52%) strains belonging to lineage I, including 11 strains representing serotype 4b, and two belonging to serotype 1/2b (Table 2). The remaining 12 (48%) strains belonged to lineage II and serotype 1/2a (Table 2). The general features of the 25 Listeria monocytogenes draft genomes are listed in Table 3.

Table 2 Key features of 25 Listeria monocytogenes isolates from flowing surface water and of five reference strains.
Table 3 General features of the 25 Listeria monocytogenes draft genomes and number of comparable genomes available in the NENT databasea.

Identification of epidemic and outbreak clones

To assess the clinical relevance of L. monocytogenes occurring in the aquatic environment, the genomes of the 25 strains were subjected to detailed in silico analysis. Based on the seven-gene MLST scheme provided by the BIGSdb-L. monocytogenes platform (https://bigsdb.pasteur.fr/listeria), a total of 11 STs which corresponded to 11 CCs were identified (Table 2). The predominant CCs were CC1 (20%), CC4 (16%), and CC412 (16%) (Table 2). L. monocytogenes serotype 4b CC1 and CC4 are epidemic clones associated with human listeriosis outbreaks worldwide, and both clones are also frequent among strains causing listeriosis in ruminants6,22,23. By contrast, L. monocytogenes serotype 1/2a CC412 is only sporadically associated with human infections but is prevalent among strains causing rhombencephalitis in cattle6,22. Two further strains (8%) were assigned to CC6, which is an emerging epidemic L. monocytogenes serotype 4b clone causing major outbreaks and severe forms of human listerial meningitis worldwide6,24. Other CCs identified among the strains in this study included L. monocytogenes serotype 1/2a CC7, CC11 and CC29, all of which have been associated with outbreaks occurring in the US between 1987 and 2011, and further found to represent environmental strains that persist within food production and livestock environments globally23,25,26,27. On the other hand, L. monocytogenes serotype 1/2a CC37 found in two samples in this study, is associated with abortion in small ruminants and cattle and found frequently in wildlife and ruminant environments13,25,28. Further, L. monocytogenes CC37 is prominent in milk samples from US dairy farms29.

Analysis of strain relatedness

The 25 L. monocytogenes strains belonged to 25 different cgMLST types (CTs), as shown in Table 2. The population structure of the strains was visualized by constructing a phylogenetic tree based on cgMLST. The isolates grouped according to lineages and serotypes, but were phylogenetically clearly distinct from each other, with ≥ 10 different alleles between each pair of neighboring isolates (Fig. 2). The genomes of the CC1, CC4, CC6, CC7, CC37, CC59, and CC412 isolates were compared with the available genomes of corresponding CCs present in the database of the Swiss National Reference Centre for Enteropathogenic Bacteria and Listeria (NENT) which collects all L. monocytogenes strains from confirmed human listeriosis cases nationwide and performs Illumina-based whole genome sequencing. The cgMLST-based phylogenetic trees are shown in Fig. 3 and the number of L. monocytogenes genomes in the NENT database are listed in Table 3. None of the 25 strains from this study clustered with a strain in the database, thereby ruling out a direct match with any L. monocytogenes reported from a case of human disease in Switzerland.

Figure 2
figure 2

Minimum-spanning tree based on cgMLST allelic profiles of 25 Listeria monocytogenes isolated from surface waters. Each circle represents an allelic profile based on sequence analysis of 1,701 cgMLST target genes. The numbers on connecting lines represent the number of allelic differences between two strains. Each circle contains the strain ID, and CCs are color coded. Lineages and serotypes are indicated.

Figure 3
figure 3

Minimum-spanning trees based on cgMLST allelic profiles of 25 Listeria monocytogenes isolated from surface water and 368 genome-sequenced clinical L. monocytogenes isolated during 2011–2020 available in the database of the Swiss National Reference Centre for Enteropathogenic Bacteria and Listeria (NENT) in Switzerland. Strains are grouped by clonal complex (CC). Each circle represents an allelic profile based on sequence analysis of 1,701 cgMLST target genes. The numbers on connecting lines represent the number of allelic differences between two strains. Clusters were defined as isolates containing ≤ 10 different alleles between a pair of isolates. Strains from this study are indicated with a pink star.

Assessment of virulence attributes

Pan-genomic analysis was performed to identify the presence of virulence genes associated with invasion of mammalian host cells, hypervirulence, or neural and placental infections.

Internalin A, encoded by inlA is a key factor associated with pathogenic L. monocytogenes. Premature stop codons within inlA, often observed in food-associated and environmental strains, result in the presence of truncated InlA and reduced or loss of virulence30. In this study, all isolates contained full length inlA genes and were therefore potentially virulent. Among the 25 strains, there were several lineage and clonal complex specific SNP-introduced amino acid substitutions in inlA compared to the reference strain EDG-e. For example, the amino acid substitution R3K was only detected in strains belonging to lineage I, whilst Y774D was only detected in L. monocytogenes CC4 and CC6 (see Supplementary Table S2).

Further, L. monocytogenes CC6 (strains L12 and L164, respectively) all had a characteristic triple amino acid deletion in the pre-anchor region of InlA, the impact of which is not yet known30 (see Supplementary Table S2).

In addition to inlA, the L. monocytogenes genomes in this study showed a variable combination of other internalin genes due to the presence or absence of inlG and inlL (see Supplementary Fig. S2). The inlG gene was lacking in lineage I strains, with the exception of two CC6 strains (L12 and L164). Similarly, inlL was absent in all lineage I strains as well as in three lineage II strains (L36, L86, and L180) (Supplementary Table 2 and Supplementary Fig. S2).

LIPI-1 is a PrfA dependent virulence gene cluster consisting of six genes (prfA, plcA, hly, mpl, actA and plcB) that are crucial for the infection cycle of L. monocytogenes31. In this study, LIPI-1was present in all 25 strains (Supplementary Table 2 and Supplementary Fig. S2). All strains contained full length prfA genes (see Table S3). There were lineage and clonal complex specific SNP-introduced amino acid substitutions in hly, for example L35S and I438V were only detected in lineage I strains whilst V433I was unique to CC412 strains (see Supplementary Table S4).

There were various other minor lineage and strain specific genetic differences detected in further virulence factors including some amino acid changing SNPs in plcA, plcB and actA, but none that have previously been reported to alter functions of these proteins.

LIPI-3 encodes a biosynthetic cluster involved in the production of listeriolysin S (LLS), a hemolytic and cytotoxic factor postulated to play a role in Listeria gastrointestinal colonization32. LIPI-3 is strongly associated with lineage I strains and was present in all lineage I strains of this study with the exception of strain L41 which belonged to serotype 1/2b ST59 (Supplementary Table 2 and Supplementary Fig. S2).

LIPI-4 contains six genes encoding cellobiose-type phosphotransferase systems (PTS) that can enhance invasion, leading to neural and placental listeriosis6. The presence of LIPI-4 is strongly associated with hypervirulence of CC4 strains. Accordingly, LIPI-4 was identified among all CC4 strains in this study (Table 2 and Fig. S2).

SSI-1 is a five gene islet that contributes to the growth of L. monocytogenes under suboptimal conditions33,34. This islet has been previously shown to be a feature of L. monocytogenes CC7 and CC8 associated with persistence in food-processing plants but is also found in sporadic environmental strains26. In this study, SSI-1 was identified only in strains L50 and L72 both (CC7), and in L52 belonging to an ST2332 (Supplementary Table 2 and Supplementary Fig. S2).

Other VF including genes involved in adherence, intracellular survival, regulation of transcription and translation and surface protein anchoring were present in most of the strains, with some notable lineage, clonal complex, and serotype associated differences (see Supplementary Fig. S2). For instance, the adherence gene ami35 was lacking in lineage I except in strain L41. Further, strains belonging to serotype 4b all contained aut_IVb which is an allele of the invasion gene aut 36, and all lacked tagB, a gene involved in teichoic acid biosynthesis37. All serotype 1/2a and 1/2b strains were as expected without the genes gltA and gltB which are serotype 4b-specific genes involved in teichoic acid biosynthesis38. Finally, all L. monocytogenes CC412 lacked the adherence gene lapB 35.

Identification of antimicrobial, heavy metal and disinfectant resistance determinants

All 25 L. monocytogenes strains from this study contained four intrinsic antibiotic resistance genes, including the fosfomycin hydrolase gene fosX, the antibiotic efflux pump gene lin, the quinolone resistance efflux pump gene norB, and the sulfonamide resistance gene sul. Cadmium resistance genes cadA1 and cadC1 were detected in strain L86 (CC14). No arsenic or benzalkonium chloride resistance genes were detected (see Supplementary Fig. S2).

Discussion

In this nationwide study, we recovered L. monocytogenes from water samples from rivers and streams localized within agricultural and nonagricultural environments, urbanized areas, and mountainous regions up to altitudes of 1560 m above sea level. The majority (21 /84%) of the positive samples was retrieved within the geographical region of Switzerland that belongs to the so-called Central Plain. This area is characterized by settlement and urban areas, as well as agricultural areas, and represents the most densely populated region of Switzerland39. Further, with 76% of the positive samples located in the proximity to WWTP, anthropogenic sources of L. monocytogenes strains retrieved in this study appear likely, however, this observation needs to be confirmed by additional investigations that include further data collection. Indeed, previous studies suggest that Listeria species survive conventional wastewater treatment processes and that effluents of WWTPs are potential sources of clinically important L. monocytogenes 40,41. However, the prevalence and diversities of the L. monocytogenes strains may have been influenced by the recovery procedure. In our study, the methodology included enrichment in HFB, as opposed to previous studies using other protocols to isolate Listeria from water samples, such as Universal Pre-enrichment Broth (UPB)21, or selective Listeria Enrichment Broth (LEB)15,17. It cannot be excluded that some lineages or serotypes of L. monocytogenes may differ in their ability to recover in HFB, and caution should be applied when comparing results obtained using different methodologies.

The prevalence of L. monocytogenes in surface water in this study was 13%. By comparison, L. monocytogenes was identified in 10% of surface water sampled in Canada15, in 13% in New York State42, 31% in Mid-Atlantic US21. Notably, L. monocytogenes was not detected in surface water in Austria13. However, prevalence of L. monocytogenes in natural water bodies may vary according to recovery methodology and sampling season20,21. Therefore, the lack of periodic sampling and the lack data for the summer and autumn seasons in this study could have influenced the results and it cannot be excluded that extending the study period to include warm seasons may have had an impact on prevalence and variety of the L. monocytogenes present in flowing surface waters. Therefore, our results may not apply directly to L. monocytogenes recovered from water during warm seasons. Nevertheless, the data from this study highlights the broad geographical distribution of clinically relevant L. monocytogenes in the aquatic ecosystem.

Among the isolates, the majority belonged to serotypes and clonal complexes corresponding to those from human listeriosis outbreaks and sporadic cases of human and animal infection. In our study, the majority of the strains were either serotype 1/2a (48%), or 4b (44%). These results are similar to those reported for surface waters analyzed in Canada, where serotypes 1/2a/3a and 4b/4d/4e constituted 49% and 32% of the isolates, respectively15. By contrast, other investigations found serotype 1/2a among 67% of water-derived L. monocytogenes in a further Canadian study, while L. monocytogenes recovered from water samples in California belonged predominantly to serotype 4b/4d/4e16. In spite of the limited number of isolates in the present study and the differences in methodologies and study settings used by previous investigators, these results of these earlier studies are supportive of our observation that L. monocytogenes populations in the aquatic environment contain serotypes that may cause listeriosis. Further, virulence gene profiling revealed that all the strains harbored intact genes associated with invasion and infection, underlining the virulence potential and clinical relevance of L. monocytogenes from the aquatic environment. These findings, although based on the analysis of a small number of isolates, are different from data on VFs found in many but not all isolates of serotypes 1/2a, 1/2b and 1/2c from food and food processing environments for example in Ireland and in the US, where inlA is truncated in 31% and in 45% of the isolates, respectively10,43. Our data are also supportive of a previous report regarding the integrity of the virulence gene inlA in L. monocytogenes recovered from natural waters17. Notably, there was a lack of benzalkonium and arsenic resistance genes and a very low prevalence of cadmium resistance genes among the strains in this study. These determinants are frequently associated with L. monocytogenes isolated from food and from humans, and their scarcity among the isolates in this study may reflect specific adaptations in the natural environment, consistent with a recent study that observed a very low prevalence of cadmium resistance genes among L. monocytogenes from wildlife44.

A considerable proportion (16%) of the clones belonged to CC4 which contains LIPI-4 and is considered hypervirulent based on its neurovirulence and capacity for placental infection6. CC4 is highly associated with human isolates and, in contrast to other serotype 4b strains, to our knowledge has not been described in surface water so far45.

Likewise, CC1 and CC6 are major contributors to human listeriosis, and recent years have seen an increase of severe listeriosis cases related to L. monocytogenes CC6, a clone which was first implicated in a multistate outbreak in the US in 1998–199946, and has since disseminated globally, causing one of the world’s largest listeriosis outbreaks in South Africa in 2017–2018, a large outbreak in Germany during 2018–2019, a local outbreak in Switzerland in 201647,48,49 , and very recently, a nationwide outbreak in Switzerland in 2018–2020 that caused 34 cases and 10 deaths50.

The occurrence in surface water of L. monocytogenes belonging to CCs associated with disease highlights the potential of rivers, streams and inland canals as a reservoir for pathogenic L. monocytogenes. In particular, the use of river water for crop irrigation during dry seasons may allow L. monocytogenes in the water to enter the food chain. Irrigation has repeatedly been associated with an increased risk of pre-harvest produce contamination by L. monocytogenes, particularly if the irrigation water is drawn from surface water19,20,51,52. Further, irrigation within three days of harvest is associated with L. monocytogenes in produce production environments42. Therefore, river sourced irrigation water may indicate a public health risk, should contaminated product be consumed raw.

Similarly, the surface water samples in this study contained particular CCs that have reportedly caused bovine listeriosis, with L. monocytogenes CC1, CC4, and CC412 among the most common causes of ruminant encephalitis in central Europe28. These CCs are also frequently detected in the cattle farm environment including feed, drinking troughs, ruminant feces and manure25,28. However, the introduction routes of these CCs to the farm environment are currently not well understood, although spoiled silage is considered to be the primary source3,14. The occurrence of L. monocytogenes in rivers reported in this study points towards surface water a possible further source of clones causing disease in cattle. Notably, the use of river water for watering cattle may represent a possible exposure to bovine pathogenic L. monocytogenes.

Conclusions

This study demonstrates that L. monocytogenes circulating in the aquatic environment belong to CCs and contain the same virulence traits as L. monocytogenes that are frequently isolated from human and animal clinical cases and from globally occurring outbreaks, including hypervirulent clones CC1, CC4, and CC6.

Our data contribute to a better understanding of the diversity of L. monocytogenes present in flowing surface waters. The results may provide information to improve crop irrigation strategies and cattle watering practices that aim to reduce the transmission of foodborne pathogens from surface water to fresh produce and to the farm environment.

Material and methods

Sampling

A total of 191 water samples from different rivers and streams and inland canals throughout Switzerland were collected between January and May 2020. Sampling sites were located between 200 m and 1,730 m above sea level (see Supplementary Table S1). Water was taken from large rivers at 1 m depth using collection poles and sterile 500 mL containers. Smaller water bodies were sampled at 0.2–0.3 m depth using sterile 500 mL containers. The water samples were transported to the laboratory in a cool box. Each sample was stored at 4 °C for a maximum of 18 h. At each sampling site, weather conditions, ambient temperature and proximity to wastewater treatment plants were recorded.

Bacterial isolation

From each water sample 100 ml were passed through sterile, 0.22 μm membrane filters (Millipore, Billerica, MA, USA). For enrichment, the filters were incubated in 50 ml Half Frazer Broth (HFB, BioRad, Cressier, Switzerland) in sterile blender bags (Seward, Worthing, UK) at 30 °C for 48 h. One loopful each of the enriched cultures was streaked on Oxoid chromogenic Listeria agar (OCLA) plates (Oxoid, Pratteln, Switzerland) and incubated under aerobic conditions at 37 °C for 48 h. Presumptive Listeria colonies exhibiting green morphologies and opaque halos were subcultured on OCLA plates at 37 °C for 48 h. L. monocytogenes isolates were grown on sheep blood agar (Difco Laboratories) at 37 °C for 24 h, and kept at − 80 °C in brain heart infusion (BHI) broth stocks (Oxoid, Hampshire, UK) containing 15% glycerol.

Reference strains

To confirm lineages and serotypes, an average nucleotide identity (ANI) comparison was performed to create a phylogenetic tree that was calibrated with five L. monocytogenes genomes of known lineages and serotypes as reference strains (see Supplementary Fig. S1). The L. monocytogenes serotype 4b and 1/2b isolates from this study were compared to previously described outbreak strains LL195 (GenBank accession no. HF558398) and N2306 (CP011004). L. monocytogenes LL195 (CC1) was isolated from Swiss Vacherin Mont d’Or cheese during an outbreak 1983–198753, and L. monocytogenes N2306 was detected in ready-to-eat salad during an outbreak 2013–201454.

The L. monocytogenes serotype 1/2a isolates were compared to the reference strain L. monocytogenes EGD-e (NC003210), and to strains N1546 (CP013724) and N12-1273 (QYFZ00000000). L. monocytogenes EGD-e (CC9) was originally isolated from a rodent and is a widely used reference strain55. L. monocytogenes N1546 (CC8) is a clinical isolate recovered during the 2011 outbreak linked to ham products56,57. N12-1273 (CC412) is a previously characterized human listeriosis isolate49.

Whole genome sequencing

Genomic DNA was isolated using the DNeasy Blood and Tissue Kit (Qiagen, Hilden, Germany). Sequencing libraries were prepared using the Illumina Nextera DNA preparation kit and sequencing was performed on an Illumina MiSeq sequencer (Illumina, San Diego, CA, USA). Genomes were sequenced with a minimal coverage of 50x. Following a quality assessment with FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/), the reads were assembled with Shovill 1.0.9 and Spades 3.12.058 and integrated into the Ridom SeqSphere + software version 5.1.0 (Ridom, Münster, Germany)59.

Serotyping and multilocus sequence typing

In silico serotyping was performed in SeqSphere + using gene targets described previously4. Classic MLST determination based on seven housekeeping genes were performed in accordance with the L. monocytogenes BIGSdb-L. monocytogenes platform (https://bigsdb.pasteur.fr/listeria).

Core genome MLST and Single-Nucleotide Polymorphism Analysis

Core genome MLST (cgMLST) analysis was performed in accordance to the core genome defined by Ruppitsch et al.59. Sequences were blasted against 1701 genes of the reference genome of strain EGD-e, using the standard settings59. Cluster types (CTs) were determined upon submission to the L. monocytogenes cgMLST Ridom SeqSphere + server (http://www.cgmlst.org/ncs/schema/690488/). Missing genes were ignored in all samples. Minimum spanning trees (MSTs) were generated in Ridom SeqSphere + version 5.1.0 for visualization of strain relatedness. Clusters were defined as isolates containing ≤ 10 different alleles between a pair of isolates59.

Single-nucleotide polymorphism (SNP) analysis was performed using tparsnp in the harvest suite with option -c ignore MUMi activated60. An ANI was calculated based on MUMmer alignments as described previously61. Each isolate was compared to a reference strain of the same serotype, as listed above.

Pan- and Core Genome Profiling

The Bacterial Isolate Genome Sequence database (BIGSdb)62 was used to generate the pan-genome of the 25 surface water isolates together with the five reference strains. The presence or absence of genes, including virulence genes, pathogenicity islands, antimicrobial resistance, heavy metal resistance, and biocide resistance genes across each genome was verified by manual curation using Basic Local Alignment Search Tool (BLAST) version 2.10.1 on a CLC genomics Workbench version 20.0.3.

Geographical map

Geospatial visualization was carried out by plotting GPS coordinates of the sampling sites onto a geographical map using the open source geographic information system (GIS) software QGIS (https://qgis.org).