Long-read Sequencing and de novo Genome Assembly of Three Aspergillus fumigatus Genomes

Hemmings, Samuel J.; Rhodes, Johanna L.; Fisher, Matthew C.

doi:10.1007/s11046-023-00740-2

Long-read Sequencing and de novo Genome Assembly of Three Aspergillus fumigatus Genomes

Mycopathologia GENOME
Open access
Published: 25 May 2023

Volume 188, pages 409–412, (2023)
Cite this article

Download PDF

You have full access to this open access article

Mycopathologia Aims and scope Submit manuscript

Long-read Sequencing and de novo Genome Assembly of Three Aspergillus fumigatus Genomes

Download PDF

1473 Accesses
3 Altmetric
Explore all metrics

Abstract

Aspergillus fumigatus is a genetically diverse fungal species, which is near ubiquitous in its global distribution and is the major cause of the life-threatening disease invasive aspergillosis. We present 3 de novo genome assemblies that were selected to be representative of the genetic diversity of clinical and environmental A. fumigatus. Sequencing using long-read Oxford Nanopore and subsequent assembly of the genomes yielded 10–23 contigs with an N50 of 4.05 Mbp to 4.93 Mbp.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Aspergillus fumigatus is a globally ubiquitous environmental mould that was recently highlighted in the World Health Organization (WHO) fungal priority pathogens list as a species of critical concern [1]. A. fumigatus can cause invasive and chronic forms of the disease aspergillosis which results in more than 300,000 deaths per year [2]. Unfortunately, resistance of A. fumigatus to triazole antifungals (the first-line therapy for aspergillosis) is emerging worldwide [3].

Previous phylogenomic analysis has shown that the population of A. fumigatus is genetically diverse and clusters into two clades (A and B) [4, 5]. This extensive genetic diversity provides ample opportunity for new drug resistance polymorphisms to arise. However, the current reference genomes, Af293 [6] and A1163 [7] do not span the existing known diversity. To assist in investigating why most environmental triazole resistance occurs in clade A, we have resequenced three isolates from our laboratories in-house A. fumigatus collection that are representative of the main diversity of A. fumigatus [4] (Fig. 1). Sequencing was achieved using deep nanopore sequencing to generate de novo assemblies of two clade A isolates (one of which contains the predominant resistance allele TR₃₄/L98H) and a single clade B isolate. Although at time of original submission, there are currently 321 A. fumigatus isolates available on NCBI [8] the 3 de novo assemblies we present here are assembled into fewer contigs than > 99%. Moreover, these genomes were sequenced cheaply and inhouse using long-read sequencing and we provide a freely available, downloadable bioinformatic pipeline for research groups who also wish to produce de novo genome assemblies of fungal species with small genomes from long-read sequencing data (https://github.com/SJHemmings/afasont).

Method

The isolates selected for sequencing were C6 (a clinical wildtype isolate from clade A, U.K.), C87 (a clinical isolate from clade A with resistant TR₃₄/L98H allele, U.K.) and E142 (an environmental wildtype isolate from clade B, U.S.).

A. fumigatus isolates were inoculated in vented 25 cm³ tissue culture flasks with Sabouraud Dextrose agar (Oxoid, Hampshire, U.K.) and incubated for 48 h at 37 °C. Spores were harvested in PBS + 0.01% Tween-20 by filtration through glass wool (Thermo Fisher Scientific, Massachusetts, U.S.). Spores were centrifuged (5000 rpm for 10 min) and resuspended in Yeast Cell Lysis Solution (Biosearch Technologies, Hoddesdon, U.K.) and vortexed at maximum speed for 10 min with 1.0 mm zirconia/silica beads (Thistle Scientific, Glasgow, U.K.). The suspension was then centrifuged (14,000 rpm for 2 min) and supernatant was removed and treated with RNase Cocktail™ Enzyme Mix (Thermo Fisher Scientific) according to the manufacturer’s instruction. DNA was then isolated on spin columns using AW1 and AW2 wash buffers (Qiagen, Venlo, Netherlands) and eluted in nuclease free water. To achieve the required relative absorption ratios for Oxford Nanopore sequencing and to concentrate DNA, additional washing steps were carried out using 0.6X AMPure Reagent (Beckman Coulter, California, U.S.) and 70% ethanol. An SRE XS kit (Pacific Biosciences, California, U.S.) was used to deplete any remaining reads below 10 kb. For quality control, DNA was visualised using Genomic DNA Screentape on a TapeStation (Agilent Technologies, California, U.S.) to ensure the average DNA length was above 20 kbp. 1 μg of DNA was prepared for sequencing using an SQK-LSK110 ligation sequencing kit (Oxford Nanopore Technologies, Oxford, U.K.) and NEBNext Companion Module (New England Biolabs, Massachusetts, U.S.) following the manufacturer’s instructions. Isolates were sequenced on a minION using an R10.4 flow cell (Oxford Nanopore Technologies) for a total of 18 h.

Live base calling was performed using Guppy v6.3.9. Porechop v0.2.4 [9] and NanoLyse v1.2.1 [10] were used to remove adapters and sequences and CS DNA (Oxford Nanopore Technologies) from raw fastq files. Reads were filtered using NanoFilt v2.8.0 [10] to remove reads with a quality score below Q10 or less than 1 kbp in length. Reads passing quality control were used for de novo assembly using Canu v2.2 [11] with a specified genome length of 29 Mbp. The assembled genomes were polished with Pilon v1.24 [12] using Illumina paired-end reads (150 bp) sequenced on a NovaSeq 6000 SP v1.5 (185X coverage for C6, 29X coverage for C87 and 51X coverage for E142) at the Earlham Institute (UK). Illumina paired-end reads can be accessed from the European Nucleotide Archive at EMBL-EBI under accession code PRJEB27135. The number of tRNA and protein coding genes within the assembles were then estimated using tRNAscan-SE v2.0.9 [13] and AUGUSTUS v3.8.0 [14]. Genome completeness was then predicted using BUSCO coupled with the ascomycota_odb10 lineage dataset [15]. The full pipeline (‘afasont’) used to generate these assemblies is available from https://github.com/SJHemmings/afasont. Finally, BLAST v2.12.0 + [16] analysis was used to screen the individual contigs for contamination.

Genome Details

After passing through quality control, raw reads from the minION showed coverage of 82X for C6, 73X for C87 and 149X for E142. Isolates C6, C87 and E142 were then assembled into genomes of 29,266,253 bp, 28,591,451 bp and 28,644,426 bp in length.

C6 was assembled into 10 contigs with an N50 of 3.99 Mbp with a longest contig of 4.93 Mbp. AUGUSTUS [14] predicted 8,802 protein coding genes and tRNAscan-SE [13] detected 211 genes which encode for transfer RNA. Using the ascomycota_odb10 lineage dataset, BUSCO estimated genome completeness to be 98.5% [15].

C87 assembled into 23 contigs with an N50 of 2.55 Mbp, the longest contig reached 4.05 Mbp in length. AUGUSTUS [14] found 8830 protein coding genes and tRNAscan-SE [13] estimated there were 200 genes which encode for tRNA. Genome completeness was predicted to be 97.6% using BUSCO [15].

E142 was assembled into 15 contigs and has a N50 of 2.67 Mbp, the longest contig was 4.20 Mbp. 8,806 protein coding genes were found by AUGUSTUS [14] and tRNAscan-SE [13] detected 208 genes which encode for tRNA. 98.1% genome completeness was estimated using BUSCO [15].

Raw read files sequenced with Oxford Nanopore Technologies and de novo genome assemblies can be accessed from the European Nucleotide Archive at EMBL-EBI under the accession code PRJEB59410. The sequence accessions for the individual assemblies are: CASBLW01 (GCA_949125545) (C6); CASBLU01 (GCA_949125165) (E142); CASBLV01 (GCA_949125185) (C87).

References

WHO fungal priority pathogens list to guide research, Development and Public Health Action (2022) World Health Organization. World Health Organization. 2022. https://www.who.int/publications/i/item/9789240060241 Accessed 17 Jan 2023.
Bongomin F, Gago S, Oladele RO, Denning DW. Global and multi-national prevalence of fungal diseases—estimate precision. J Fungi. 2017;3(4):57.
Article Google Scholar
Fisher MC, Hawkins NJ, Sanglard D, Gurr SJ. Worldwide emergence of resistance to antifungal drugs challenges human health and food security. Science. 2018;360(6390):739–42.
Article CAS PubMed Google Scholar
Rhodes J, Abdolrasouli A, Dunne K, Sewell TR, Zhang Y, Ballard E, Brackin AP, van Rhijn N, Chown H, Tsitsopoulou A, Posso RB. Population genomics confirms acquisition of drug-resistant Aspergillus fumigatus infection by humans from the environment. Nat Microbiol. 2022;7(5):663–74.
Article CAS PubMed PubMed Central Google Scholar
Sewell TR, Zhu J, Rhodes J, Hagen F, Meis JF, Fisher MC, Jombart T. Nonrandom distribution of azole resistance across the global population of Aspergillus fumigatus. MBio. 2019;10(3):e00392-e419.
Article CAS PubMed PubMed Central Google Scholar
Nierman WC, Pain A, Anderson MJ, Wortman JR, Kim HS, Arroyo J, Berriman M, Abe K, Archer DB, Bermejo C, Bennett J. Genomic sequence of the pathogenic and allergenic filamentous fungus Aspergillus fumigatus. Nature. 2005;438(7071):1151–6.
Article CAS PubMed Google Scholar
Fedorova ND, Khaldi N, Joardar VS, Maiti R, Amedeo P, Anderson MJ, Crabtree J, Silva JC, Badger JH, Albarraq A, Angiuoli S. Genomic islands in the pathogenic filamentous fungus Aspergillus fumigatus. PLoS Genet. 2008;4(4):e1000046.
Article PubMed PubMed Central Google Scholar
Aspergillus fumigatus genome list - genome - NCBI. National Center for Biotechnology Information https://www.ncbi.nlm.nih.gov/genome/browse/#!/eukaryotes/18/. Acessed 2 Feb 2023.
Wick RR, Judd LM, Gorrie CL, Holt KE. Completing bacterial genome assemblies with multiplex MinION sequencing. Microb Genomics. 2017;3(10):e000132.
Article Google Scholar
De Coster W, D’hert S, Schultz DT, Cruts M, Van Broeckhoven C. NanoPack: visualizing and processing long-read sequencing data. Bioinformatics. 2018;34(15):2666–9.
Article PubMed PubMed Central Google Scholar
Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017;27(5):722–36.
Article CAS PubMed PubMed Central Google Scholar
Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE. 2014;9(11):e112963.
Article PubMed PubMed Central Google Scholar
Chan PP, Lin BY, Mak AJ, Lowe TM. tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucleic Acids Res. 2021;49(16):9077–96.
Article CAS PubMed PubMed Central Google Scholar
Stanke M, Diekhans M, Baertsch R, Haussler D. Using native and syntenically mapped cDNA alignments to improve de novo gene finding. Bioinformatics. 2008;24(5):637–44.
Article CAS PubMed Google Scholar
Seppey M, Manni M, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness. Methods Mol Biol. 2019;1962:227–45. https://doi.org/10.1007/978-1-4939-9173-0_14.
Article CAS PubMed Google Scholar
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. BLAST+: architecture and applications. BMC Bioinform. 2009;10:1–9.
Article Google Scholar

Download references

Acknowledgements

We thank all reviewers for taking the time to assess our manuscript. We also acknowledge funding from the Wellcome Trust and Natural Environmental Research Council (NERC). We also thank Dr S. R. Lockhart (CDC Mycotic Diseases Branch) for prevision of the isolate E142.

Funding

Supported by the Wellcome Trust Collaborative Awards in Science grant: “Understanding and mitigating the impact of emerging antifungal resistance” & NERC: “Understanding the eco-evolutionary drivers of emerging antifungal resistance”. MCF is a CIFAR Fellow in the ‘Fungal Kingdom’ programme.

Author information

Authors and Affiliations

Department of Infectious Disease Epidemiology, Imperial College London, London, UK
Samuel J. Hemmings & Matthew C. Fisher
Department of Medical Microbiology, Radboud University Medical Centre, Nijmegen, Netherlands
Johanna L. Rhodes

Authors

Samuel J. Hemmings
View author publications
You can also search for this author in PubMed Google Scholar
Johanna L. Rhodes
View author publications
You can also search for this author in PubMed Google Scholar
Matthew C. Fisher
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the study conception, design and writing of the manuscript. Material preparation and data collection was performed by SJH. Analysis was performed by SJH & JLR.

Corresponding author

Correspondence to Samuel J. Hemmings.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hemmings, S.J., Rhodes, J.L. & Fisher, M.C. Long-read Sequencing and de novo Genome Assembly of Three Aspergillus fumigatus Genomes. Mycopathologia 188, 409–412 (2023). https://doi.org/10.1007/s11046-023-00740-2

Download citation

Received: 24 March 2023
Accepted: 21 April 2023
Published: 25 May 2023
Issue Date: August 2023
DOI: https://doi.org/10.1007/s11046-023-00740-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Long-read Sequencing and de novo Genome Assembly of Three Aspergillus fumigatus Genomes

Abstract

Introduction

Method

Genome Details

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation