Repository logo
 

DNAscent v2: detecting replication forks in nanopore sequencing data with deep learning.

Accepted version
Peer-reviewed

Type

Article

Change log

Authors

Abstract

BACKGROUND: Measuring DNA replication dynamics with high throughput and single-molecule resolution is critical for understanding both the basic biology behind how cells replicate their DNA and how DNA replication can be used as a therapeutic target for diseases like cancer. In recent years, the detection of base analogues in Oxford Nanopore Technologies (ONT) sequencing reads has become a promising new method to supersede existing single-molecule methods such as DNA fibre analysis: ONT sequencing yields long reads with high throughput, and sequenced molecules can be mapped to the genome using standard sequence alignment software. RESULTS: This paper introduces DNAscent v2, software that uses a residual neural network to achieve fast, accurate detection of the thymidine analogue BrdU with single-nucleotide resolution. DNAscent v2 also comes equipped with an autoencoder that interprets the pattern of BrdU incorporation on each ONT-sequenced molecule into replication fork direction to call the location of replication origins termination sites. DNAscent v2 surpasses previous versions of DNAscent in BrdU calling accuracy, origin calling accuracy, speed, and versatility across different experimental protocols. Unlike NanoMod, DNAscent v2 positively identifies BrdU without the need for sequencing unmodified DNA. Unlike RepNano, DNAscent v2 calls BrdU with single-nucleotide resolution and detects more origins than RepNano from the same sequencing data. DNAscent v2 is open-source and available at https://github.com/MBoemo/DNAscent . CONCLUSIONS: This paper shows that DNAscent v2 is the new state-of-the-art in the high-throughput, single-molecule detection of replication fork dynamics. These improvements in DNAscent v2 mark an important step towards measuring DNA replication dynamics in large genomes with single-molecule resolution. Looking forward, the increase in accuracy in single-nucleotide resolution BrdU calls will also allow DNAscent v2 to branch out into other areas of genome stability research, particularly the detection of DNA repair.

Description

Keywords

Budding yeast, DNA replication, DNAscent, Oxford nanopore, Replication forks, Replication origins, Residual neural networks, Deep Learning, High-Throughput Nucleotide Sequencing, Nanopore Sequencing, Nanopores, Sequence Analysis, DNA, Software

Journal Title

BMC Genomics

Conference Name

Journal ISSN

1471-2164
1471-2164

Volume Title

22

Publisher

Springer Science and Business Media LLC

Rights

All rights reserved
Sponsorship
Isaac Newton Trust (19.39(b))
Royal Society (RGS\R1\201251)
Engineering and Physical Sciences Research Council (EP/P020259/1)
Royal Society grant RGS\R1\201251 Isaac Newton Trust grant 19.39b Startup funds from the University of Cambridge Department of Pathology This work was performed using resources provided by the Cambridge Service for Data Driven Discovery (CSD3) operated by the University of Cambridge Research Computing Service (www.csd3.cam.ac.uk), provided by Dell EMC and Intel using Tier-2 funding from the Engineering and Physical Sciences Research Council (capital grant EP/P020259/1), and DiRAC funding from the Science and Technology Facilities Council (www.dirac.ac.uk).
Relationships
Is derived from: