Abstract
Motivation In molecular epidemiology, the identification of clusters of transmissions typically requires the alignment of viral genomic sequence data. However, existing methods of multiple sequence alignment scale poorly with respect to the number of sequences.
Results ViralMSA is a user-friendly reference-guided multiple sequence alignment tool that leverages the algorithmic techniques of read mappers to enable the multiple sequence alignment of ultra-large viral genome datasets. It scales linearly with the number of sequences, and it is able to align tens of thousands of full viral genomes in seconds.
Availability ViralMSA is freely available at https://github.com/niemasd/ViralMSA as an open-source software project.
Contact a1moshir{at}ucsd.edu
Competing Interest Statement
The authors have declared no competing interest.
Footnotes
The first page got messed up again after uploading. This should hopefully fix it