Skip to main content

AsmVar: tools and exemplar data.

Dataset type: Software
Data released on December 03, 2015

Liu S; Huang S; Rao J; Ye W; The Genome Denmark Consortium ; Krogh A; Wang J; Schierup MH; Villesen P; Xu X; Li N; Kristiansen K; Soerensen TIA; Hansen T; Pedersen O; Brunak S; Gupta R; Rasmussen S; Lund O; Bolund L; Borglum AD; Eiberg H; Flindt EN; Xu R; Sun J; Liu H; Besenbacher S; Grove J; Als TD; Lescai F; Mailund T; Friborg RM; Pedersen CNS; Chang Y; Li S; Guo X; Cao H; Ye C; Maretty L; Sibbesen JA; Albrechtsen A; Bork-Jensen J; Have CT; Izarzugaza JMG; Belling K; Yadav R (2015): AsmVar: tools and exemplar data. GigaScience Database. https://doi.org/10.5524/100173

DOI10.5524/100173

Comprehensive characterization of genomic variation in a human individual is important for understanding disease and for development of personalized approaches to treatment. Many tools exist for identification of single nucleotide polymorphism (snps), small indels and large deletions based on DNA re-sequencing strategy. However, those approaches consistently display significant bias for recovery of complex structural variants and novel sequence in the individual genomes and lack sequence interpretation such as ancestral state and mechanism. Here we present a novel approach implemented in a single software package, AsmVar, to discover, genotype and characterize different forms of structural variants and novel sequence in population-scale de novo assemblies at single nucleotide resolution. Our approach displays good scalability and makes it applicable for investigations in large population studies of species with complex genomes, such as homo sapiens. Application of AsmVar to several human de novo assemblies captures a wide spectrum of structural variants and novel sequences present in the human population with high sensitivity and specificity. Our method provides a direct solution to investigate the structural variations and novel sequences from de novo assemblies, which is important for construction of population-scale pan genome. Our study also suggests the advantages of the de novo assembly strategy for definition of genome structure.
This software has been released under the MIT License Copyright 2014-2015.

Additional details

Read the peer-reviewed publication(s):

  • Liu, S., Huang, S., Rao, J., Ye, W., Krogh, A., & Wang, J. (2015). Discovery, genotyping and characterization of structural variation and novel sequence at single nucleotide resolution from de novo genome assemblies on a population scale. GigaScience, 4(1). https://doi.org/10.1186/s13742-015-0103-4

Additional information:

https://github.com/bioinformatics-centre/AsmVar

Click on a table column to sort the results.

Table Settings
Sample ID Common Name Scientific Name Sample Attributes Taxonomic ID Genbank Name
NA12878 Human Homo sapiens Biomaterial provider:Coriell
Cell line:GM12878
Sex:female [PATO:0000383]
...
9606 human

Click on a table column to sort the results.

Table Settings

File Name Description Sample ID Data Type File Format Size Release Date File Attributes Download
Readme TEXT 2.65 kB 2015-10-27 MD5 checksum: 6a4b87711bab8836174fea3408cad7e1
GitHub repository archived copy taken at time of publication, please see current project page for most recent updates https://github.com/ShujiaHuang/AsmVar Software zip 26.48 MB 2015-10-27 MD5 checksum: 9da5ad2831666852b286f8bc721d8e87
compressed archive of example data and example results files for the public sample NA12878 Other zip 84.25 MB 2015-10-27 MD5 checksum: 3068d5dd76c483ecdf27aff65b3155c3
Funding body Awardee Award ID Comments
Danish National Advanced Technology Foundation
Danish National Research Foundation
Novo Nordisk UK Research Foundation
State Key Development Program for Basic Research of China-973 Program
Date Action
December 3, 2015 Dataset publish
January 12, 2016 Manuscript Link added : 10.1186/s13742-015-0103-4