The genome of Leishmania panamensis: insights into genomics of the L. (Viannia) subgenus.

Llanes, Alejandro; Restrepo, Carlos Mario; Vecchio, Gina Del; Anguizola, Franklin José; Lleonart, Ricardo

doi:10.1038/srep08550

Download PDF

Article
Open access
Published: 24 February 2015

The genome of Leishmania panamensis: insights into genomics of the L. (Viannia) subgenus.

Alejandro Llanes^1,2,3,
Carlos Mario Restrepo^1,3,
Gina Del Vecchio²,
Franklin José Anguizola² &
…
Ricardo Lleonart¹

Scientific Reports volume 5, Article number: 8550 (2015) Cite this article

7527 Accesses
52 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Kinetoplastid parasites of the Leishmania genus cause several forms of leishmaniasis. Leishmania species pathogenic to human are separated into two subgenera, Leishmania (Leishmania) and L. (Viannia). Species from the Viannia subgenus cause predominantly cutaneous leishmaniasis in Central and South America, occasionally leading to more severe clinical presentations. Although the genomes of several species of Leishmania have been sequenced to date, only one belongs to this rather different subgenus. Here we explore the unique features of the Viannia subgenus by sequencing and analyzing the genome of L. (Viannia) panamensis. Against a background of conservation in gene content and synteny, we found key differences at the genomic level that may explain the occurrence of molecular processes involving nucleic acid manipulation and differential modification of surface glycoconjugates. These differences may in part explain some phenotypic characteristics of the Viannia parasites, including their increased adaptive capacity and enhanced metastatic ability.

Colonization and genetic diversification processes of Leishmania infantum in the Americas

Article Open access 29 January 2021

Philipp Schwabl, Mariana C. Boité, … Elisa Cupolillo

Chromosome-scale genome sequencing, assembly and annotation of six genomes from subfamily Leishmaniinae

Article Open access 06 September 2021

Hatim Almutairi, Michael D. Urbaniak, … Derek Gatherer

Major changes in chromosomal somy, gene expression and gene dosage driven by SbIII in Leishmania braziliensis and Leishmania panamensis

Article Open access 01 July 2019

Luz H. Patino, Hideo Imamura, … Juan David Ramírez

Introduction

Leishmaniasis is a broad term describing several clinical presentations, ranging from mild cutaneous lesions to the life-threatening visceral form. The causative agents are kinetoplastid parasites of the genus Leishmania, transmitted to human by phlebotomine sandflies. During their life cycle the parasites exist in two life stages: a flagellated promastigote within the digestive tract of the sandfly vector and a non-motile amastigote infecting macrophages of the vertebrate host. The Leishmania (Viannia) subgenus encompasses several species distributed across the Neotropics, including L. braziliensis, L. guyanensis, L. panamensis and L. peruviana. This subgenus was defined by Lainson and Shaw¹ based on differences in the site of propagation of promastigotes inside the digestive tract of the insect vector. These species primarily cause cutaneous leishmaniasis, but the parasites may occasionally migrate to nasopharyngeal tissues leading to highly disfiguring lesions in a presentation known as mucocutaneous leishmaniasis. This clinical form is exclusively associated with species of the L. (Viannia) subgenus. However, it is thought to be caused by a variety of factors that may contribute to enhance the metastatic ability of the parasites², including their infection by a specific retrovirus called Leishmania RNA virus (LRV)³.

The genomes of Old World Leishmania species such as L. major have 36 chromosomes, whereas those of L. (Viannia) species have only 35, due to the fusion of chromosomes corresponding to 20 and 34 in L. major⁴. All Leishmania genomes sequenced to date have ~8,000 protein-coding genes which lack introns and are organized in relatively large polycistronic units called directional gene clusters (DGCs)⁵. Mature mRNA coding for each gene is formed by 5'-trans-splicing coupled to 3'-polyadenylation of the gene immediately upstream in the polycistronic primary transcript⁶. Gene expression is not primarily regulated at the level of transcription, but rather post-transcriptionally at the levels of mRNA stability and translation⁷.

L. braziliensis is the only species of the Viannia subgenus whose genome has been completed⁸. This study highlighted several differences at the genomic level as compared to other Leishmania species, such as the presence of potentially active mobile elements and several genes possibly involved in an RNA-mediated interference (RNAi) machinery. RNAi activity was later experimentally confirmed in Viannia parasites and further proposed to serve as a protective mechanism against the effect of transposable elements and viruses such as LRV⁹. Here we present a high quality draft of the L. (Viannia) panamensis genome. This is the second genome sequenced for species of the Viannia subgenus, therefore allowing us to conduct a more detailed study of subgenus-specific features at the genomic level. L. panamensis is the main causative agent of tegumentary leishmaniasis in Panama¹⁰ and Colombia¹¹ and is responsible for a relatively large number of cases in other neighboring countries. In Panama, there are about 3,000 new cases per year, 5% of which progress to the mucocutaneous presentation.

Results

Genome sequencing and assembly

The PSC-1 strain (MHOM/PA/94/PSC-1) of L. panamensis was selected to sequence its genome as it has been used as a reference strain for previous epidemiological studies in Panama¹². We obtained two 454 read libraries with a total of 4 million reads and an estimated 30X coverage of the genome, one with single (shotgun) reads and the other with mate-paired reads and a median insert size of 8 kb. Additionally, a library of paired-end 100-bp Illumina reads was obtained, in this case not intended for de novo assembly, but for assembly validation, error correction and chromosome or gene copy number analyses. The 454 and Illumina read libraries were deposited in the Sequence Read Archive (SRA) under the accession codes SRX681913, SRX681914 and SRX681983.

De novo assembly of the 454 reads by using Newbler¹³ yielded 108 scaffolds with an N50 size of 674 kb, a total size of 30.9 Mb and 2.29% of N (~600 gaps) (Supplementary Table 1). The assembly was validated by using REAPR¹⁴, a program that uses the information contained in mate-paired reads to detect scaffolding errors and to break the original scaffolds at those points. The program flagged 81 errors, producing a fragmented assembly with a re-calculated N50 size of 555 kb. The REAPR-validated scaffolds were pooled together with the unassigned contigs and submitted to iCORN¹⁵ for correction of 454 pyrosequencing errors, resulting in several thousand corrections during 10 iterations (Supplementary Fig. 1).

A total of 140 fragments could be further oriented and contiguated into 35 pseudochromosomes by using ABACAS¹⁶ and the genome of L. braziliensis as a reference. Several relatively short fragments could not be assembled into pseudochromosomes due to conflicting or low sequence similarity. Twelve percent of these fragments matched kinetoplast DNA (kDNA) sequences and were not considered further. The remaining fragments have a concatenated size of 350 kb and are generally too short to span complete genes, but some of them are indeed fragments of genes organized in tandem arrays (further discussed ahead). It is important to stress that the scaffolds and not the individual contigs were submitted to ABACAS, so that the original assembly information surviving REAPR would be preserved. This served to detect several rearrangements between our assembled pseudochromosomes and the corresponding L. braziliensis chromosomes (Supplementary Figure 2). Comparisons regarding synteny and the chromosomal position of genetic elements throughout this article refer to regions assembled de novo for L. panamensis. The assembled pseudochromosomes were deposited in GenBank under the accession codes CP009370 to CP009404.

Annotation of protein-coding and non-coding RNA genes

RATT¹⁷ was used to transfer the gene models annotated in the L. major and L. braziliensis genomes to the L. panamensis pseudochromosomes. Approximately 95% of the L. braziliensis annotated gene models could be transferred either completely or partially, compared to only 65% of those from the more distant L. major genome. However, 6% of the genes from L. braziliensis were transferred to regions apparently non-syntenic, although gene order is rather conserved within them. With a few exceptions, the synteny in our assembly matches that of the L. major genome. These regions account for many of the rearrangements we previously found during contiguation with ABACAS, together comprising nearly 490 annotated gene models (Supplementary Data 1). The relatively high sequence similarity between corresponding genes in these regions, as well as the conserved synteny between our assembly and the L. major genome, suggest that some of these segments may have been incorrectly placed in the L. braziliensis assembly.

Gene models transferred by RATT and those predicted de novo on the basis of codon usage bias were combined and manually curated, resulting in 7,748 predicted protein-coding genes and 185 suspected pseudogenes, distributed in 132 DGCs (Table 1 and Supplementary Fig. 3a). We were also able to annotate most of the non-coding RNA genes previously described in L. major, including the vast majority of tRNA genes¹⁸ and those encoding the 28S rRNA (LSU-α, β, γ, δ, ε and ζ)¹⁹. Likewise, most of the clusters of small nucleolar RNA genes (snoRNA) detected in L. major²⁰ are conserved in L. panamensis, with a few exceptions that appear to have degenerated to a point in which the typical signatures of these molecules are no longer recognizable. Interestingly, we found two genes encoding H/ACA-like snoRNAs in chromosome 35 that are similar to two T. brucei genes (TB10Cs5H2 and TB10Cs5H3). This cluster might be a Viannia-specific trait, as it is absent from L. major but present in L. braziliensis —although currently not annotated.

Table 1 Summary of general statistics for the five Leishmania genomes considered in this study

Full size table

Functional and comparative analysis of protein-coding genes

Functional analysis of the predicted gene models allowed us to ascribe a putative function to nearly 300 genes transferred from L. braziliensis and L. major without functional annotation. Running OrthoMCL on the annotated gene models resulted in 7,157 ortholog groups with orthologs in all Leishmania species included in OrthoMCL-DB (version 5), with only ~430 groups differing in at least one species (Fig. 1 and Supplementary Data 2). As expected, the number of groups shared between pairs of species of the same subgenus is larger than the number of groups shared between pairs from different subgenera.

Considering all Leishmania species analyzed, genes found to be differentially present or absent in L. braziliensis are similar to those in L. panamensis. One of the most discussed examples are the genes suspected to be involved in the RNAi machinery, including those encoding a Dicer-like endonuclease (DCL1) (LbrM.23.0390/LpmP.23.0400) and an Argonaute-like protein (AGO1) (LbrM.11.0360/LpmP.11.0590). Conversely, an example of genes previously found to be absent in L. braziliensis are those of the HASP/SHERP locus⁸. This locus encodes a family of surface proteins differentially expressed throughout the L. major life stages, which have been shown to be critical for parasite differentiation in the sandfly vector²¹. It was later demonstrated that L. braziliensis has a stage-regulated HASP ortholog (oHASP), divergent in sequence but with similar biochemical properties to that of L. major²². This L. braziliensis gene (LbrM.23.1120) also has a relatively similar ortholog in L. panamensis (LpmP.23.1160). The corresponding loci are syntenic to those from L. (Leishmania) species, but unlike the L. major locus, we found no evidence supporting the presence of several copies in L. (Viannia) (see Gene copy number variation analysis).

Divergence in orthologous genes might also explain other differences than have been found when conducting comparative studies among species of L. (Leishmania) and L. (Viannia), especially with respect to surface proteins. For example, the PSA-2/GP46 gene encoding a major promastigote surface antigen could not be found in L. (Viannia) by using antibodies and hybridization probes specific for L. (Leishmania)²³. The authors attempted to explain the putative loss of this gene in L. (Viannia) by chromosomal deletion, but also suggested the possibility of rapid evolution leading to sequence divergence. The latter seems to be the correct explanation, since L. braziliensis and L. panamensis share two orthologs (LbrM.12.0760 and LpmP.12.0760) with low sequence similarity to the PSA-2 genes from L. (Leishmania) but located in approximately the same position in chromosome 12. In addition, the sequence similarity is higher in the region occupied by the leucine-rich domains (LRR) in the genes from L. (Leishmania), which have been demonstrated to be critical for the interaction of PSA-2 with the macrophages of the mammalian host²⁴. Evidence supporting rapid differential evolution was in fact reported later for this family²⁵, although LbrM.12.0760 was not considered in that study.

Another example of divergent orthologous genes related to surface components are those involved in the synthesis and modification of the lipophosphoglycan (LPG). LPG is a major surface glycoconjugate in Leishmania and is considered to be a critical factor for parasite survival both in the insect vector and the mammalian host²⁶. It is commonly modified by addition of carbohydrate side chains that vary significantly among life cycle stages and species. Genes encoding enzymes involved in LPG side chain modification therefore exhibit high variability among species, with a tandem array in chromosome 2 and several loci located in subtelomeric regions²⁷. The largest family, SCG, encompasses several genes encoding β1,3-galactosyltransferases involved in the addition of galactose residues to the LPG side chains. In L. major, these modifications are thought to promote attachment of the non-infective promastigotes to the sandfly midgut, whereas further capping of the side chains with arabinose residues causes the highly infective forms to detach²⁸. We were able to find several loci for SCG genes in L. panamensis, including a gene in the tandem array of chromosome 2 that is very similar to its ortholog in L. braziliensis, but very different to those from L. (Leishmania). However, we did not find the genes encoding the β1,2-arabinosyltransferases (SCA1 and SCA2) involved in arabinose capping of the LPG side chains. This is consistent with the finding that LPG side chains in L. braziliensis are not apparently modified with arabinose, but with glucose²⁹.

In agreement to previous studies⁸, pseudogene formation appears to be the main cause of gene loss in L. panamensis. Most genes annotated as pseudogenes appear to be deteriorated coding sequences, with intact orthologs in other Leishmania species. We noticed a relatively high number of pseudogenes in L. panamensis as compared to L. major or L. infantum. Although this finding may be attributed to errors introduced by next-generation sequencing, the L. braziliensis genome, which was completed by using the more accurate traditional Sanger sequencing, also appears to have a relatively large number of pseudogenes³⁰. Pseudogenization seems to be frequent in duplicated genes, in which case a duplicated copy is converted into a pseudogene by diversification and eventual deterioration. A relevant example is an adenine phosphoribosyltransferase gene in L. braziliensis (LbrM.26.0120), previously suggested to have arisen by tandem duplication⁸, whose ortholog in L. panamensis appears to have become a pseudogene (LpmP.26.0130). We also noticed a few examples of gene loss by apparent deletion, such as a gene encoding a guanine nucleotide-binding protein present in all the species included in this study (LbrM.14.0740 in L. braziliensis) but absent from L. panamensis with no detectable deteriorated sequence.

To better emphasize the differences between the two subgenera, a one-tailed Fisher's exact test for Gene Ontology (GO) term enrichment was performed for the L. panamensis genes uniquely shared with L. braziliensis —excluding those suspected to be separately clustered due to high sequence divergence— using the whole theoretical proteome of L. panamensis as the reference set (Fig. 2). Many of the enriched GO terms are associated with functions or processes involving nucleic acids, which in these species is due to the presence of an active RNAi pathway (endoribonuclease activity and double-stranded RNA-specific ribonuclease activity) and possibly active mobile elements (DNA integration, DNA recombination and RNA-dependent DNA replication).

Several terms related to transmembrane transport are related to a gene putatively encoding an equilibrative nucleoside transporter (LbrM.28.0580/LpmP.28.0570) with no direct ortholog in L. (Leishmania) species. This gene has only weak similarity to the four nucleoside/nucleobase transporters (NT1 to NT4) previously characterized in all Leishmania³¹ and also predicted to be present in L. panamensis. This plethora of genes coding for nucleoside/nucleobase transporters may be due to the fact that, like most parasitic protozoa, Leishmania is not able to synthesize purines de novo and therefore relies upon the salvage of purines from their hosts³². The occurrence of several processes involving manipulation of nucleic acids in L. (Viannia) could result in an increased demand of nucleosides, which may be in part fulfilled by acquisition via this additional transporter.

Another interesting finding is the enrichment of GO terms related to metabolic processes involving nitrogen compounds, in part associated with an additional gene coding for a glutathione peroxidase, as well as a gene putatively encoding a tyrosine/DOPA decarboxylase. Although Leishmania species have several enzymes with peroxidase activity, the presence of an additional glutathione peroxidase gene in L. (Viannia) (LpmP.26.0780/LbrM.26.0810) suggests an enhanced resistance to oxidative stress in these parasites, another factor that has been implicated in the development of metastatic clinical presentations². Conversely, the tyrosine/DOPA decarboxylase gene (LbrM.30.2460/LpmP.30.2430) putatively codes for a function apparently exclusive to L. (Viannia), since the orthologous loci in L. (Leishmania) species are likely to be pseudogenes. Theoretically, such an enzyme would mediate the conversion of tyrosine into tyramine. This would be an alternative way of processing tyrosine, because Leishmania can only convert tyrosine into 4-hydroxyphenylpiruvate by using two different aminotransferases³³. Additionally, this enzyme can theoretically mediate the conversion of dihydroxyphenylalanine (DOPA) into dopamine. Inferences on the biological relevance of such an enzyme in these parasites would require further experiments.

Repetitive sequences and mobile elements

De novo detection of repetitive sequences resulted in over 220 repeat families, with repeat units of length ranging from 50 to 1,200 bp —excluding protein domains and other repetitive regions within coding sequences. Bases in repetitive sequences represent ~4% of the total base content of the L. panamensis genome, with many families uniformly distributed across chromosomes (Supplementary Fig. 3b). We identified 70% of all repetitive sequences as short interspersed degenerated retroelements (SIDERs), 31% from the SIDER1 subfamily and 41% from the SIDER2 subfamily. The abundance of these extinct retroposons in Leishmania genomes has been attributed to their role in regulation of gene expression³⁴ and, more recently, to their ability to participate in recombinational events leading to genetic amplification³⁵. We could also confirm the presence of telomere-associated mobile elements (TATEs) in the L. panamensis genome. Unlike SIDERs, TATEs belong to a family of putatively active mobile elements described for the first time in L. braziliensis genome⁸. It is important to mention that we found repeat families similar or related to TATEs located in internal positions of chromosomes, both in L. panamensis and L. braziliensis, thus indicating that these elements may not be associated specifically with telomeres as suggested by their name.

In addition, we found several relatively large repeat families with predicted protein-coding genes, later clustered in OrthoMCL groups that exclude L. (Leishmania) species. An example is OG5_141602, which contains hypothetical proteins from different chromosomes of L. braziliensis, L. panamensis and Trypanosoma spp. At least two of the proteins from the L. (Viannia) species in this group (LbrM.05.0960 and LpmP.19.1310) were predicted to have reverse transcriptase (RNA-dependent DNA polymerase) domains. These proteins have vague similarity to those from retroposons of the ingi/L1Tc clade of T. brucei gambiense, T. congolense and T. cruzi³⁶. Although the repeats we found in L. (Viannia) species are shorter and lack other features previously described in such retroposons, we consider the presence of these hypothetical proteins to be a unique and intriguing trait of this subgenus. Furthermore, the region containing the gene LbrM.05.0960 (LpmP.05.0960 in L. panamensis) is flanked by two inverted copies of the MST gene, which encode a 3-mercaptopyruvate sulfurtransferase involved in the Leishmania defense against oxidative stress³⁷. The length of the region is conserved in L. (Leishmania) species, but only one copy of the MST gene is present in these species.

Variations in chromosome somy

Alignments of the Illumina reads to the assembled pseudochromosomes were used for ploidy or chromosome somy estimations. Despite local spikes associated with repetitive features, distribution of read depth is relatively uniform along the sequence of all chromosomes (Supplementary Figure 4). Median read depth does not appear to be globally affected by local variations in depth of coverage or GC content bias (Supplementary Figure 5). Several studies in our laboratory have shown that the PSC-1 strain is mostly diploid, thus we arbitrarily assigned the most frequent value of median read depth within all chromosomes to a disomic state. This value was used to compute a normalized median read depth for each chromosome, which was considered to be an estimation of its somy (Fig. 3 and Supplementary Data 3). Most chromosomes seem to be disomic, with the exception of chromosomes 4 and 23, which appear to be trisomic and chromosome 31, which seems to be tetrasomic. Chromosome 31 has been previously found to have an unusually larger somy in Leishmania; in fact, this is the only chromosome that has been found to be supernumerary in all Leishmania species in previous studies^30,38. As reported by Rogers et al.³⁰, we also noticed irregular values for median read depth in some of the smallest chromosomes, suggestive of chromosomes that are not fully disomic or trisomic. However, this situation is likely to be a consequence of mosaic aneuploidy³⁹, a phenomenon recently described for Leishmania, characterized by a variation in the somy of the same chromosomes among cells within the population.

We also noticed two relatively large regions with a uniform increase in read depth in chromosome 34, spanning approximately 45 and 100 kb, respectively (Fig. 4a). These regions are likely to be amplified, either duplicated within the chromosomes or as extrachromosomal elements. The first one encompasses ten genes of unknown function, with the exception of one (LpmP.34.3370) putatively encoding a protein of the SMC (structural maintenance of chromosome) family. However, the second one contains the LD1 region, a well-studied region prone to numerous types of amplifications in Leishmania⁴⁰. The “canonical” LD1 region was defined as a 27-kb segment from chromosome 35 of an L. infantum strain, which occurs as two inverted repeats in an amplified circular episomal element⁴¹. However, many different types of amplicons containing this region have been described⁴², all with an inverted repeat dimer organization. The amplification found in this work resembles a 245-kb linear minichromosome previously described in L. braziliensis⁴³, consisting of two inverted repeats of ~120 kb arranged so that the original telomeric region of chromosome 34 is placed at both ends (Fig. 4b). The abundance of LD1 amplifications among Leishmania species has been attributed to the presence of a BT1 gene encoding a biopterin transporter (LpmP.34.4980 in L. panamensis), which may contribute to an improved capture of pterins when it is amplified. This particular type of minichromosome may be considered a Viannia-specific feature, although it is not present in all strains or in all lineages of the same strains in L. braziliensis. It has been demonstrated that the presence of the minichromosome in L. braziliensis favors the survival of parasites and improves their infectivity in macrophages and in the sandfly vector⁴⁴.

Gene copy number variation analysis

The haploid copy number of protein-coding genes was estimated from the alignments of Illumina reads by using an approach similar to that described in the previous section. Results showed ~400 putative gene arrays in this strain, of which 285 (71%) have only two copies of the duplicated gene (Fig. 5; Table 2; Supplementary Data 4). Gene arrays with a relatively low copy number are not rare among eukaryotes⁴⁵. This situation typically occurs in cases where the protein products are normally required in relatively large doses, such as ribosomal proteins or histones. Genes encoding such proteins has been found duplicated in other Leishmania species and also here in L. panamensis. One example of functionally related genes found to be tandemly duplicated in L. panamensis are those encoding most of the glycolytic enzymes. Their duplication might be associated with the finding that, in Leishmania, these enzymes are not only part of glycolysis but also appear to participate in a number of “moonlighting” extracellular functions, including cell adherence, hemoglobin binding and modulation of the host cell immune system⁴⁶.

Table 2 Gene arrays with more than four estimated copies per chromosome whose function could be inferred

Full size table

Conversely, larger tandem gene arrays are less common and are thought to be maintained during evolution in order to fulfill particular cellular demands, commonly in the presence of stressors⁴⁷. Due to the lack of transcriptional control in Leishmania, increase in copy number within gene arrays has also been suggested to serve as a way of increasing the level of critical proteins³⁰, such as those important for parasite survival and infectivity in the sandfly vector and the mammalian host. Several genes that are widely known to occur as relatively large tandem arrays in Leishmania species also appear to be multicopy in L. panamensis (Fig. 5). Furthermore, gene arrays seem to be similar in their haploid number of copies for both L. braziliensis and L. panamensis, including the array of genes encoding the GP63 metalloprotease, previously reported to have an increased number of copies in L. braziliensis when compared to other Leishmania species^8,30. Another notable example is the gene coding for the NAD(P)H-dependent fumarate reductase (FRD), found to be multicopy in L. braziliensis and L. panamensis but not in all other Leishmania species. The distinctive increase in copy number for this gene is one of the factors that have been associated with an enhanced metastatic ability in these parasites². Interestingly, we found a higher number of copies for ubiquitin and histone H3 in L. panamensis, although these genes are also typically multicopy in Leishmania.

As in other Leishmania species, our results show that amastins comprise the largest family of multicopy genes in L. panamensis, with 19 genes assembled in 10 loci (Fig. 5), but an haploid number of copies estimated to be around 80. Although the function of these surface glycoproteins is not known, they are thought to participate in the interaction with macrophages of the mammalian host due to their preferential expression in amastigotes⁴⁸. Jackson⁴⁹ classified the amastin genes in four subfamilies (α, β, γ and δ) and a proto-δ group. Analysis of the δ subfamily has shown that it is not only expanded in Leishmania, but also that some member genes vary greatly among species^49,50. Accordingly, the genes we found in L. panamensis are more similar to those from L. braziliensis, with the majority of δ amastin genes clustered into subgenus-specific clades in a maximum likelihood phylogeny (Supplementary Fig. 6). In fact, for all amastin subfamilies, the genes from the two L. (Viannia) species tend to cluster together and are separated from those corresponding to L. (Leishmania) species, regardless of their shared genomic position. This strongly suggests diversifying evolution of these genes between subgenera, as opposed to concerted evolution within subgenera. The evolution of these genes is probably shaped by the different environmental and immunological niches to which parasites are exposed.

Discussion

Completely sequencing the genome of L. panamensis allowed us to explore the genomic background of the L. (Viannia) subgenus, with previous studies traditionally relying only on the L. braziliensis genome. We confirmed several general characteristics described for Leishmania genomes and, at the same time, several subgenus-specific features. Our results are in agreement with recent paradigms regarding the extensive genome plasticity in Leishmania. This genome plasticity is evidenced at several levels, including mosaic aneuploidy³⁹ and stochastic amplification of genomic regions mediated by homologous recombination³⁵. Repetitive sequences spread throughout the genome are thought to provide favorable conditions for several types of recombinational events leading to extrachromosomal amplifications. This dynamic environment provides the parasite population with an immediate availability of genetic variants to cope with potentially deleterious conditions, such as toxic drugs or attack by host immune system factors. In addition, the lack of transcriptional control is thought to be balanced with the ability to change gene dosage by increasing the number of critical genes in tandem arrays³⁰, which can be viewed as a form of “intrachromosomal amplification”. Maintaining relatively large gene arrays, however, suggests the existence of a mechanism of concerted evolution, probably with an adaptive advantage.

In addition, comparative analysis also suggests that diversification after duplication is an important source of variation in this species, in this case assuming neutral evolution. Pseudogene formation after duplication has been recognized as a relevant event responsible for species-specific differences in gene content among Leishmania⁸. As it was previously reported for L. braziliensis, we found a relatively large number of pseudogenes in L. panamensis as compared to L. (Leishmania) species. Although this may be explained by faster divergence and deterioration, it also suggests that specific genetic features of the Viannia subgenus, such as potentially active mobile elements, may contribute to a faster generation of pseudogenes. Additionally, pseudogenes generated after duplication may participate in the endogenous production of small interference RNAs (siRNAs) used in RNAi, a role that has been previously described in several eukaryotes, including T. brucei⁵¹.

We found relatively large sequence divergence when comparing several orthologs from L. (Viannia) species to those from L. (Leishmania), a situation that has often resulted in the L. (Viannia) orthologs clustered into different groups or considered to be different genes. This seems to be particularly relevant for proteins involved in surface components or in their modification, which vary significantly among species. Acting as an interface between the parasite and its hosts, surface components are largely responsible for differences in life cycle, range of vectors, tissue tropism and infective capabilities among species of Leishmania^26,52,53,54. Differences in genes involved in LPG side chain modifications may cause a different pattern of glycosylation of this surface glycoconjugate, which in turn may alter the way in which Viannia parasites develop in the digestive tract of the insect vector. This may explain the additional step of development in the insect hindgut, a feature that was used as a criterion to define the Viannia subgenus. Furthermore, these differences in surface glycolipids, together with divergence and/or copy number variation in several critical genes — including those encoding GP63, 3-mercaptopyruvate sulfurtransferase, glutathione reductase and NAD(P)H-dependent fumarate reductase — may enhance parasite survival and metastatic abilities, promoting the development of more severe disease outcomes, such as the mucocutaneous presentation.

On the other hand, the finding of TATE-related sequences in internal positions of chromosomes, as well as repetitive sequences comprising predicted protein-coding genes with reverse transcriptase domains, suggest that the impact of mobile elements — either autonomous, non-autonomus or extinct — may be stronger than previously suspected in L. (Viannia). The presence of these elements, together with the RNAi activity and several other features considered to be specific of L. (Viannia) species, resemble traits that have been typically associated with Trypanosoma species. This supports the previously formulated hypothesis of an early divergence of the Viannia subgenus during the evolution of the Leishmania genus^55,56, although alternative scenarios have been suggested⁵⁷.

The sequence of the L. panamensis genome is a valuable resource to understand the biology and evolution of the L. (Viannia) subgenus. In addition, it provides the foundation for future studies regarding epidemiology, pathogenesis and drug resistance in countries where this parasite is a notable causative agent of leishmaniasis, including Panama and Colombia.

Methods

Genome sequencing

High-quality genomic DNA was extracted from the PSC-1 strain (MHOM/PA/1994/PSC-1) of L. panamensis, originally isolated from a skin lesion on the arm of a male subject in Panama. Genomic DNA was extracted from stationary phase promastigotes using a commercial salting out procedure as recommended by the manufacturer (Wizard Genomic DNA purification kit, Promega). Size check, integrity and presence of contaminants in the DNA samples were assessed through gel electrophoresis. DNA concentration was estimated by the picogreen method using Victor 3 fluorometry (PerkinElmer). DNA purity was measured using a NanoDrop 2000 spectrophotometer (Thermo Scientific). Two libraries were prepared for 454 pyrosequencing from 25 μg of genomic DNA, one with shotgun (single-end) reads and the other with mate-paired reads with a median insert size of 8 kb, respectively following the GS rapid library and the GS 8-kb span paired end library preparation protocols from Roche. Genomic DNA was fragmented by nebulization in the case of the shotgun read library, or in a HydroShear apparatus (Genomic Solutions Inc.), followed by circularization and nebulization in the case of the mate-paired library. Fragment size was experimentally confirmed by using a DNA 12000 LabChip with a 2100 Bioanalyzer (Agilent Technologies). Both libraries were then sequenced on a GS-FLX Titanium instrument. An additional library was prepared from 2 μg of genomic DNA by using the TruSeq DNA HT sample preparation protocol (Illumina) and then sequenced in a HiSeq 2000 instrument for a total throughput of 5 Gb.

Reference genomes

Four reference genomes were used at different stages of this work, corresponding to L. major strain Friedlin⁵⁸ (version 6.1), L. infantum strain JPCM5⁸ (version 5.0), L. braziliensis strain M2904⁸ (version 3.0) and L. mexicana strain U1103³⁰ (version 5.0). All genomes were downloaded from the FTP site of the Wellcome Trust Sanger Institute (UK) (ftp.sanger.ac.uk/pub/pathogens/Leishmania/).

De novo assembly, post-assembly improvements and short read mapping

The 454 reads were assembled de novo by using Newbler¹³ (version 2.6), with the built-in gap-filling and heterozygotic modes enabled (-scaffold and -het flags, respectively). The use of these modes was chosen after several tests to assess their beneficial effect on the assembly metrics. Only contigs larger than 500 bp were used during scaffolding and the subsequent steps in assembly.

The Illumina reads were used to validate the de novo assemblies with REAPR¹⁴. After running REAPR, iCORN¹⁵ was used to detect and correct errors derived from the 454 pyrosequencing using information from the Illumina reads. The Mauve Contig Mover⁵⁹ was used to initially order the de novo fragments against the L. braziliensis chromosomes. Each group of fragments assigned to particular L. braziliensis chromosomes was submitted independently to ABACAS¹⁶ to contiguate them into individual pseudochromosomes, based on comparison at the nucleotide level.

Sequence reads were mapped back to the assembled pseudochromosomes or reference chromosomes by using SMALT (version 0.7.2) (https://www.sanger.ac.uk/resources/software/smalt/), with the parameters suggested in the manual for each type of read.

Annotation of protein-coding genes and non-coding features

RATT¹⁷ was used to transfer the annotated genes from the genomes of L. major and L. braziliensis. In addition to RATT, Artemis⁶⁰ was used for de novo prediction of open reading frames (ORFs) larger than 225 bp correlating to the expected codon usage of other Leishmania species, taking into account the suggestions given by Aggarwal et al.⁶¹ ORFs with a codon usage correlation score lower than 55 were discarded. The three lines of evidence for gene models were manually revised and combined during a three-way comparison of the corresponding genomes using the Artemis Comparison Tool (ACT)⁶⁰. Pseudogenes were identified based on frameshifts or in-frame stop codons disrupting the corresponding ORFs for the transferred genes, in cases where these artifacts could be confirmed in the majority of reads mapping to the corresponding loci.

Non-coding RNA genes were predicted by scanning the sequences against the Rfam database⁶² (version 11.0), using the rfam_scan.pl script included in the distribution. tRNAscan-SE⁶³ (version 1.21) was also used to predict tRNA genes.

A combined strategy was used to identify repetitive regions. First, RepeatScout⁶⁴ was used for de novo detection of repeat families; then, a sequence similarity search against RepBase Update⁶⁵ (volume 18, issue 9) was performed with RepeatMasker (http://www.repeatmasker.org), both with the default options. Families predicted de novo were manually correlated to hits produced by RepeatMasker and to repeat families previously described in trypanosomatids, on the basis of sequence similarity and chromosomal location.

Ortholog clustering and functional analysis

The OrthoMCL web assignment tool was used to assign the L. panamensis predicted proteins to the ortholog groups pre-defined in the OrthoMCL-DB database⁶⁶ (ortholog ID version 5). Functional analysis of the annotated gene models was performed by using Blast2GO⁶⁷ and InterProScan⁶⁸ (version 5). Gene Ontology (GO) terms and Enzyme Commission (EC) numbers were obtained from the predicted domains whenever possible. Additionally, EC numbers that could not be inferred from domain architecture were transferred from orthologous annotations in other Leishmania genomes. Blast2GO was also used to perform the GO enrichment analysis.

Chromosome somy and gene copy number variation analysis

SAMtools⁶⁹ depth (version 0.1.19) was used to record the read depth per base along the assembled pseudochromosomes. These values were then used to compute the median read depth for each chromosome. To obtain an estimate of chromosome somy, the values of median read depth for each chromosome were normalized dividing by the median read depth expected for a monosomic chromosome. This value in turn was calculated from the most frequent median read depth among all the chromosomes, which was arbitrarily considered to correspond to a disomic state.

Read depth for annotated features was computed as described above, but submitting an additional file to SAMtools with the corresponding annotations in BED format. The haploid copy number for each feature was estimated by the ratio between its median read depth and the median read depth of its corresponding chromosome. For genes within the same ortholog group and located in the same chromosome, the total haploid copy number was considered to be the sum of their individual estimates.

Additional information

Accession codes: This project has been registered in NCBI as bioproject PRJNA235344. The raw reads have been deposited in the Sequence Read Archive (SRA) under the accession codes SRX681913, SRX681914 and SRX681983. The assembled and annotated pseudochromosomes have been deposited in GenBank under the accession codes CP009370 to CP009404.

References

Lainson, R. & Shaw, J. J. Evolution, classification and geographical distribution. in The Leishmaniasis in Biology and Medicine. (eds Peters, W. & Killick-Kendrick, R.) Vol. 1, 1–120 (Academic Press, 1987).
Google Scholar
Hartley, M. A., Drexler, S., Ronet, C., Beverley, S. M. & Fasel, N. The immunological, environmental and phylogenetic perpetrators of metastatic leishmaniasis. Trends Parasitol. 30, 412–22 (2014).
CAS PubMed PubMed Central Google Scholar
Scott, P. Leishmania — A Parasitized Parasite. N. Engl. J. Med. 364, 1773–1775 (2011).
CAS PubMed Google Scholar
Britto, C. et al. Conserved linkage groups associated with large-scale chromosomal rearrangements between Old World and New World Leishmania genomes. Gene 222, 107–117 (1998).
CAS PubMed Google Scholar
Smith, D. F., Peacock, C. S. & Cruz, A. K. Comparative genomics: from genotype to disease phenotype in the leishmaniases. Int J Parasitol 37, 1173–1186 (2007).
CAS PubMed PubMed Central Google Scholar
Martínez-Calvillo, S., Vizuet-de-Rueda, J. C., Florencio-Martínez, L. E., Manning-Cela, R. G. & Figueroa-Angulo, E. E. Gene expression in trypanosomatid parasites. J. Biomed. Biotechnol. 2010, 525241 (2010).
PubMed PubMed Central Google Scholar
Haile, S. & Papadopoulou, B. Developmental regulation of gene expression in trypanosomatid parasitic protozoa. Curr. Opin. Microbiol. 10, 569–77 (2007).
CAS PubMed Google Scholar
Peacock, C. S. et al. Comparative genomic analysis of three Leishmania species that cause diverse human disease. Nat. Genet. 39, 839–847 (2007).
CAS PubMed PubMed Central Google Scholar
Lye, L. F. et al. Retention and loss of RNA interference pathways in trypanosomatid protozoans. PLoS Pathog 6, e1001161 (2010).
PubMed PubMed Central Google Scholar
Vásquez, A., Paz, H., Alvar, J., Pérez, D. & Hernández, C. Informe Final: Estudios Sobre la Epidemiología de la Leishmaniasis en la Parte Occidental de la República de Panamá. (Instituto Conmemorativo Gorgas de Estudio de la Salud; Ministerio de Salud, 1998).
Davies, C. R. et al. The epidemiology and control of leishmaniasis in Andean countries. Cad. Saude Publica 16, 925–50 (2000).
CAS PubMed Google Scholar
Restrepo, C. M. et al. AFLP polymorphisms allow high resolution genetic analysis of American Tegumentary Leishmaniasis agents circulating in Panama and other members of the Leishmania genus. PLoS One 8, e73177 (2013).
ADS CAS PubMed PubMed Central Google Scholar
Quinn, N. L. et al. Assessing the feasibility of GS FLX Pyrosequencing for sequencing the Atlantic salmon genome. BMC Genomics 9, 404 (2008).
PubMed PubMed Central Google Scholar
Hunt, M. et al. REAPR: a universal tool for genome assembly evaluation. Genome Biol. 14, R47 (2013).
PubMed PubMed Central Google Scholar
Otto, T. D., Sanders, M., Berriman, M. & Newbold, C. Iterative Correction of Reference Nucleotides (iCORN) using second generation sequencing technology. Bioinformatics 26, 1704–1707 (2010).
CAS PubMed PubMed Central Google Scholar
Assefa, S., Keane, T. M., Otto, T. D., Newbold, C. & Berriman, M. ABACAS: algorithm-based automatic contiguation of assembled sequences. Bioinformatics 25, 1968–9 (2009).
CAS PubMed PubMed Central Google Scholar
Otto, T. D., Dillon, G. P., Degrave, W. S. & Berriman, M. RATT: Rapid Annotation Transfer Tool. Nucleic Acids Res. 39, e57 (2011).
CAS PubMed PubMed Central Google Scholar
Padilla-Mejía, N. E. et al. Gene organization and sequence analyses of transfer RNA genes in Trypanosomatid parasites. BMC Genomics 10, 232 (2009).
PubMed PubMed Central Google Scholar
Martínez-Calvillo, S. et al. Genomic organization and functional characterization of the Leishmania major Friedlin ribosomal RNA gene locus. Mol. Biochem. Parasitol. 116, 147–57 (2001).
PubMed Google Scholar
Liang, X. et al. Genome-wide analysis of C/D and H/ACA-like small nucleolar RNAs in Leishmania major indicates conservation among trypanosomatids in the repertoire and in their rRNA targets. Eukaryot. Cell 6, 361–77 (2007).
CAS PubMed Google Scholar
Sádlová, J. et al. The stage-regulated HASPB and SHERP proteins are essential for differentiation of the protozoan parasite Leishmania major in its sand fly vector, Phlebotomus papatasi. Cell. Microbiol. 12, 1765–79 (2010).
PubMed PubMed Central Google Scholar
Depledge, D. P. et al. Leishmania- specific surface antigens show sub-genus sequence variation and immune recognition. PLoS Negl. Trop. Dis. 4, e829 (2010).
Google Scholar
McMahon-Pratt, D., Traub-Cseko, Y., Lohman, K. L., Rogers, D. D. & Beverley, S. M. Loss of the GP46/M-2 surface membrane glycoprotein gene family in the Leishmaniabraziliensis complex. Mol. Biochem. Parasitol. 50, 151–60 (1992).
CAS PubMed Google Scholar
Kedzierski, L. et al. A leucine-rich repeat motif of Leishmania parasite surface antigen 2 binds to macrophages through the complement receptor 3. J. Immunol. 172, 4902–6 (2004).
CAS PubMed Google Scholar
Devault, A. & Bañuls, A.-L. The promastigote surface antigen gene family of the Leishmania parasite: differential evolution by positive selection and recombination. BMC Evol. Biol. 8, 292 (2008).
PubMed PubMed Central Google Scholar
Franco, L. H., Beverley, S. M. & Zamboni, D. S. Innate immune activation and subversion of Mammalian functions by Leishmania lipophosphoglycan. J. Parasitol. Res. 2012, 165126 (2012).
PubMed PubMed Central Google Scholar
Dobson, D. E., Scholtes, L. D., Myler, P. J., Turco, S. J. & Beverley, S. M. Genomic organization and expression of the expanded SCG/L/R gene family of Leishmaniamajor: internal clusters and telomeric localization of SCGs mediating species-specific LPG modifications. Mol Biochem Parasitol 146, 231–241 (2006).
CAS PubMed Google Scholar
Goswami, M., Dobson, D. E., Beverley, S. M. & Turco, S. J. Demonstration by heterologous expression that the Leishmania SCA1 gene encodes an arabinopyranosyltransferase. Glycobiology 16, 230–6 (2006).
CAS PubMed Google Scholar
Soares, R. P. et al. Differential midgut attachment of Leishmania (Viannia) braziliensis in the sand flies Lutzomyia (Nyssomyia) whitmani and Lutzomyia (Nyssomyia) intermedia. J. Biomed. Biotechnol. 2010, 439174 (2010).
PubMed Google Scholar
Rogers, M. B. et al. Chromosome and gene copy number variation allow major structural change between species and strains of Leishmania. Genome Res. 21, 2129–2142 (2011).
CAS PubMed PubMed Central Google Scholar
Ortiz, D. et al. Molecular genetic analysis of purine nucleobase transport in Leishmaniamajor. Mol. Microbiol. 64, 1228–43 (2007).
CAS PubMed Google Scholar
Landfear, S., Ullman, B., Carter, N. & Sanchez, M. Nucleoside and nucleobase transporters in parasitic protozoa. Eukaryot. Cell 3, 245–254 (2004).
CAS PubMed PubMed Central Google Scholar
Nowicki, C. & Cazzulo, J. J. Aromatic amino acid catabolism in trypanosomatids. Comp. Biochem. Physiol. A. Mol. Integr. Physiol. 151, 381–390 (2008).
PubMed Google Scholar
Bringaud, F., Ghedin, E., El-Sayed, N. M. A. & Papadopoulou, B. Role of transposable elements in trypanosomatids. Microbes Infect. 10, 575–81 (2008).
CAS PubMed Google Scholar
Ubeda, J.-M. et al. Genome-wide stochastic adaptive DNA amplification at direct and inverted DNA repeats in the parasite Leishmania. PLoS Biol. 12, e1001868 (2014).
PubMed PubMed Central Google Scholar
Bringaud, F., Berriman, M. & Hertz-Fowler, C. Trypanosomatid genomes contain several subfamilies of ingi-related retroposons. Eukaryot. Cell 8, 1532–42 (2009).
CAS PubMed PubMed Central Google Scholar
Williams, R. A. M., Kelly, S. M., Mottram, J. C. & Coombs, G. H. 3-Mercaptopyruvate sulfurtransferase of Leishmania contains an unusual C-terminal extension and is involved in thioredoxin and antioxidant metabolism. J. Biol. Chem. 278, 1480–6 (2003).
CAS PubMed Google Scholar
Downing, T. et al. Whole genome sequencing of multiple Leishmaniadonovani clinical isolates provides insights into population structure and mechanisms of drug resistance. Genome Res. 21, 2143–2156 (2011).
CAS PubMed PubMed Central Google Scholar
Sterkers, Y. et al. Novel insights into genome plasticity in Eukaryotes: mosaic aneuploidy in Leishmania. Mol Microbiol 86, 15–23 (2012).
CAS PubMed Google Scholar
Segovia, M. & Ortiz, G. LD1 amplifications in Leishmania. Parasitol. Today 13, 196–199 (1997).
Google Scholar
Myler, P. J., Lodes, M. J., Merlin, G., de Vos, T. & Stuart, K. D. An amplified DNA element in Leishmania encodes potential integral membrane and nucleotide-binding proteins. Mol. Biochem. Parasitol. 66, 11–20 (1994).
CAS PubMed Google Scholar
Sunkin, S. M. et al. Conservation of the LD1 region in Leishmania includes DNA implicated in LD1 amplification. Mol. Biochem. Parasitol. 113, 315–21 (2001).
CAS PubMed Google Scholar
Fu, G., Melville, S., Brewster, S., Warner, J. & Barker, D. C. Analysis of the genomic organisation of a small chromosome of Leishmaniabraziliensis M2903 reveals two genes encoding GTP-binding proteins, one of which belongs to a new G-protein family and is an antigen. Gene 210, 325–33 (1998).
CAS PubMed Google Scholar
Sampaio, M. C. R. et al. A 245kb mini-chromosome impacts on Leishmaniabraziliensis infection and survival. Biochem. Biophys. Res. Commun. 382, 74–8 (2009).
CAS PubMed Google Scholar
Innan, H. & Kondrashov, F. The evolution of gene duplications: classifying and distinguishing between models. Nat. Rev. Genet. 11, 97–108 (2010).
CAS PubMed Google Scholar
Gómez-Arreaza, A. et al. Extracellular functions of glycolytic enzymes of parasites: Unpredicted use of ancient proteins. Mol. Biochem. Parasitol. 193, 75–81 (2014).
PubMed Google Scholar
Kondrashov, F. A. Gene duplication as a mechanism of genomic adaptation to a changing environment. Proc. Biol. Sci. 279, 5048–57 (2012).
PubMed PubMed Central Google Scholar
Rochette, A. et al. Characterization and developmental gene regulation of a large gene family encoding amastin surface proteins in Leishmania spp. Mol. Biochem. Parasitol. 140, 205–20 (2005).
CAS PubMed Google Scholar
Jackson, A. P. The evolution of amastin surface glycoproteins in trypanosomatid parasites. Mol. Biol. Evol. 27, 33–45 (2010).
CAS PubMed Google Scholar
Real, F. et al. The genome sequence of Leishmania (Leishmania) amazonensis: functional annotation and extended analysis of gene models. DNA Res. 20, 567–81 (2013).
CAS PubMed PubMed Central Google Scholar
Wen, Y. Z. et al. Pseudogene-derived small interference RNAs regulate gene expression in African Trypanosomabrucei. Proc. Natl. Acad. Sci. U. S. A. 108, 8345–50 (2011).
ADS CAS PubMed PubMed Central Google Scholar
Descoteaux, A. & Turco, S. J. Glycoconjugates in Leishmania infectivity. Biochim. Biophys. Acta 1455, 341–352 (1999).
CAS PubMed Google Scholar
Chang, K. & McGwire, B. Molecular determinants and regulation of Leishmania virulence. Kinetoplastid Biol. Dis. 1, 1, 10.1186/1475-9292-1-1 (2002).
Article PubMed PubMed Central Google Scholar
Naderer, T., Vince, J. E. & McConville, M. J. Surface determinants of Leishmania parasites and their role in infectivity in the mammalian host. Curr. Mol. Med. 4, 649–65 (2004).
CAS PubMed Google Scholar
Noyes, H. Implications of a Neotropical origin of the genus Leishmania. Mem. Inst. Oswaldo Cruz 93, 657–61 (1998).
CAS PubMed Google Scholar
Momen, H. & Cupolillo, E. Speculations on the origin and evolution of the genus Leishmania. Mem. Inst. Oswaldo Cruz 95, 583–8 (2000).
CAS PubMed Google Scholar
Kerr, S. F. Palaearctic origin of Leishmania. Mem. Inst. Oswaldo Cruz 95, 75–80 (2000).
CAS PubMed Google Scholar
Ivens, A. C. et al. The genome of the kinetoplastid parasite, Leishmaniamajor. Science 309, 436–42 (2005).
ADS PubMed PubMed Central Google Scholar
Rissman, A. I. et al. Reordering contigs of draft genomes using the Mauve aligner. Bioinformatics 25, 2071–3 (2009).
CAS PubMed PubMed Central Google Scholar
Carver, T. et al. Artemis and ACT: viewing, annotating and comparing sequences stored in a relational database. Bioinformatics 24, 2672–6 (2008).
CAS PubMed PubMed Central Google Scholar
Aggarwal, G., Worthey, E. A., McDonagh, P. D. & Myler, P. J. Importing statistical measures into Artemis enhances gene identification in the Leishmania genome project. BMC Bioinformatics 4, 23 (2003).
PubMed PubMed Central Google Scholar
Burge, S. W. et al. Rfam 11.0: 10 years of RNA families. Nucleic Acids Res. 41, D226–32 (2012).
PubMed PubMed Central Google Scholar
Lowe, T. M. & Eddy, S. R. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964 (1997).
CAS PubMed PubMed Central Google Scholar
Price, A. L., Jones, N. C. & Pevzner, P. A. De novo identification of repeat families in large genomes. Bioinformatics 21 Suppl 1, i351–8 (2005).
PubMed Google Scholar
Jurka, J. et al. Repbase Update, a database of eukaryotic repetitive elements. Cytogenet. Genome Res. 110, 462–467 (2005).
CAS PubMed Google Scholar
Chen, F., Mackey, A. J., Stoeckert, C. J. & Roos, D. S. OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups. Nucleic Acids Res. 34, D363–D368 (2006).
CAS PubMed Google Scholar
Götz, S. et al. High-throughput functional annotation and data mining with the Blast2GO suite. Nucleic Acids Res. 36, 3420–3435 (2008).
PubMed PubMed Central Google Scholar
Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–40 (2014).
CAS PubMed PubMed Central Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–9 (2009).
PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Authors want to thank Dr. Matthew Berriman and Dr. Thomas Otto from the Wellcome Trust Sanger Institute (UK) for their valuable suggestions on the methodology; Dr. Fatima Silva-Franco from Liverpool University (UK) for her help during annotation; Dr. Jagannatha Rao and Dr. Arturo Melo for their valuable logistic support and Dr. Gabrielle Britton for revision of the manuscript. This work was funded by an institutional grant from INDICASAT AIP, Panama.

Author information

Authors and Affiliations

Centro de Biología Celular y Molecular de Enfermedades, Instituto de Investigaciones Científicas y Servicios de Alta Tecnología (INDICASAT AIP), Ciudad del Saber, Panamá, Panamá
Alejandro Llanes, Carlos Mario Restrepo & Ricardo Lleonart
Facultad de Ciencias de la Salud Dr. William C. Gorgas, Universidad Latina de Panamá, Panamá, Panamá
Alejandro Llanes, Gina Del Vecchio & Franklin José Anguizola
Department of Biotechnology, Acharya Nagarjuna University, Guntur, India
Alejandro Llanes & Carlos Mario Restrepo

Authors

Alejandro Llanes
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Mario Restrepo
View author publications
You can also search for this author in PubMed Google Scholar
Gina Del Vecchio
View author publications
You can also search for this author in PubMed Google Scholar
Franklin José Anguizola
View author publications
You can also search for this author in PubMed Google Scholar
Ricardo Lleonart
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.L., C.M.R. and R.L. performed the experiments, analyzed the data and wrote the paper; G.D.V. and F.J.A. helped in the manual revision of gene models during annotation; R.L. conceived and directed the project.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Supplementary Data 1

Supplementary Information

Supplementary Data 2

Supplementary Information

Supplementary Data 3

Supplementary Information

Supplementary Data 4

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder in order to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Llanes, A., Restrepo, C., Vecchio, G. et al. The genome of Leishmania panamensis: insights into genomics of the L. (Viannia) subgenus.. Sci Rep 5, 8550 (2015). https://doi.org/10.1038/srep08550

Download citation

Received: 18 November 2014
Accepted: 26 January 2015
Published: 24 February 2015
DOI: https://doi.org/10.1038/srep08550

This article is cited by

Screening of the antileishmanial and antiplasmodial potential of synthetic 2-arylquinoline analogs
- Roger Espinosa-Saez
- Sara M. Robledo
- Camilo Guzmán-Teran
Scientific Reports (2023)
Identification of a unique conserved region from a kinetoplastid genome of Leishmania orientalis (formerly named Leishmania siamensis) strain PCM2 in Thailand
- Pornchai Anuntasomboon
- Suradej Siripattanapipong
- Teerasak E-kobon
Scientific Reports (2023)
Comparative analysis of the transcriptional responses of five Leishmania species to trivalent antimony
- Julián Medina
- Lissa Cruz-Saavedra
- Juan David Ramírez
Parasites & Vectors (2021)
Study of VIPER and TATE in kinetoplastids and the evolution of tyrosine recombinase retrotransposons
- Yasmin Carla Ribeiro
- Lizandra Jaqueline Robe
- Adriana Ludwig
Mobile DNA (2019)
Comparative genomics of Leishmania (Mundinia)
- Anzhelika Butenko
- Alexei Y. Kostygov
- Vyacheslav Yurchenko
BMC Genomics (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Genome sequencing and assembly

Annotation of protein-coding and non-coding RNA genes

Functional and comparative analysis of protein-coding genes

Repetitive sequences and mobile elements

Variations in chromosome somy

Gene copy number variation analysis

Discussion

Methods

Genome sequencing

Reference genomes

De novo assembly, post-assembly improvements and short read mapping

Annotation of protein-coding genes and non-coding features

Ortholog clustering and functional analysis

Chromosome somy and gene copy number variation analysis

Additional information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links