Transcriptome sequencing reveals signatures of positive selection in the Spot-Tailed Earless Lizard

Jose A. Maldonado; Thomas J. Firneno Jr; Corey E. Roelke; Nathan D. Rains; Juliet Mwgiri; Matthew K. Fujita

doi:10.1371/journal.pone.0234504

Abstract

The continual loss of threatened biodiversity is occurring at an accelerated pace. High-throughput sequencing technologies are now providing opportunities to address this issue by aiding in the generation of molecular data for many understudied species of high conservation interest. Our overall goal of this study was to begin building the genomic resources to continue investigations and conservation of the Spot-Tailed Earless lizard. Here we leverage the power of high-throughput sequencing to generate the liver transcriptome for the Northern Spot-Tailed Earless Lizard (Holbrookia lacerata) and Southern Spot-Tailed Earless Lizard (Holbrookia subcaudalis), which have declined in abundance in the past decades, and their sister species, the Common Lesser Earless Lizard (Holbrookia maculata). Our efforts produced high quality and robust transcriptome assemblies validated by 1) quantifying the number of processed reads represented in the transcriptome assembly and 2) quantifying the number of highly conserved single-copy orthologs that are present in our transcript set using the BUSCO pipeline. We found 1,361 1-to-1 orthologs among the three Holbrookia species, Anolis carolinensis, and Sceloporus undulatus. We carried out dN/dS selection tests using a branch-sites model and identified a dozen genes that experienced positive selection in the Holbrookia lineage with functions in development, immunity, and metabolism. Our single-copy orthologous sequences additionally revealed significant pairwise sequence divergence (~.73%) between the Northern H. lacerata and Southern H. subcaudalis that further supports the recent elevation of the Southern Spot-Tailed Earless Lizard to full species.

Citation: Maldonado JA, Firneno TJ Jr, Roelke CE, Rains ND, Mwgiri J, Fujita MK (2020) Transcriptome sequencing reveals signatures of positive selection in the Spot-Tailed Earless Lizard. PLoS ONE 15(6): e0234504. https://doi.org/10.1371/journal.pone.0234504

Editor: Axel Janke, Senckenberg am Meer Deutsches Zentrum fur Marine Biodiversitatsforschung, GERMANY

Received: March 17, 2020; Accepted: May 26, 2020; Published: June 15, 2020

Copyright: © 2020 Maldonado et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: Raw sequence reads are available from Sequence Read Archive: Accession PRJNA636108; the assemblies and other additional files are available from Dryad DOI: 10.5061/dryad.kwh70rz1m.

Funding: This work was funded by a Texas Parks & Wildlife Section Six Grant awarded to Corey E. Roelke and Matthew K. Fujita (TPWD 474241).

Competing interests: The authors have declared that no competing interests exist.

Introduction

The continuing reduction of per-base costs of high-throughput sequencing (HTS) methodologies has provided new opportunities to generate large-scale genomic and transcriptomic resources. This is especially beneficial for data deficient, non-model organisms that may require conservation action, are commercially valuable, or are excellent systems to investigate ecological and evolutionary questions [1–4]. The field of conservation genetics has historically relied on the use of few dozen genetic markers (allozymes, microsatellites, mitochondrial DNA, and a few nuclear genes) to understand population structure, viability, and evolutionary processes (e.g. genetic drift, selection, and migration) [5,6]. With the low cost of HTS, it is now possible to rapidly generate hundreds to thousands of genetic markers from multiple individuals and populations to assay genetic diversity for virtually any species [7]. These larger datasets can overcome some of the limitations of traditional methods used in conservation genetics that yield only a few variable loci [8]. For example, with hundreds to thousands of variable sites, HTS datasets can be beneficial for species management by providing high resolution and accurate inferences of important parameters such as genetic diversity, inbreeding depression, effective population size [9,10], as well as historical demography and local adaptations. All of this can provide insight into resolving taxonomic uncertainties to determine which species need immediate attention and protection [8,11]. Using RNA-Seq methodologies to address the types of questions for species with no available genomic resources is becoming increasingly favorable [12–14]. As long as appropriately preserved tissue is available, it is possible to sequence the expressed genes by sequencing complementary DNA (cDNA) libraries. By using millions of short reads generated by massive parallel sequencing of cDNA libraries and robust assembly methods [15,16], one can generate a high coverage de novo transcriptome assembly without the need for a reference genome. Because transcriptome sequencing is versatile, it is a desirable option for developing conservation genetics tools because it largely circumvents the time-consuming process of identifying and optimizing genetic markers (e.g. primer development and testing) [12,17]. Furthermore, transcriptome sequencing captures protein coding regions with functional significance [2,18]. The usage of gene expression data for conservation biology is an emerging field that will significantly benefit wildlife management [19].

Our focus in this paper is the development of genomic resources for the Northern and Southern Spot-tailed Earless Lizard (Holbrookia lacerata and Holbrookia subcaudalis), whose historic range extends from central to southern Texas, and into northeastern Mexico. For the past several decades, H. lacerata has faced taxonomic uncertainties and confusion. Initially, there was a single species, H. lacerata, that included the currently-recognized H. lacerata as well as its sister species, H. maculata [20]. Holbrookia lacerata was then reduced to a subspecies under H. maculata [21], and subsequently elevated back to full species status [22]. Subsequently, two subspecies of H. lacerata were recognized, H. l. lacerata and H. l. subcaudalis, that are geographically separated by the Balcones Escarpment fault line. Renewed interest in resolving their taxonomic status revealed that the disjunct populations are reciprocally monophyletic with an approximately 4% mitochondrial sequence divergence [3]. Most recently, Hibbits et al. (2019) took an integrative approach and used morphology, mitochondrial, and nuclear data to elevate each subspecies to full species status [23].

Both species have experienced a sharp decline in distribution and abundance through their historical range and are labeled as near threatened by the International Union for Conservation of Nature organization. The most notable decrease has occurred in southern Texas, where H. subcaudalis populations have become increasingly fragmented and isolated, with fewer observations being made throughout its historic range. Hypotheses for the decline of H. lacerata include pesticides and agriculture practices [24], and urbanization, exotic grasses, and invasive fauna are additional factors contributing to their decline [25]. The conservation concerns of both species have led to recent studies utilizing H. lacerata as a focal organism for better land management practices in Texas [25–27], and efforts are being made to protect both species under the Endangered Species Act [28].

The decline in H. lacerata and H. subcaudalis abundance can have a substantial impact on population viability. Small and fragmented populations can lead to an increase in homozygosity, the disappearance of genetic diversity, and an increase in the frequency deleterious variants become fixed can lead to inbreeding depression, and thus a reduction in individual fitness. The goal of this study is to provide a detailed characterization of a transcriptome as a means to generate molecular resources for conservation studies of H. lacerata and H. subcaudalis. We provide an annotated liver transcriptome, identify adaptive loci, and estimate genetic distance, all of which are of value for the conservation efforts of the North and Southern Spot-tailed Earless Lizard.

Materials and methods

Sampling

Samples for this study were collected in summer 2015 from three different localities. Lizards were caught by hand or loops and humanely euthanized under our IACUC protocol (#A16.010) approved by the University of Texas at Arlington. We harvested skeletal muscle, liver, heart, blood, and integument from one individual per species and stored the tissue in RNAlater (Sigma-Aldrich, St. Louis, MO). We preserved the specimens with 10% formalin and deposited them in the Amphibian and Reptile Diversity Research Center (ARDRC) at the University of Texas at Arlington (see S1 Table for locality data and field numbers).

RNA extraction and RNA-Seq library preparation for sequencing

Total RNA was extracted from liver tissue using a Promega SV Total RNA Isolation kit (Promega, Madison, WI) following the manufacturer’s protocol. We quantified our RNA extractions on the Qubit 2.0 Fluorometer (Life Technologies, Carlsbad, CA), and assessed RNA quality and size distribution on an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA). All RNA extractions yielded high-quality RNA, with all samples having an RNA integrity number (RIN) >8. RNA isolates from each sample were sent to the Brigham Young University DNA Sequencing Center to generate cDNA libraries using the Kapa Biosystems RNA depletion kit (Kapa Biosystems, Wilmington, MA) and sequenced on the Illumina HiSeq 2500 (Illumina, San Diego, CA) generating 100bp paired-end sequences.

De novo transcriptome assembly and quality assessments

The data were processed to remove low quality reads using Trimmomatic v.32 [29]. We used the following parameters to trim and remove failed reads: a 4-base sliding window trimming nucleotides with a Q score <5 and discarding reads <25bp long [30]. To ensure successful quality trimming and filtering, we ran the processed reads through FASTQC v0.10.1 (Babraham Bioinformatics) to evaluate read quality, length, and the number of reads retained. With no reference genomes for any Holbrookia species and given that previous results have demonstrated that guided transcriptome assembly methods for diverged species typically perform worse than de novo assembly [31,32], we carried out de novo assemblies using Trinity short-read assembler V2.2.1 [15] for each sample.

To measure the completeness of our assembled transcripts, we performed two quality assessments as suggested by the Trinity package. First, to evaluate the assembly quality of each transcriptome, we mapped the input processed RNA-Seq reads back to their corresponding assemblies for each species of Holbrookia and quantified the number of input reads represented in our de novo transcriptome assemblies. Reads were mapped back to the transcriptome using the short read aligner Bowtie 2 v2.3.4 using the–local and–no-unal options [33]. Second, we ran our Holbrookia protein coding transcript set (see below for protein coding transcript set generation) through CD-Hit v4.8.1 (90% sequence identity threshold with all other parameters set to default) [34] to produce a non-redundant protein coding transcript set for each individual. We compared our non-redundant transcripts with a set of 3,950 highly conserved single-copy tetrapod orthologs using the BUSCO (benchmarking universal single-copy orthologs) v3 pipeline [35]. Fig 1 is an overview of our transcriptome assembly and analysis pipeline described below.

Download:

Fig 1. Schematic overview of our workflow to assemble the transcriptome of H. lacerata, H. subcaudalis, and H. maculata, and analysis carried out in this study.

https://doi.org/10.1371/journal.pone.0234504.g001

Identification of orthologs and paralogs and pairwise distance

To identify candidate coding genes from our assembled transcript sets with the longest open reading frames (ORFs), we used the TransDecoder v3.0.1 pipeline [36]. To maximize the number of ORFs captured and to ensure we did not lose any potential coding genes, we ran TransDecoder optional homology search against the PFAM database [37]. We ran the TransDecoder pipeline on our three de novo Holbrookia transcriptomes, the Anolis carolinensis transcriptome downloaded from NCBI server (GCF_000090745.1), and the publicly available Sceloporus undulatus transcriptome [38].

We used OrthoFinder v2.2.3to identify orthologous genes between all five species from the protein coding transcript set generated by TransDecoder. OrthoFinder uses a BLAST all-vs-all algorithm and performs reciprocal best-hit BLAST searches that normalize the bit scores to overcome transcript length bias in the ortholog detection [39]. We extracted all of the 1-to-1 orthologs found between all five species into Fasta files and performed an amino-acid guided alignment for each ortholog using MACSE v1.2 [40]. We inspected our multiple species codon alignments and removed ten orthologs for having a poor alignment. We used Geneious R9 (https://www.geneious.com) to calculate the uncorrected pairwise distance across the remaining orthologous alignments.

To identify paralogs (duplicated single-copy orthologs) in our Holbrookia transcripts, we used our non-redundant protein coding transcript set generated by CD-Hit to reduce false positive detection of paralogs. We searched against the tetrapod single-copy ortholog database to determine the number of transcripts that are likely paralogs using the BUSCO pipeline [35] and OrthoFinder all-vs-all BLAST [39] algorithm.

DAVID functional analysis

We submitted all A. carolinensis gene IDs from the 1-to-1 orthologs set to the bioinformatics database DAVID [41] to identify genes in our orthologous set, cluster these genes into their biological processes, and their inclusion in biological pathways. We ran DAVID using all default parameters for enrichment and pathway analysis.

Selection test

To identify positive selection in Holbrookia, we estimated dN/dS in all 1-to-1 orthologous genes found by OrthoFinder across all five individuals by following the recommendations of Yang et al. (2006) [42]. We used the branch-sites model (CODEML: M2 and NSites2) from the PAML v4.9 package [43], which requires an a priori phylogenetic tree to test for positive selection in foreground branches. We utilized a phylogenetic tree generated by OrthoFinder, by reconciling over a thousand gene trees into a species tree and designated the Holbrookia lineage as the foreground branches. The remaining Anolis and Sceloporus branches on the species tree were labeled as background branches. We compared the likelihood of two different models for each orthologous gene: 1) our alternative model that allows for a proportion of the sites to be under positive selection (ω₂ ≥ 1) along the Holbrookia branch, and the background branches having a proportion of sites being under purifying selection (ω₁<1) or neutrally evolving (ω₀ = 1); and 2) the simplistic null model that has the ω₂ fixed at 1 on the foreground branches, and all other branches having ω₀ = 1 and ω₁<1 [43,44]. We obtained likelihood values after running each transcript under two different models and carried out likelihood ratio tests (2×ΔlnL) between the models to evaluate whether the alternative model outperforms the null model. We performed a Bonferroni correction to account for multiple comparisons. A significant result from the branch-sites model is indicative that a subset of the sites in the coding gene has gone through episodic positive selection, with the selected sites providing an advantage to Holbrookia lineage.

Gene annotation

We used the Blast2Go v5 pipeline [45] to annotate and assign functions to our complete protein coding gene set produced by TransDecoder: 34,214 transcripts for H. maculata, 33,379 transcripts for H. lacerata, and 29,149 transcripts for H. subcaudalis. We used the Blastp function to blast our transcript set against the NCBI NR-protein database using an e-value cutoff of 1e-5 and all other parameters set to default. Annotation was performed using an e-value cutoff of 1e-3, an annotation score of 45, and a GO weight of 5. To generate biological processes, molecular functions, and cellular components graphs we first grouped our Go annotation into GO-slim terms to simplify the input and filtered out nodes containing >10 sequences cellular component, nodes containing >10 sequences were filtered out.

Results

Sequencing and De novo transcriptome assembly

Our sequenced cDNA libraries on the Illumina Hi-Seq 2500 yielded a total of 223 million paired-end reads between all three samples, and we retained 99.5% of our reads after filtering out low-quality reads from our raw data set. FastQC verified that only high-quality reads with a Q score >30 were kept and assembled by the short-read assembler Trinity. The total number of assembled transcripts for H. maculata, H. lacerata, and H. subcaudalis are respectively 107,863, 99,821, and 91,278. The average maximum transcript length between all individuals was ~18,340 bp. Assembly statics for all three Holbrookia samples used in this study are listed in Table 1.

Download:

Table 1. Assembly statistics for H. maculata, H. lacerata, and H. subcaudalis.

https://doi.org/10.1371/journal.pone.0234504.t001

Assembly quality assessment

Our first assembly quality check involved mapping the processed reads with Bowtie2 back to their assembled transcriptome. Across all samples, an average of 97% of the reads mapped back to their respective assemblies. If ≥70% of the input reads mapped back to their assembly, it is indicative of a robust transcriptome assembly by Trinity [15]. To further quantify the completeness of our Holbrookia transcript set, we ran our protein coding gene set for each individual through CD-HIT to remove redundant transcripts from the assemblies. Our Holbrookia protein coding transcript set (96,742) decreased by a factor of one-third to generate a non-redundant transcript list of 63,957 sequences. We ran our three non-redundant Holbrookia transcript sets against a conserved set of 3,950 universal tetrapod single-copy orthologs using the BUSCO pipeline [35,46]. We recovered 60–70% complete and 8–11% partial orthologs from the BUSCO tetrapod database. Between 831 to 1,113 orthologs were classified as missing from our transcript set. The high number of complete conserved single-copy orthologs present in each transcript set is indicative of high coverage and high-quality protein coding transcriptome assemblies for each Holbrookia species. All BUSCO statistics are detailed in Table 2.

Download:

Table 2. Benchmarking Universal Single-Copy Orthologs (BUSCO) summary of complete, duplicated, fragmented, and missing orthologs search against the 3950 single-copy orthologs.

https://doi.org/10.1371/journal.pone.0234504.t002

Orthologs and paralogs identification

To identify the 1-to-1 orthologous groups among our coding genes identified by TransDecoder between the three Holbrookia species, Anolis carolinensis, and Sceloporus undulatus, we ran all five individuals through OrthoFinder. TransDecoder found between 29,149 to 35,594 protein coding genes with the longest open reading frame from our assembled transcript set. We submitted a total of 160,639 coding transcripts to OrthoFinder to identify orthologous groups. OrthoFinder identified 19,401 orthogroups (defined as containing both orthologs and paralogs) containing 125,845 transcripts. We found that 43% (8,243) of the orthogroups had all five individuals present, and 56% (10,916) of the total orthogroups had at least two individuals present (Fig 2). We found 6,805 transcripts (1,361 orthologs) that were true 1-to-1 orthologs between all five individuals, and a low proportion of our transcripts that were unique to their given transcriptome assemblies.

Download:

Fig 2. Venn diagram showing the number of shared orthologous groups identified by OrthoFinder between all four species assembled transcriptome.

https://doi.org/10.1371/journal.pone.0234504.g002

To identify paralogs in our Holbrookia transcriptome, we ran our protein coding set through CD-Hit [34] to remove redundancy in our transcript set for each of our Holbrookia samples. BUSCO found that between 3% to 3.5% of the 3950 single-copy tetrapod orthologs that we searched against were detected as duplicated in our transcript assemblies. Our second method to identify paralogs in the Holbrookia transcriptome was using the OrthoFinder BLAST algorithm only on our Holbrookia protein coding gene list, by counting the number of self-BLAST hits identified between each transcript. We discovered 4590 transcripts in H. maculata, 4797 transcripts in H. lacerata, and 4271 transcripts in H. subcaudalis as paralogous transcripts within each species.

DAVID functional analysis and selection test

DAVID clustered our orthologous genes into separate biological processes, 409 genes into metabolic processes, 107 genes associated with stress response processes, 190 genes that play a role in gene expression, and 216 genes that are involved in cellular component organization. DAVID could not associate 838 genes with any biological process as they did not meet the enrichment threshold (P-values < .1). DAVID identified 156 genes involved in different metabolic pathways (purine/pyrimidine metabolism, amino acid metabolism, and the Citric acid cycle). DAVID identified thirteen genes associated with the nucleotide excision repair pathway, six genes linked to nucleotide mismatch repair mechanism, and seven genes that have a role in DNA replication.

To determine if any of the 1-to-1 orthologous genes have undergone positive selection in Holbrookia, we carried out selection tests using PAML [44] and performed likelihood-ratio tests to assess significance. We adjusted our P-values by performing a Bonferroni correction test to account for multiple comparisons and committing a type I error. We found twelve genes from our ortholog set that have undergone positive selection with a significance value of < .05, after adjusting our P-values. Table 3 has a complete list of all twelve genes with the alternative and null likelihood values and the Bayes Empirical Bayes (BEB) score for sites under selection.

Download:

Table 3. The alternative and null model likelihood values for the twelve genes that show footprints of positive selection in the Holbrookia lineage.

https://doi.org/10.1371/journal.pone.0234504.t003

Pairwise distance

We calculated the uncorrected pairwise distance using the complete single copy ortholog set. The most significant sequence divergence was found between A. carolinensis and all other species being around ~11%. The split between Anolis (family Dactyloidae) and the four other species in the family Phrynosomatidae occurred ~72MYA and accounts for the vast amount of genetic divergence [47]. There is an average 5.57% sequence divergence between S. undulatus and all Holbrookia species, with the pairwise distance between H. maculata and H. lacerata and H. subcaudalis is ~1.5%. The genetic distance between H. lacerata and H. subcaudalis 0.73% (Table 4).

Download:

Table 4. The uncorrected pairwise distance calculated across all orthologous gene sets between all species.

https://doi.org/10.1371/journal.pone.0234504.t004

Annotations

Out of the 96,742 transcripts submitted to BLAST2GO, around ~85% were successfully blasted against the NCBI NR-protein database, with the top BLAST hit being A. carolinensis with the remaining orthologs blasting to other reptile species (e.g. Python bivittatus, Gecko japonicus). We successfully annotated a total of 71,314 transcripts into 55 functional categories using GO-Slim assignments within the three categories of the GO classification system biological process, cellular component, and molecular function. The primary categories our coding genes were clustered into are cellular metabolic processes (30,145), nitrogen compound metabolic process (28,816), primary metabolic process (23,249), and ion binding (21,228). A breakdown of GO terms for each category for each Holbrookia sample is shown in Fig 3.

Download:

Fig 3.

Gene ontology annotations for the liver transcriptome of H. maculata (yellow), H. subcaudalis (red), and H. lacerata (blue).

https://doi.org/10.1371/journal.pone.0234504.g003

Discussion

Genetic data are extremely useful in conservation studies because they can provide estimates of genetic diversity, population size (and viability), and local adaptations for imperiled populations and species. However, many taxa of conservation significance have little to no genetic data available, impeding studies that can inform conservation planning and action. The speed and ease of generating molecular data for the use of species management and policy have quickly increased with the transition to HTS technologies [8,48]. The decline of Northern and Southern Spot-Tailed Earless Lizard led us to generate and profile the liver transcriptome for each species. By doing so, we establish a genomic resource for future genome-scale studies on Holbrookia as well as provide initial insights into the divergence and evolution of H. lacerata and H. subcaudalis. Because divergence and adaptation are important components of conservation genetics, we (1) quantify the genetic distance between H. lacerata and H. subcaudalis, and (2) identify potential genes under selection in H. lacerata and H. subcaudalis.

Transcriptome annotation

Our gene ontology analyses revealed that the biological functions of our liver transcripts closely follow similar patterns of other liver transcriptomes in squamates [49]. A large number of genes expressed in the liver play a role in metabolism, proteolysis, and nitrogen compound synthesis and breakdown. Oxidative damage can lead to DNA lesions and strand breaks, and are caused by endogenous reactive oxygen species produced during normal cellular metabolism [50]. Here we found both heath shock and glutathione peroxidase proteins in our transcriptome set that are well documented as having roles in protecting against oxidative damage [51].

Pairwise distances

The U.S. Fish and Wildlife Service (FWS) is currently reviewing if H. lacerata and H. subcaudalis require protection under the endangered species act [28]. Using their historical classification as subspecies, the FWS can place either the Northern or Southern Spot-Tailed Earless Lizard in need of protection, or both. Here we have found significantly more divergence (0.73%) between the Northern and Southern Spot-Tailed Earless Lizard using conserved orthologous sequences than previous study has estimated between two allopatric species using nuclear genes [52]. The results in this study, alongside Roelke et al. (2018) finding that both the northern and southern lizards are genetically distinct using mitochondrial genomes, all further support the Hibbitts et al. (2019) recent reclassification of their taxonomy. The elevation of the Northern and Southern Spot-Tailed Earless Lizard from subspecies to full species status requires the FWS consider them separately for protection, as each species faces unique threats [23,24]. The southern H. subcaudalis has two isolated populations that were once part of a larger distribution across southern Texas, while the more robust northern species has more observations in multiple surveys of the region [23].

DAVID analysis of orthologs and positive selection

Established methods to find signatures of adaptive evolution across the genome coupled with the increased number of molecular markers captured with HTS techniques are facilitating the discovery of adaptive variation. Identifying populations from at-risk species that show signs of local adaptive genetic variation and subsequently maintaining it are essential parts of any conservation strategy [53]. Adaptive variation can inform conservation managers on how to perform genetic rescues or assisted gene flow to raise the fitness of small populations that lack genetic diversity [54]. As the use of large scale genomic data becomes more widely adopted in the field of conservation biology, it will have a significant role in conservation management and policy.

The bulk of nonsynonymous mutations are generally considered deleterious as they alter protein structure and function, which can reduce an organism's fitness, and purifying selection should remove these mutations. Genes such as RAD23 protein [55] and Damage-specific DNA Binding protein I [56] are involved in the repair of DNA lesions, and were clustered by DAVID into the nucleotide repair pathways that had no fingerprints of positive selection. Seldomly occurring, some amino acid replacements can lead to changes in proteins that are advantageous in new environments and selected for and kept in the species lineage.

Our branch-sites model selection test identified 12 positively selected genes in the Holbrookia lineage that have roles in immunity, development, and metabolism. From the twelve positively selected genes, three genes, BNIP3L, LBHD2, and PHGDH, are of particular interest for their potential role in shaping morphology, resistance against viruses, and amino acid production. The LBH (Limb-bud-and-heart) Domain-2 is part of a protein family that are key transcription regulators in embryonic development. The LBH gene is expressed early in embryogenesis with proteins playing a role in the development of limb buds and heart formation in a mice-model system [57]. A previous study has identified a single nonsynonymous mutation in an LBH homolog [58] associated with an adaptive variation, and here we have found multiple amino acid replacements selected for in the Holbrookia lineage. While we do not know the functional consequences of these amino acid changes, they may play an important role in local adaptation to desert and grassland environments that Holbrookia inhabit.

Immune genes are known hotspots for selection to act on, having persistent selective pressures by pathogens that elicit immune responses. BNIP3L promotes apoptotic activity, targeting dysfunctional mitochondria organelles. Specific viruses produced anti-apoptotic proteins that suppress the apoptosis of virally infected cells, allowing viruses to replicate and multiply. BNIP3L proteins interact with viral and cellular anti-apoptosis proteins and overcome the suppression to initiate apoptosis [59,60].

Amino acids are the essential building blocks to protein, and here we found the PHGDH (D-3-phosphoglycerate dehydrogenase) gene that has a role in amino synthesis has experienced positive selection with multiple amino acid replacements. The protein encoded by this gene is responsible for producing the non-essential amino acid L-serine. While described as non-essential, L-serine is a critical precursor required for the synthesis of D-serine, amino acids, and other metabolites in animal cells [61]. Deficiencies in the D-3-phosphoglycerate dehydrogenase protein results in metabolic defects affecting the nervous systems [62,63]. The kidneys produce the majority of L-serine under normal conditions [61]. When dietary protein is limited, the liver becomes the primary production of the metabolite. While this has been shown only in mammals, it is of interest to see if a similar pattern will be observed in Holbrookia species as the alteration of grassland for agriculture purposes and the use of pesticides, can impact the invertebrate populations and reduce their food availability.

Conclusions

While there are still problems in conservation genetics that can still be answered successfully using conventional conservation genetics techniques, the low cost of HTS technologies allows us to address questions of genetic diversity, adaptation, and taxonomy. High-throughput sequencing technologies are improving on traditional approaches generating extensive molecular resources for organisms of conservation interest. Scaling up from just a few loci to genomics and transcriptomics allows for better inferences and conservation practices. Here we have generated the first transcriptome for three lizards in the genus Holbrookia and identified multiple genes under positive selection. These transcriptomes have already provided insight into potentially adaptive loci in Holbrookia and will continue to contribute to future population-based and systematics studies of iguanian lizards.

Supporting information

S1 Table.

https://doi.org/10.1371/journal.pone.0234504.s001

(XLSX)

S1 Data.

https://doi.org/10.1371/journal.pone.0234504.s002

(TXT)

S1 File.

https://doi.org/10.1371/journal.pone.0234504.s003

(PHY)

S2 File.

https://doi.org/10.1371/journal.pone.0234504.s004

(PHY)

S3 File.

https://doi.org/10.1371/journal.pone.0234504.s005

(PHY)

S4 File.

https://doi.org/10.1371/journal.pone.0234504.s006

(PHY)

S5 File.

https://doi.org/10.1371/journal.pone.0234504.s007

(PHY)

S6 File.

https://doi.org/10.1371/journal.pone.0234504.s008

(PHY)

S7 File.

https://doi.org/10.1371/journal.pone.0234504.s009

(PHY)

S8 File.

https://doi.org/10.1371/journal.pone.0234504.s010

(PHY)

S9 File.

https://doi.org/10.1371/journal.pone.0234504.s011

(PHY)

S10 File.

https://doi.org/10.1371/journal.pone.0234504.s012

(PHY)

S11 File.

https://doi.org/10.1371/journal.pone.0234504.s013

(PHY)

S12 File.

https://doi.org/10.1371/journal.pone.0234504.s014

(PHY)

Acknowledgments

We thank the biologists of the Texas Parks and Wildlife Department. We thank the many landowners and land managers, both public and private, who granted us access to their land for lizard surveys. This work was funded by a Texas Parks & Wildlife Section Six Grant awarded to Corey E. Roelke and Matthew K. Fujita (TPWD 474241).

References

1. Tollis M, DeNardo DF, Cornelius JA, Dolby GA, Edwards T, Henen BT, et al. The Agassiz’s desert tortoise genome provides a resource for the conservation of a threatened species. PLOS ONE. 2017. p. e0177708. pmid:28562605
- View Article
- PubMed/NCBI
- Google Scholar
2. Cheng T, Fu B, Wu Y, Long R, Liu C, Xia Q. Transcriptome sequencing and positive selected genes analysis of Bombyx mandarina. PLoS One. 2015;10: e0122837. pmid:25806526
- View Article
- PubMed/NCBI
- Google Scholar
3. Roelke CE, Maldonado JA, Pope BW, Firneno TJ, Laduc TJ, Hibbitts TJ, et al. Mitochondrial genetic variation within and between Holbrookia lacerata lacerata and Holbrookia lacerata subcaudalis, the spot-tailed earless lizards of Texas. J Nat Hist. 2018;52: 1017–1027.
- View Article
- Google Scholar
4. Carruthers M, Yurchenko AA, Augley JJ, Adams CE, Herzyk P, Elmer KR. De novo transcriptome assembly, annotation and comparison of four ecological and evolutionary model salmonid fish species. BMC Genomics. 2018;19: 32. pmid:29310597
- View Article
- PubMed/NCBI
- Google Scholar
5. Frankham R, Briscoe DA, Ballou JD. Introduction to Conservation Genetics. Cambridge University Press; 2002.
6. Avise JC, Hamrick JL. Conservation genetics: case histories from nature. 1996.
- View Article
- Google Scholar
7. Allendorf FW, Hohenlohe PA, Luikart G. Genomics and the future of conservation genetics. Nat Rev Genet. 2010;11: 697–709. pmid:20847747
- View Article
- PubMed/NCBI
- Google Scholar
8. Funk WC, McKay JK, Hohenlohe PA, Allendorf FW. Harnessing genomics for delineating conservation units. Trends Ecol Evol. 2012;27: 489–496. pmid:22727017
- View Article
- PubMed/NCBI
- Google Scholar
9. Höglund J. Evolutionary Conservation Genetics. Oxford University Press; 2009.
10. Primmer CR. From conservation genetics to conservation genomics. Ann N Y Acad Sci. 2009;1162: 357–368. pmid:19432656
- View Article
- PubMed/NCBI
- Google Scholar
11. Taberlet P, Coissac E, Pompanon F, Brochmann C, Willerslev E. Towards next-generation biodiversity assessment using DNA metabarcoding. Mol Ecol. 2012;21: 2045–2050. pmid:22486824
- View Article
- PubMed/NCBI
- Google Scholar
12. Vera JC, Wheat CW, Fescemyer HW, Frilander MJ, Crawford DL, Hanski I, et al. Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencing. Mol Ecol. 2008;17: 1636–1647. pmid:18266620
- View Article
- PubMed/NCBI
- Google Scholar
13. Miller HC, Biggs PJ, Voelckel C, Nelson NJ. De novo sequence assembly and characterisation of a partial transcriptome for an evolutionarily distinct reptile, the tuatara (Sphenodon punctatus). BMC Genomics. 2012;13: 439. pmid:22938396
- View Article
- PubMed/NCBI
- Google Scholar
14. Schunter C, Vollmer SV, Macpherson E, Pascual M. Transcriptome analyses and differential gene expression in a non-model fish species with alternative mating tactics. BMC Genomics. 2014;15: 167. pmid:24581002
- View Article
- PubMed/NCBI
- Google Scholar
15. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011;29: 644–652. pmid:21572440
- View Article
- PubMed/NCBI
- Google Scholar
16. Henschel R, Nista PM, Lieber M, Haas BJ, Wu L-S, LeDuc RD. Trinity RNA-Seq assembler performance optimization. Proceedings of the 1st Conference of the Extreme Science and Engineering Discovery Environment on Bridging from the eXtreme to the campus and beyond—XSEDE ‘12. 2012. https://doi.org/10.1145/2335755.2335842
17. Parchman TL, Geist KS, Grahnen JA, Benkman CW, Buerkle CA. Transcriptome sequencing in an ecologically important tree species: assembly, annotation, and marker discovery. BMC Genomics. 2010;11: 180. pmid:20233449
- View Article
- PubMed/NCBI
- Google Scholar
18. Renaut S, Nolte AW, Bernatchez L. Mining transcriptome sequences towards identifying adaptive single nucleotide polymorphisms in lake whitefish species pairs (Coregonus spp. Salmonidae). Mol Ecol. 2010;19 Suppl 1: 115–131.
- View Article
- Google Scholar
19. Kristensen TN, Sørensen P, Pedersen KS, Kruhøffer M, Loeschcke V. Inbreeding by environmental interactions affect gene expression in Drosophila melanogaster. Genetics. 2006;173: 1329–1336. pmid:16624914
- View Article
- PubMed/NCBI
- Google Scholar
20. Cope ED. The Crocodilians, Lizards, and Snakes of North America. Smithsonian; 1900.
21. Stejneger L. Annotated List of Reptiles and Batrachians collected by Dr. C. Hart Merriam and party in Idaho, 1890. N Am Fauna. 1891; 109–114.
- View Article
- Google Scholar
22. Axtell RW. A solution to the long neglected Holbrookia lacerata problem, and the description of two new subspecies of Holbrookia. Chicago Academy of Sciences; 1956. Available: http://webapps.fhsu.edu/ksherp/bibFiles/854.pdf
23. Hibbitts TJ, Ryberg WA, Harvey JA, Voelker G, Lawing AM, Adams CS, et al. Phylogenetic structure of Holbrookia lacerata (Cope 1880) (Squamata: Phrynosomatidae): one species or two? Zootaxa. 2019;4619: zootaxa.4619.1.6.
- View Article
- Google Scholar
24. Duran M, Axtell RW. A Rangewide Inventory and Habitat Model for the Spot-tailed Earless Lizard, Holbrookia lacerata. Report submitted to Texas Parks and Wildlife Department. 2010.
- View Article
- Google Scholar
25. Wolaver BD, Pierre JP, Labay BJ, LaDuc TJ, Duran CM, Ryberg WA, et al. An approach for evaluating changes in land-use from energy sprawl and other anthropogenic activities with implications for biotic resource management. Environ Earth Sci. 2018;77: 171.
- View Article
- Google Scholar
26. Pierre JP, Wolaver BD, Labay BJ, LaDuc TJ, Duran CM, Ryberg WA, et al. Comparison of Recent Oil and Gas, Wind Energy, and Other Anthropogenic Landscape Alteration Factors in Texas Through 2014. Environ Manage. 2018;61: 805–818. pmid:29504039
- View Article
- PubMed/NCBI
- Google Scholar
27. Wolaver BD, Pierre JP, Ikonnikova SA, Andrews JR, McDaid G, Ryberg WA, et al. An Improved Approach for Forecasting Ecological Impacts from Future Drilling in Unconventional Shale Oil and Gas Plays. Environ Manage. 2018;62: 323–333. pmid:29654362
- View Article
- PubMed/NCBI
- Google Scholar
28. Ingram M. The Endangered Species Act in Texas: a survey and history. Austin (Texas): Texas Policy Foundation. 2017. Available: https://www.texaspolicy.com/library/doclib/2017-06-study-endangeredspeciessurvey-acee-mingram-2.pdf
29. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinforma. Oxf. Engl. 30, 2114–2120. 2014.
- View Article
- Google Scholar
30. Macmanes MD. On the optimal trimming of high-throughput mRNA sequence data. Front Genet. 2014;5: 13. pmid:24567737
- View Article
- PubMed/NCBI
- Google Scholar
31. Visser EA, Wegrzyn JL, Steenkmap ET, Myburg AA, Naidoo S. Combined de novo and genome guided assembly and annotation of the Pinus patula juvenile shoot transcriptome. BMC Genomics. 2015;16: 1057. pmid:26652261
- View Article
- PubMed/NCBI
- Google Scholar
32. Chopra R, Burow G, Farmer A, Mudge J, Simpson CE, Burow MD. Comparisons of de novo transcriptome assemblers in diploid and polyploid species using peanut (Arachis spp.) RNA-Seq data. PLoS One. 2014;9: e115055. pmid:25551607
- View Article
- PubMed/NCBI
- Google Scholar
33. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9: 357–359. pmid:22388286
- View Article
- PubMed/NCBI
- Google Scholar
34. Li W, Godzik A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006;22: 1658–1659. pmid:16731699
- View Article
- PubMed/NCBI
- Google Scholar
35. Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31: 3210–3212. pmid:26059717
- View Article
- PubMed/NCBI
- Google Scholar
36. Haas B, Papanicolaou A, Others. TransDecoder (find coding regions within transcripts). Github, nd https://githubcom/TransDecoder/TransDecoder (accessed May 17, 2018). 2015.
37. Finn RD, Coggill P, Eberhardt RY, Eddy SR, Mistry J, Mitchell AL, et al. The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res. 2016;44: D279–85. pmid:26673716
- View Article
- PubMed/NCBI
- Google Scholar
38. McGaugh SE, Bronikowski AM, Kuo C-H, Reding DM, Addis EA, Flagel LE, et al. Rapid molecular evolution across amniotes of the IIS/TOR network. Proc Natl Acad Sci U S A. 2015;112: 7055–7060. pmid:25991861
- View Article
- PubMed/NCBI
- Google Scholar
39. Emms DM, Kelly S. OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol. 2015;16: 157. pmid:26243257
- View Article
- PubMed/NCBI
- Google Scholar
40. Ranwez V, Harispe S, Delsuc F, Douzery EJP. MACSE: Multiple Alignment of Coding SEquences accounting for frameshifts and stop codons. PLoS One. 2011;6: e22594. pmid:21949676
- View Article
- PubMed/NCBI
- Google Scholar
41. Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4: 44–57. pmid:19131956
- View Article
- PubMed/NCBI
- Google Scholar
42. Yang Z. Computational Molecular Evolution: Oxford University Press. New York. 2006.
43. Zhang J. Evaluation of an Improved Branch-Site Likelihood Method for Detecting Positive Selection at the Molecular Level. Molecular Biology and Evolution. 2005. pp. 2472–2479. pmid:16107592
- View Article
- PubMed/NCBI
- Google Scholar
44. Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24: 1586–1591. pmid:17483113
- View Article
- PubMed/NCBI
- Google Scholar
45. Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005;21: 3674–3676. pmid:16081474
- View Article
- PubMed/NCBI
- Google Scholar
46. Waterhouse RM, Seppey M, Simão FA, Manni M, Ioannidis P, Klioutchnikov G, et al. BUSCO Applications from Quality Assessments to Gene Prediction and Phylogenomics. Mol Biol Evol. 2018;35: 543–548. pmid:29220515
- View Article
- PubMed/NCBI
- Google Scholar
47. Blankers T, Townsend TM, Pepe K, Reeder TW, Wiens JJ. Contrasting global-scale evolutionary radiations: phylogeny, diversification, and morphological evolution in the major clades of iguanian lizards. Biol J Linn Soc Lond. 2013;108: 127–143.
- View Article
- Google Scholar
48. Garner BA, Hand BK, Amish SJ, Bernatchez L, Foster JT, Miller KM, et al. Genomics in Conservation: Case Studies and Bridging the Gap between Data and Application. Trends in ecology & evolution. 2016. pp. 81–83.
- View Article
- Google Scholar
49. Duan J, Sanggaard KW, Schauser L, Lauridsen SE, Enghild JJ, Schierup MH, et al. Transcriptome analysis of the response of Burmese python to digestion. Gigascience. 2017;6: 1–18.
- View Article
- Google Scholar
50. Cooke MS, Evans MD, Dizdaroglu M, Lunec J. Oxidative DNA damage: mechanisms, mutation, and disease. FASEB J. 2003;17: 1195–1214. pmid:12832285
- View Article
- PubMed/NCBI
- Google Scholar
51. Pamplona R, Costantini D. Molecular and structural antioxidant defenses against oxidative stress in animals. Am J Physiol Regul Integr Comp Physiol. 2011;301: R843–63. pmid:21775650
- View Article
- PubMed/NCBI
- Google Scholar
52. Sullivan JP, Lundberg JG, Hardman M. A phylogenetic analysis of the major groups of catfishes (Teleostei: Siluriformes) using rag1 and rag2 nuclear gene sequences. Mol Phylogenet Evol. 2006;41: 636–662. pmid:16876440
- View Article
- PubMed/NCBI
- Google Scholar
53. Crandall KA, Bininda-Emonds OR, Mace GM, Wayne RK. Considering evolutionary processes in conservation biology. Trends Ecol Evol. 2000;15: 290–295. pmid:10856956
- View Article
- PubMed/NCBI
- Google Scholar
54. Flanagan SP, Forester BR, Latch EK, Aitken SN, Hoban S. Guidelines for planning genomic assessment and monitoring of locally adaptive variation to inform species conservation. Evol Appl. 2018;11: 1035–1052. pmid:30026796
- View Article
- PubMed/NCBI
- Google Scholar
55. Watkins JF, Sung P, Prakash L, Prakash S. The Saccharomyces cerevisiae DNA repair gene RAD23 encodes a nuclear protein containing a ubiquitin-like domain required for biological function. Mol Cell Biol. 1993;13: 7757–7765. pmid:8246991
- View Article
- PubMed/NCBI
- Google Scholar
56. Iovine B, Iannella ML, Bevilacqua MA. Damage-specific DNA binding protein 1 (DDB1): a protein with a wide range of functions. Int J Biochem Cell Biol. 2011;43: 1664–1667. pmid:21959250
- View Article
- PubMed/NCBI
- Google Scholar
57. Briegel KJ, Joyner AL. Identification and characterization of Lbh, a novel conserved nuclear protein expressed during early limb and heart development. Dev Biol. 2001;233: 291–304. pmid:11336496
- View Article
- PubMed/NCBI
- Google Scholar
58. Powder KE, Cousin H, McLinden GP, Craig Albertson R. A nonsynonymous mutation in the transcriptional regulator lbh is associated with cichlid craniofacial adaptation and neural crest cell development. Mol Biol Evol. 2014;31: 3113–3124. pmid:25234704
- View Article
- PubMed/NCBI
- Google Scholar
59. Ney PA. Mitochondrial autophagy: Origins, significance, and role of BNIP3 and NIX. Biochim Biophys Acta. 2015;1853: 2775–2783. pmid:25753537
- View Article
- PubMed/NCBI
- Google Scholar
60. Imazu T, Shimizu S, Tagami S, Matsushima M, Nakamura Y, Miki T, et al. Bcl-2/E1B 19 kDa-interacting protein 3-like protein (Bnip3L) interacts with Bcl-2/Bcl-xL and induces apoptosis by altering mitochondrial membrane permeability. Oncogene. 1999. pp. 4523–4529. pmid:10467396
- View Article
- PubMed/NCBI
- Google Scholar
61. Grant GA. D-3-Phosphoglycerate Dehydrogenase. Front Mol Biosci. 2018;5: 110. pmid:30619878
- View Article
- PubMed/NCBI
- Google Scholar
62. Tabatabaie L, de Koning TJ, A J J, van den Berg IET, Berger R, Klomp LWJ. Novel mutations in 3-phosphoglycerate dehydrogenase (PHGDH) are distributed throughout the protein and result in altered enzyme kinetics. Human Mutation. 2009. pp. 749–756. pmid:19235232
- View Article
- PubMed/NCBI
- Google Scholar
63. Fuchs SA, Dorland L, de Sain-van der Velden MG, Hendriks M, Klomp LWJ, Berger R, et al. D-serine in the developing human central nervous system. Ann Neurol. 2006;60: 476–480. pmid:17068790
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Tollis M, DeNardo DF, Cornelius JA, Dolby GA, Edwards T, Henen BT, et al. The Agassiz’s desert tortoise genome provides a resource for the conservation of a threatened species. PLOS ONE. 2017. p. e0177708. pmid:28562605
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Cheng T, Fu B, Wu Y, Long R, Liu C, Xia Q. Transcriptome sequencing and positive selected genes analysis of Bombyx mandarina. PLoS One. 2015;10: e0122837. pmid:25806526
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Roelke CE, Maldonado JA, Pope BW, Firneno TJ, Laduc TJ, Hibbitts TJ, et al. Mitochondrial genetic variation within and between Holbrookia lacerata lacerata and Holbrookia lacerata subcaudalis, the spot-tailed earless lizards of Texas. J Nat Hist. 2018;52: 1017–1027.
View Article
Google Scholar

[10] View Article

[11] Google Scholar

[ref4] 4. Carruthers M, Yurchenko AA, Augley JJ, Adams CE, Herzyk P, Elmer KR. De novo transcriptome assembly, annotation and comparison of four ecological and evolutionary model salmonid fish species. BMC Genomics. 2018;19: 32. pmid:29310597
View Article
PubMed/NCBI
Google Scholar

[13] View Article

[14] PubMed/NCBI

[15] Google Scholar

[ref5] 5. Frankham R, Briscoe DA, Ballou JD. Introduction to Conservation Genetics. Cambridge University Press; 2002.

[ref6] 6. Avise JC, Hamrick JL. Conservation genetics: case histories from nature. 1996.
View Article
Google Scholar

[18] View Article

[19] Google Scholar

[ref7] 7. Allendorf FW, Hohenlohe PA, Luikart G. Genomics and the future of conservation genetics. Nat Rev Genet. 2010;11: 697–709. pmid:20847747
View Article
PubMed/NCBI
Google Scholar

[21] View Article

[22] PubMed/NCBI

[23] Google Scholar

[ref8] 8. Funk WC, McKay JK, Hohenlohe PA, Allendorf FW. Harnessing genomics for delineating conservation units. Trends Ecol Evol. 2012;27: 489–496. pmid:22727017
View Article
PubMed/NCBI
Google Scholar

[25] View Article

[26] PubMed/NCBI

[27] Google Scholar

[ref9] 9. Höglund J. Evolutionary Conservation Genetics. Oxford University Press; 2009.

[ref10] 10. Primmer CR. From conservation genetics to conservation genomics. Ann N Y Acad Sci. 2009;1162: 357–368. pmid:19432656
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref11] 11. Taberlet P, Coissac E, Pompanon F, Brochmann C, Willerslev E. Towards next-generation biodiversity assessment using DNA metabarcoding. Mol Ecol. 2012;21: 2045–2050. pmid:22486824
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref12] 12. Vera JC, Wheat CW, Fescemyer HW, Frilander MJ, Crawford DL, Hanski I, et al. Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencing. Mol Ecol. 2008;17: 1636–1647. pmid:18266620
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref13] 13. Miller HC, Biggs PJ, Voelckel C, Nelson NJ. De novo sequence assembly and characterisation of a partial transcriptome for an evolutionarily distinct reptile, the tuatara (Sphenodon punctatus). BMC Genomics. 2012;13: 439. pmid:22938396
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref14] 14. Schunter C, Vollmer SV, Macpherson E, Pascual M. Transcriptome analyses and differential gene expression in a non-model fish species with alternative mating tactics. BMC Genomics. 2014;15: 167. pmid:24581002
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref15] 15. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011;29: 644–652. pmid:21572440
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref16] 16. Henschel R, Nista PM, Lieber M, Haas BJ, Wu L-S, LeDuc RD. Trinity RNA-Seq assembler performance optimization. Proceedings of the 1st Conference of the Extreme Science and Engineering Discovery Environment on Bridging from the eXtreme to the campus and beyond—XSEDE ‘12. 2012. https://doi.org/10.1145/2335755.2335842

[ref17] 17. Parchman TL, Geist KS, Grahnen JA, Benkman CW, Buerkle CA. Transcriptome sequencing in an ecologically important tree species: assembly, annotation, and marker discovery. BMC Genomics. 2010;11: 180. pmid:20233449
View Article
PubMed/NCBI
Google Scholar

[55] View Article

[56] PubMed/NCBI

[57] Google Scholar

[ref18] 18. Renaut S, Nolte AW, Bernatchez L. Mining transcriptome sequences towards identifying adaptive single nucleotide polymorphisms in lake whitefish species pairs (Coregonus spp. Salmonidae). Mol Ecol. 2010;19 Suppl 1: 115–131.
View Article
Google Scholar

[59] View Article

[60] Google Scholar

[ref19] 19. Kristensen TN, Sørensen P, Pedersen KS, Kruhøffer M, Loeschcke V. Inbreeding by environmental interactions affect gene expression in Drosophila melanogaster. Genetics. 2006;173: 1329–1336. pmid:16624914
View Article
PubMed/NCBI
Google Scholar

[62] View Article

[63] PubMed/NCBI

[64] Google Scholar

[ref20] 20. Cope ED. The Crocodilians, Lizards, and Snakes of North America. Smithsonian; 1900.

[ref21] 21. Stejneger L. Annotated List of Reptiles and Batrachians collected by Dr. C. Hart Merriam and party in Idaho, 1890. N Am Fauna. 1891; 109–114.
View Article
Google Scholar

[67] View Article

[68] Google Scholar

[ref22] 22. Axtell RW. A solution to the long neglected Holbrookia lacerata problem, and the description of two new subspecies of Holbrookia. Chicago Academy of Sciences; 1956. Available: http://webapps.fhsu.edu/ksherp/bibFiles/854.pdf

[ref23] 23. Hibbitts TJ, Ryberg WA, Harvey JA, Voelker G, Lawing AM, Adams CS, et al. Phylogenetic structure of Holbrookia lacerata (Cope 1880) (Squamata: Phrynosomatidae): one species or two? Zootaxa. 2019;4619: zootaxa.4619.1.6.
View Article
Google Scholar

[71] View Article

[72] Google Scholar

[ref24] 24. Duran M, Axtell RW. A Rangewide Inventory and Habitat Model for the Spot-tailed Earless Lizard, Holbrookia lacerata. Report submitted to Texas Parks and Wildlife Department. 2010.
View Article
Google Scholar

[74] View Article

[75] Google Scholar

[ref25] 25. Wolaver BD, Pierre JP, Labay BJ, LaDuc TJ, Duran CM, Ryberg WA, et al. An approach for evaluating changes in land-use from energy sprawl and other anthropogenic activities with implications for biotic resource management. Environ Earth Sci. 2018;77: 171.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref26] 26. Pierre JP, Wolaver BD, Labay BJ, LaDuc TJ, Duran CM, Ryberg WA, et al. Comparison of Recent Oil and Gas, Wind Energy, and Other Anthropogenic Landscape Alteration Factors in Texas Through 2014. Environ Manage. 2018;61: 805–818. pmid:29504039
View Article
PubMed/NCBI
Google Scholar

[80] View Article

[81] PubMed/NCBI

[82] Google Scholar

[ref27] 27. Wolaver BD, Pierre JP, Ikonnikova SA, Andrews JR, McDaid G, Ryberg WA, et al. An Improved Approach for Forecasting Ecological Impacts from Future Drilling in Unconventional Shale Oil and Gas Plays. Environ Manage. 2018;62: 323–333. pmid:29654362
View Article
PubMed/NCBI
Google Scholar

[84] View Article

[85] PubMed/NCBI

[86] Google Scholar

[ref28] 28. Ingram M. The Endangered Species Act in Texas: a survey and history. Austin (Texas): Texas Policy Foundation. 2017. Available: https://www.texaspolicy.com/library/doclib/2017-06-study-endangeredspeciessurvey-acee-mingram-2.pdf

[ref29] 29. Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinforma. Oxf. Engl. 30, 2114–2120. 2014.
View Article
Google Scholar

[89] View Article

[90] Google Scholar

[ref30] 30. Macmanes MD. On the optimal trimming of high-throughput mRNA sequence data. Front Genet. 2014;5: 13. pmid:24567737
View Article
PubMed/NCBI
Google Scholar

[92] View Article

[93] PubMed/NCBI

[94] Google Scholar

[ref31] 31. Visser EA, Wegrzyn JL, Steenkmap ET, Myburg AA, Naidoo S. Combined de novo and genome guided assembly and annotation of the Pinus patula juvenile shoot transcriptome. BMC Genomics. 2015;16: 1057. pmid:26652261
View Article
PubMed/NCBI
Google Scholar

[96] View Article

[97] PubMed/NCBI

[98] Google Scholar

[ref32] 32. Chopra R, Burow G, Farmer A, Mudge J, Simpson CE, Burow MD. Comparisons of de novo transcriptome assemblers in diploid and polyploid species using peanut (Arachis spp.) RNA-Seq data. PLoS One. 2014;9: e115055. pmid:25551607
View Article
PubMed/NCBI
Google Scholar

[100] View Article

[101] PubMed/NCBI

[102] Google Scholar

[ref33] 33. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9: 357–359. pmid:22388286
View Article
PubMed/NCBI
Google Scholar

[104] View Article

[105] PubMed/NCBI

[106] Google Scholar

[ref34] 34. Li W, Godzik A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006;22: 1658–1659. pmid:16731699
View Article
PubMed/NCBI
Google Scholar

[108] View Article

[109] PubMed/NCBI

[110] Google Scholar

[ref35] 35. Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31: 3210–3212. pmid:26059717
View Article
PubMed/NCBI
Google Scholar

[112] View Article

[113] PubMed/NCBI

[114] Google Scholar

[ref36] 36. Haas B, Papanicolaou A, Others. TransDecoder (find coding regions within transcripts). Github, nd https://githubcom/TransDecoder/TransDecoder (accessed May 17, 2018). 2015.

[ref37] 37. Finn RD, Coggill P, Eberhardt RY, Eddy SR, Mistry J, Mitchell AL, et al. The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res. 2016;44: D279–85. pmid:26673716
View Article
PubMed/NCBI
Google Scholar

[117] View Article

[118] PubMed/NCBI

[119] Google Scholar

[ref38] 38. McGaugh SE, Bronikowski AM, Kuo C-H, Reding DM, Addis EA, Flagel LE, et al. Rapid molecular evolution across amniotes of the IIS/TOR network. Proc Natl Acad Sci U S A. 2015;112: 7055–7060. pmid:25991861
View Article
PubMed/NCBI
Google Scholar

[121] View Article

[122] PubMed/NCBI

[123] Google Scholar

[ref39] 39. Emms DM, Kelly S. OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol. 2015;16: 157. pmid:26243257
View Article
PubMed/NCBI
Google Scholar

[125] View Article

[126] PubMed/NCBI

[127] Google Scholar

[ref40] 40. Ranwez V, Harispe S, Delsuc F, Douzery EJP. MACSE: Multiple Alignment of Coding SEquences accounting for frameshifts and stop codons. PLoS One. 2011;6: e22594. pmid:21949676
View Article
PubMed/NCBI
Google Scholar

[129] View Article

[130] PubMed/NCBI

[131] Google Scholar

[ref41] 41. Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4: 44–57. pmid:19131956
View Article
PubMed/NCBI
Google Scholar

[133] View Article

[134] PubMed/NCBI

[135] Google Scholar

[ref42] 42. Yang Z. Computational Molecular Evolution: Oxford University Press. New York. 2006.

[ref43] 43. Zhang J. Evaluation of an Improved Branch-Site Likelihood Method for Detecting Positive Selection at the Molecular Level. Molecular Biology and Evolution. 2005. pp. 2472–2479. pmid:16107592
View Article
PubMed/NCBI
Google Scholar

[138] View Article

[139] PubMed/NCBI

[140] Google Scholar

[ref44] 44. Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24: 1586–1591. pmid:17483113
View Article
PubMed/NCBI
Google Scholar

[142] View Article

[143] PubMed/NCBI

[144] Google Scholar

[ref45] 45. Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005;21: 3674–3676. pmid:16081474
View Article
PubMed/NCBI
Google Scholar

[146] View Article

[147] PubMed/NCBI

[148] Google Scholar

[ref46] 46. Waterhouse RM, Seppey M, Simão FA, Manni M, Ioannidis P, Klioutchnikov G, et al. BUSCO Applications from Quality Assessments to Gene Prediction and Phylogenomics. Mol Biol Evol. 2018;35: 543–548. pmid:29220515
View Article
PubMed/NCBI
Google Scholar

[150] View Article

[151] PubMed/NCBI

[152] Google Scholar

[ref47] 47. Blankers T, Townsend TM, Pepe K, Reeder TW, Wiens JJ. Contrasting global-scale evolutionary radiations: phylogeny, diversification, and morphological evolution in the major clades of iguanian lizards. Biol J Linn Soc Lond. 2013;108: 127–143.
View Article
Google Scholar

[154] View Article

[155] Google Scholar

[ref48] 48. Garner BA, Hand BK, Amish SJ, Bernatchez L, Foster JT, Miller KM, et al. Genomics in Conservation: Case Studies and Bridging the Gap between Data and Application. Trends in ecology & evolution. 2016. pp. 81–83.
View Article
Google Scholar

[157] View Article

[158] Google Scholar

[ref49] 49. Duan J, Sanggaard KW, Schauser L, Lauridsen SE, Enghild JJ, Schierup MH, et al. Transcriptome analysis of the response of Burmese python to digestion. Gigascience. 2017;6: 1–18.
View Article
Google Scholar

[160] View Article

[161] Google Scholar

[ref50] 50. Cooke MS, Evans MD, Dizdaroglu M, Lunec J. Oxidative DNA damage: mechanisms, mutation, and disease. FASEB J. 2003;17: 1195–1214. pmid:12832285
View Article
PubMed/NCBI
Google Scholar

[163] View Article

[164] PubMed/NCBI

[165] Google Scholar

[ref51] 51. Pamplona R, Costantini D. Molecular and structural antioxidant defenses against oxidative stress in animals. Am J Physiol Regul Integr Comp Physiol. 2011;301: R843–63. pmid:21775650
View Article
PubMed/NCBI
Google Scholar

[167] View Article

[168] PubMed/NCBI

[169] Google Scholar

[ref52] 52. Sullivan JP, Lundberg JG, Hardman M. A phylogenetic analysis of the major groups of catfishes (Teleostei: Siluriformes) using rag1 and rag2 nuclear gene sequences. Mol Phylogenet Evol. 2006;41: 636–662. pmid:16876440
View Article
PubMed/NCBI
Google Scholar

[171] View Article

[172] PubMed/NCBI

[173] Google Scholar

[ref53] 53. Crandall KA, Bininda-Emonds OR, Mace GM, Wayne RK. Considering evolutionary processes in conservation biology. Trends Ecol Evol. 2000;15: 290–295. pmid:10856956
View Article
PubMed/NCBI
Google Scholar

[175] View Article

[176] PubMed/NCBI

[177] Google Scholar

[ref54] 54. Flanagan SP, Forester BR, Latch EK, Aitken SN, Hoban S. Guidelines for planning genomic assessment and monitoring of locally adaptive variation to inform species conservation. Evol Appl. 2018;11: 1035–1052. pmid:30026796
View Article
PubMed/NCBI
Google Scholar

[179] View Article

[180] PubMed/NCBI

[181] Google Scholar

[ref55] 55. Watkins JF, Sung P, Prakash L, Prakash S. The Saccharomyces cerevisiae DNA repair gene RAD23 encodes a nuclear protein containing a ubiquitin-like domain required for biological function. Mol Cell Biol. 1993;13: 7757–7765. pmid:8246991
View Article
PubMed/NCBI
Google Scholar

[183] View Article

[184] PubMed/NCBI

[185] Google Scholar

[ref56] 56. Iovine B, Iannella ML, Bevilacqua MA. Damage-specific DNA binding protein 1 (DDB1): a protein with a wide range of functions. Int J Biochem Cell Biol. 2011;43: 1664–1667. pmid:21959250
View Article
PubMed/NCBI
Google Scholar

[187] View Article

[188] PubMed/NCBI

[189] Google Scholar

[ref57] 57. Briegel KJ, Joyner AL. Identification and characterization of Lbh, a novel conserved nuclear protein expressed during early limb and heart development. Dev Biol. 2001;233: 291–304. pmid:11336496
View Article
PubMed/NCBI
Google Scholar

[191] View Article

[192] PubMed/NCBI

[193] Google Scholar

[ref58] 58. Powder KE, Cousin H, McLinden GP, Craig Albertson R. A nonsynonymous mutation in the transcriptional regulator lbh is associated with cichlid craniofacial adaptation and neural crest cell development. Mol Biol Evol. 2014;31: 3113–3124. pmid:25234704
View Article
PubMed/NCBI
Google Scholar

[195] View Article

[196] PubMed/NCBI

[197] Google Scholar

[ref59] 59. Ney PA. Mitochondrial autophagy: Origins, significance, and role of BNIP3 and NIX. Biochim Biophys Acta. 2015;1853: 2775–2783. pmid:25753537
View Article
PubMed/NCBI
Google Scholar

[199] View Article

[200] PubMed/NCBI

[201] Google Scholar

[ref60] 60. Imazu T, Shimizu S, Tagami S, Matsushima M, Nakamura Y, Miki T, et al. Bcl-2/E1B 19 kDa-interacting protein 3-like protein (Bnip3L) interacts with Bcl-2/Bcl-xL and induces apoptosis by altering mitochondrial membrane permeability. Oncogene. 1999. pp. 4523–4529. pmid:10467396
View Article
PubMed/NCBI
Google Scholar

[203] View Article

[204] PubMed/NCBI

[205] Google Scholar

[ref61] 61. Grant GA. D-3-Phosphoglycerate Dehydrogenase. Front Mol Biosci. 2018;5: 110. pmid:30619878
View Article
PubMed/NCBI
Google Scholar

[207] View Article

[208] PubMed/NCBI

[209] Google Scholar

[ref62] 62. Tabatabaie L, de Koning TJ, A J J, van den Berg IET, Berger R, Klomp LWJ. Novel mutations in 3-phosphoglycerate dehydrogenase (PHGDH) are distributed throughout the protein and result in altered enzyme kinetics. Human Mutation. 2009. pp. 749–756. pmid:19235232
View Article
PubMed/NCBI
Google Scholar

[211] View Article

[212] PubMed/NCBI

[213] Google Scholar

[ref63] 63. Fuchs SA, Dorland L, de Sain-van der Velden MG, Hendriks M, Klomp LWJ, Berger R, et al. D-serine in the developing human central nervous system. Ann Neurol. 2006;60: 476–480. pmid:17068790
View Article
PubMed/NCBI
Google Scholar

[215] View Article

[216] PubMed/NCBI

[217] Google Scholar

Figures

Abstract

Introduction

Materials and methods

Sampling

RNA extraction and RNA-Seq library preparation for sequencing

De novo transcriptome assembly and quality assessments

Identification of orthologs and paralogs and pairwise distance

DAVID functional analysis

Selection test

Gene annotation

Results

Sequencing and De novo transcriptome assembly

Assembly quality assessment

Orthologs and paralogs identification

DAVID functional analysis and selection test

Pairwise distance

Annotations

Discussion

Transcriptome annotation

Pairwise distances

DAVID analysis of orthologs and positive selection

Conclusions

Supporting information

Acknowledgments

References