Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Characterization of microsatellite markers in the coding regions of the Penaeus vannamei genome

  • Iasmim Santos Mangabeira-Silva ,

    Contributed equally to this work with: Iasmim Santos Mangabeira-Silva, Paulo Eduardo Toscano Soares

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Writing – original draft

    Affiliations Laboratory of Applied Molecular Biology—LAPLIC, Federal University of Rio Grande do Norte, Natal, RN, Brazil, Postgraduate Program in Biotechnology, RENORBIO, Natal, RN, Brazil

  • Paulo Eduardo Toscano Soares ,

    Contributed equally to this work with: Iasmim Santos Mangabeira-Silva, Paulo Eduardo Toscano Soares

    Roles Conceptualization, Data curation, Methodology, Software, Writing – review & editing

    Affiliations Laboratory of Applied Molecular Biology—LAPLIC, Federal University of Rio Grande do Norte, Natal, RN, Brazil, Postgraduate Program in Biochemistry, Federal University of Rio Grande do Norte, Natal, RN, Brazil

  • Yago Tomaz Vieira da Silva,

    Roles Investigation, Methodology

    Affiliation Laboratory of Applied Molecular Biology—LAPLIC, Federal University of Rio Grande do Norte, Natal, RN, Brazil

  • Beatriz Helena Dantas Rodrigues de Albuquerque,

    Roles Investigation, Methodology

    Affiliations Laboratory of Applied Molecular Biology—LAPLIC, Federal University of Rio Grande do Norte, Natal, RN, Brazil, Postgraduate Program in Biochemistry, Federal University of Rio Grande do Norte, Natal, RN, Brazil

  • Maryana Thalyta Ferreira Câmera de Oliveira,

    Roles Investigation, Methodology

    Affiliations Laboratory of Applied Molecular Biology—LAPLIC, Federal University of Rio Grande do Norte, Natal, RN, Brazil, Postgraduate Program in Biochemistry, Federal University of Rio Grande do Norte, Natal, RN, Brazil

  • Larissa Alves Honorato Ferreira,

    Roles Investigation, Methodology

    Affiliation Laboratory of Applied Molecular Biology—LAPLIC, Federal University of Rio Grande do Norte, Natal, RN, Brazil

  • Maria Fernanda Bezerra de Souza,

    Roles Methodology

    Affiliations Laboratory of Applied Molecular Biology—LAPLIC, Federal University of Rio Grande do Norte, Natal, RN, Brazil, Postgraduate Program in Biochemistry, Federal University of Rio Grande do Norte, Natal, RN, Brazil

  • Danyllo Vieira de Lucena,

    Roles Writing – review & editing

    Affiliation Postgraduate Program in Civil and Environmental Engineering, Federal University of Campina Grande, Campina Grande, PB, Brazil

  • Jessica Marina Paiva Pereira,

    Roles Investigation, Methodology

    Affiliation Samaria Unidade de Pós Larvas, Touros, RN, Brazil

  • Roseli Pimentel Pinheiro e Silva,

    Roles Methodology, Resources

    Affiliation Samaria Unidade de Pós Larvas, Touros, RN, Brazil

  • Daniel Carlos Ferreira Lanza

    Roles Conceptualization, Funding acquisition, Investigation, Project administration, Resources, Software, Supervision, Validation, Writing – original draft, Writing – review & editing

    danielclanza@gmail.com

    Affiliations Laboratory of Applied Molecular Biology—LAPLIC, Federal University of Rio Grande do Norte, Natal, RN, Brazil, Postgraduate Program in Biotechnology, RENORBIO, Natal, RN, Brazil, Postgraduate Program in Biochemistry, Federal University of Rio Grande do Norte, Natal, RN, Brazil

Abstract

In this study, an extensive analysis of microsatellite markers (Single Tandem Repeats—STRs) in Penaeus vannamei was conducted at an advanced level. The markers were thoroughly examined, characterized, and specific markers located within coding regions were identified. Out of a total of 306 STRs, 117 were classified as perfect markers based on their single repeat motif. Among these perfect markers, 62 were found to be associated with predicted coding genes (mRNA), which were involved in various functions such as binding, catalytic activity, ATP-dependent activity, transcription, structural and molecular regulation. To validate the accuracy of the findings, a sample of nine markers was subjected to in vitro testing, which confirmed the presence of polymorphisms within the population. These results suggest the existence of different protein isoforms within the population, indicating the potential of these markers for application in both population and phenotype-genotype association studies. This innovative approach opens up new possibilities for investigating the impact of genomic plasticity in populations of P. vannamei.

Introduction

An inherent characteristic of marine shrimp production systems is the increase in inbreeding within populations, which leads to a higher genetic similarity among individuals. Consequently, this can result in undesired production traits, such as increased susceptibility to pathogens [13]. Genetic selection plays a crucial role in identifying desirable phenotypes and maintaining genetic diversity, both of which are essential for successful shrimp farming [4, 5].

In this context, the study and identification of microsatellite markers, also known as Short Tandem Repeats (STRs), hold great potential for practical applications. These markers offer a cost-effective means to monitor individual genetic characteristics [6, 7]. Microsatellites possess several advantageous features, including broad genome coverage, multi-allelic nature, co-dominance, and polymorphism, making them excellent tools for studying genetic variation and constructing genetic maps [8, 9].

Among the STR markers, those composed of simple sequences (e.g., consisting of a single repeated motif like ACACAC…AC, or simply ACn) are referred to as perfect markers. These markers are preferred for panel assembly due to their high accuracy in genotyping breeding populations. Their repetitive and straightforward structure facilitates efficient genotyping, making them an excellent choice in breeding programs. These markers can be categorized into mono-, di-, tri-, tetra-, penta-, and hexanucleotides, based on the length of their repeat motif [10]. The variation in the number of motif repeats often provides insights into the possible alleles present at each locus, making it easier to determine bin sets. This information is crucial for automating the genotyping process [1113].

During their early development, STRs were primarily regarded as neutral markers [14, 15]. However, in recent years, numerous studies have demonstrated associations between these markers and known genes or specific phenotypes. This growing body of research supports the notion that certain STRs play an evolutionary role by serving as significant sources of adaptive genetic variation [1417].

The association of STR markers with specific phenotypes in marine shrimp Penaeus vannamei has been relatively unexplored, primarily due to the availability of new SNP-based genotyping methods [18]. Unlike SNPs, which typically require around 60 to 100 markers for genotyping individuals, a small number of STRs (sometimes even fewer) can accomplish the same task [19, 20]. This difference opens up a range of possibilities for discoveries, considering that approximately 24% of the P. vannamei genome is composed of STRs, and within this fraction, around 78% are found in genes that potentially encode proteins [17, 21].

Therefore, the objective of this study was to further investigate and characterize all previously described microsatellite markers in P. vannamei. Specifically, the focus was on the functional characterization of perfect markers located within coding regions. Additionally, a multiplex PCR panel using a small set of perfect STRs was proposed. The results obtained in this study emphasize the significance of STRs within protein-coding regions and contribute to the development of a new STR-based multiplex genotyping panel for P. vannamei. This panel offers a cost-effective alternative for genetic selection programs, providing valuable tools for breeders in their efforts to improve the species.

Material and methods

Microsatellite search on Penaeus vannamei

Literature on STRs in P. vannamei was obtained from the NCBI (https://www.ncbi.nlm.nih.gov/) using the following ENTREZ query: "microsatellite repeats"[MeSH Terms] OR ("microsatellite"[All Fields] AND "repeats"[All Fields]) OR "microsatellite repeats"[All Fields] OR "microsatellites"[All Fields]) AND shrimp[All Fields] AND ("penaeidae"[MeSH Terms] OR "penaeidae"[All Fields] OR "penaeus"[All Fields]) AND vannamei[All Fields].

Based on this investigation, a comprehensive database of STRs was established. These markers were categorized based on their type, whether they were perfect or composite, as well as their specific motif and number of repeats. The database also included information about the position of each marker within the genomic sequence, the primer sequences used for amplification (both forward and reverse), the melting temperature (TM) of each primer, the number of alleles detected for each STR in the tested populations, and the name of the corresponding author.

For subsequent analyses, only the markers classified as perfect STRs were chosen and utilized.

Identification of microsatellites within coding sequences

The in silico identification of the sequences flanking each marker (i.e.: the amplicon sequence) was performed using Primer-BLAST (https://www.ncbi.nlm.nih.gov/tools/primer-blast/). The forward and reverse primers for each STR were used as the search input, the ‘database’ was set to “Refseq representative genomes” and the “organism” parameter was set to “Penaeus vannamei (taxid:6689)”. The parameter “max target amplicon size” was adjusted to 500 bp based on the average size of the alleles previously described.

Sequences in which the primers showed a 100% identity match or had at most 1 base pair difference (SNP or deletion) were considered for further analysis. The flanking regions, including the primers, also known as putative amplicons, were manually extracted. This process involved using the provided coordinates of the aligned primers and ensuring that the entire amplicon region was captured, starting from the left-most coordinate, and ending at the right-most coordinate in the corresponding sequence. In cases where confirmation of microsatellites was necessary, the Geneious software version 7.1.3 [22] was employed.

Subsequently, the sequences of the putative amplicons were functionally annotated using BLASTx [23] against the ’nr’ database, employing default parameters. Protein functions were assigned to the sequences based on their best alignment (best hit). This analysis aimed to identify putative genes encoded by the sequences aligning with each marker. By associating the marker sequences with known protein functions, we gained insights into the potential genetic elements represented by the markers.

Next, we determine genomic locations of these marker sequences in the P. vannamei genome, unveiling the associated genomic features based on the provided genome annotations [21].

For this analysis, we exclusively selected markers that yielded a single hit during the Primer-BLAST step to ensure accuracy and eliminate potential ambiguities caused by multiple targets within the same or different genomic scaffolds. In essence, we conducted a BLASTn search to obtain the genomic location of each marker, employing a ’best hit’ approach. Subsequently, we examined the genome annotation file (GFF file) directly using the BEDTools software v2.26.0 to identify any overlapping annotations or features associated with the markers [24]. First, BLASTn from the NCBI BLAST+ package v2.9.0 [25] was run locally (parameters: -task blastn -outfmt 6) to align the marker sequences to the P. vannamei genome, assigning each marker’s location to the one in its best hit.

A second BLASTn analysis was conducted using the Lvan99 sequence, which exhibited 90% low complexity and initially failed to align. To overcome this issue, the second BLASTn search was performed with the same parameters as before but with the DUST filter deactivated (-dust no), resulting in successful alignments. The results from both BLASTn analyses were then merged.

For each aligned marker sequence, the corresponding subject ID (representing the scaffold ID), start and end positions of the alignment in the subject, and query ID (representing the marker ID) were extracted. These details were formatted into a BED format consisting of four columns. The BEDtools program was utilized to search for features based on this BED file, allowing for further analysis of the aligned marker sequences in relation to genomic features [24].

The association of markers to linkage groups was performed using the information provided in the P. vannamei genome [21]. With the obtained BLAST best hits, we utilized the accession IDs of the subjects to retrieve the corresponding scaffold IDs. Subsequently, we mapped these scaffold IDs to their respective linkage groups, enabling the assignment of markers to specific linkage groups within the genome.

The BEDtools software was employed using the ’intersect’ function, following the pseudocommand with the specified parameters: $ bedtools intersect -a <Intervals>.bed -b <GenomicAnnotations.gff> -wo. This allowed the identification of all genomic annotations within the P. vannamei genome that overlapped with the genomic regions where the markers aligned. Detailed information regarding these genomic annotations was extracted from the genomic feature file (GFF file) associated with the P. vannamei genome. The results obtained from the ’BEDtools intersect’ analysis were subsequently analyzed using Python scripts, utilizing data analysis packages such as Pandas and Jupyter.

The protein identified through functional annotation using BLASTn from BEDtools was further subjected to a search in InterPro (https://www.ebi.ac.uk/interpro/). This allowed for the retrieval of Gene Ontology (GO) terms associated with the protein, providing insights into its molecular functions.

Selection of polymorphic STR markers, PCR and genotyping

After establishing the initial database, a series of filtering steps was applied to select the perfect markers for in vitro validation in the population of P. vannamei. The selected markers were those with the highest number of repeat motifs, the highest number of alleles, and located within coding regions.

DNA extraction was carried out using total DNA from 50 individuals of P. vannamei collected from shrimp farms in the state of Rio Grande do Norte, Brazil. Approximately 30 mg of abdominal muscle from each shrimp was utilized for DNA extraction, following the instructions provided by the Nucleospin® Tissue Macherey-Nagel kit.

The selected microsatellite markers were amplified under the specific conditions previously described for each marker. Genotyping was performed using a 6.5% polyacrylamide gel stained with silver nitrate. The electrophoresis runs were conducted in a vertical tank (Owl Separation System, Thermo Fisher) for an approximate duration of 5 hours, with a voltage of 2000 V and a power of 400 W.

Allelic and genotypic frequencies and the number of expected and observed heterozygotes were calculated using the software GenAlEx v 6.503 [26, 27]. The Genepop v 4.7 software was used to calculate inbreeding coefficient values (FIS) for all loci in the population [28], available online at the website <https://genepop.curtin.edu.au/index.html>. Sequential Bonferroni corrections were performed to test the significance of the results obtained [29].

Results

Perfect microsatellite markers in P. vannamei

We have identified 57 articles that describe microsatellites in P. vannamei from 1996 to 2021 (Table 1). Out of the microsatellites described, 306 have been characterized in population studies and exhibit polymorphism (S1 Table), enabling a more comprehensive investigation in this study. Among these 306 microsatellite markers, 117 (38.88%) are considered perfect markers due to their single repeat motif (S2 Table).

thumbnail
Table 1. Articles containing information about STRs in P. vannamei.

https://doi.org/10.1371/journal.pone.0289351.t001

Within the perfect markers, the highest prevalence was observed for dinucleotide (46.2%), followed by trinucleotides (30.8%) and tetranucleotides (11.1%). The mononucleotide repeats consisted of thymine repeat motifs (T), ranging from 15 to 20 repeats. One microsatellite marker exhibited a hexanucleotide repeat motif (Fig 1A). The number of uninterrupted repeats ranged from 3 to 62, with the majority consisting of short repeats. Both groups with up to 5 repetitions and the category of 6 to 10 repetitions were more frequent, comprising 34 and 43 markers, respectively (Fig 1B).

thumbnail
Fig 1. Overview and distribution of short tandem repeats (STRs).

(A) Summary of di-, tri-, tetra-, penta-, and hexanucleotide microsatellites. (B) Distribution of STRs by Repeat Number.

https://doi.org/10.1371/journal.pone.0289351.g001

Among the perfect markers, 55 have their sequences deposited in GenBank, while 62 do not. Therefore, we identified their sequences in the P. vannamei genome using Primer-BLAST. This process resulted in obtaining 135 target sequences, as seven markers aligned in multiple regions of the genome, producing similar target sequences. These markers include CNM-MG-444 (n = 3), CNM-MG-474 (n = 2), CNM-MG-369 (n = 2), c7152/f1p0/6395_1 (n = 2), c7152/f1p0/6395_2 (n = 2) c7152/f1p0/6395_3 (n = 2) and c16138/f14p37/618 (n = 19). No target sequences were found for seven loci: Livan10, Livan13, Livan56, Lvan33(H07-2), CNM-MG-489, CNM-MG-498, and c2322/f15p12/3591, consequently, these markers were excluded from further analyses. All identified sequences are provided in S3 Table.

Functional characterization of sequences containing microsatellites

Among the 135 sequences, 72 (53.33%) were associated with the prediction of gene features in functional annotation through BLASTn. Out of these, 62 were linked to predicted coding genes (mRNA), and 53 STRs were in exons. Furthermore, the linkage group with the highest number of STR markers was LG36 (n = 9), followed by LG20 (n = 6) and LG44 (n = 4) (Table 2 and S4 Table). These sequences were then assigned Gene Ontology (GO) annotations.

thumbnail
Table 2. Putative mRNA products from aligned marker sequences as inferred from the annotations in their corresponding sequences in P. vannamei genome.

https://doi.org/10.1371/journal.pone.0289351.t002

The GO terms for molecular function describe the biochemical activities of genes based on various biological features associated with a given protein, including "biological process," "cellular component," and "molecular function" [75]. However, since there was insufficient information available for these proteins regarding the GO categories of "biological process" and "cellular component," they were not considered in our GO analysis.

The 40 analyzed in the molecular function category of GO exhibited 57 predicted functions. The identified proteins were primarily associated with binding (54.39%; n = 31) and catalytic activity (31.58%; n = 18). In addition to these main functions, there were proteins with ATP-dependent activity (n = 3), structural molecule activity (n = 2), molecular function regulation (n = 1), molecular transducer activity (n = 1) and transcriptional regulatory activity (n = 1) (S5 Table). Notably, several markers, including Lvan96, Lvan183, Lvan195, LvE22X, LvE26X, Livan04, CNM-MG-367, CNM-MG-369, CNM-MG-372, CNM-MG-390, CNM-MG-548, c2438/f35p18/1618, c16138/f14p37/618, CNM-MG-479, Lvan93, TUMXLv10.484, and TUMXLv9.174, were associated with more than one molecular function (Fig 2). On the other hand, 22 putative marker proteins did not have annotations for a molecular function (S5 Table).

thumbnail
Fig 2. Gene ontology analysis.

The numbers within the diagram represent the number of characterized sequences in each category.

https://doi.org/10.1371/journal.pone.0289351.g002

Prospecting of STR markers and population analysis by genotyping

The nine markers located in exons described in Table 3 were evaluated in a population to verify their potential for population studies and potential variations in each coding region.

The average number of alleles per locus was approximately five, ranging from 1 to 8. However, the marker Livan44 did not exhibit any polymorphisms. Among the markers, Lvan93 displayed the highest heterozygosity (Ho = 1.000), while Livan04 had the lowest (Ho = 0). The markers Livan44, Lvan183, and Lvan93 showed significant departures from Hardy-Weinberg (HW) equilibrium based on the probability test. The markers CNM-MG-402 and CNM-MG-421 did not amplify in our population (Table 4). Although the primer Lvan204 could be amplified, population analysis was not conducted for this marker.

thumbnail
Table 4. Genetic parameters considering six microsatellite loci.

https://doi.org/10.1371/journal.pone.0289351.t004

The inbreeding coefficient (FIS) was less than zero for Livan04, Lvan183, and Lvan93, indicating higher genetic variability and the coexistence of different protein isoforms within the population (Table 4).

The positions where these markers overlap in the exon are depicted in Fig 3. As they are situated within exon regions, variations in the number of their repetitions can alter the codons, thus potentially modifying the function of the protein. The majority of these STRs were located at the 3’ end of the mRNAs.

thumbnail
Fig 3. STRs located in exons within the P. vannamei genome.

The marker sequences (shown in blue) represent the entire amplicon and include the primers (dark green and light green). The repeats within the markers are also indicated (orange). Exon annotations are presented in black.

https://doi.org/10.1371/journal.pone.0289351.g003

Discussion

This study aimed to gather and consolidate all the currently available information on microsatellite markers characterized for P. vannamei. We believe that this represents a significant milestone, as data on these markers were previously scattered and, in many cases, lacked information regarding their genomic location. This was primarily since most of these markers were designed prior to the first genomic draft of P. vannamei [21]. The publication of the genomic draft showed the occurrence of 44 linkage groups, mapping to 44 distinct chromosomes. This allowed us to precisely map most of the markers to a linkage group, providing valuable information for breeding programs.

Overall, our observations revealed a high occurrence of diverse repeat motifs (ranging in length from 1 to 6 bp) within perfect short tandem repeats (STRs), as depicted in Fig 1. Among these, markers featuring dinucleotide repeats were the most prevalent, followed by trinucleotide and tetranucleotide repeats, which aligns with the findings reported in the original studies [21, 69, 72].

The main challenge encountered in this study was the task of establishing the linkage between these markers and their respective genomic locations. Up until now, no prior research has been conducted to connect the existing microsatellite markers (with available sequences) to their specific genomic positions. It was quite surprising to discover that numerous microsatellites in P. vannamei are situated within genomic regions that potentially bear some adaptive significance, as demonstrated in Table 2 and Fig 2. Functional annotation analysis indicated that out of the perfect markers, 62 (45.9%) were located within coding regions. In other words, these genes are associated with various processes such as host cell response to pathogens, energy metabolism, cytoskeleton organization/reorganization, protein folding, and the efficiency of translation mechanisms.

During our analysis, we observed redundancy and ambiguity in certain markers. Utilizing the Primer BLAST search, we found that the primers for the following five markers aligned with multiple locations in the genome. These locations could either be within the same genomic scaffolds or in different scaffolds, with the possibility of being on the same chromosome or not. The markers and their respective counts are as follows: CNM-MG-369 (n = 2), CNM-MG-444 (n = 3), CNM-MG-474 (n = 2), c7152/f1p0/6395 (n = 3), and c16138/f14p37/618 (n = 19). Further details can be found in S3 Table. This result confirms the existence of regions that are likely to arise from duplication events, a common feature in the P. vannamei genome also observed in other shrimp genomes [21]. Consequently, the use of these markers requires caution since they might be inaccurate when used in the traditional way.

Five short tandem repeats (STRs) were successfully validated in a shrimp population, confirming the occurrence of protein variants within the population, as outlined in Table 4. Some markers within this panel have demonstrated potential for application in traditional population studies. However, we believe that the true value of this panel lies in the correlation between protein variations and phenotypic traits. The observed deviations from the Hardy-Weinberg equilibrium in certain markers could potentially be explained by specific selective pressures acting on these protein isoforms.

The expansion or contraction of STRs can influence various biological processes, including gene transcription, splicing, and translation, thereby contributing to genomic plasticity [17]. The following are examples of proteins that exhibit different isoforms within the analyzed shrimp population, along with their respective functions:

  1. Clp proteases are ATP-dependent enzymes that play a crucial role in the cleavage of proteins and polypeptides by hydrolyzing peptide bonds. They serve as precise regulatory mechanisms involved in protein degradation and quality control processes within cells [76, 77].
  2. Sorbitol dehydrogenase (SDH) is an enzyme that relies on zinc as a cofactor and is primarily responsible for the conversion of D-sorbitol into D-fructose. This enzymatic activity holds significant importance in the early embryonic development of invertebrates [78]. In a study conducted by [79], rainbow trout were used to investigate the effects of increasing stock density on sorbitol dehydrogenase (SDH) activity. The findings revealed that higher stock densities in tanks had a significant inhibitory effect on SDH activity. Consequently, this led to the accumulation of sorbitol, disrupting the physiological balance of the fish, and resulting in undesirable outcomes. In P. vannamei, the expression levels of SDH have been found to be associated with Taura Syndrome Virus (TSV) infection.
  3. The 26S proteasome regulatory is a large multi-protein complex that selectively degrades ubiquitin-tagged proteins, and this process is ATP-dependent [8082]. It plays a key role in biological homeostasis since removing damaged or misfolded proteins [83, 84]. Furthermore, the ubiquitin-proteasome system may play an important role in crustaceans’ immune response and can be a candidate gene involved in host viral disease defense [85, 86]. [87] found a Ubiquitin proteasome mediated response in animals infected with WSSV to reduce the deleterious damage caused by the viral infection.
  4. Protein tyrosine phosphatases (PTPs) are signaling molecules responsible for dephosphorylating tyrosine residues in proteins and play critical roles in cellular regulation and cell adhesion [88, 89]. In a study conducted by [89] using P. vannamei, it was discovered that the expression of non-receptor protein tyrosine phosphatase 6 (PTP6) was regulated by the interferon regulatory factor (IRF). Additionally, PTP6 was found to promote STAT dimerization, leading to increased expression of genes associated with STAT-targeted immune effectors. This, in turn, enhanced the antiviral immunity of shrimp.
  5. Inverted Formin plays a diverse range of roles in actin polarization and depolarization processes. It is involved in various cellular processes, including cytokinesis, cell adhesion, cell motility, endocytosis, among others. Additionally, studies have reported its interaction with microtubules, which contributes to microtubule stabilization [90].

Conclusion

This study provides an updated characterization of previously validated microsatellite markers in population studies for P. vannamei. Among the 306 markers analyzed, 117 were identified as perfect repeats/markers, with 62 located within coding regions of the genome. Functional characterization revealed their potential influence on cellular processes, including binding function, catalytic activity, and transcriptional activity, which may be linked to phenotypic traits. Additionally, a panel of five markers located in coding regions was validated in vitro, confirming the presence of different protein isoforms within the P. vannamei population. The results highlight the potential of coding region markers in establishing direct associations between phenotype and genotype. The validation of six markers in vivo further supports this notion. Moreover, the study suggests the possibility of exploring the remaining untested markers through in silico functional predictions, which would expand the potential for phenotype-genotype associations.

Supporting information

S1 Table. List of Penaeus vannamei microsatellite markers.

https://doi.org/10.1371/journal.pone.0289351.s001

(XLSX)

S2 Table. Perfect Penaeus vannamei microsatellite markers that have been validated in population studies.

https://doi.org/10.1371/journal.pone.0289351.s002

(XLSX)

S3 Table. Coding sequences in which the microsatellites are located.

https://doi.org/10.1371/journal.pone.0289351.s003

(XLSX)

S4 Table. Gene predictions obtained from the identified sequences after functional annotation.

https://doi.org/10.1371/journal.pone.0289351.s004

(XLSX)

S5 Table. Results of the gene ontology analysis.

https://doi.org/10.1371/journal.pone.0289351.s005

(XLSX)

Acknowledgments

We would like to thank Samaria Unidade de Pós Larvas Company for providing the samples that were crucial for the completion of this work.

References

  1. 1. Perez-Enriquez RH, Hernandez-Martinez F, Cruz P. Genetic diversity status of White shrimp Penaeus (Litopenaeus) vannamei broodstock in Mexico. Aquaculture. 2009;297:44–50.
  2. 2. Lima APS, Silva SMBC, Oliveira KKC. Genetics of two marine shrimp hatcheries of the Pacific White shrimp Litopenaeus vannamei (Boone, 1931) in Pernambuco, Brazil. Ciência Rural, 2010;40(2):453–456.
  3. 3. Pérez-Enríquez R, Ponce-Palafox JT, Arreola-Lizárraga JA, Beltrán-Torres AU, Ruiz-Velazco J, Rábago-Castro JL. Genetic tracing of farmed shrimp (Decapoda: Penaeidae) in wild populations from a main aquaculture region in Mexico. Rev Biol Trop. 2018;66(1):381–393.
  4. 4. Yu Y, Zhang X, Huang H, Li F, Xiang J. Genome scan for genomic regions and genes associated with growth trait in pacific white shrimp Litopeneaus vannamei. Mar Biotechnol (NY). 2019;21(3):374–383. pmid:30887268
  5. 5. Zenger KR, Khatkar MS, Jones DB, Khalilisamani N, Jerry DR, Raadsma HW. Genomic selection in aquaculture: application, limitations and opportunities with special reference to marine shrimp and pearl oysters. Front Genet. 2018;9:693. pmid:30728827
  6. 6. Xu Z, Yu Q, Dong X, et al. Genetic analyses for TSV-susceptible and TSV-resistant Pacific white shrimp Litopenaeus vannamei using M1 microsatellite. J World Aquacult Soc. 2003;34(3):332–343.
  7. 7. Santos CA, Farias F, Teixeira AK. Polymorphism in Litopenaeus vannamei genes and cross-species amplification in other shrimp species. Pesq Agropec Bras. 2018;53:1–9.
  8. 8. Zhang L, Luo K, Zhang X, Huang H, Huang X, Jin S, et al. A genetic linkage map of Pacific white shrimp (Litopenaeus vannamei): Sex-linked microsatellite markers and high recombination rates. Genetica. 2007;131(1):37–49. pmid:17043742
  9. 9. Andriantahina F, Liu X, Huang H. Genetic Map Construction and Quantitative Trait Locus (QTL) Detection of Growth-Related Traits in Litopenaeus vannamei for Selective Breeding Applications. PLoS One. 2013 Oct 23;8(10):e76411.
  10. 10. Vaseeharan B, Jayakumar R, Malaikozhundan B, Rekha RD. Molecular markers and their application in genetic diversity of penaeid shrimp. Aquacult Int. 2013;21(2):219–241.
  11. 11. Liu ZJ, Cordes JF. DNA marker technologies and their applications in aquaculture genetics. Aquaculture, 2004, 238(1–4):1–37.
  12. 12. Coutinho LL, Rosário MF, Jorge EC. Biotecnologia Animal. Estud Av. 2010;24(70):1–4.
  13. 13. Turchetto-Zolet AC, Maraschin FS, de Morais GL, Cagliari A, Andrade CM, Margis R. Molecular markers in the genomics era: methodologies and applications. Genet Mol Biol. 2017;40(1 suppl 1):S1–S18.
  14. 14. Fan H, Chu JY. A brief review of short tandem repeat mutation. Genomics Proteomics Bioinformatics. 2007;5(1):7–14. pmid:17572359
  15. 15. Maralit BA, Santos M. Molecular markers for understanding shrimp biology and populations. In: Biotechnological Advances in Shrimp Health Management in the Philippines. Springer India, 2015, pp. 135–148. https://doi.org/10.1007/978-81-322-1043-3_8
  16. 16. Yin B, Li J, Li F, Xu X, Li L, Yang Q, et al. A Polymorphic (CT)n-STR Influences the Activity of the Litopenaeus vannamei IRF Gene Implicated in Viral Resistance. Front Genet. 2019;10:1257. pmid:31921300
  17. 17. Yuan J, Zhang X, Liu C, Cheng T, Zhang H, Yang X, et al. Simple sequence repeats drive genome plasticity and promote adaptive evolution in penaeid shrimp. Commun Biol. 2021;4(1):186. pmid:33574498
  18. 18. Mangabeira-Silva IS, Soares PET, Lanza DCF (2020) Single nucleotide polymorphism associated with disease resistance in Penaeus vannamei. Journal of Invertebrate Pathology 177:107498. pmid:33137318
  19. 19. Glover KA, et al. A comparison of SNP and STR loci for delineating population structure and performing individual genetic assignment. BMC Genet. 2010;11(2). pmid:20051144
  20. 20. Pérez-Enríquez R, Max-Aguilar A. Pedigree traceability in whiteleg shrimp (Litopenaeus vannamei) using genetic markers: A comparison between microsatellites and SNPs. Cienc Mar. 2016;42(4):227–235.
  21. 21. Zhang X, Yuan J, Sun Y, Li S, Gao Y, Yu Y, et al. Penaeid shrimp genome provides insights into benthic adaptation and frequent molting. Nat Commun. 2019;10(1):356. pmid:30664654
  22. 22. Kearse M, Moir R, Wilson A, et al. Geneious basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012;28:1647–1649. pmid:22543367
  23. 23. Altschul SF, Gish W, Miller W, et al. Basic local alignment search tool. J Mol Biol. 1990 Oct 5;215(3):403–10. pmid:2231712
  24. 24. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. pmid:20110278
  25. 25. Camacho C, Coulouris G, Avagyan V, et al. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10:421. pmid:20003500
  26. 26. Peakall R, Smouse PE. GENALEX 6: genetic analysis in Excel. Population genetic software for teaching and research. Mol Ecol Notes. 2006;6:288–95.
  27. 27. Peakall R, Smouse PE. GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research-an update. Bioinformatics. 2012;28:2537–9. pmid:22820204
  28. 28. Rousset F. Genepop’007: a complete reimplementation of the Genepop software for Windows and Linux. Mol Ecol Resour. 2008;8:103–106. pmid:21585727
  29. 29. Rice WR. Analyzing tables of statistical tests. Evolution. 1989;43:223–225. pmid:28568501
  30. 30. Lyu D, Zhang X, Zhai J, et al. Identification of Growth-Associated Genes by Genome-Wide Association Study and Their Potential Application in the Breeding of Pacific White Shrimp (Litopenaeus vannamei). Frontiers in Genetics 2021, 12:611570. pmid:33897754
  31. 31. Zeng D, Chen X, Xie D. Transcriptome Analysis of Pacific White Shrimp (Litopenaeus vannamei) Hepatopancreas in Response to Taura Syndrome Virus (TSV) Experimental Infection. PLoS ONE. 2013;8(3):e57515. pmid:23469011
  32. 32. Peng J, Wei P, Chen X, Wang Y, Li F, Zhang X. Gonadal transcriptomic analysis and differentially expressed genes in the testis and ovary of the Pacific white shrimp (Litopenaeus vannamei). BMC Genomics. 2015;16:1006. pmid:26607692
  33. 33. Li QY, Li M, Zeng DG. Development of microsatellite markers of Litopenaeus vannamei and genetic polymorphism analysis of different cultured families. J Southern Agric. 2020;51(2):429–436.
  34. 34. Wolfus GM, Garcia DK, Alcivar-Warren A. Application of the microsatellite technique for analyzing genetic diversity in shrimp breeding programs. Aquaculture. 1997;152(1–4):35–47.
  35. 35. Valles-Jimenez R, Cruz P, Perez-Enriquez R. Population genetic structure of Pacific white shrimp (Litopenaeus vannamei) from Mexico to Panama: Microsatellite DNA variation. Mar Biotechnol (NY). 2004;6(5):475–484. pmid:15791491
  36. 36. Cruz P, Kilian A, Taggart JB, et al. Genetic Variability Assessed by Microsatellites in a Breeding Program of Pacific White Shrimp (Litopenaeus vannamei). Mar Biotechnol. 2004;6(2):138–149. pmid:14595549
  37. 37. Alcivar-Warren A, Warren WC, Wilson K, et al. Isolation and Mapping of Telomeric Pentanucleotide (TAACC)n Repeats of the Pacific Whiteleg Shrimp, Penaeus vannamei, Using Fluorescence In Situ Hybridization. Mar Biotechnol. 2006 May-Jun;8(3):467–80. pmid:16729213
  38. 38. Wang HX, Li YQ, Zhao LL, et al. The application of microsatellite markers for parentage determination in selective breeding of Pacific white shrimp (Litopenaeus vannamei). Mar Fish Res. 2006;28(2):179–183. pmid:16520313
  39. 39. Zhi-Ying J. et al. Isolation and characterization of microsatellite markers from Pacific white shrimp (Litopenaeus vannamei). Mol Ecol Notes. 2006 Dec;6(4):1282–1284.
  40. 40. Borrel YJ, Álvarez-Lajonchère L, Sanz-Parrilla J, Ibarra AM. Variabilidad genética de loci microsatelites em los primeiros lotes de Litopenaeus vannamei introducidos em Cuba para la acuicultura. Rev Invest Mar. 2006;27(3):237–244.
  41. 41. Garcia DK, Alcivar-Warren A. Characterization of 35 new microsatellite genetic markers for the pacific whiteleg shrimp, Litopenaeus vannamei: Their usefulness for studying genetic diversity of wild and cultured stocks, tracing pedigree in breeding programs, and linkage mapping. J Shellfish Res. 2007;26(4):1203–1216.
  42. 42. Freitas PD, et al. In silico analysis of polymorphic microsatellites in penaeid shrimp and construction of a free-access database. Genet Mol Biol. 2007a;30(4).
  43. 43. Ai HS, Li SD, Chen K, et al. Characterization of a prophenoloxidase from hemocytes of the shrimp Litopenaeus vannamei that is down-regulated by white spot syndrome virus. Fish Shellfish Immunol. 2008 Jan;25(1–2):28–39. pmid:18474432
  44. 44. Li Z-M. Genetic Diversity of Four Selected Stocks of Litopenaeus Vannamei as Revealed by SSR Marker. Thesis, Ocean University of China, 2009. Available from: https://www.semanticscholar.org/paper/Genetic-Diversity-of-Four-Selected-Stocks-of-as-by-Zhi-min/288f0c5ab7e487d6afce6bd91a071ce700665040.
  45. 45. XIN G. Genetic diversity of cultured pacific white shrimp (litopenaeus vannamei) stocks of different generations in china. 2009.
  46. 46. Ke Y. Microsatellite analysis on genetic variation of imported Litopenaeus vannamei population and cultured stocks. Thesis, Shanghai Ocean University, 2011.
  47. 47. Artiles A, Rodríguez A, Rodríguez C, et al. Low genetic variability in the fifth introduction of Litopenaeus vannamei in Cuba, as estimated with microsatellite markers. Biotecnol Apl. 2011 Jul-Sep;28(3):147–50.
  48. 48. Pérez-Beloborodova A, Galván-Tirado C, Vázquez-Juárez R, et al. Genetic characterization of six stocks of Litopenaeus used in Cuba for aquaculture by means of microsatellite loci. Int J Zool. 2012;2012:352165.
  49. 49. Maggioni R, Coimbra MRM, Costa RBC. Genetic variability of marine shrimp in the Brazilian industry. Pesquisa Agropecuária Brasileira, 2013, 48(8):984–991.
  50. 50. Vela-Avitua S, Brouwer M, Pérez-Enríquez R. Decline of genetic variability in a captive population of Pacific white shrimp Penaeus (Litopenaeus) vannamei using microsatellite and pedigree information. Electron J Biotechnol. 2013;16(4):12.
  51. 51. Zhang K, Yan G, Tan S, Liu J, Cao B. Analysis of genetic diversity and differentiation of seven stocks of Litopenaeus vannamei using microsatellite markers. J Ocean Univ China. 2014;13(4):647–656.
  52. 52. Yu Y, Zhang X, Yuan J, Li F, Chen X. Genome survey and high-density genetic map construction provide genomic and genetic resources for the Pacific White Shrimp Litopenaeus vannamei. Sci Rep. 2015;5:15612. pmid:26503227
  53. 53. Rezaee S, Farahmand H, Nematollahi MA. Genetic diversity status of Pacific white shrimp (Litopenaeus vannamei) using STR markers in Iran. Aquacult Int. 2016;24:479–489.
  54. 54. Ren S, Chen Y, Wang Z, et al. Levels of genetic diversity and inferred origin of Penaeus vannamei cultured resources in China: Implications for the production of a broad synthetic base population for genetic improvement. Aquaculture. 2018;491:221–231.
  55. 55. Fang Z, et al. Genetic diversity analysis of domestic commercial brands seedlings of Litopenaeus vannamei based on microsatellite molecular markers. J Progress Fishery Sci. 2020;41(5):101–109.
  56. 56. Peng M, Chen H, Li Q. Genetic diversity of three consecutive generations of Litopenaeus vannamei. J South Agric. 2020;5(6):1224–1450.
  57. 57. Chen X. Analysis of microsatellite DNA markers for the growth trait in White shrimp Litopenaeus vannamei. J Wuhan Univ. 2006;52(4):498–502.
  58. 58. Aungsuchawan S, Ball A, Chapman R, Shepard E, Browdy C, Withyachumnarnkul B, et al. Evaluation of published microsatellites for paternity analysis in the Pacific White Shrimp Litopenaeus vannamei. ScienceAsia. 2008; 34: 115–123.
  59. 59. Ren S, Mather PB, Tang B, Hurwood DA. Standardized microsatellite panels for pedigree management of farmed white-leg shrimp (Penaeus vannamei) stocks validated in a VIE tagged family selection line. Aquaculture. 2022; 551: 737946.
  60. 60. Lima APS, Santos ACL, Dantas HL. Genetic monitoring of broodstocks of the marine shrimp Litopenaeus vannamei in a closed rearing system in Pernambuco, Brazil. Aquacult Res. 2008;39:1461–1466.
  61. 61. Gong H, et al. Effects of dietary protein level and source on the growth and survival of two genetic lines of specific-pathogen-free Pacific white shrimp, Penaeus vannamei. Aquaculture. 2012;29:118–123.
  62. 62. Andriantahina F, Liu X, Huang H. Using microsatellite markers to identify heritability of Pacific whiteleg shrimp Litopenaeus vannamei. Acta Oceanol Sin. 2015;34(4):59–65.
  63. 63. Pérez F, Rodríguez-Ramírez A, García-Machado E, et al. Development of EST-STR Markers by Data Mining in Three Species of Shrimp: Litopenaeus vannamei, Litopenaeus stylirostris, and Trachypenaeus birdy. Mar Biotechnol. 2005;7:554–69. pmid:16027992
  64. 64. Zhang LS, Xiang JH. A preliminary study on the inheritance of microsatellite in two selective breeding families of shrimp (Litopenaeus vannamei). Yi Chuan. 2005;27(6):919–924. pmid:16378940
  65. 65. Alcivar-Warren A, Kuhn D, Pettit S, et al. Characterization and mapping of expressed sequence tags isolated from a subtracted cDNA library of Litopenaeus vannamei injected with white spot syndrome virus. J Shellfish Res. 2007 Dec;26(4):1247–54.
  66. 66. Luvesuto E, Freitas P, Galetti P Jr. Genetic variation in a closed line of the white shrimp Litopenaeus vannamei (Penaeidae). Genetics and Molecular Biology, 2007, 30(4):1097–1101.
  67. 67. Wang Y, Hu C, Zhang L. A preliminary study on microsatellite markers screening from EST sequences of Litopenaeus vannamei. J Fish China. 2011;35(6):969–976.
  68. 68. Freitas PD, Jesus CM, Galetti P Jr. Isolation and characterization of new microsatellite loci in the Pacific white shrimp Litopenaeus vannamei and cross-species amplification in other penaeid species. Mol Ecol Notes. 2007b;7:324–326.
  69. 69. Garcia DK, Dhar AK, Alcivar-Warren A. Molecular analysis of a RAPD marker (B20) reveals two microsatellites and differential mRNA expression in Penaeus vannamei. Mol Mar Biol Biotechnol. 1996;5(1):71–83. pmid:8869519
  70. 70. Bagshaw JC, Buckholz MA. A novel satellite/microsatellite combination in the genome of the marine shrimp, Penaeus vannamei. Gene. 1997;184(2):247–251. pmid:9031630
  71. 71. Cruz P, Kilian A, Tang-Nelson K, et al. Isolation and characterization of microsatellites in Pacific white shrimp Penaeus (Litopenaeus) vannamei. Mol Ecol Notes. 2002;2(3):239–241.
  72. 72. Meehan D, Shi Y, Lall SP, et al. High Frequency and Large Number of Polymorphic Microsatellites in Cultured Shrimp, Penaeus (Litopenaeus) vannamei [Crustacea: Decapoda]. Marine Biotechnology, 2003, 5(4):311–330. pmid:14719161
  73. 73. Alcivar-Warren A, Wilson K, Davidson E, et al. Shrimp genomics: development of a genetic map to identify QTLs responsible for economically important traits in Litopenaeus vannamei. In: Shimizu N, Aoki T, Hirono I, Takashima F, eds. Aquatic Genomics. Tokyo: Springer; 2003: 131–42.
  74. 74. Santos CA, Rossina BC, Marques CG et al. Characterization and genomic annotation of polymorphic EST-SSR loci in Litopenaeus vannamei shrimp. Aquaculture Research, 2012, 43(10): 1567–1570.
  75. 75. Camon E, Magrane M, Barrel D, et al. The gene ontology annotation (GOA) project: Implementation of GO SWISS-PROT, TrEMBL and InterPro. Genome Research, 2003, 13: 662–672.
  76. 76. Maurizi MR (1994) Endopeptidase Clp: ATP-dependent Clp protease from Escherichia coli. Methods in Enzymology 244:314–331. pmid:7845217
  77. 77. Porankiewicz J, Wang J, Clarke AK. New insights into the ATP-dependent Clp protease: Escherichia coli and beyond. Mol Microbiol. 1999;32:449–458. pmid:10320569
  78. 78. Wang T, Hou M, Zhao N, et al. Cloning and expression of the sorbitol dehydrogenase gene during embryonic development and temperature stress in Artemia sinica. Gene. 2013, 521(2):296–302. pmid:23562727
  79. 79. Alim Z, Aktumsek A, Yilmaz S, et al. The Relationship Between Stocking Density and Sorbitol Dehydrogenase Activity in Rainbow Trout Oncorhynchus mykiss Tissues. Hacettepe J Biol Chem. 2012;40(1):105–10.
  80. 80. Tanaka K, Tsurumi C. The 26S proteasome: subunits and function. Mol Biol Rep. 1997;24:3–11. pmid:9228274
  81. 81. Ferrell K, Wilkinson CR, Dubiel W, Gordon C. Regulatory subunit interactions of the 26S proteasome, a complex problem. Trends in biochemical sciences, 2000, 25(2), 83–88. pmid:10664589
  82. 82. Zhu Y, Wang WL, Yu D. et al. Structural mechanism for nucleotide-driven remodeling of the AAA-ATPase unfoldase in the activated human 26S proteasome. Nature communications, 2018, 9(1), 1360. pmid:29636472
  83. 83. Miller J, Gordon C. The regulation of proteasome degradation by multi-ubiquitin chain binding proteins. FEBS Lett. 2005;579(15):3224–3230. pmid:15943965
  84. 84. Gallastegui N, Groll M. The 26S proteasome: assembly and function of a destructive machine. Trends Biochem Sci. 2010;35(11):634–642. pmid:20541423
  85. 85. Robinson NA, Gopikrishna G, Baranski M, et al. QTL for white spot syndrome virus resistance and the sex-determining locus in the Indian black tiger shrimp (Penaeus monodon). BMC Genomics. 2014;15:731. pmid:25164406
  86. 86. Götz S, et al. Proteasome properties of hemocytes differ between the whiteleg shrimp Penaeus vannamei and the brown shrimp Crangon crangon (Crustacea, Decapoda). Cell Stress Chaperones. 2017;22:879–891. pmid:28646424
  87. 87. Jarrousse AS. Relationships between proteasomes and viral gene products. Mol Biol Rep. 1999;26(1–2):113–117. pmid:10363656
  88. 88. Senis YA, Barr AJ. Targeting Receptor-Type Protein Tyrosine Phosphatases with Biotherapeutics: Is Outside-in Better than Inside-Out? Molecules. 2018;23:569. pmid:29498714
  89. 89. Luo M, Wu Y, Lu K, et al. The Non-Receptor Protein Tyrosine Phosphatase PTPN6 Mediates a Positive Regulatory Approach From the Interferon Regulatory Factor to the JAK/STAT Pathway in Litopenaeus vannamei. Frontiers in Immunology, 2022, 13:913955. pmid:35844582
  90. 90. Hegsted A, Yingling CV, Pruyne D. Inverted formins: A subfamily of atypical formins. Cytoskeleton. 2017;74:405–419. pmid:28921928