From head to rootlet: comparative transcriptomic analysis of a rhizocephalan barnacle <i>Peltogaster reticulata</i> (Crustacea: Rhizocephala)

Maksim Nesterenko; Aleksei Miroliubov

doi:10.12688/f1000research.110492.2

Home Browse From head to rootlet: comparative transcriptomic analysis of a rhizocephalan...

ALL Metrics

Views

Downloads

Get PDF

Get XML

Export

▬

✚

Research Article

Revised

From head to rootlet: comparative transcriptomic analysis of a rhizocephalan barnacle Peltogaster reticulata (Crustacea: Rhizocephala)

[version 2; peer review: 2 approved]

Maksim Nesterenko ^1,2, Aleksei Miroliubov²

PUBLISHED 09 Jan 2023

Author details Author details

¹ Department of Invertebrate Zoology, St Petersburg State University, St Petersburg, 199034, Russian Federation
² Laboratory of parasitic worms and protists, Zoological Institute of Russian Academy of Sciences, St Petersburg, 199034, Russian Federation

Maksim Nesterenko
Roles: Conceptualization, Data Curation, Formal Analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Aleksei Miroliubov
Roles: Conceptualization, Funding Acquisition, Investigation, Methodology, Resources, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

This article is included in the Genomics and Genetics gateway.

This article is included in the Bioinformatics gateway.

This article is included in the Cell & Molecular Biology gateway.

Abstract

Background: Rhizocephalan barnacles stand out in the diverse world of metazoan parasites. The body of a rhizocephalan female is modified beyond revealing any recognizable morphological features, consisting of the interna, a system of rootlets, and the externa, a sac-like reproductive body. Moreover, rhizocephalans have an outstanding ability to control their hosts, literally turning them into “zombies”. Despite all these amazing traits, there are no genomic or transcriptomic data about any Rhizocephala.
Methods: We collected transcriptomes from four body parts of an adult female rhizocephalan Peltogaster reticulata: the externa, and the main, growing, and thoracic parts of the interna. We used all prepared data for the de novo assembly of the reference transcriptome. Next, a set of encoded proteins was determined, the expression levels of protein-coding genes in different parts of the parasite’s body were calculated and lists of enriched bioprocesses were identified. We also in silico identified and analyzed sets of potential excretory / secretory proteins. Finally, we applied phylostratigraphy and evolutionary transcriptomics approaches to our data.
Results: The assembled reference transcriptome included transcripts of 12,620 protein-coding genes and was the first for any rhizocephalan. Based on the results obtained, the spatial heterogeneity of protein-coding gene expression in different regions of the adult female body of P. reticulata was established. The results of both transcriptomic analysis and histological studies indicated the presence of germ-like cells in the lumen of the interna. The potential molecular basis of the interaction between the nervous system of the host and the parasite's interna was also determined. Given the prolonged expression of development-associated genes, we suggest that rhizocephalans “got stuck in their metamorphosis”, even at the reproductive stage.
Conclusions: The results of the first comparative transcriptomic analysis for Rhizocephala not only clarified but also expanded the existing ideas about the biology of these extraordinary parasites.

Keywords

Rhizocephala, parasitic barnacles, evolutionary transcriptomics, host manipulation, coloniality

Corresponding author: Maksim Nesterenko

Competing interests: No competing interests were disclosed.

Grant information: This study was supported by the Grant No. 21-74-00018 of the Russian Science Foundation, by a grant of the Ministry of Science and Higher Education of the Russian Federation (No. 075-15-2021-1069) and the state laboratory theme of the Zoological Institute 122031100260-0 (Biodiversity of parasites, life cycles, biology and evolution).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright: © 2023 Nesterenko M and Miroliubov A. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Nesterenko M and Miroliubov A. From head to rootlet: comparative transcriptomic analysis of a rhizocephalan barnacle Peltogaster reticulata (Crustacea: Rhizocephala) [version 2; peer review: 2 approved]. F1000Research 2023, 11:583 (https://doi.org/10.12688/f1000research.110492.2) First published: 27 May 2022, 11:583 (https://doi.org/10.12688/f1000research.110492.1) Latest published: 09 Jan 2023, 11:583 (https://doi.org/10.12688/f1000research.110492.2)

Revised Amendments from Version 1

There are no major differences between this version and the previous version. We mainly clarified the points in the text (as suggested by the reviewers). We added a phylogenetic tree with species considered during phylostratigraphic analysis in the figshare archive (Extended data, Figure S1), and we created a git-repository where all the main scripts written in Python and R languages and used in the analysis were uploaded.

See the authors' detailed response to the review by Viatcheslav Ivanenko
See the authors' detailed response to the review by Andreas Hejnol and Ferenc Kagan

Introduction

Rhizocephalan barnacles (Crustacea: Rhizocephala) stand out among metazoan parasites. In the process of adaptation to a parasitic lifestyle, they have changed beyond recognition, losing almost all the structures characteristic of other crustaceans. In particular, they have lost all normal organ systems, as well as body axes^1,2. The body of an adult rhizocephalan female is represented by the interna, a system of hollow, ramifying rootlets infiltrating the body cavity of their host (exclusively crustaceans, usually decapods), and the externa, a sac-like body protruding outside the host. The interna is responsible for absorbing nutrients from the host hemolymph and their transportation to the externa¹ as well as for interactions with the host^3,4. The externa is a temporary structure thought to be an organ for sexual reproduction⁵. It usually contains two incorporated dwarf males and a special mantle chamber with developing embryos². Some rhizocephalans can form numerous externae, sometimes as many as 2,000^6,7, which is considered a unique instance of modular/colonial organization among arthropods. Besides their unusual morphology, rhizocephalans have evolved a unique life cycle with a characteristic larval stage. The larva injects a few poorly differentiated cells into the host’s hemolymph, and what is left of the larva dies⁸. Noteworthy, the entire adult body originates from these cells and is thus a newly formed structure⁹.

In addition to their morphological adaptations and unusual life cycle, rhizocephalan barnacles show a remarkable ability to manipulate the host. These parasites can take control of the moulting cycle of the host, change its metabolism, behaviour, and even body shape^2,10–20. Specialized sites responsible for host-parasite interactions have recently been described^3,4, with a network of the host’s neurons enlacing the rootlets of the parasite, but the molecular mechanisms of these interactions remaine enigmatic. The authors suggested that the parasite may emit some signal molecules attracting the growth of the host’s neurons³.

The rapid development of high-throughput sequencing technologies has enabled detailed molecular-based biological studies of many living organisms (for example, 21–25), but until recently, molecular studies on Rhizocephala have been lacking. The research on body heterogony, host-parasite interactions and functional physiology has only been based on morphological and other classical methods.

In an attempt to fill this gap, a comparative transcriptomics analysis of different parts of the rhizocephalan female body was made. Our research object was Peltogaster reticulata Shiino, 1943 (Rhizocephala: Peltogasteridae), whose females parasitize the hermit crab Pagurus minutus Hess, 1865 (Crustacea: Decapoda) and form one or, less often, several externae. Although P. reticulata is a typical rhizocephalan, but, on the other hand, it has a lesser degree of modularity than many other representatives of this group²⁶, making it a particularly convenient research model. In this study, we present the transcriptome-based evidence of molecular and functional heterogeneity of the female rhizocephalan body. We also show that the ovary is diffused throughout the interna, and that host’s motor neurons axon are attracted to the rootlets of the interna. Phylostratigraphy and evolutionary transcriptomic analysis were performed for Rhizocephala for the first time. Our results make it possible to trace evolutionary trends in the P. reticulata gene set.

Methods

Sampling

Hermit crabs Pagurus minutus infected with Peltogaster reticulata were collected in the Sea of Japan (Marine Biological Station “Vostok” of the Institute of Marine Biology of the Russian Academy of Sciences) (N: 42.893720, E: 132.732755). All the parasites were adults with fully developed externae.

The infected crabs were dissected in filtered sea water. The parasite was removed from the host’s body cavity and the remains of the host tissues were isolated from the interna. The body of each parasite was divided into four parts: 1) the externa (it was separated at the level of the stalk), 2) distal part of the main trunk (the growing part) (approximately the last three millimeters of the main where muscular system is not developed yet²⁷), 3) the main part of the interna located in the abdomen of the host (the main trunk), and 4) the part of interna from the thorax of the host (the thoracic part) (Figure 1). For each body part the pooled sample was prepared, containing material from five parasitic individuals in two biological replicates. The samples were collected into centrifuge tubes and frozen at -80 °C in IntactRNA (Evrogen, Moscow, Russia) reagent according to the manufacturer`s protocol.

Figure 1. Molecular signatures of female Peltogaster reticulata.

(a) Generalized scheme of female P. reticulata in the host. Colour sectors indicate the body parts examined in our study: externa (red), growing part of interna (green), main trunk of interna (purple), and thoracic part (blue). (b) The number of common OMA groups. The colour key on the heatmap shows the number of shared OMA groups between species. (c, e) Venn diagram for a set of genes either included in molecular signatures of the body parts (c) or over-expressed in the body parts (e). (d, f) Multidimensional scaling (MDS) plots for molecular signatures (d) and sets of over-expressed genes (f). Different clusters in MDS plots are marked with colours. Abbreviations: MS/overexp – the molecular signature or set of over-expressed genes for the body part, respectively; rep1/2 – biological replication identifier.

Before RNA isolation, the IntactRNA-fixed samples were rinsed in 0.1M phosphate-buffered saline (PBS). The total RNA was isolated using Quick-RNA MiniPrep (R1054, Zymo Research, Irvine, California, USA) according to the manufacturer’s protocol. The libraries were synthesized using NEBNext Ultra Directional RNA Library Prep Kit for Illumina (E7760, New England BioLabs, Ipswich, Massachusetts, USA) according to the manufacturer protocol. Paired-end sequencing was carried out using Illumina HiSeq 2500 instrument (Illumina, San-Diego, California, USA).

Sampling was conducted in accordance with the European Community Council Directive of November 24, 1986 (86/609/EEC). All possible efforts were made to minimize the number of animals used.

Preparation of reads libraries and de novo transcriptome assembly

The primary quality control of paired-end reads libraries was manually assessed using FastQC (v0.11.5) [https://www.bioinformatics.babraham.ac.uk/projects/fastqc/]. The potential sequencing error identification and correction was performed by Karect²⁸ (v1.0) [https://github.com/aminallam/karect] with the following parameters: --celltype=diploid –matchtype=hamming. Trimmomatic (v0.39) [http://www.usadellab.org/cms/?page=trimmomatic] was used for removing the sequencing adaptors (ILLUMINACLIP:$ADAPTERS:2:30:10:2:TRUE), low-quality read regions (SLIDINGWINDOW:4:20 MAXINFO:50:0.8) as well as reads with a length less than 25 nucleotides (MINLEN:25). Since library preparation and sequencing were performed in the laboratory where researchers also work with human medical samples, the parasite data were checked for the presence of the read pairs with a high identity to the Homo sapiens reference transcriptome (GENCODE v.31) using BBTools (v37.02).

The prepared libraries were pooled and used for de novo reference transcriptome assembly using Trinity²⁹ (v2.5.1) [https://github.com/trinityrnaseq/trinityrnaseq] with k-mer size and required minimal contig length equal to 25 and 200 nucleotides, respectively. The assembled contigs were renamed by adding the four-digit tag of the species, “Pret”, at the beginning of IDs. Isoforms were clustered on all assembled contigs using CDHIT-est³⁰ (v4.7) [http://weizhong-lab.ucsd.edu/cd-hit/] and a sequence identity threshold equal to 95% (-c 0.95), accurate mode (-g 1), and both +/+ and +/- strands alignments (-r 1). TransRate³¹ (v1.0.1) [https://hibberdlab.com/transrate/] was used for the quality assessment of clustered sequences. Only the contigs classified as “good” by TransRate³¹ were included in further analysis.

Removal of potential contamination

The 18S and 28S ribosomal RNA sequences were searched using RNAmmer³² (v1.2) [https://services.healthtech.dtu.dk/service.php?RNAmmer-1.2] from the Trinotate pipeline (v3.1.1) [https://anaconda.org/bioconda/trinotate]. The sequences obtained this way were compared with the NCBI nucleotide database with BLASTn³³ (v2.6.0+) [https://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE_TYPE=BlastDocs&DOC_TYPE=Download] to identify their possible sources.

We used MCSC (Model-based Categorical Sequence Clustering) Decontamination method³⁴ [https://github.com/Lafond-LapalmeJ/MCSC_Decontamination] for removing potential contamination, with “Arthropoda” as the target taxon and the clustering level equal to 5 (32 clusters). The parsing of BAM-files with reads alignment results created by Bowtie2³⁵ [http://bowtie-bio.sourceforge.net/bowtie2/index.shtml] was performed to extract only reads pairs that mapped to the decontaminated set of contigs.

Quantification of gene expression levels and identification of encoded amino acid sequences

Salmon³⁶ (v1.0.1) was used for expression level quantification (-l ISF –discardOrphans –seqBias –gcBias –validateMappings). The expression quantification results and the transcripts-to-genes map from the Trinity output were provided to the “tximport” package for R to obtain expression levels of genes. The tables with both unaveraged transcripts-per-million (TPM)³⁷ values and TPM values averaged between biological replicates were prepared. Only the sequences with expression levels ≥ 1 TPM in at least one sample were included in further analysis.

TransDecoder (v5.5.0) [https://github.com/TransDecoder/TransDecoder] was used for the identification of the amino acid sequences encoded by assembled contigs. Firstly, the long open reading frames with a length ≥ 100 amino acids (aa) and products of its translation were found. Secondly, identified proteins were compared with the NCBI non-redundant (DIAMOND BLASTp³⁸ (v0.9.22.123) [https://github.com/bbuchfink/diamond/], e-value = 1e-3) and the PfamA³⁹ [http://pfam.xfam.org] (HMMscan (v3.1b2)) databases. Thirdly, the comparison results were provided to the TransDecoder to identify the likely coding regions and to obtain the probable set of proteins.

Reference gene set preparation

In our analysis, the focus was only on genes that successfully passed two filters: 1) noticeable expression level (i.e., the expression is ≥ 1 TPM in at least one sample) and 2) encoding of the proteins with a length greater than or equal to 100 amino acids. Only the longest protein and its coding transcript were selected as representatives for each gene and referred to as “reference sets”. The completeness of the protein reference set was evaluated by comparison with the database of single-copy orthologues of Metazoa (odb-9) using BUSCO^40,41 (v3.0.1) [https://gitlab.com/ezlab/busco] (e-values = 1e-3, mode = proteins).

Sequence annotation

For the annotation of the genes, their nucleotide and amino acid sequences were compared with publicly available databases: NCBI nucleotide collection (nt), NCBI non-redundant (nr), and SwissProt^42,43. The similarity search was carried out with BLASTn megablast³³ (nt) and the sensitive mode of DIAMOND BLASTp³⁸ (amino acid databases), with an expected value (e-value) threshold equal to 1e-3 and a limit up to 10000 for the number of description and alignments⁴⁴. The best BLAST hits (BBH) were selected with a custom script.

The potential domain architecture of the proteins was identified using the PfamA database (HMMScan) and custom script. The proteins were also analysed using the eggNOG-mapper web-resource⁴⁵ (v2) [http://eggnog-mapper.embl.de] with default parameters.

Identification of orthogroups

Orthogroups were identified with the use of OMA standalone program (v2.5.0)⁴⁶ [https://omabrowser.org/standalone/] in three steps. Firstly, the reference proteomes of Amphibalanus amphitrite Darwin, 1854 (UP000440578), Armadillidium nasatum Budde-Lund, 1885 (UP000326759), Armadillidium vulgare Latreille, 1804 (UP000288706), Daphnia magna Straus, 1820 (Strain: Xinb3) (UP000076858), Daphnia pulex Leydig, 1860 (UP000000305), Penaeus vannamei Boone, 1931 (UP000283509), Portunus trituberculatus Miers, 1876 (UP000324222), and Tigriopus californicus Baker, 1912 (UP000318571) were downloaded from the UniProt^42,43 [https://www.uniprot.org] database [accessed 21 October 2021]. Only the sequences with a length equal to or more than 100 amino acids were analysed. The OMA standalone was run with default parameters with the “bottom-up” algorithm for inference of hierarchical orthologous groups (HOGs), without a phylogenetic tree, but with the identification of two Daphnia species as an out-group. Secondly, we reconstructed the phylogenetic tree following the protocol by Dylus et al.⁴⁷. Briefly, using the filter_groups.py provided, we selected OMA groups that included at least eight of the nine crustacean species involved in the analysis. Then, using MAFFT⁴⁸ (v7.487) [https://mafft.cbrc.jp/alignment/software/], multiple protein alignment in each orthogroup was performed (--maxiterate 1000 -localpair). The alignments were concatenated into a supermatrix using the concat_alignments.py. The selection of suitable sites in the supermatrix was carried out using tramAl⁴⁹ (-automated1) [http://trimal.cgenomics.org]. We used the ProtTest program^50,51 (v3.4.2) [https://github.com/ddarriba/prottest3] to determine the most appropriate sequence evolution model. The phylogenetic tree was reconstructed using the IQ-TREE^52,53 (v2.1.4-beta) [http://www.iqtree.org] with the following parameters: -m LG+I+G+F --seed 12345 -B 1000 --nmax 1000. The consensus tree was rooted by the out-group using the “ape”⁵⁴ [https://cran.r-project.org/web/packages/ape/index.html] library for R. Thirdly, the phylogenetic tree was used when the OMA standalone⁴⁶ was re-run with default settings. The construction of a heatmap with the number of common OMA groups between the studied species was performed in RStudio using the “ggplot2” (v3.3.5), “pheatmap” (v1.0.12), and “RColorBrewer” (v1.1-2) libraries.

Reference gene set expression analysis

The “molecular signature” of a body part was defined as a set of genes with an expression level ≥ 2 TPM in the body part. The expression threshold value was chosen in accordance with the results of studies by Wagner, Kin, and Lynch, according to which “genes with more than two transcripts per million transcripts (TPM) are highly likely from actively transcribed genes”⁵⁵. Genes that had an expression ≥ 2 TPM in all the body parts, were classified as “commonly expressed”.

Significant variation of gene expression between samples was detected using “RNentropy” library⁵⁶ (v1.2.2) [https://cran.r-project.org/web/packages/RNentropy/index.html] for R. The analysis was carried out using a table with unaveraged TPM values between replicates. The corrected global sample specificity test P < 0.01 was used according to the Benjamini-Hochberg method, and local sample specificity test P < 0.01.

The overlaps between the molecular signatures and the sets of over-expressed genes were visualized with InteractiVenn⁵⁷ [http://www.interactivenn.net].

Multidimensional scaling

A multidimensional scaling (MDS) analysis of the molecular signatures and the sets of over-expressed genes was performed. The presence / absence matrices were used as input. It was indicated in the matrix for each gene (row) whether the gene was included in the molecular signature of the body part or had an increased expression in it (“1”) or not (“0”). The optimal number of clusters was determined using the “silhouette” method implemented in the “factoextra” library (v1.0.7) for R. The metaMDS function from the “vegan” library (v2.5-7) was used with the following parameters: distance = "manhattan", try = 100, trymax = 100000, autotransform = FALSE, binary = TRUE, k = the optimal number of clusters. The seed was set to 1234 both when the optimal number of clusters was determined and in MDS. To visualize the results, the ggscatter function from “ggpubr” (v0.4.0) library for R was used.

Potential excretory/secretory proteins (ESP) identification and analysis

The in silico identification of potential ESP was performed according to the pipelines described by Garg and Ranganathan⁵⁸. Firstly, all proteins from the reference set were analysed with SignalP⁵⁹ (v5.0b) [https://services.healthtech.dtu.dk/service.php?SignalP-5.0]. Based on the analysis results, the proteins were divided into potential “classical” (SP ≥ 0.5) and “non-classical” (SP < 0.5) ESP. Secondly, the “non-classical” ESP were analysed using SecretomeP⁶⁰ (v1.0) [https://services.healthtech.dtu.dk/service.php?SecretomeP-2.0]. Only proteins that had NN-scores ≥ 0.9 and were simultaneously predicted not to contain a signal peptide were selected. Thirdly, all potential ESP were scanned for the presence of the mitochondrial transit peptide with TargetP⁶¹ (v2.0). The proteins with this signal were excluded. Fourthly, the transmembrane hidden Markov model (TMHMM)⁶² (v2.0c) [https://services.healthtech.dtu.dk/service.php?TMHMM-2.0] was used to model and predict the location and orientation of transmembrane domains in proteins, and only proteins without them were considered as potential ESP. Out of these potential ESP, only those were selected that were included in at least one molecular signature. The overlap analyses between ESP sets were performed using InteractiVenn⁵⁷.

We ran all against all BLASTp searching using DIAMOND³⁸ (--evalue 1e-3) for both classical and non-classical ESP. Similarity search results were used in SiLix⁶³ (v1.2.11) [https://lbbe-web.univ-lyon1.fr/fr/SiLix] (-r 0.9), which assigns proteins to putative gene families. For annotation, all ESP were compared against the NeuroPep⁶⁴ [http://isyslab.info/NeuroPep/] and MetazSecKB⁶⁵ [http://proteomics.ysu.edu/secretomes/animal/] databases using DIAMOND BLASTp³⁸ with the following parameters: --sensitive --max-target-seqs 10000 –evalue 1e-3. The best BLAST hits were selected using a custom script.

Gene set enrichment analysis (GSEA)

The GSEA using “topGO” library (v2.40.0) [https://bioconductor.org/packages/release/bioc/html/topGO.html] for R was performed for 1) whole molecular signatures, 2) sets of over-expressed genes, and 3) sets of potential “classical” and “non-classical” ESP. Only the Gene Ontology (GO) terms describing biological processes were considered. Fisher’s exact test was used and extracted only the terms including at least 10 significant genes (GO terms with p-value <0.01) from the results. Redundancy was reduced with the “rrvgo” library (v1.0.2) [http://bioconductor.org/packages/release/bioc/html/rrvgo.html] for R. The minus log10-transformed p-values were used as scores, org.Dm.eg.db (Genome wide annotation for Fly) as database, relevance as similarity measures methods, and 0.7 as the threshold for reduceSimMatrix. The “wordcloud” (v2.6) library [https://cran.r-project.org/web/packages/wordcloud/index.html] for R was used to build word clouds based on the redundancy reduction results. The more often the parental bioprocess was found in the list of enriched bioprocesses, the larger the word size. Each bioprocess was assigned a colour from the “viridis” (v0.6.1) palette package.

Phylostratigraphy and Transcriptome Age Index (TAI) measuring

The phylostratigraphic analysis of the P. reticulata reference protein set was performed using the “phylostratr” package⁶⁶ (v0.2.1) [https://github.com/arendsee/phylostratr] for R. We used reference proteomes of Crustacea prepared beforehand as well as the prebuilt dataset of prokaryotes, human, and yeast. In total, in addition to the P. reticulata, 139 species were included in the analysis, of which 44 were representatives of the Metazoa. The complete phylogenetic tree of the studied species is presented in the Figure S1 (Extended data⁶⁷). Similarity search between proteins was carried out with BLASTp³³ (v2.6.0+). The tables with BLAST results in “6” output format were used as input for “phylostratr” for protein distribution between phylostrata: 1) “Cellular organisms”, 2) “Eukaryota”, 3) “Opisthokonta”, 4) “Metazoa”, 5) “Eumetazoa”, 6) “Bilateria”, 7) “Protostomia”, 8) “Ecdysozoa”, 9) “Panarthropoda”, 10) “Arthropoda”, 11) “Mandibulata”, 12) “Pancrustacea”, 13) “Crustacea”, 14) “Multicrustacea”, 15) “Hexanauplia”, 16) “Cirripedia”, and 17) “Peltogaster reticulata”.

The phylostratigraphic composition was analysed for the P. reticulata reference gene set, the set of genes with noticeable (≥ 2 TPM) expression in all the female body parts considered, the sets of overexpressed genes as well as the sets of genes encoding potential “classical” and “non-classical” ESP. The results were visualized using “ggplot2”, “viridis” (v0.6.1), and “reshape” (v0.8.8) libraries for R.

Transcriptome Age Index (TAI) definition was performed for P. reticulata body parts using phylostratigraphic results and tables with averaged TPM-values. The analysis was carried out using the “myTAI”⁶⁸ (v0.9.3) [https://github.com/drostlab/myTAI] package for R. Genes with an expression level < 2 TPM at all body parts were excluded. The analysis was carried out on log2(TPM + 1) transformed values. The FlatLineTest function was used to quantify the statistical significance of the global TAI pattern. For analysis with the use PlotRE and PlotBarRE functions, the phylostrata were divided into two groups: “before” (phylostrata 1–13), and “after” (phylostrata 14–17) the division of Crustacea.

Using the pMatrix function from “myTAI”⁶⁸, the contributions of genes to the TAI of body parts were determined. For each body part, 500 genes with the largest contribution were selected out of the genes with the GO annotation. Further, GSEA for the selected gene sets was performed similarly to GSEA for the molecular signature.

Histology

For histological and light-microscopic examination, the dissected internae were fixed with Bouin solution (picric acid (trinitrophenol) 71.5%, paraformaldehyde 24% and acetic acid 4.5%). Paraffin sections (5 μm thick) were made using standard histological methods with the help of a Leica RM-2265 microtome and stained with hematoxylin-eosin. The sections were examined under a Leica DM2500 microscope. The photos were taken with a Nikon DS-Fi1 camera and processed with ImageJ software (FiJi⁶⁹).

Confocal laser scanning microscopy (CLSM)

Samples of interna for immune labelling were fixed with 4% paraformaldehyde (PFA; Sigma-Aldrich) in PBS (Fluka) at 4 °C for four hours, and then rinsed three times with PBS. Prior to immunocytochemical staining, the fixed material was incubated with PBST (PBS + 0.1 % Triton-X100; Sigma-Aldrich) during 24 hours at 4 °C. Then, the samples were incubated in primary antibodies and anti-acetylated α-tubulin (Sigma Aldrich, Germany, T6793, produced in mice) and anti-serotonin (Sigma Aldrich, Germany, S5545, produced in rabbit) for three days. After incubation the specimens were rinsed in PBS three times and incubated in secondary antibodies anti-mouse IgG CFTM 633 (Sigma Aldrich, Germany, SAB4600138) and anti-rabbit IgG CFTM 488A (Sigma Aldrich, Germany, SAB4600030).

The specimens were rinsed with PBS three times and stained with the DAPI nuclei stain (1 μg/ml; Sigma-Aldrich) for 30 min, rinsed in PBS and mounted in DABCO-glycerol. The samples were examined using a Leica TCS SP5 confocal laser scanning microscope in the Resource Center “Microscopy and Microanalysis” of the Research Park of Saint Petersburg State University. The images were processed with ImageJ software (FiJi⁶⁹).

Scanning electron microscope (SEM)

Specimens for SEM were fixed at 4 °C in 2.5% glutaraldehyde, dehydrated in a gradient ethanol series and acetone, critical point-dried in a Hitachi critical point dryer HCP- 2, mounted on stubs, coated with platinum with the use of a Giko IB-5 Ion sputter coater, and viewed under a FEI Quanta 250 scanning electron microscope in the “Taxon” Research Resource Center of the Zoological Institute of the Russian Academy of Sciences.

Results

The de novo assembled transcriptome was characterized by high quality and completeness of assembly

The transcriptomes of the whole body, the thoracic part of the interna, the growing part of the interna and the main trunk of the interna as well as that of the externa of adult P. reticulata were collected and sequenced in two biological replicates (Figure 1a). More than 87% of read pairs remained in all the paired-end libraries after the removal of adapters, poor-quality regions, and short sequences (Table S1, Underlying data⁷⁰). The potential contamination with human-derived sequences did not exceed 4.1% from the total number of trimmed read pairs in each library (Table S1, Underlying data⁷⁰).

All prepared libraries were merged, and the resulting libraries were used as input for Trinity software. In general, 353,130 contigs with lengths greater than or equal to 200 nucleotides were assembled de novo. After the clusterization of similar sequences, the P. reticulata transcriptome included 267,188 contigs. TransRate assembly and optimal scores made up 0.3331 and 0.3835, respectively. More than 95% (256047/267188) of the contigs were well-assembled (“good”) according to the TransRate quality control results. The completeness analysis for “good” contigs using a database of the metazoan single-copy orthologues revealed that 96.3% (Single: 57.8%, duplicated: 38.5%) of the orthologues were assembled completely.

Peltogaster reticulata reference transcriptome included transcripts of 12,620 protein-encoding genes

Given the parasitic lifestyle of P. reticulata, the assembled transcriptome was checked for the presence of potential contamination. According to the RNAmmer analysis results, eight and nine sequences could be classified as 18S and 28S ribosomal RNAs, respectively. The comparison with the NCBI nucleotide database revealed that the contigs aligned successfully with ribosomal sequences from Alveolata (HQ891115.2), Fungi (AY382649.1; CP033152.1; CP030254.1; GQ336996.1; MF611880.1), and Metazoa (AY265359.1; EU082415.1; KY454201.1; EU370441.1; KU052603.1). Among the metazoan hits, only sequences from Crustacea were identified. The identity percentage with the database-derived 18S host (KY454201.1) and parasite (EU082415.1) sequences made up 88% (990/1,123) and 99% (1,200 / 1,201), respectively. In this study, the MCSC hierarchical clustering algorithm was used to remove potential contamination from the assembled transcriptome. The decontaminated transcriptome included 80,779 contigs and contained 81.6% (Single: 56.9%; Duplicated: 24.7%) of completely assembled and 6.2% fragmented single-copy metazoan orthologues. The number of paired-end reads successfully aligned to selected contigs varied from 10.74 (externa, first replicate) to 30.64 million (the thoracic part of the interna, second replicate) (Table S1, Underlying data⁷⁰).

The sequence expression level quantification was performed with Salmon by mapping selected read pairs to the decontaminated transcriptome. The mapping rate ranged from 85% to 91%. The transcript-to-gene map was used to obtain gene expression levels in TPM values. After excluding genes with a low activity in all analysed samples (expression level < 1 TPM), the dataset contained TPM values for 20,980 contigs. According to the TransDecoder results, 32,990 contigs encoded proteins with lengths ≥ 100 amino acids.

Only the protein-coding genes with a noticeable expression were involved in further analysis. After filtering by expression level (≥ 1 TPM in at least one sample) and encoded protein lengths (≥ 100 aa), the reference gene set obtained for P. reticulata contained 12,620 sequences. For each gene, the longest protein encoded by its splice variants was selected as a representative sequence. The comparison with a single-copy metazoan orthologues database revealed that in the reference protein set of P. reticulata 75.6% of the orthologues were assembled completely (Single: 70.4%; Duplicated: 5.2%), whereas 3.4% and 21% of the sequences were fragmented or absent, respectively.

Most of the sequences from the reference sets were annotated successfully

The prepared reference sets were compared with publicly available databases. According to the results, 3,399, 8,502, and 9,690 genes had hits with NCBI nucleotide, SwissProt, and NCBI non-redundant databases, respectively. The overlap analysis revealed that 3,240 genes were successfully annotated using each database. Moreover, 6,090 genes belonged to at least one GO terms. The domain architecture of the encoded protein was identified for 8,803 genes based on the comparison with the PfamA database. Annotation results are presented in Table S2 (Underlying data⁷¹).

The proteins set of P. reticulata was similar to the reference proteome of another cirripede barnacle, Amphibalanus amphitrite.

The identification of orthogroups in P. reticulata and other Crustacea involved in this study was carried out in three stages. First, the proteomes of P. reticulata and eight reference crustacean species were analysed using OMA standalone. As a result, 24,840 OMA groups were discovered.

Secondly, a phylogenetic tree of the studied crustacean species was constructed based on the results (Figure S2, Extended data⁷²). A total of 609 OMA groups were selected, containing at least eight out of nine species. The multiple protein alignment results in each of the OMA groups were concatenated into a supermatrix. After site selection, the final supermatrix contained 282,907 sites. In the resulting phylogenetic tree, P. reticulata was united into the same taxon with another cirripede barnacle, Amphibalanus amphitrite, with full support (Figure S2, Extended data⁷²).

Thirdly, the resulting tree was used to refine the search results for orthogroups. More than 20,000 orthogroups were found: 24,840 and 20,874 OMA and HOGs were identified, respectively. Figure 1b shows the number of common OMA groups for pairs of species. P. reticulata had the largest number of "common" OMA Groups (4,354) with A. amphitrite. For comparison, P. reticulata had no more than 3,490 “common” OMA groups with other crustacean species, the smallest overlap being found with Portunus trituberculatus (1,262 OMA groups).

The externa was clustered separately from the interna based on the gene expression analysis results

The molecular signature of a body part was defined as a set of genes with an expression ≥ 2 TPM in the body part considered. Each molecular signature included at least 8,000 genes: the main trunk of interna (8,070 genes), the growing part (8,148), the thoracic part (8,223), and externa (9,233) (Table S3, Underlying data⁷³). Approximately 54% (6,829 / 12,620) of the genes from the reference set were included into molecular signatures of all body parts (Figure 1c). Figure 1c shows significant overlaps between the parts of the interna and an almost 10-fold difference in the number of “specific” genes between the interna parts and the externa. Based on the gene expression, the body parts were divided into two clusters (Figure 1d). The first cluster contained all parts of the interna, while the second cluster contained only two replicates of the externa.

According to the differential expression analysis results, the number of over-expressed genes varied from 204 (the main trunk of the interna) to 2,224 (the externa) (Table S3, Underlying data⁷³). The number of genes with an increased expression in the interna did not exceed 283. Figure 1e shows that (i) only one gene was over-expressed both in the externa and in the interna, (ii) the number of “shared” over-expressed genes between different interna parts was ≤ 35, (iii) only six genes were over-expressed in the whole interna. The externa clustered separately from the interna, and all interna parts were remote from each other (Figure 1f).

The results of identification of molecular signatures and differentially expressed genes are presented in Table S3 (Underlying data⁷³).

GSEA results and histological studies indicate the presence of germ-like cells in the interna

Figure 2a and b show Venn diagrams for lists of bioprocesses enriched with genes included in the molecular signatures (Figure 2a) and genes with increased expression (Figure 2b) in the female body parts considered. In contrast to the externa, where many active bioprocesses were associated with development, bioprocesses enriched in the interna were mainly connected with metabolism. Comparative analysis revealed that 50 bioprocesses were active in at least two parts of the female body. Among them were “mitotic cell cycle process” (main trunk and thoracic part of interna), “cell division” (growing and thoracic part of interna), “homeostasis of number of cells” (same), “apoptotic signalling pathway” (same), “symbiotic process” (growing and main trunk of interna), “determination of adult lifespan” (same), “immune system process” (growing part, main trunk, and thoracic part of interna), and “autophagy” (same).

Figure 2. Gene set enrichment analysis (GSEA) results for the gene sets.

(a, b) Venn diagram for sets of parental bioprocesses enriched with either genes included in the molecular signature of the body part (a) or genes over-expressed in body part considered (b). (c) Clouds of enriched parental bioprocesses for the body parts. Most often, the bioprocess was found in the list of enriched bioprocesses, the larger the word size. Abbreviations: MS/Over – bioprocesses enriched with genes included either in molecular signature or in sets of over-expressed genes; PB – parental bioprocesses. (d– f) Histological sections of the main trunk of P. reticulata 1 - the wall of the main trunk; 2 - central lumen; 3 - groups of the floating cells; 4 - Nuage body. Scale bars: d - 200µm, e - 20µm, f - 50µm.

The “germ cell development” bioprocess was enriched in the main trunk and the growing part of the interna, whereas “female gamete generation” was only found among the lists of enriched bioprocesses in the thoracic interna part (Figure 2c). At the same time, according to our results, meiosis probably only occurred in the externa (Figure 2c). The results of histological studies also confirmed the presence of germ-like cells in the central lumen of the rootlets of the interna. Groups of floating small round cells with a high nuclear cytoplasmic ratio were found in the central lumen of the main trunk and peripheral rootlets (Figure 2d–f). A Nuage body (Figure 2d–f), which is a marker of germ cells, was present in each cell next to the nucleus.

No common bioprocess was found in the different parts of the female body, enriched with over-expressed genes. The genes with over-expression in the externa were involved in bioprocesses associated with cuticle transformation and development of the nervous system. In the growing interna part, over-expressed genes were involved in “gland development,” “response to nutrient levels”, “organic acid transport”, as well as lipid and fatty acid metabolic processes. In addition to various metabolic processes, “determination of adult lifespan” and “intrinsic apoptotic signaling pathway” were also found among a set of enriched bioprocesses for the main trunk. Bioprocesses associated with responses to various stimuli (bacterium/oxygen-containing compound/wounding), as well as “interspecies interaction between organisms”, “ion transmembrane transport”, “cellular homeostasis”, and “aging” were classified as “enriched” in the thoracic part of the interna.

All GSEA results are presented in Table S4 (Extended data⁷⁴).

Hundreds of genes encoding potential excretory/secretory proteins (ESP) were identified

The identification of potential ESP was performed in silico. A total of 852 "classical" and 282 "non-classical" ESP were found, which, respectively, had or did not have classical N-terminal signal peptides. Figure 3a, b shows Venn diagrams for the sets of genes encoding “classical” (Figure 3a) and “non-classical” (Figure 3b) ESP, respectively, which were included in the molecular signatures of body parts. Approximately 35% (297/852) of the genes encoding “classical” ESP had a noticeable expression level in all body parts considered. At the same time, more than half (143/282) of genes encoding “non-classical” ESP presented this expression pattern. In both cases, the externa had the largest number of “specific” ESP: 325 and 56 “classical” and “non-classical” ESP, respectively. Dozens of ESP (101 “classical” and 35 “non-classical”) were common for the three interna parts considered.

Figure 3. Gene set enrichment analysis (GSEA) results for the identified sets of potential excretory/secretory proteins (ESP).

(a, b) Venn diagram for sets of potential “classical” (a) and “non-classical” (b) ESP encoded by genes from molecular signatures of the body parts. (c) Clouds of parental bioprocesses enriched by genes encoding “classical” ESP. Most often the bioprocess was found in the list of enriched bioprocesses, the larger the word size. (d– e) Scanning electron microscope (SEM) photos of the interna of P. reticulata, 1 - rootlets; 2 – host tissues enlacing rootlets. (f) Confocal laser scanning microscopy (CLSM) photo of the interna of P. reticulata, scale bar 100µm, 3 - host tissues stained with antibodies against α-tubulin. Abbreviations: class/nonclassES – “classical” and “non-classical” ESP, respectively.

Both “classical” and “non-classical” ESP were divided into families based on their sequence similarity. For “classical” ESP, 35 families contained two to four proteins, while only two “non-classical” ESP were combined into one family. Most (579/852) of the “classical” ESP matched the MetazSecKB database, the hits being, e.g., mannose-binding proteins, serine proteinase, and cuticle proteins (Table S5, Extended data⁷⁵). Only 58 “non-classical” ESP matched MetazSecKB, of which 35 were “uncharacterized proteins” (Table S5, Extended data⁷⁵). All ESP were also compared to the NeuroPep database, with which only 14 “classical” ESP had hits (Table S5, Extended data⁷⁵). The latter included cerebellin-1, kininogen-1, insulin-like growth factors I and II, neuroparsin-A, nucleobindin-2, and five representatives of the serpin family.

Figure 3c shows bioprocesses enriched with genes encoding “classical” ESP. Among the body-part-specific bioprocesses were the “molting cycle” and “chitin-based cuticle development” in the externa, “positive regulation of cell communication” and “muscle organ development” in the growing interna part, and “immune response” and “regulation of apoptotic signaling pathway” in the main trunk and the thoracic part of interna, respectively. The majority (21/34) of enriched bioprocesses were common for two or more body parts considered. For example, “regulated exocytosis” (externa and growing part), “mesoderm development” (the growing part and the main trunk), “cell recognition” (same), “proteolysis” (all body parts considered), “motor neuron axon guidance” (same), “cell adhesion” (same), and “neuron recognition” (externa and thoracic part of interna). The activity of the bioprocesses associated with the involvement of the nervous system is consistent with the fact that trophic rootlets of P. reticulata were enlaced by a network of host's neurons marked by a presence of a-tubulin and serotonin (Figure 3d–f).

All ESP analysis results are presented in Table S5 (Extended data⁷⁵).

Significant differences between the TAI of body parts were revealed

Almost all (12,618 / 12,620) P. reticulata genes were distributed across 17 phylostrata, i.e., sets of genes that coalesce to founder genes having a common phylogenetic origin⁷⁶ (Table S6, Extended data⁷⁷) The three largest phylostrata were “Cellular organisms” (32.34%, 4,082 genes), “Eukaryota” (22.25%, 2,809 genes), and species-specific ones (13.7%, 1,729 genes) (Figure 4a). Less than 100 genes were assigned to “Ecdysozoa” (50 genes), “Panarthropoda” (44 genes), “Mandibulata” (63 genes), “Crustacea” (28 genes), and “Hexanauplia” (6 genes). The phylostratum “Cirripedia”, which included two barnacles, A. amphitrite and P. reticulata, consisted of 363 genes.

Figure 4. Phylostratigraphy and evolutionary transcriptomics results.

(a) Phylostratigraphic composition analysis results for P. reticulata reference gene set (“reference”), set of genes with noticeable (≥ 2 transcripts-per-million) expression in all female body parts considered (“common expr”), sets of genes over-expressed (externa/growing/main trunk/thoracic_overexpr) and sets of genes encoding potential “classical” (“classical_ESP”) and “non-classical” (“nonclassical_ESP”) excretory/secretory proteins (ESP). (b, c) Relative mean expression levels of phylostrata which occurred “before” (b) or “after” (c) division of Crustacea. (d) Transcriptome Age Indices (TAI) variation for female body parts considered. A lower TAI value describes an “older” transcriptome, whereas a higher TAI denotes a “younger” one. (e) The cumulative phylostrata contribution to the final (global) TAI profile.

The phylostratigraphic results were also used for composition analysis of various gene sets (Figure 4a). More than half of the genes with expression ≥ 2 TPM in all body parts considered belonged to “Cellular organisms” and “Eukaryota” phylostrata. The proportion of species-specific genes in this set was approximately 7% (490/6,829). Noticeable differences in the contributions of different phylostrata to the sets of over-expressed genes were found. For example, approximately 31% (88/283) of such genes in the thoracic interna part belonged to the species-specific phylostratum. In contrast, in other parts of the body, the proportion of species-specific genes from the total number of over-expressed genes did not exceed 16%. A complex phylostratigraphic composition was also revealed for genes encoding both "classical" and "non-classical" ESP. “Cellular organisms” phylostratum made the greatest contribution to the “classical” ESP (32.16%), while the species-specific phylostratum contributed most to the “non-classical” ESP (38.3%).

The phylostrata were divided into two groups: prior to the divergence of Crustacea (from “Cellular organisms” to “Crustacea”) and after this event (from “Multicrustacea” to “P. reticulata”). The relative expression patterns of the phylostrata are shown in Figure 4b, c. All phylostrata except the species-specific one had the highest relative expression in the externa, while the highest expression of the species-specific phylostratum was recorded in the thoracic part of the interna. At the same time, 14 out of 17 phylostrata had the least expression in the main trunk of the interna, the remaining three being “Bilateria”, “Pancrustacea”, and “Multicrustacea”.

One metric to quantify transcriptome conservation on a global scale is the TAI⁷⁸, which denotes the average transcriptome age throughout the biological process of interest⁶⁸. The TAI was measured for each part of the female P. reticulata body. The higher the value of the TAI, the greater the contribution of the “young” phylostrata. Significant differences between the TAI of different body parts were revealed. The TAI of the thoracic part of the interna was the highest (4.33), whereas the TAI of the growing part of the interna was the lowest (4.21) (Figure 4d). The partial contribution of the “P. reticulata” phylostratum to the TAI of the body part was about approximately three times greater than the contribution of any other phylostratum (Figure 4e).

The top-500 with the largest contribution to the TAI of body parts were extracted from the genes with the GO annotation and performed GSEA for these gene sets (Figure 5). Among the “specific” bioprocesses were “embryonic organ development” and “male sex differentiation” in the externa, “cell fate commitment involved in the formation of primary germ layer” and “positive regulation of chemotaxis” in the growing part of the interna, “formation of primary germ layer” and “gland morphogenesis” in the main trunk of the interna, and “response to external stimulus” and “cell population proliferation” in its thoracic part. Only four bioprocesses were common to all female body parts considered: “stem cell population maintenance”, “regulation of anatomical structure morphogenesis”, “chitin-based cuticle development”, and “animal organ morphogenesis”. Developmental processes were registered in each of the female body parts studied (Figure 5).

Figure 5. Gene set enrichment analysis (GSEA) results for top-500 annotated genes with the largest contribution to Transcriptome Age Index (TAI).

Clouds of parental bioprocesses enriched with top-500 genes with both Gene Ontology (GO) annotation and large contribution to TAI of the body part under consideration. The more often the bioprocess was found in the list of enriched bioprocesses, the larger the word size.

Phylostratigraphy and evolutionary transcriptomics results are presented in Tables S6 and S7 (Extended data^77,79), respectively.

Discussion

In this study, we obtained the first transcriptomes of a rhizocephalan and made a comparative analysis of the different body parts of an adult female rhizocephalan. Our results are a step towards understanding the functioning of these highly specialized parasites and the trends during their evolutionary history. The discussion below addresses 1) how the molecular signatures helped us to verify the functional role of each part of the parasite’s body; 2) potential excretory/secretory proteins and their putative role in host-parasite interactions; 3) the trends in rhizocephalan evolution derived from the phylostratigraphy and evolutionary transcriptomics results.

Contamination identification and elimination in molecular biological studies of non-model parasitic organisms is undoubtedly one of the main challenging tasks. In our case, this challenge was aggravated by the fact that both the parasite and its host were crustaceans. This means that approaches based on database comparison could be less effective than usual at separating the reads from different sources. For this reason, an MCSC algorithm was chosen, which classifies sequences based on the analysis of their properties. Based on the results of a preliminary orthogroup reconstruction (data not shown) and the reconstruction after decontamination, and considering that there was an approximate 14% reduction in the number of duplicates of single-copy metazoan orthologues, it can be assume that at least part of the signal from the host was removed. In further analyses, we focused on 12,620 protein-coding genes with a noticeable expression level. The results indicate that the reference gene set obtained in our study corresponds to those of other crustacean species in quality and completeness. The results of the analysis of orthogroups revealed that the proteome of P. reticulata was more similar to that of Amphibalanus amphitrite than to any other crustacean species involved in our study. Amphibalanus amphitrite belongs to the Thoracica, the sister group of Rhizocephala⁸⁰. Thoracican barnacles are mostly free-living but also highly transformed crustaceans. A comparative analysis involving numerous representatives of these two sister taxa may uncover evolutionarily conservative mechanisms of transformation in adult rhizocephalans. Some other parasitic crustaceans mostly from Copepoda have highly modified bodies as well^81,82. In the future, one of the promising directions of research will be comparative studies of the molecular basis for morphological simplification in different taxa.

The body of a female rhizocephalan is divided into the externa and an extensive interna, which has a different ultrastructural organization of its constituent parts^27,83. Our aim was to make a detailed record of the genome’s activity in the different parts of the female body. In order to achieve this the soft threshold of 2 TPM was used as a condition for identifying genes whose expression contributed to the molecular signature of the sample under consideration. The results of the study of gene expression in different parts of the female body showed that: 1) slightly more than half of the identified protein-coding genes worked in all examined parts of the body, 2) the lists of genes with an increased expression differed greatly between body parts, 3) the externa always clustered separately from the interna, although the differences were also found between the sites of the interna. These results suggest that the morphological heterogeneity of the female body is reflected in the spatial differences of gene expression (molecular heterogeneity).

However, it is conceivable that the molecular signature of the body part is a derivative of the transcriptomes of all cell types included in the body region analyzed. It also depends on the organism’s response to various stimuli and conditions (for example, the host's immune response, O₂ level and concentrations of different metabolites in the host’s hemolymph)^17,84. Therefore, the more similar the cellular composition of body parts or the set of factors affecting them are, the more similar their molecular signatures will be. In our study all parts of the interna clustered together and were distinct from the externa cluster. At the same time, the differences of the biological replicates of the externae could be explained by the fact that the externa contains embryos at different stages of development. The signal from the embryos was probably so strong that even the pooling of samples did not smooth out the differences.

Our results indicate that various processes are at work in different body parts of the female rhizocephalan. Active developmental processes were registered in the externa, which could be expected considering that it contains numerous embryos, while active metabolism processes were recorded in the interna (Figure 2c). These results are consistent with the classical concepts of the functional role of the individual parts of the rhizocephalan body^1,2.

However, some of our findings are at odds with the classical views. Previous morphological studies have postulated that the ovary is located in the visceral mass of the externa². However, the GSEA revealed that the female germ cells formed in the interna. Moreover, in histological sections we observed some cells floating in the central lumen of the main trunk, that looked like primary germ cells. Such cells have been described before^85,86 but were assumed to be stem cells, responsible for the formation of new buds of externae. Based on histological data and the GSEA results, it is suggested here that these cells are more likely to become female germ cells. We suppose that female germ cells begin and/or continue to form in the interna and then migrate to the externa, where they mature and are fertilized. In our opinion, the ovary of rhizocephalans is diffused within the interna, while the externa serves merely as a brooding chamber.

The discovery of a “diffused” ovary prompts a reconsideration of the phenomenon of rhizocephalan “coloniality”. As noted above, some rhizocephalans form numerous externae. It has been suggested that each externa is a separate reproductive module, and the entire animal has therefore been considered as a colony^2,85. However, if the ovary is in fact diffused and scattered across the interna and if numerous externae are merely brooding chambers, the term “coloniality” does not seem suitable for Rhizocephala. This issue calls for further research with the use of molecular and morphological methods.

Regardless of whether a rhizocephalan barnacle is a colony or an individual organism, it has to communicate with the host via special excretory molecules involved in particular bioprocesses³. We found that the composition of the potential excretome/secretome varied in different parts of the female rhizocephalan body and showed the character of the distribution of the secreted substances. For instance, in the externa there seems to be an active excretion/secretion of proteins involved in the storage of nutrients in the developing embryos as well as those involved in the moulting cycle. At the same time, proteins responsible for muscle development were also among the potential excretome/secretome of the growing part of the interna. These data confirm that the muscular system is formed in a growing tip of the main trunk²⁷.

Potential excretory/secretory proteins involved in neuron axon guidance were found in both the externa and the interna. While neuron axon guidance in the externa is probably associated with offspring development, the evidence of this process in the interna is less expected and more interesting. Direct contact between the parasite and the host’s nervous system was shown in this study and in our previous research³. We can expect the parasite to emit attracting signals, directing the host’s nervous system towards and along the interna. In addition, the excretory proteins involved in cell adhesion could also play an important role in the formation of a neural network around the rootlets. However, it cannot be ruled out that this transcriptome signal comes from the host tissues surrounding the rootlets of the parasite, since it is technically impossible to completely separate interna from the host’s tissues. Nevertheless, an intimate host-parasite interaction is an intriguing phenomenon requiring further in-depth research.

The evolution of parasitic barnacles and their interactions with the hosts remain enigmatic⁸⁷. In particular, this concerns the molecular basis of both the formation of phenotypes at different stages of the rhizocephalan life cycle and the interaction of the parasite with its host. At the same time, the principle of genomic phylostratigraphy implies that the genome of every extant species retains parts of the picture of the evolutionary epochs⁷⁶. In order to determine how the P. reticulata gene set changed during evolution, modern implementations of the phylostratigraphy were applied to identify gene groups with a common phylogenetic origin, called “phylostratum” or “phylostrata” in the plural form. The results of the phylostratigraphy analysis indicate that almost all P. reticulata genes were successfully distributed into 17 phylostrata. The third largest phylostratum was a species-specific one, which probably also includes genus- and family-specific orthologs. Given the transcriptome obtained in our study is the first to be reported for a rhizocephalan, it is difficult to determine what percentage of proteins from this phylostratum belongs to each of these categories. Nevertheless, we are confident that a more detailed analysis of this particular phylostratum will allow us to identify the molecular basis of the Rhizocephala-specific biological traits. However, it should be kept in mind that genes need a cellular environment, the combined action of multiple other genes, as well as certain conditions to have an observable effect on an organism⁸⁸.

It should be noted that we analysed the reference gene set, which was reconstructed based on the transcriptomic data, and the transcripts of poorly expressed or completely silent genes may be absent from our data. However, since transcriptomes were obtained for different regions of the female body and a soft expression level threshold was used, it is conceivable that many genes active in the adult female were represented in the reference gene set.

Our results indicate that evolutionarily younger genes make a relatively large contribution to the signatures of the externa and the thoracic part of the interna. The transcriptional “youth” of the externa could be associated with the fact that the samples contained highly modified males². Our assumption is also based on the fact that we observed "male sex differentiation" among the lists of bioprocesses enriched by genes with the largest contributions to the TAI of externa. On the contrary, the signature of the thoracic part probably does not have any additional components. Consequently, we can assume that a high TAI value for this region may be associated with the evolutionary transformation of the region itself. We have already previously identified morphological heterogeneity of the entire body of an adult rhizocephalan female^9,27. Given the differences between body regions in morphology, we also expected to find molecular and functional heterogeneity in the body of an adult female P. reticulata. The results obtained from the studies indicate that such heterogeneity manifests itself not only in the expression of individual genes and the activity of bioprocesses, but also in the contributions of various phylostrata to the molecular signatures of the interna regions studied. Further research should be directed towards a more detailed study of the identified differences in TAI between interna regions. At the same time, we assume that, for example, the revealed transcriptional “oldness” of the growing interna region is presumably due to the activity of conservative processes, including those associated with the cell cycle.

The GSEA results for genes with the greatest contribution to TAI, both in the interna and in the externa, revealed many developmental processes. One may get the impression that rhizocephalans have an incomplete and endless metamorphosis. Taking into account that the adult female body originates from a fraction of the larval body, we are inclined to agree with the hypothesis suggested by Glenner and Høeg⁸⁷. It postulates that ancestors of rhizocephalans were filter-feeding epibiotic barnacles and the interna of an adult female originates from the part of the larval body homologous to the peduncle of a Goose barnacle, whose metamorphosis went the “wrong” way. The peduncle separated from the rest of the body and gave rise to the interna^87,89. Nevertheless, a contemporary species can only serve as a proxy for an ancestral model. More transcriptomic/genomic data from other rhizocephalans are essential before a reliable reconstruction of the evolutionary history of this unique group of parasites can be achieved.

In summary, the first comparative transcriptomic results for rhizocephalans provided new exciting insights in our understanding of the molecular mechanisms underlying the biology of these extraordinary parasites. We identified the molecular and functional heterogeneity of the female rhizocephalan body and compared it to the previously documented morphological one. A similarity was found between the set of protein-coding genes of P. reticulata and that of a free-living representative of the sister taxon (Thoracica) of the Rhizocephala. Both bioinformatic data analysis and histological results indicated the presence of germ cells in the lumen of the interna, casting doubt on the previously accepted phenomenon of rhizocephalan coloniality. The molecular basis of the interaction between the nervous system of the host and the parasite's interna was determined. Differences between body parts in terms of phylostratum expression and their contribution to molecular signatures were established. Our results indicate that rhizocephalans probably “got stuck in their metamorphosis” even at the reproductive stage. Our study can serve as a basis for future research on rhizocephalan evolution.

Data availability

All Python and R scripts used in the study are publicly available: https://github.com/maxnest/From_head_to_rootlet

Underlying data

NCBI BioProject: Peltogaster reticulata Raw sequence reads, accession number: PRJNA798055.

Figshare: Table S1. Summary of paired-end read libraries preparation results, https://doi.org/10.6084/m9.figshare.19307486⁷⁰

Figshare: Table S2. Peltogaster reticulata reference sequence set annotation results, https://doi.org/10.6084/m9.figshare.19307516⁷¹

Figshare: Table S3. Gene expression quantification and analysis results https://doi.org/10.6084/m9.figshare.19307549⁷³

Extended data

Figshare: Figure S1. Phylogenetic tree of species whose data were used for phylostratigraphic analysis of the Peltogaster reticulata gene set, https://doi.org/10.6084/m9.figshare.21747554⁶⁷

Figshare: Figure S2. Phylogenetic relationships between the crustacean species based on orthologs analysis results, https://doi.org/10.6084/m9.figshare.19307444⁷²

Figshare: Table S4. Gene Set Enrichment Analysis (GSEA) results for the molecular signatures and sets of over-expressed genes, https://doi.org/10.6084/m9.figshare.19307570⁷⁴

Figshare: Table S5. Potential excretory/secretory proteins analysis results, https://doi.org/10.6084/m9.figshare.19307594⁷⁵

Figshare: Table S6. Phylostratigraphic affiliation analysis results for different set of sequences, https://doi.org/10.6084/m9.figshare.19307606⁷⁷

Figshare: Table S7. Evolutionary transcriptomics results, https://doi.org/10.6084/m9.figshare.19307621⁷⁹

Acknowledgements

We thank the staff of the Resource Centers “Microscopy and Microanalysis”, “Molecular and Cell Technologies”, and “The Bio-Bank” of the Research Park of St Petersburg State University and “Taxon” Research Resource Center (http://www.ckp-rf.ru/ckp/3038/) of the Zoological Institute of the Russian Academy of Sciences for technical assistance. Data analysis was performed at the Bioinformatics Shared Access Center of the Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences. We are grateful to Dr. Olga Korn and the diving team of the Marine Biological Station “Vostok” of National Scientific Center of Marine Biology, Far East Branch of Russian Academy of Science, for the help with the collection of specimens. We are grateful to Dr. Igor Adameyko and Natalia Lentsman for help with the manuscript preparation. We also express our gratitude to Ilya Borisenko for the help with the management of the research.

Faculty Opinions recommended

References

1. Bresciani J, Høeg JT: Comparative ultrastructure of the root system in rhizocephalan barnacles (Crustacea: Cirripedia: Rhizocephala). J Morphol. 2001; 249(1): 9–42. PubMed Abstract | Publisher Full Text
2. Høeg JT: The biology and life cycle of the Rhizocephala (Cirripedia). J Mar Biol Assoc United Kingdom. 1995; 75(3): 517–50. Publisher Full Text
3. Miroliubov A, Borisenko I, Nesterenko M, et al.: Specialized structures on the border between rhizocephalan parasites and their host’s nervous system reveal potential sites for host-parasite interactions. Sci Rep. 2020; 10(1): 1128. PubMed Abstract | Publisher Full Text | Free Full Text
4. Lianguzova AD, Ilyutkin SA, Korn OM, et al.: Specialised rootlets of Sacculina pilosella. (Rhizocephala: Sacculinidae) used for interactions with its host’s nervous system. Arthropod Struct Dev. 2021; 60: 101009. PubMed Abstract | Publisher Full Text
5. Høeg JT, Lützen J: Life cycle and reproduction in the Cirripedia, Rhizocephala. Oceanogr Mar Biol an Annu Rev. 1995; 33: 427–85. Reference Source
6. Lützen J, Du PT: Three colonial rhizocephalans from mantis shrimps and a crab in Vietnam, including Pottsia serenei, new species (Cirripedia: Rhizocephala: Thompsoniidae). J Crustac Biol. 1999; 19(4): 902–7. Publisher Full Text
7. Høeg JT, Lützen J: Comparative morphology and phylogeny of the family Thompsoniidae (Cirripedia, Rhizocephala, Akentrogonida), with descriptions of three new genera and seven new species. Zool Scr. 1993; 22(4): 363–86. Publisher Full Text
8. Glenner H: Cypris metamorphosis, injection and earliest internal development of the Rhizocephalan Loxothylacus panopaei. (Gissler). Crustacea: Cirripedia: Rhizocephala: Sacculinidae. J Morphol. 2001; 249(1): 43–75. PubMed Abstract | Publisher Full Text
9. Miroliubov AA, Borisenko IE, Nesterenko MA, et al.: Muscular system in the interna of Polyascus polygenea. and Sacculina pilosella. (Cirripedia: Rhizocephala: Sacculinidae). Invertebr Zool. 2019; 16(1): 48–56. Publisher Full Text
10. Alvarez F, Hines AH, Reaka-Kudla ML: The effects of parasitism by the barnacle Loxothylacus panopaei (Gissler) (Cirripedia: Rhizocephala) on growth and survival of the host crab Rhithropanopeus harrisii (Gould) (Brachyura: Xanthidae). J Exp Mar Bio Ecol. 1995; 192(2): 221–32. Publisher Full Text
11. Bishop RK, Cannon LRG: Morbid behaviour of the commercial sand crab, Portunus pelagicus (L.), parasitized by Sacculina granifera Boschma, 1973 (Cirripedia: Rhizocephala). J Fish Dis. 1979; 2(2): 131–44. Publisher Full Text
12. Innocenti G, Pinter N, Galil BS: Observations on the agonistic behavior of the swimming crab Charybdis longicollis Leene infected by the rhizocephalan barnacle Heterosaccus dollfusi Boschma. Can J Zool. 2003; 81(1): 173–6. Publisher Full Text
13. Vázquez-lópez H, Alvarez F, Franco J, et al.: Observations on the Behavior of the Dark Crab Callinectes rathbunae Contreras Parasitized with the Rhizocephalan Loxothylacus texanus Boschma. Int J Zool Res. 2006; 2(4): 344–53. Publisher Full Text
14. Larsen MH, Høeg JT, Mouritsen KN: Influence of infection by Sacculina carcini (Cirripedia, Rhizocephala) on consumption rate and prey size selection in the shore crab Carcinus maenas. J Exp Mar Bio Ecol. 2013; 446: 209–15. Publisher Full Text
15. Vásquez-López H: Affectation of Swimming Capacity in Callinectes rathbunae (Crustacea: Brachyura) Caused by Loxothylacus Texanus (Crustacea: Rhizocephala). Res J Fish Hydrobiol. 2010; 5(2): 76–80. Reference Source
16. Toscano BJ, Newsome B, Griffen BD: Parasite modification of predator functional response. Oecologia. 2014; 175(1): 345–52. PubMed Abstract | Publisher Full Text
17. Alvarez F, Alcaraz G, Robles R: Osmoregulatory disturbances induced by the parasitic barnacle Loxothylacus texanus (Rhizocephala) in the crab Callinectes rathbunae (Portunidae). J Exp Mar Bio Ecol. 2002; 278(2): 135–40. Publisher Full Text
18. Zacher LS, Horstmann L, Hardy SM: A field-based study of metabolites in sacculinized king crabs Paralithodes camtschaticus (Tilesius, 1815) and Lithodes aequispinus Benedict, 1895 (Decapoda: Anomura: Lithodidae). J Crustac Biol. 2018; 38(6): 794–803. Publisher Full Text
19. Takahashi T, Iwashige A, Matsuura S: Behavioral manipulation of the shore crab, Hemigrapsus sanguineus by the rhizocephalan barnacle, Sacculina polygenea. Crustacean Research. 1997; 26: 153–61. Publisher Full Text
20. Belgrad BA, Griffen BD: Rhizocephalan infection modifies host food consumption by reducing host activity levels. J Exp Mar Bio Ecol. 2015; 466: 70–5. Publisher Full Text
21. Fuchs B, Wang W, Graspeuntner S, et al.: Regulation of Polyp-to-Jellyfish Transition in Aurelia aurita. Curr Biol. 2014; 24(3): 263–73. PubMed Abstract | Publisher Full Text
22. Nesterenko M, Starunov V, Shchenkov S, et al.: Molecular signatures of the rediae, cercariae and adult stages in the complex life cycles of parasitic flatworms (Digenea: Psilostomatidae). Parasit Vectors. 2020; 13(1): 559. PubMed Abstract | Publisher Full Text | Free Full Text
23. Torruella G, Grau-Bové X, Moreira D, et al.: Global transcriptome analysis of the aphelid Paraphelidium tribonemae supports the phagotrophic origin of fungi. Commun Biol. 2018; 1(1): 231. PubMed Abstract | Publisher Full Text | Free Full Text
24. Almudi I, Vizueta J, Wyatt CDR, et al.: Genomic adaptations to aquatic and aerial life in mayflies and the origin of insect wings. Nat Commun. 2020; 11(1): 2631. PubMed Abstract | Publisher Full Text | Free Full Text
25. Wang J, Zhang L, Lian S, et al.: Evolutionary transcriptomics of metazoan biphasic life cycle supports a single intercalation origin of metazoan larvae. Nat Ecol Evol. 2020; 4(5): 725–36. PubMed Abstract | Publisher Full Text
26. Isaeva VV, Akhmadieva AV, Shukalyuk AI: The hidden coloniality at the parasitic stage in Peltogaster reticulatus (Crustacea: Rhizocephala). J Mar Biol Assoc United Kingdom. 2012; 92(3): 457–62. Publisher Full Text
27. Miroliubov AA: Muscular system in interna of Peltogaster paguri (Rhizocephala: Peltogastridae). Arthropod Struct Dev. 2017; 46(2): 230–5. PubMed Abstract | Publisher Full Text
28. Allam A, Kalnis P, Solovyev V: Karect: accurate correction of substitution, insertion and deletion errors for next-generation sequencing data. Bioinformatics. 2015; 31(21): 3421–8. PubMed Abstract | Publisher Full Text
29. Grabherr MG, Haas BJ, Yassour M, et al.: Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011; 29(7): 644–52. PubMed Abstract | Publisher Full Text | Free Full Text
30. Fu L, Niu B, Zhu Z, et al.: CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics. 2012; 28(23): 3150–2. PubMed Abstract | Publisher Full Text | Free Full Text
31. Smith-Unna R, Boursnell C, Patro R, et al.: TransRate: reference-free quality assessment of de novo transcriptome assemblies. Genome Res. 2016; 26(8): 1134–44. PubMed Abstract | Publisher Full Text | Free Full Text
32. Lagesen K, Hallin P, Rødland EA, et al.: RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007; 35(9): 3100–8. PubMed Abstract | Publisher Full Text | Free Full Text
33. Camacho C, Coulouris G, Avagyan V, et al.: BLAST+: Architecture and applications. BMC Bioinformatics. 2009; 10(1): 421. PubMed Abstract | Publisher Full Text | Free Full Text
34. Lafond-Lapalme J, Duceppe MO, Wang S, et al.: A new method for decontamination of de novo transcriptomes using a hierarchical clustering algorithm. Bioinformatics. 2017; 33(9): 1293–300. PubMed Abstract | Publisher Full Text
35. Langmead B, Salzberg SL: Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012; 9(4): 357–9. PubMed Abstract | Publisher Full Text | Free Full Text
36. Patro R, Duggal G, Love MI, et al.: Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods. 2017; 14(4): 417–9. PubMed Abstract | Publisher Full Text | Free Full Text
37. Wagner P, Kin K, Lynch VJ: Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples. Theory Biosci. 2012; 131(4): 281–5. PubMed Abstract | Publisher Full Text
38. Buchfink B, Xie C, Huson DH: Fast and sensitive protein alignment using DIAMOND. Nat Methods. 2015; 12(1): 59–60. PubMed Abstract | Publisher Full Text
39. El-Gebali S, Mistry J, Bateman A, et al.: The Pfam protein families database in 2019. Nucleic Acids Res. 2019; 47(D1): D427–32. PubMed Abstract | Publisher Full Text | Free Full Text
40. Simão FA, Waterhouse RM, Ioannidis P, et al.: BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015; 31(19): 3210–2. PubMed Abstract | Publisher Full Text
41. Waterhouse RM, Seppey M, Simão FA, et al.: BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol Biol Evol. 2018; 35(3): 543–8. PubMed Abstract | Publisher Full Text | Free Full Text
42. The UniProt Consortium: UniProt: The universal protein knowledgebase. Nucleic Acids Res. 2017; 45(D1): D158–69. PubMed Abstract | Publisher Full Text | Free Full Text
43. Bateman A, Martin MJ, Orchard S, et al.: UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res. 2021; 49(D1): D480–9. PubMed Abstract | Publisher Full Text | Free Full Text
44. Shah N, Nute MG, Warnow T, et al.: Misunderstood parameter of NCBI BLAST impacts the correctness of bioinformatics workflows. Bioinformatics. 2019; 35(9): 1613–1614. PubMed Abstract | Publisher Full Text
45. Cantalapiedra CP, Hernández-Plaza A, Letunic I, et al.: eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale. Mol Biol Evol. 2021; 38(12): 5825–5829. PubMed Abstract | Publisher Full Text | Free Full Text
46. Altenhoff AM, Levy J, Zarowiecki M, et al.: OMA standalone: Orthology inference among public and custom genomes and transcriptomes. Genome Res. 2019; 29(7): 1152–63. PubMed Abstract | Publisher Full Text | Free Full Text
47. Dylus D, Nevers Y, Altenhoff AM, et al.: How to build phylogenetic species trees with OMA [version 2; peer review: 2 approved]. F1000Res. 2020; 9: 511. Publisher Full Text
48. Katoh K, Standley DM: MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol Biol Evol. 2013; 30(4): 772–80. PubMed Abstract | Publisher Full Text | Free Full Text
49. Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T: trimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009; 25(15): 1972–1973. PubMed Abstract | Publisher Full Text | Free Full Text
50. Guindon S, Gascuel O: A Simple, Fast, and Accurate Algorithm to Estimate Large Phylogenies by Maximum Likelihood. Syst Biol. 2003; 52(5): 696–704. PubMed Abstract | Publisher Full Text
51. Darriba D, Taboada GL, Doallo R, et al.: ProtTest 3: Fast selection of best-fit models of protein evolution. Bioinformatics. 2011; 27(8): 1164–1165. PubMed Abstract | Publisher Full Text | Free Full Text
52. Nguyen LT, Schmidt HA, Von Haeseler A, et al.: IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015; 32(1): 268–74. PubMed Abstract | Publisher Full Text | Free Full Text
53. Minh BQ, Schmidt HA, Chernomor O, et al.: IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol Biol Evol. 2020; 37(5): 1530–1534. PubMed Abstract | Publisher Full Text | Free Full Text
54. Paradis E, Schliep K: ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics. 2019; 35(3): 526–8. PubMed Abstract | Publisher Full Text
55. Wagner GP, Kin K, Lynch VJ: A model based criterion for gene expression calls using RNA-seq data. Theory Biosci. 2013; 132(3): 159–64. PubMed Abstract | Publisher Full Text
56. Zambelli F, Mastropasqua F, Picardi E, et al.: RNentropy: An entropy-based tool for the detection of significant variation of gene expression across multiple RNA-Seq experiments. Nucleic Acids Res. 2018; 46(8): e46. PubMed Abstract | Publisher Full Text | Free Full Text
57. Heberle H, Meirelles GV, da Silva FR, et al.: InteractiVenn: A web-based tool for the analysis of sets through Venn diagrams. BMC Bioinformatics. 2015; 16(1): 169. PubMed Abstract | Publisher Full Text | Free Full Text
58. Garg G, Ranganathan S: In silico secretome analysis approach for next generation sequencing transcriptomic data. BMC Genomics. 2011; 12 Suppl 3(Suppl 3): S14. PubMed Abstract | Publisher Full Text | Free Full Text
59. Almagro Armenteros JJ, Tsirigos KD, Sønderby CK, et al.: SignalP 5.0 improves signal peptide predictions using deep neural networks. Nat Biotechnol. 2019; 37(4): 420–3. PubMed Abstract | Publisher Full Text
60. Bendtsen JD, Jensen LJ, Blom N, et al.: Feature-based prediction of non-classical and leaderless protein secretion. Protein Eng Des Sel. 2004; 17(4): 349–56. PubMed Abstract | Publisher Full Text
61. Emanuelsson O, Nielsen H, Brunak S, et al.: Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J Mol Biol. 2000; 300(4): 1005–16. PubMed Abstract | Publisher Full Text
62. Krogh A, Larsson B, von Heijne G, et al.: Predicting Transmembrane Protein Topology with a Hidden Markov Model: Application to Complete Genomes. J Mol Biol. 2001; 305(3): 567–80. PubMed Abstract | Publisher Full Text
63. Miele V, Penel S, Duret L: Ultra-fast sequence clustering from similarity networks with SiLiX. BMC Bioinformatics. 2011; 12(1): 116. PubMed Abstract | Publisher Full Text | Free Full Text
64. Wang Y, Wang M, Yin S, et al.: NeuroPep: A comprehensive resource of neuropeptides. Database (Oxford). 2015; 2015: bav038. PubMed Abstract | Publisher Full Text | Free Full Text
65. Meinken J, Walker G, Cooper CR, et al.: MetazSecKB: The human and animal secretome and subcellular proteome knowledgebase. Database (Oxford). 2015; 2015: bav077. PubMed Abstract | Publisher Full Text | Free Full Text
66. Arendsee Z, Li J, Singh U, et al.: Phylostratr: A framework for phylostratigraphy. Bioinformatics. 2019; 35(19): 3617–27. PubMed Abstract | Publisher Full Text
67. Nesterenko M: Figure S1. Phylogenetic tree of species whose data were used for phylostratigraphic analysis of the Peltogaster reticulata gene set. figshare. Figure. 2022. http://www.doi.org/10.6084/m9.figshare.21747554.v1
68. Drost HG, Gabel A, Liu J, et al.: MyTAI: Evolutionary transcriptomics with R. Bioinformatics. 2018; 34(9): 1589–90. PubMed Abstract | Publisher Full Text | Free Full Text
69. Schindelin J, Arganda-Carreras I, Frise E, et al.: Fiji: an open-source platform for biological-image analysis. Nat Methods. 2012; 9(7): 676–82. PubMed Abstract | Publisher Full Text | Free Full Text
70. Nesterenko M: Table S1. Summary of paired-end read libraries preparation results. figshare. Journal contribution. 2022. http://www.doi.org/10.6084/m9.figshare.19307486.v1
71. Nesterenko M: Table S2. Peltogaster reticulata reference sequence set annotation results. figshare. Journal contribution. 2022. http://www.doi.org/10.6084/m9.figshare.19307516.v1
72. Nesterenko M: Figure S2. Phylogenetic relationships between the crustacean species based on orthologs analysis results. figshare. Figure. 2022. http://www.doi.org/10.6084/m9.figshare.19307444.v4
73. Nesterenko M: Table S3. Gene expression quantification and analysis results. figshare. Journal contribution. 2022. http://www.doi.org/10.6084/m9.figshare.19307549.v1
74. Nesterenko M: Table S4. Gene Set Enrichment Analysis (GSEA) results for the molecular signatures and sets of over-expressed genes. figshare. Journal contribution. 2022. http://www.doi.org/10.6084/m9.figshare.19307570.v1
75. Nesterenko M: Table S5. Potential excretory/secretory proteins analysis results. figshare. Journal contribution. 2022. http://www.doi.org/10.6084/m9.figshare.19307594.v1
76. Domazet-Loso T, Brajković J, Tautz D: A phylostratigraphy approach to uncover the genomic history of major adaptations in metazoan lineages. Trends Genet. 2007; 23(11): 533–9. PubMed Abstract | Publisher Full Text
77. Nesterenko M: Table S6. Phylostratigraphic affiliation analysis results for different set of sequences. figshare. Journal contribution. 2022. http://www.doi.org/10.6084/m9.figshare.19307606.v1
78. Domazet-Lošo T, Tautz D: A phylogenetically based transcriptome age index mirrors ontogenetic divergence patterns. Nature. 2010; 468(7325): 815–8. PubMed Abstract | Publisher Full Text
79. Nesterenko M: Table S7. Evolutionary transcriptomics results. figshare. Journal contribution. 2022. http://www.doi.org/10.6084/m9.figshare.19307621.v1
80. Chan BKK, Dreyer N, Gale AS, et al.: The evolutionary diversity of barnacles, with an updated classification of fossil and living forms. Zool J Linn Soc. 2021; 193(3): 789–846. Publisher Full Text
81. Boxshall GA, Harrison K: New nicothoid copepods (Copepoda: Siphonostomatoida) from an amphipod and from deep-sea isopods. Bulletin of the British Museum (Natural History) Zoology. London, BM(NH); 1988; 54: 285–99. Publisher Full Text
82. Lamb EJ, Boxshall GA, Mill PJ, et al.: Nucellicolidae: A New Family of Endoparasitic Copepods (Poecilostomatoida) from the Dog Whelk Nucella lapillus (Gastropoda). J Crustac Biol. 1996; 16(1): 142-148. Publisher Full Text
83. Miroliubov AA, Lianguzova AD, Ilyutkin SA, et al.: The interna of the rhizocephalan Peltogaster reticulata: Comparative morphology and ultrastructure. Arthropod Struct Dev. 2022; 70: 101190. PubMed Abstract | Publisher Full Text
84. Shirley SM, Shirley TC, Meyers TR: Hemolymph responses of Alaskan king crabs to rhizocephalan parasitism. Can J Zool. 1986; 64(8): 1774–81. Publisher Full Text
85. Shukalyuk A, Isaeva V, Kizilova E, et al.: Stem cells in the reproductive strategy of colonial rhizocephalan crustaceans (Crustacea: Cirripedia: Rhizocephala). Invertebr Reprod Dev. 2005; 48(1–3): 41–53. Publisher Full Text
86. Isaeva VV, Shukalyuk AI, Trofimova AV, et al.: The structure of colonial interna in Sacculina polygenea (Crustacea: Cirripedia: Rhizocephala). Crustacean Research. 2001; 30: 133–46. Publisher Full Text
87. Glenner H, Høeg JT: A Scenario for the Evolution of the Rhizocephala. In: Escobar-Briones E, Alvarez F, editors. Modern Approaches to the Study of Crustacea. Springer, Boston, MA; 2002; 301–10. Publisher Full Text
88. Orgogozo V, Morizot B, Martin A: The differential view of genotype-phenotype relationships. Front Genet. 2015; 6: 179. PubMed Abstract | Publisher Full Text | Free Full Text
89. Glenner H, Hebsgaard MB: Phylogeny and evolution of life history strategies of the Parasitic Barnacles (Crustacea, Cirripedia, Rhizocephala). Mol Phylogenet Evol. 2006; 41(3): 528–38. PubMed Abstract | Publisher Full Text

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 27 May 2022

Author details Author details

Aleksei Miroliubov
Roles: Conceptualization, Funding Acquisition, Investigation, Methodology, Resources, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

This study was supported by the Grant No. 21-74-00018 of the Russian Science Foundation, by a grant of the Ministry of Science and Higher Education of the Russian Federation (No. 075-15-2021-1069) and the state laboratory theme of the Zoological Institute 122031100260-0 (Biodiversity of parasites, life cycles, biology and evolution).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Article Versions (2)

version 2

Revised

Published: 09 Jan 2023, 11:583

https://doi.org/10.12688/f1000research.110492.2

version 1

Published: 27 May 2022, 11:583

https://doi.org/10.12688/f1000research.110492.1

© 2023 Nesterenko M and Miroliubov A. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

SEE MORE DETAILS

CITE

how to cite this article

Nesterenko M and Miroliubov A. From head to rootlet: comparative transcriptomic analysis of a rhizocephalan barnacle Peltogaster reticulata (Crustacea: Rhizocephala) [version 2; peer review: 2 approved] F1000Research 2023, 11:583 (https://doi.org/10.12688/f1000research.110492.2)

NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?

Key to Reviewer Statuses VIEW HIDE

ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions

Version 1

VERSION 1

PUBLISHED 27 May 2022

Views

Reviewer Report 14 Nov 2022

Viatcheslav Ivanenko, Shenzhen MSU-BIT University, Shenzhen, China; Department of Invertebrate Zoology, Lomonosov Moscow State University, Moscow, Russian Federation; Shirshov Institute of Oceanology, Russian Academy of Sciences, Moscow, Russian Federation

Approved

https://doi.org/10.5256/f1000research.122108.r153684

This manuscript is a completed work with new and significant scientific results. In my opinion, the manuscript is well written and illustrated, and the interpretation of the results raises no questions or doubts. The main remark concerns the fact that despite the uniqueness of many features of the biology and morphology of rhizocephalans, they are not the only highly modified crustaceans that parasitize crustaceans. In the first sentences of the manuscript or discussion, it would be appropriate to mention at least some poorly studied and highly modified copepods with root-like mouthparts that also parasitize other crustaceans. These copepods are belonging to the family Nicothoidae Dana, 1852-1853 and the synonymous family Choniostomatidae Hansen, 1886 (see Boxshall and Harrison, 1988¹).

Some phrases may not be very well written, but I am not a native English speaker to make edits.

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

I cannot comment. A qualified statistician is required.
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

References

1. Boxshall GA, Harrison K: New nicothoid coepods (Copepoda: Siphonostomatoida) from an amphipod and from deep-sea isopods.Bulletin of the British Museum (Natural History), Zoology. 1988; 54 (6): 285-299 Reference Source

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Morphology, development, phylogeny and ecology of symbiotic crustaceans.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Author Response 09 Jan 2023

Maksim Nesterenko, Laboratory of parasitic worms and protists, Zoological Institute of Russian Academy of Sciences, St Petersburg, 199034, Russian Federation

09 Jan 2023

Author Response

We thank the reviewer for their comments, which are listed below (in italics) with our responses (in bold):

The main remark concerns the fact that despite the uniqueness of ... Continue reading We thank the reviewer for their comments, which are listed below (in italics) with our responses (in bold):

The main remark concerns the fact that despite the uniqueness of many features of the biology and morphology of rhizocephalans, they are not the only highly modified crustaceans that parasitize crustaceans. In the first sentences of the manuscript or discussion, it would be appropriate to mention at least some poorly studied and highly modified copepods with root-like mouthparts that also parasitize other crustaceans. These copepods are belonging to the family Nicothoidae Dana, 1852-1853 and the synonymous family Choniostomatidae Hansen, 1886 (see Boxshall and Harrison, 19881).
You are absolutely correct in pointing out that rhizocephalans are not the only ones showing amazing body transformation among Crustacea. We have added information about parasitic copepods at the end of the second paragraph of the discussion.

Some phrases may not be very well written, but I am not a native English speaker to make edits.
Even though for both authors of the article English is not their native language, we have tried to do our best to convey our thoughts as correctly and clearly as possible. However, we have tried to improve some of the wording.
We thank the reviewer for their comments, which are listed below (in italics) with our responses (in bold):

The main remark concerns the fact that despite the uniqueness of many features of the biology and morphology of rhizocephalans, they are not the only highly modified crustaceans that parasitize crustaceans. In the first sentences of the manuscript or discussion, it would be appropriate to mention at least some poorly studied and highly modified copepods with root-like mouthparts that also parasitize other crustaceans. These copepods are belonging to the family Nicothoidae Dana, 1852-1853 and the synonymous family Choniostomatidae Hansen, 1886 (see Boxshall and Harrison, 19881).
You are absolutely correct in pointing out that rhizocephalans are not the only ones showing amazing body transformation among Crustacea. We have added information about parasitic copepods at the end of the second paragraph of the discussion.

Some phrases may not be very well written, but I am not a native English speaker to make edits.
Even though for both authors of the article English is not their native language, we have tried to do our best to convey our thoughts as correctly and clearly as possible. However, we have tried to improve some of the wording.
Competing Interests: No competing interests were disclosed Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 09 Jan 2023

Maksim Nesterenko, Laboratory of parasitic worms and protists, Zoological Institute of Russian Academy of Sciences, St Petersburg, 199034, Russian Federation

09 Jan 2023

Author Response

We thank the reviewer for their comments, which are listed below (in italics) with our responses (in bold):

The main remark concerns the fact that despite the uniqueness of ... Continue reading We thank the reviewer for their comments, which are listed below (in italics) with our responses (in bold):

The main remark concerns the fact that despite the uniqueness of many features of the biology and morphology of rhizocephalans, they are not the only highly modified crustaceans that parasitize crustaceans. In the first sentences of the manuscript or discussion, it would be appropriate to mention at least some poorly studied and highly modified copepods with root-like mouthparts that also parasitize other crustaceans. These copepods are belonging to the family Nicothoidae Dana, 1852-1853 and the synonymous family Choniostomatidae Hansen, 1886 (see Boxshall and Harrison, 19881).
You are absolutely correct in pointing out that rhizocephalans are not the only ones showing amazing body transformation among Crustacea. We have added information about parasitic copepods at the end of the second paragraph of the discussion.

Some phrases may not be very well written, but I am not a native English speaker to make edits.
Even though for both authors of the article English is not their native language, we have tried to do our best to convey our thoughts as correctly and clearly as possible. However, we have tried to improve some of the wording.
We thank the reviewer for their comments, which are listed below (in italics) with our responses (in bold):

The main remark concerns the fact that despite the uniqueness of many features of the biology and morphology of rhizocephalans, they are not the only highly modified crustaceans that parasitize crustaceans. In the first sentences of the manuscript or discussion, it would be appropriate to mention at least some poorly studied and highly modified copepods with root-like mouthparts that also parasitize other crustaceans. These copepods are belonging to the family Nicothoidae Dana, 1852-1853 and the synonymous family Choniostomatidae Hansen, 1886 (see Boxshall and Harrison, 19881).
You are absolutely correct in pointing out that rhizocephalans are not the only ones showing amazing body transformation among Crustacea. We have added information about parasitic copepods at the end of the second paragraph of the discussion.

Some phrases may not be very well written, but I am not a native English speaker to make edits.
Even though for both authors of the article English is not their native language, we have tried to do our best to convey our thoughts as correctly and clearly as possible. However, we have tried to improve some of the wording.
Competing Interests: No competing interests were disclosed Close
Report a concern

Views

Reviewer Report 27 Jun 2022

Andreas Hejnol, Sars International Centre for Marine Molecular Biology, University of Bergen, Thormøhlensgate 55, 5006, Bergen, Norway, University of Bergen, Norway; University of Jena, Jena, Germany

Ferenc Kagan, University of Bergen, Bergen, Norway

Approved

https://doi.org/10.5256/f1000research.122108.r139230

This study provides the first glimpse into the transcriptomics of a parasitic rhizocephalan crustacean. The paper reads well, its coherent structure is easy to follow. The language used is appropriate for a publication and words are mindfully used when interpreting the results.

The authors set out to provide transcriptomic data of the parasitic barnacle Peltogaster reticulata, a timely goal as this highly specialized group of crustaceans is heavily understudied. This, with the addition of the current study, appears to change as a novel chromosome level genome assembly has been recently made available for another rhizocephalan species (Sacculina carcini). In this study different regions of the organism were selected, sequenced and their transcriptomic data were contrasted to each other. I found this approach sound to decipher the different roles of specialized body regions. I missed the description of the criteria used for separating the different body regions, especially for the internal structures.

The authors successfully recognized and removed the challenge of biological contamination of the data. A thorough and careful preprocessing of the data was performed. The generation of a high-quality transcriptome assembly is pivotal for any project with similar aims. Therefore, the authors spend a tremendous amount of time assuring the quality of the assembly and annotation. I liked the attention to the details during these steps. With several filtering criteria, they arrive at a set of contigs which are plausible. From the quality assessment, I missed the report of the full BUSCO scores (if BUSCO scores are the reported scores), not just the single-copy ortholog and duplicate, but also fragmented and missing proportions. Furthermore, with such strong filtering steps, valid data is also discarded in parallel with noise. To alleviate this problem generally multiple assemblers with multiple k-mer sizes are utilized and redundancy across the assemblies is dealt with downstream of the assembly (eg. DOI: 10.7717/peerj.5428 or https://doi.org/10.111/1755-0998.13593).[ref-1]^,[ref-2] This approach in my experience will usually greatly improve the assemblies, an improvement which might marginally benefit the authors of this study.

Next, the focus was shifted to quantifying the expression of genes in the body regions and finding differentially expressed genes among them. Gene ontological enrichment was performed on different sets of genes (molecular signature set, overexpressed set and different phylostratographic sets). The output of these analyses with the addition of histological data prompted the authors to hypothesize that female germ cells are formed in the internal trunk region and migrate from here to the externa part. Indeed, such results combined with previous reports hint towards this possibility, something which could be tested with an in situ hybridization for germ cell marker genes such as for example vasa or nanos (both of which are present in the assembled transcriptome).

Using various in silico methods the authors outlined a set of potential excretory/secretory proteins. Several uncharacterized proteins are retrieved from this identification method, some of which could potentially be involved at the interface between parasite and host, as suggested by the authors. Unfortunately, without further validation, this suggestion persists as a testable hypothesis.

Results from the phylostratographic analyses emphasize further the unique biology of parasitic barnacles. The interpretation of the results by the authors was legitimate. My concern with such approaches is their sensitivity to the BLAST algorithm (see a thorough description of the issue here: https://drostlab.github.io/myTAI/articles/Phylostratigraphy.html)

Furthermore, I found the wording used for describing the datasets utilized in similarity searches confusing. How many species were used for the phylostratographic mapping and how many of them were metazoan?

A rich resource of results was provided by the authors. With the descriptions provided the additional data is easily interpretable. At the moment the analyses conducted in this study are irreproducible without the custom scripts used by the authors. These should be made available for the community who wishes to replicate the study or to further investigate these intriguing parasitic barnacles.

Overall I found this publication exciting and serving as a springboard for future studies on parasitic barnacle biology or evolutionary biology of parasitic life histories. If the authors complement the manuscript with the missing parts outlined above (criteria used for separating the body regions, phylostratography wording, missing scripts) the article would be greatly enhanced.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Partly
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: comparative transcriptomics in development and evolution

We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

CITE

Report a concern

Author Response 09 Jan 2023

Maksim Nesterenko, Laboratory of parasitic worms and protists, Zoological Institute of Russian Academy of Sciences, St Petersburg, 199034, Russian Federation

09 Jan 2023

Author Response

We thank the reviewer for their very helpful, clear, and pertinent remarks. Each original comment is in italics and our responses are in bold.

The authors set out to ... Continue reading We thank the reviewer for their very helpful, clear, and pertinent remarks. Each original comment is in italics and our responses are in bold.

The authors set out to provide transcriptomic data of the parasitic barnacle Peltogaster reticulata, a timely goal as this highly specialized group of crustaceans is heavily understudied. This, with the addition of the current study, appears to change as a novel chromosome level genome assembly has been recently made available for another rhizocephalan species (Sacculina carcini). In this study different regions of the organism were selected, sequenced and their transcriptomic data were contrasted to each other. I found this approach sound to decipher the different roles of specialized body regions. I missed the description of the criteria used for separating the different body regions, especially for the internal structures.
We have added a more defined description of the way of separating different body parts in the Methods section.

The authors successfully recognized and removed the challenge of biological contamination of the data. A thorough and careful preprocessing of the data was performed. The generation of a high-quality transcriptome assembly is pivotal for any project with similar aims. Therefore, the authors spend a tremendous amount of time assuring the quality of the assembly and annotation. I liked the attention to the details during these steps. With several filtering criteria, they arrive at a set of contigs which are plausible. From the quality assessment, I missed the report of the full BUSCO scores (if BUSCO scores are the reported scores), not just the single-copy ortholog and duplicate, but also fragmented and missing proportions.
We refined the results of the comparison against the database of single-copy orthologs by indicating the percentages of orthologs that were duplicated, fragmented, or missing from our dataset. For example: “The comparison with a single-copy metazoan orthologues database revealed that in the reference protein set of P. reticulata 75.6% of the orthologues were assembled completely (Single: 70.4%; Duplicated: 5.2%), whereas 3.4% and 21% of the sequences were fragmented or absent, respectively.”

Furthermore, with such strong filtering steps, valid data is also discarded in parallel with noise. To alleviate this problem generally multiple assemblers with multiple k-mer sizes are utilized and redundancy across the assemblies is dealt with downstream of the assembly (eg. DOI: 10.7717/peerj.5428 or https://doi.org/10.111/1755-0998.13593).[ref-1],[ref-2] This approach in my experience will usually greatly improve the assemblies, an improvement which might marginally benefit the authors of this study.
We fully agree that the proposed method can indeed improve assembling results. Moreover, using multiple assemblers and re-running programs with different k-mer lengths was considered as a possible strategy in the early stages of planning a general analysis scheme based on the results presented in the TransRate publication (DOI: 10.1101/gr.196469.115). However, for the initial assembly, we opted for the Trinity program, which shows excellent assembly results even with default parameters. Now we are starting work on a new article, which will include new data, with the help of which we plan to improve the assembly presented in the current manuscript and refine the results obtained on its basis. As part of the new study, we also plan to test the approach you indicated in order to obtain the most complete and high-quality assembly of the transcriptome.

Next, the focus was shifted to quantifying the expression of genes in the body regions and finding differentially expressed genes among them. Gene ontological enrichment was performed on different sets of genes (molecular signature set, overexpressed set and different phylostratographic sets). The output of these analyses with the addition of histological data prompted the authors to hypothesize that female germ cells are formed in the internal trunk region and migrate from here to the externa part. Indeed, such results combined with previous reports hint towards this possibility, something which could be tested with an in situ hybridization for germ cell marker genes such as for example vasa or nanos (both of which are present in the assembled transcriptome).
Soon we are going to test our hypothesis by an in situ hybridization for germ cell marker genes. However, we plan to publish the results obtained in a separate article.

Using various in silico methods the authors outlined a set of potential excretory/secretory proteins. Several uncharacterized proteins are retrieved from this identification method, some of which could potentially be involved at the interface between parasite and host, as suggested by the authors. Unfortunately, without further validation, this suggestion persists as a testable hypothesis.
We fully agree that the presented results are preliminary and are aimed primarily at narrowing the range of potential targets for a more detailed study. Now we are working on it with the use of a proteomic approach and hope to publish the results soon in our new publications.

Results from the phylostratographic analyses emphasize further the unique biology of parasitic barnacles. The interpretation of the results by the authors was legitimate. My concern with such approaches is their sensitivity to the BLAST algorithm (see a thorough description of the issue here: https://drostlab.github.io/myTAI/articles/Phylostratigraphy.html)
Before starting the phylostratigraphic analysis, we carefully reviewed various materials, including the ones you mentioned. So, for example, according to this publication (https://doi.org/10.1093/molbev/msw284) BLAST is an appropriate and sufficiently sensitive tool in phylostratigraphic analysis that does not appear to introduce significant biases into evolutionary pattern inferences. Nevertheless, we agree that more complex comparisons of different methods are required to determine how correctly they detect distant homologs. By the way, as one of the promising directions for improving the strategy for phylostrata identification the Leapfrog method (doi:10.1101/gr.216226.116) proposed by your scientific group and based on the inclusion of «bridge» species can be considered.

Furthermore, I found the wording used for describing the datasets utilized in similarity searches confusing. How many species were used for the phylostratographic mapping and how many of them were metazoan?
In total, in addition to the P. reticulata, 139 species were included in the analysis, of which 44 were representatives of the Metazoa. We not only have provided this information in the Methods but have also included a phylogenetic tree of all species considered in the Supplementary materials. Thank you very much for pointing out to us that we missed this important information.

A rich resource of results was provided by the authors. With the descriptions provided the additional data is easily interpretable. At the moment the analyses conducted in this study are irreproducible without the custom scripts used by the authors. These should be made available for the community who wishes to replicate the study or to further investigate these intriguing parasitic barnacles.
All custom Python and R scripts used in the study are publicly available: https://github.com/maxnest/From_head_to_rootlet
We thank the reviewer for their very helpful, clear, and pertinent remarks. Each original comment is in italics and our responses are in bold.

The authors set out to provide transcriptomic data of the parasitic barnacle Peltogaster reticulata, a timely goal as this highly specialized group of crustaceans is heavily understudied. This, with the addition of the current study, appears to change as a novel chromosome level genome assembly has been recently made available for another rhizocephalan species (Sacculina carcini). In this study different regions of the organism were selected, sequenced and their transcriptomic data were contrasted to each other. I found this approach sound to decipher the different roles of specialized body regions. I missed the description of the criteria used for separating the different body regions, especially for the internal structures.
We have added a more defined description of the way of separating different body parts in the Methods section.

The authors successfully recognized and removed the challenge of biological contamination of the data. A thorough and careful preprocessing of the data was performed. The generation of a high-quality transcriptome assembly is pivotal for any project with similar aims. Therefore, the authors spend a tremendous amount of time assuring the quality of the assembly and annotation. I liked the attention to the details during these steps. With several filtering criteria, they arrive at a set of contigs which are plausible. From the quality assessment, I missed the report of the full BUSCO scores (if BUSCO scores are the reported scores), not just the single-copy ortholog and duplicate, but also fragmented and missing proportions.
We refined the results of the comparison against the database of single-copy orthologs by indicating the percentages of orthologs that were duplicated, fragmented, or missing from our dataset. For example: “The comparison with a single-copy metazoan orthologues database revealed that in the reference protein set of P. reticulata 75.6% of the orthologues were assembled completely (Single: 70.4%; Duplicated: 5.2%), whereas 3.4% and 21% of the sequences were fragmented or absent, respectively.”

Furthermore, with such strong filtering steps, valid data is also discarded in parallel with noise. To alleviate this problem generally multiple assemblers with multiple k-mer sizes are utilized and redundancy across the assemblies is dealt with downstream of the assembly (eg. DOI: 10.7717/peerj.5428 or https://doi.org/10.111/1755-0998.13593).[ref-1],[ref-2] This approach in my experience will usually greatly improve the assemblies, an improvement which might marginally benefit the authors of this study.
We fully agree that the proposed method can indeed improve assembling results. Moreover, using multiple assemblers and re-running programs with different k-mer lengths was considered as a possible strategy in the early stages of planning a general analysis scheme based on the results presented in the TransRate publication (DOI: 10.1101/gr.196469.115). However, for the initial assembly, we opted for the Trinity program, which shows excellent assembly results even with default parameters. Now we are starting work on a new article, which will include new data, with the help of which we plan to improve the assembly presented in the current manuscript and refine the results obtained on its basis. As part of the new study, we also plan to test the approach you indicated in order to obtain the most complete and high-quality assembly of the transcriptome.

Next, the focus was shifted to quantifying the expression of genes in the body regions and finding differentially expressed genes among them. Gene ontological enrichment was performed on different sets of genes (molecular signature set, overexpressed set and different phylostratographic sets). The output of these analyses with the addition of histological data prompted the authors to hypothesize that female germ cells are formed in the internal trunk region and migrate from here to the externa part. Indeed, such results combined with previous reports hint towards this possibility, something which could be tested with an in situ hybridization for germ cell marker genes such as for example vasa or nanos (both of which are present in the assembled transcriptome).
Soon we are going to test our hypothesis by an in situ hybridization for germ cell marker genes. However, we plan to publish the results obtained in a separate article.

Using various in silico methods the authors outlined a set of potential excretory/secretory proteins. Several uncharacterized proteins are retrieved from this identification method, some of which could potentially be involved at the interface between parasite and host, as suggested by the authors. Unfortunately, without further validation, this suggestion persists as a testable hypothesis.
We fully agree that the presented results are preliminary and are aimed primarily at narrowing the range of potential targets for a more detailed study. Now we are working on it with the use of a proteomic approach and hope to publish the results soon in our new publications.

Results from the phylostratographic analyses emphasize further the unique biology of parasitic barnacles. The interpretation of the results by the authors was legitimate. My concern with such approaches is their sensitivity to the BLAST algorithm (see a thorough description of the issue here: https://drostlab.github.io/myTAI/articles/Phylostratigraphy.html)
Before starting the phylostratigraphic analysis, we carefully reviewed various materials, including the ones you mentioned. So, for example, according to this publication (https://doi.org/10.1093/molbev/msw284) BLAST is an appropriate and sufficiently sensitive tool in phylostratigraphic analysis that does not appear to introduce significant biases into evolutionary pattern inferences. Nevertheless, we agree that more complex comparisons of different methods are required to determine how correctly they detect distant homologs. By the way, as one of the promising directions for improving the strategy for phylostrata identification the Leapfrog method (doi:10.1101/gr.216226.116) proposed by your scientific group and based on the inclusion of «bridge» species can be considered.

Furthermore, I found the wording used for describing the datasets utilized in similarity searches confusing. How many species were used for the phylostratographic mapping and how many of them were metazoan?
In total, in addition to the P. reticulata, 139 species were included in the analysis, of which 44 were representatives of the Metazoa. We not only have provided this information in the Methods but have also included a phylogenetic tree of all species considered in the Supplementary materials. Thank you very much for pointing out to us that we missed this important information.

A rich resource of results was provided by the authors. With the descriptions provided the additional data is easily interpretable. At the moment the analyses conducted in this study are irreproducible without the custom scripts used by the authors. These should be made available for the community who wishes to replicate the study or to further investigate these intriguing parasitic barnacles.
All custom Python and R scripts used in the study are publicly available: https://github.com/maxnest/From_head_to_rootlet
Competing Interests: No competing interests were disclosed Close
Report a concern
Respond or Comment

COMMENTS ON THIS REPORT

Author Response 09 Jan 2023

Maksim Nesterenko, Laboratory of parasitic worms and protists, Zoological Institute of Russian Academy of Sciences, St Petersburg, 199034, Russian Federation

09 Jan 2023

Author Response

We thank the reviewer for their very helpful, clear, and pertinent remarks. Each original comment is in italics and our responses are in bold.

The authors set out to ... Continue reading We thank the reviewer for their very helpful, clear, and pertinent remarks. Each original comment is in italics and our responses are in bold.

The authors set out to provide transcriptomic data of the parasitic barnacle Peltogaster reticulata, a timely goal as this highly specialized group of crustaceans is heavily understudied. This, with the addition of the current study, appears to change as a novel chromosome level genome assembly has been recently made available for another rhizocephalan species (Sacculina carcini). In this study different regions of the organism were selected, sequenced and their transcriptomic data were contrasted to each other. I found this approach sound to decipher the different roles of specialized body regions. I missed the description of the criteria used for separating the different body regions, especially for the internal structures.
We have added a more defined description of the way of separating different body parts in the Methods section.

The authors successfully recognized and removed the challenge of biological contamination of the data. A thorough and careful preprocessing of the data was performed. The generation of a high-quality transcriptome assembly is pivotal for any project with similar aims. Therefore, the authors spend a tremendous amount of time assuring the quality of the assembly and annotation. I liked the attention to the details during these steps. With several filtering criteria, they arrive at a set of contigs which are plausible. From the quality assessment, I missed the report of the full BUSCO scores (if BUSCO scores are the reported scores), not just the single-copy ortholog and duplicate, but also fragmented and missing proportions.
We refined the results of the comparison against the database of single-copy orthologs by indicating the percentages of orthologs that were duplicated, fragmented, or missing from our dataset. For example: “The comparison with a single-copy metazoan orthologues database revealed that in the reference protein set of P. reticulata 75.6% of the orthologues were assembled completely (Single: 70.4%; Duplicated: 5.2%), whereas 3.4% and 21% of the sequences were fragmented or absent, respectively.”

Furthermore, with such strong filtering steps, valid data is also discarded in parallel with noise. To alleviate this problem generally multiple assemblers with multiple k-mer sizes are utilized and redundancy across the assemblies is dealt with downstream of the assembly (eg. DOI: 10.7717/peerj.5428 or https://doi.org/10.111/1755-0998.13593).[ref-1],[ref-2] This approach in my experience will usually greatly improve the assemblies, an improvement which might marginally benefit the authors of this study.
We fully agree that the proposed method can indeed improve assembling results. Moreover, using multiple assemblers and re-running programs with different k-mer lengths was considered as a possible strategy in the early stages of planning a general analysis scheme based on the results presented in the TransRate publication (DOI: 10.1101/gr.196469.115). However, for the initial assembly, we opted for the Trinity program, which shows excellent assembly results even with default parameters. Now we are starting work on a new article, which will include new data, with the help of which we plan to improve the assembly presented in the current manuscript and refine the results obtained on its basis. As part of the new study, we also plan to test the approach you indicated in order to obtain the most complete and high-quality assembly of the transcriptome.

Next, the focus was shifted to quantifying the expression of genes in the body regions and finding differentially expressed genes among them. Gene ontological enrichment was performed on different sets of genes (molecular signature set, overexpressed set and different phylostratographic sets). The output of these analyses with the addition of histological data prompted the authors to hypothesize that female germ cells are formed in the internal trunk region and migrate from here to the externa part. Indeed, such results combined with previous reports hint towards this possibility, something which could be tested with an in situ hybridization for germ cell marker genes such as for example vasa or nanos (both of which are present in the assembled transcriptome).
Soon we are going to test our hypothesis by an in situ hybridization for germ cell marker genes. However, we plan to publish the results obtained in a separate article.

Using various in silico methods the authors outlined a set of potential excretory/secretory proteins. Several uncharacterized proteins are retrieved from this identification method, some of which could potentially be involved at the interface between parasite and host, as suggested by the authors. Unfortunately, without further validation, this suggestion persists as a testable hypothesis.
We fully agree that the presented results are preliminary and are aimed primarily at narrowing the range of potential targets for a more detailed study. Now we are working on it with the use of a proteomic approach and hope to publish the results soon in our new publications.

Results from the phylostratographic analyses emphasize further the unique biology of parasitic barnacles. The interpretation of the results by the authors was legitimate. My concern with such approaches is their sensitivity to the BLAST algorithm (see a thorough description of the issue here: https://drostlab.github.io/myTAI/articles/Phylostratigraphy.html)
Before starting the phylostratigraphic analysis, we carefully reviewed various materials, including the ones you mentioned. So, for example, according to this publication (https://doi.org/10.1093/molbev/msw284) BLAST is an appropriate and sufficiently sensitive tool in phylostratigraphic analysis that does not appear to introduce significant biases into evolutionary pattern inferences. Nevertheless, we agree that more complex comparisons of different methods are required to determine how correctly they detect distant homologs. By the way, as one of the promising directions for improving the strategy for phylostrata identification the Leapfrog method (doi:10.1101/gr.216226.116) proposed by your scientific group and based on the inclusion of «bridge» species can be considered.

Furthermore, I found the wording used for describing the datasets utilized in similarity searches confusing. How many species were used for the phylostratographic mapping and how many of them were metazoan?
In total, in addition to the P. reticulata, 139 species were included in the analysis, of which 44 were representatives of the Metazoa. We not only have provided this information in the Methods but have also included a phylogenetic tree of all species considered in the Supplementary materials. Thank you very much for pointing out to us that we missed this important information.

A rich resource of results was provided by the authors. With the descriptions provided the additional data is easily interpretable. At the moment the analyses conducted in this study are irreproducible without the custom scripts used by the authors. These should be made available for the community who wishes to replicate the study or to further investigate these intriguing parasitic barnacles.
All custom Python and R scripts used in the study are publicly available: https://github.com/maxnest/From_head_to_rootlet
We thank the reviewer for their very helpful, clear, and pertinent remarks. Each original comment is in italics and our responses are in bold.

The authors set out to provide transcriptomic data of the parasitic barnacle Peltogaster reticulata, a timely goal as this highly specialized group of crustaceans is heavily understudied. This, with the addition of the current study, appears to change as a novel chromosome level genome assembly has been recently made available for another rhizocephalan species (Sacculina carcini). In this study different regions of the organism were selected, sequenced and their transcriptomic data were contrasted to each other. I found this approach sound to decipher the different roles of specialized body regions. I missed the description of the criteria used for separating the different body regions, especially for the internal structures.
We have added a more defined description of the way of separating different body parts in the Methods section.

The authors successfully recognized and removed the challenge of biological contamination of the data. A thorough and careful preprocessing of the data was performed. The generation of a high-quality transcriptome assembly is pivotal for any project with similar aims. Therefore, the authors spend a tremendous amount of time assuring the quality of the assembly and annotation. I liked the attention to the details during these steps. With several filtering criteria, they arrive at a set of contigs which are plausible. From the quality assessment, I missed the report of the full BUSCO scores (if BUSCO scores are the reported scores), not just the single-copy ortholog and duplicate, but also fragmented and missing proportions.
We refined the results of the comparison against the database of single-copy orthologs by indicating the percentages of orthologs that were duplicated, fragmented, or missing from our dataset. For example: “The comparison with a single-copy metazoan orthologues database revealed that in the reference protein set of P. reticulata 75.6% of the orthologues were assembled completely (Single: 70.4%; Duplicated: 5.2%), whereas 3.4% and 21% of the sequences were fragmented or absent, respectively.”

Furthermore, with such strong filtering steps, valid data is also discarded in parallel with noise. To alleviate this problem generally multiple assemblers with multiple k-mer sizes are utilized and redundancy across the assemblies is dealt with downstream of the assembly (eg. DOI: 10.7717/peerj.5428 or https://doi.org/10.111/1755-0998.13593).[ref-1],[ref-2] This approach in my experience will usually greatly improve the assemblies, an improvement which might marginally benefit the authors of this study.
We fully agree that the proposed method can indeed improve assembling results. Moreover, using multiple assemblers and re-running programs with different k-mer lengths was considered as a possible strategy in the early stages of planning a general analysis scheme based on the results presented in the TransRate publication (DOI: 10.1101/gr.196469.115). However, for the initial assembly, we opted for the Trinity program, which shows excellent assembly results even with default parameters. Now we are starting work on a new article, which will include new data, with the help of which we plan to improve the assembly presented in the current manuscript and refine the results obtained on its basis. As part of the new study, we also plan to test the approach you indicated in order to obtain the most complete and high-quality assembly of the transcriptome.

Next, the focus was shifted to quantifying the expression of genes in the body regions and finding differentially expressed genes among them. Gene ontological enrichment was performed on different sets of genes (molecular signature set, overexpressed set and different phylostratographic sets). The output of these analyses with the addition of histological data prompted the authors to hypothesize that female germ cells are formed in the internal trunk region and migrate from here to the externa part. Indeed, such results combined with previous reports hint towards this possibility, something which could be tested with an in situ hybridization for germ cell marker genes such as for example vasa or nanos (both of which are present in the assembled transcriptome).
Soon we are going to test our hypothesis by an in situ hybridization for germ cell marker genes. However, we plan to publish the results obtained in a separate article.

Using various in silico methods the authors outlined a set of potential excretory/secretory proteins. Several uncharacterized proteins are retrieved from this identification method, some of which could potentially be involved at the interface between parasite and host, as suggested by the authors. Unfortunately, without further validation, this suggestion persists as a testable hypothesis.
We fully agree that the presented results are preliminary and are aimed primarily at narrowing the range of potential targets for a more detailed study. Now we are working on it with the use of a proteomic approach and hope to publish the results soon in our new publications.

Results from the phylostratographic analyses emphasize further the unique biology of parasitic barnacles. The interpretation of the results by the authors was legitimate. My concern with such approaches is their sensitivity to the BLAST algorithm (see a thorough description of the issue here: https://drostlab.github.io/myTAI/articles/Phylostratigraphy.html)
Before starting the phylostratigraphic analysis, we carefully reviewed various materials, including the ones you mentioned. So, for example, according to this publication (https://doi.org/10.1093/molbev/msw284) BLAST is an appropriate and sufficiently sensitive tool in phylostratigraphic analysis that does not appear to introduce significant biases into evolutionary pattern inferences. Nevertheless, we agree that more complex comparisons of different methods are required to determine how correctly they detect distant homologs. By the way, as one of the promising directions for improving the strategy for phylostrata identification the Leapfrog method (doi:10.1101/gr.216226.116) proposed by your scientific group and based on the inclusion of «bridge» species can be considered.

Furthermore, I found the wording used for describing the datasets utilized in similarity searches confusing. How many species were used for the phylostratographic mapping and how many of them were metazoan?
In total, in addition to the P. reticulata, 139 species were included in the analysis, of which 44 were representatives of the Metazoa. We not only have provided this information in the Methods but have also included a phylogenetic tree of all species considered in the Supplementary materials. Thank you very much for pointing out to us that we missed this important information.

A rich resource of results was provided by the authors. With the descriptions provided the additional data is easily interpretable. At the moment the analyses conducted in this study are irreproducible without the custom scripts used by the authors. These should be made available for the community who wishes to replicate the study or to further investigate these intriguing parasitic barnacles.
All custom Python and R scripts used in the study are publicly available: https://github.com/maxnest/From_head_to_rootlet
Competing Interests: No competing interests were disclosed Close
Report a concern

Comments on this article Comments (0)

Version 2

VERSION 2 PUBLISHED 27 May 2022

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2
Version 2 (revision) 09 Jan 23
Version 1 27 May 22	read	read

Andreas Hejnol, Sars International Centre for Marine Molecular Biology, University of Bergen, Thormøhlensgate 55, 5006, Bergen, Norway, University of Bergen, Norway; University of Jena, Jena, Germany

Ferenc Kagan, University of Bergen, Bergen, Norway
Viatcheslav Ivanenko, Shenzhen MSU-BIT University, Shenzhen, China; Lomonosov Moscow State University, Moscow, Russian Federation; Russian Academy of Sciences, Moscow, Russian Federation

Comments on this article

All Comments(0)

Add a comment

Browse by related subjects

Back to all reports

Reviewer Report

28 Views

14 Nov 2022 | for Version 1

28 Views Cite this report Responses(1)

Approved

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

I cannot comment. A qualified statistician is required.
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

References

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Morphology, development, phylogeny and ecology of symbiotic crustaceans.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Author Response

09 Jan 2023

Maksim Nesterenko, Laboratory of parasitic worms and protists, Zoological Institute of Russian Academy of Sciences, St Petersburg, 199034, Russian Federation

We thank the reviewer for their comments, which are listed below (in italics) with our responses (in bold):

The main remark concerns the fact that despite the uniqueness of many features of the biology and morphology of rhizocephalans, they are not the only highly modified crustaceans that parasitize crustaceans. In the first sentences of the manuscript or discussion, it would be appropriate to mention at least some poorly studied and highly modified copepods with root-like mouthparts that also parasitize other crustaceans. These copepods are belonging to the family Nicothoidae Dana, 1852-1853 and the synonymous family Choniostomatidae Hansen, 1886 (see Boxshall and Harrison, 19881).
You are absolutely correct in pointing out that rhizocephalans are not the only ones showing amazing body transformation among Crustacea. We have added information about parasitic copepods at the end of the second paragraph of the discussion.

Some phrases may not be very well written, but I am not a native English speaker to make edits.
Even though for both authors of the article English is not their native language, we have tried to do our best to convey our thoughts as correctly and clearly as possible. However, we have tried to improve some of the wording.

View more View less

Competing Interests

No competing interests were disclosed

Back to all reports

Reviewer Report

29 Views

27 Jun 2022 | for Version 1

Ferenc Kagan, University of Bergen, Bergen, Norway

29 Views Cite this report Responses(1)

Approved

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Yes
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Partly
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

comparative transcriptomics in development and evolution

We confirm that we have read this submission and believe that we have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Respond to this report

Responses (1)

Author Response

09 Jan 2023

Maksim Nesterenko, Laboratory of parasitic worms and protists, Zoological Institute of Russian Academy of Sciences, St Petersburg, 199034, Russian Federation

We thank the reviewer for their very helpful, clear, and pertinent remarks. Each original comment is in italics and our responses are in bold.

The authors set out to provide transcriptomic data of the parasitic barnacle Peltogaster reticulata, a timely goal as this highly specialized group of crustaceans is heavily understudied. This, with the addition of the current study, appears to change as a novel chromosome level genome assembly has been recently made available for another rhizocephalan species (Sacculina carcini). In this study different regions of the organism were selected, sequenced and their transcriptomic data were contrasted to each other. I found this approach sound to decipher the different roles of specialized body regions. I missed the description of the criteria used for separating the different body regions, especially for the internal structures.
We have added a more defined description of the way of separating different body parts in the Methods section.

The authors successfully recognized and removed the challenge of biological contamination of the data. A thorough and careful preprocessing of the data was performed. The generation of a high-quality transcriptome assembly is pivotal for any project with similar aims. Therefore, the authors spend a tremendous amount of time assuring the quality of the assembly and annotation. I liked the attention to the details during these steps. With several filtering criteria, they arrive at a set of contigs which are plausible. From the quality assessment, I missed the report of the full BUSCO scores (if BUSCO scores are the reported scores), not just the single-copy ortholog and duplicate, but also fragmented and missing proportions.
We refined the results of the comparison against the database of single-copy orthologs by indicating the percentages of orthologs that were duplicated, fragmented, or missing from our dataset. For example: “The comparison with a single-copy metazoan orthologues database revealed that in the reference protein set of P. reticulata 75.6% of the orthologues were assembled completely (Single: 70.4%; Duplicated: 5.2%), whereas 3.4% and 21% of the sequences were fragmented or absent, respectively.”

Furthermore, with such strong filtering steps, valid data is also discarded in parallel with noise. To alleviate this problem generally multiple assemblers with multiple k-mer sizes are utilized and redundancy across the assemblies is dealt with downstream of the assembly (eg. DOI: 10.7717/peerj.5428 or https://doi.org/10.111/1755-0998.13593).[ref-1],[ref-2] This approach in my experience will usually greatly improve the assemblies, an improvement which might marginally benefit the authors of this study.
We fully agree that the proposed method can indeed improve assembling results. Moreover, using multiple assemblers and re-running programs with different k-mer lengths was considered as a possible strategy in the early stages of planning a general analysis scheme based on the results presented in the TransRate publication (DOI: 10.1101/gr.196469.115). However, for the initial assembly, we opted for the Trinity program, which shows excellent assembly results even with default parameters. Now we are starting work on a new article, which will include new data, with the help of which we plan to improve the assembly presented in the current manuscript and refine the results obtained on its basis. As part of the new study, we also plan to test the approach you indicated in order to obtain the most complete and high-quality assembly of the transcriptome.

Next, the focus was shifted to quantifying the expression of genes in the body regions and finding differentially expressed genes among them. Gene ontological enrichment was performed on different sets of genes (molecular signature set, overexpressed set and different phylostratographic sets). The output of these analyses with the addition of histological data prompted the authors to hypothesize that female germ cells are formed in the internal trunk region and migrate from here to the externa part. Indeed, such results combined with previous reports hint towards this possibility, something which could be tested with an in situ hybridization for germ cell marker genes such as for example vasa or nanos (both of which are present in the assembled transcriptome).
Soon we are going to test our hypothesis by an in situ hybridization for germ cell marker genes. However, we plan to publish the results obtained in a separate article.

Using various in silico methods the authors outlined a set of potential excretory/secretory proteins. Several uncharacterized proteins are retrieved from this identification method, some of which could potentially be involved at the interface between parasite and host, as suggested by the authors. Unfortunately, without further validation, this suggestion persists as a testable hypothesis.
We fully agree that the presented results are preliminary and are aimed primarily at narrowing the range of potential targets for a more detailed study. Now we are working on it with the use of a proteomic approach and hope to publish the results soon in our new publications.

Results from the phylostratographic analyses emphasize further the unique biology of parasitic barnacles. The interpretation of the results by the authors was legitimate. My concern with such approaches is their sensitivity to the BLAST algorithm (see a thorough description of the issue here: https://drostlab.github.io/myTAI/articles/Phylostratigraphy.html)
Before starting the phylostratigraphic analysis, we carefully reviewed various materials, including the ones you mentioned. So, for example, according to this publication (https://doi.org/10.1093/molbev/msw284) BLAST is an appropriate and sufficiently sensitive tool in phylostratigraphic analysis that does not appear to introduce significant biases into evolutionary pattern inferences. Nevertheless, we agree that more complex comparisons of different methods are required to determine how correctly they detect distant homologs. By the way, as one of the promising directions for improving the strategy for phylostrata identification the Leapfrog method (doi:10.1101/gr.216226.116) proposed by your scientific group and based on the inclusion of «bridge» species can be considered.

Furthermore, I found the wording used for describing the datasets utilized in similarity searches confusing. How many species were used for the phylostratographic mapping and how many of them were metazoan?
In total, in addition to the P. reticulata, 139 species were included in the analysis, of which 44 were representatives of the Metazoa. We not only have provided this information in the Methods but have also included a phylogenetic tree of all species considered in the Supplementary materials. Thank you very much for pointing out to us that we missed this important information.

A rich resource of results was provided by the authors. With the descriptions provided the additional data is easily interpretable. At the moment the analyses conducted in this study are irreproducible without the custom scripts used by the authors. These should be made available for the community who wishes to replicate the study or to further investigate these intriguing parasitic barnacles.
All custom Python and R scripts used in the study are publicly available: https://github.com/maxnest/From_head_to_rootlet

View more View less

Competing Interests

No competing interests were disclosed

Alongside their report, reviewers assign a status to the article:

Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested

Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.

Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions

[1] 1. Bresciani J, Høeg JT: Comparative ultrastructure of the root system in rhizocephalan barnacles (Crustacea: Cirripedia: Rhizocephala). J Morphol. 2001; 249(1): 9–42. PubMed Abstract | Publisher Full Text

[2] 2. Høeg JT: The biology and life cycle of the Rhizocephala (Cirripedia). J Mar Biol Assoc United Kingdom. 1995; 75(3): 517–50. Publisher Full Text

[3] 3. Miroliubov A, Borisenko I, Nesterenko M, et al.: Specialized structures on the border between rhizocephalan parasites and their host’s nervous system reveal potential sites for host-parasite interactions. Sci Rep. 2020; 10(1): 1128. PubMed Abstract | Publisher Full Text | Free Full Text

[4] 4. Lianguzova AD, Ilyutkin SA, Korn OM, et al.: Specialised rootlets of Sacculina pilosella. (Rhizocephala: Sacculinidae) used for interactions with its host’s nervous system. Arthropod Struct Dev. 2021; 60: 101009. PubMed Abstract | Publisher Full Text

[5] 5. Høeg JT, Lützen J: Life cycle and reproduction in the Cirripedia, Rhizocephala. Oceanogr Mar Biol an Annu Rev. 1995; 33: 427–85. Reference Source

[6] 6. Lützen J, Du PT: Three colonial rhizocephalans from mantis shrimps and a crab in Vietnam, including Pottsia serenei, new species (Cirripedia: Rhizocephala: Thompsoniidae). J Crustac Biol. 1999; 19(4): 902–7. Publisher Full Text

[7] 7. Høeg JT, Lützen J: Comparative morphology and phylogeny of the family Thompsoniidae (Cirripedia, Rhizocephala, Akentrogonida), with descriptions of three new genera and seven new species. Zool Scr. 1993; 22(4): 363–86. Publisher Full Text

[8] 8. Glenner H: Cypris metamorphosis, injection and earliest internal development of the Rhizocephalan Loxothylacus panopaei. (Gissler). Crustacea: Cirripedia: Rhizocephala: Sacculinidae. J Morphol. 2001; 249(1): 43–75. PubMed Abstract | Publisher Full Text

[9] 9. Miroliubov AA, Borisenko IE, Nesterenko MA, et al.: Muscular system in the interna of Polyascus polygenea. and Sacculina pilosella. (Cirripedia: Rhizocephala: Sacculinidae). Invertebr Zool. 2019; 16(1): 48–56. Publisher Full Text

[10] 10. Alvarez F, Hines AH, Reaka-Kudla ML: The effects of parasitism by the barnacle Loxothylacus panopaei (Gissler) (Cirripedia: Rhizocephala) on growth and survival of the host crab Rhithropanopeus harrisii (Gould) (Brachyura: Xanthidae). J Exp Mar Bio Ecol. 1995; 192(2): 221–32. Publisher Full Text

[11] 11. Bishop RK, Cannon LRG: Morbid behaviour of the commercial sand crab, Portunus pelagicus (L.), parasitized by Sacculina granifera Boschma, 1973 (Cirripedia: Rhizocephala). J Fish Dis. 1979; 2(2): 131–44. Publisher Full Text

[12] 12. Innocenti G, Pinter N, Galil BS: Observations on the agonistic behavior of the swimming crab Charybdis longicollis Leene infected by the rhizocephalan barnacle Heterosaccus dollfusi Boschma. Can J Zool. 2003; 81(1): 173–6. Publisher Full Text

[13] 13. Vázquez-lópez H, Alvarez F, Franco J, et al.: Observations on the Behavior of the Dark Crab Callinectes rathbunae Contreras Parasitized with the Rhizocephalan Loxothylacus texanus Boschma. Int J Zool Res. 2006; 2(4): 344–53. Publisher Full Text

[14] 14. Larsen MH, Høeg JT, Mouritsen KN: Influence of infection by Sacculina carcini (Cirripedia, Rhizocephala) on consumption rate and prey size selection in the shore crab Carcinus maenas. J Exp Mar Bio Ecol. 2013; 446: 209–15. Publisher Full Text

[15] 15. Vásquez-López H: Affectation of Swimming Capacity in Callinectes rathbunae (Crustacea: Brachyura) Caused by Loxothylacus Texanus (Crustacea: Rhizocephala). Res J Fish Hydrobiol. 2010; 5(2): 76–80. Reference Source

[16] 16. Toscano BJ, Newsome B, Griffen BD: Parasite modification of predator functional response. Oecologia. 2014; 175(1): 345–52. PubMed Abstract | Publisher Full Text

[17] 17. Alvarez F, Alcaraz G, Robles R: Osmoregulatory disturbances induced by the parasitic barnacle Loxothylacus texanus (Rhizocephala) in the crab Callinectes rathbunae (Portunidae). J Exp Mar Bio Ecol. 2002; 278(2): 135–40. Publisher Full Text

[18] 18. Zacher LS, Horstmann L, Hardy SM: A field-based study of metabolites in sacculinized king crabs Paralithodes camtschaticus (Tilesius, 1815) and Lithodes aequispinus Benedict, 1895 (Decapoda: Anomura: Lithodidae). J Crustac Biol. 2018; 38(6): 794–803. Publisher Full Text

[19] 19. Takahashi T, Iwashige A, Matsuura S: Behavioral manipulation of the shore crab, Hemigrapsus sanguineus by the rhizocephalan barnacle, Sacculina polygenea. Crustacean Research. 1997; 26: 153–61. Publisher Full Text

[20] 20. Belgrad BA, Griffen BD: Rhizocephalan infection modifies host food consumption by reducing host activity levels. J Exp Mar Bio Ecol. 2015; 466: 70–5. Publisher Full Text

[21] 21. Fuchs B, Wang W, Graspeuntner S, et al.: Regulation of Polyp-to-Jellyfish Transition in Aurelia aurita. Curr Biol. 2014; 24(3): 263–73. PubMed Abstract | Publisher Full Text

[22] 22. Nesterenko M, Starunov V, Shchenkov S, et al.: Molecular signatures of the rediae, cercariae and adult stages in the complex life cycles of parasitic flatworms (Digenea: Psilostomatidae). Parasit Vectors. 2020; 13(1): 559. PubMed Abstract | Publisher Full Text | Free Full Text

[23] 23. Torruella G, Grau-Bové X, Moreira D, et al.: Global transcriptome analysis of the aphelid Paraphelidium tribonemae supports the phagotrophic origin of fungi. Commun Biol. 2018; 1(1): 231. PubMed Abstract | Publisher Full Text | Free Full Text

[24] 24. Almudi I, Vizueta J, Wyatt CDR, et al.: Genomic adaptations to aquatic and aerial life in mayflies and the origin of insect wings. Nat Commun. 2020; 11(1): 2631. PubMed Abstract | Publisher Full Text | Free Full Text

[25] 25. Wang J, Zhang L, Lian S, et al.: Evolutionary transcriptomics of metazoan biphasic life cycle supports a single intercalation origin of metazoan larvae. Nat Ecol Evol. 2020; 4(5): 725–36. PubMed Abstract | Publisher Full Text

[26] 26. Isaeva VV, Akhmadieva AV, Shukalyuk AI: The hidden coloniality at the parasitic stage in Peltogaster reticulatus (Crustacea: Rhizocephala). J Mar Biol Assoc United Kingdom. 2012; 92(3): 457–62. Publisher Full Text

[27] 27. Miroliubov AA: Muscular system in interna of Peltogaster paguri (Rhizocephala: Peltogastridae). Arthropod Struct Dev. 2017; 46(2): 230–5. PubMed Abstract | Publisher Full Text

[28] 28. Allam A, Kalnis P, Solovyev V: Karect: accurate correction of substitution, insertion and deletion errors for next-generation sequencing data. Bioinformatics. 2015; 31(21): 3421–8. PubMed Abstract | Publisher Full Text

[29] 29. Grabherr MG, Haas BJ, Yassour M, et al.: Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011; 29(7): 644–52. PubMed Abstract | Publisher Full Text | Free Full Text

[30] 30. Fu L, Niu B, Zhu Z, et al.: CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics. 2012; 28(23): 3150–2. PubMed Abstract | Publisher Full Text | Free Full Text

[31] 31. Smith-Unna R, Boursnell C, Patro R, et al.: TransRate: reference-free quality assessment of de novo transcriptome assemblies. Genome Res. 2016; 26(8): 1134–44. PubMed Abstract | Publisher Full Text | Free Full Text

[32] 32. Lagesen K, Hallin P, Rødland EA, et al.: RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007; 35(9): 3100–8. PubMed Abstract | Publisher Full Text | Free Full Text

[33] 33. Camacho C, Coulouris G, Avagyan V, et al.: BLAST+: Architecture and applications. BMC Bioinformatics. 2009; 10(1): 421. PubMed Abstract | Publisher Full Text | Free Full Text

[34] 34. Lafond-Lapalme J, Duceppe MO, Wang S, et al.: A new method for decontamination of de novo transcriptomes using a hierarchical clustering algorithm. Bioinformatics. 2017; 33(9): 1293–300. PubMed Abstract | Publisher Full Text

[35] 35. Langmead B, Salzberg SL: Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012; 9(4): 357–9. PubMed Abstract | Publisher Full Text | Free Full Text

[36] 36. Patro R, Duggal G, Love MI, et al.: Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods. 2017; 14(4): 417–9. PubMed Abstract | Publisher Full Text | Free Full Text

[37] 37. Wagner P, Kin K, Lynch VJ: Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples. Theory Biosci. 2012; 131(4): 281–5. PubMed Abstract | Publisher Full Text

[38] 38. Buchfink B, Xie C, Huson DH: Fast and sensitive protein alignment using DIAMOND. Nat Methods. 2015; 12(1): 59–60. PubMed Abstract | Publisher Full Text

[39] 39. El-Gebali S, Mistry J, Bateman A, et al.: The Pfam protein families database in 2019. Nucleic Acids Res. 2019; 47(D1): D427–32. PubMed Abstract | Publisher Full Text | Free Full Text

[40] 40. Simão FA, Waterhouse RM, Ioannidis P, et al.: BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015; 31(19): 3210–2. PubMed Abstract | Publisher Full Text

[41] 41. Waterhouse RM, Seppey M, Simão FA, et al.: BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol Biol Evol. 2018; 35(3): 543–8. PubMed Abstract | Publisher Full Text | Free Full Text

[42] 42. The UniProt Consortium: UniProt: The universal protein knowledgebase. Nucleic Acids Res. 2017; 45(D1): D158–69. PubMed Abstract | Publisher Full Text | Free Full Text

[43] 43. Bateman A, Martin MJ, Orchard S, et al.: UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res. 2021; 49(D1): D480–9. PubMed Abstract | Publisher Full Text | Free Full Text

[44] 44. Shah N, Nute MG, Warnow T, et al.: Misunderstood parameter of NCBI BLAST impacts the correctness of bioinformatics workflows. Bioinformatics. 2019; 35(9): 1613–1614. PubMed Abstract | Publisher Full Text

[45] 45. Cantalapiedra CP, Hernández-Plaza A, Letunic I, et al.: eggNOG-mapper v2: Functional Annotation, Orthology Assignments, and Domain Prediction at the Metagenomic Scale. Mol Biol Evol. 2021; 38(12): 5825–5829. PubMed Abstract | Publisher Full Text | Free Full Text

[46] 46. Altenhoff AM, Levy J, Zarowiecki M, et al.: OMA standalone: Orthology inference among public and custom genomes and transcriptomes. Genome Res. 2019; 29(7): 1152–63. PubMed Abstract | Publisher Full Text | Free Full Text

[47] 47. Dylus D, Nevers Y, Altenhoff AM, et al.: How to build phylogenetic species trees with OMA [version 2; peer review: 2 approved]. F1000Res. 2020; 9: 511. Publisher Full Text

[48] 48. Katoh K, Standley DM: MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol Biol Evol. 2013; 30(4): 772–80. PubMed Abstract | Publisher Full Text | Free Full Text

[49] 49. Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T: trimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009; 25(15): 1972–1973. PubMed Abstract | Publisher Full Text | Free Full Text

[50] 50. Guindon S, Gascuel O: A Simple, Fast, and Accurate Algorithm to Estimate Large Phylogenies by Maximum Likelihood. Syst Biol. 2003; 52(5): 696–704. PubMed Abstract | Publisher Full Text

[51] 51. Darriba D, Taboada GL, Doallo R, et al.: ProtTest 3: Fast selection of best-fit models of protein evolution. Bioinformatics. 2011; 27(8): 1164–1165. PubMed Abstract | Publisher Full Text | Free Full Text

[52] 52. Nguyen LT, Schmidt HA, Von Haeseler A, et al.: IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015; 32(1): 268–74. PubMed Abstract | Publisher Full Text | Free Full Text

[53] 53. Minh BQ, Schmidt HA, Chernomor O, et al.: IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol Biol Evol. 2020; 37(5): 1530–1534. PubMed Abstract | Publisher Full Text | Free Full Text

[54] 54. Paradis E, Schliep K: ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics. 2019; 35(3): 526–8. PubMed Abstract | Publisher Full Text

[55] 55. Wagner GP, Kin K, Lynch VJ: A model based criterion for gene expression calls using RNA-seq data. Theory Biosci. 2013; 132(3): 159–64. PubMed Abstract | Publisher Full Text

[56] 56. Zambelli F, Mastropasqua F, Picardi E, et al.: RNentropy: An entropy-based tool for the detection of significant variation of gene expression across multiple RNA-Seq experiments. Nucleic Acids Res. 2018; 46(8): e46. PubMed Abstract | Publisher Full Text | Free Full Text

[57] 57. Heberle H, Meirelles GV, da Silva FR, et al.: InteractiVenn: A web-based tool for the analysis of sets through Venn diagrams. BMC Bioinformatics. 2015; 16(1): 169. PubMed Abstract | Publisher Full Text | Free Full Text

[58] 58. Garg G, Ranganathan S: In silico secretome analysis approach for next generation sequencing transcriptomic data. BMC Genomics. 2011; 12 Suppl 3(Suppl 3): S14. PubMed Abstract | Publisher Full Text | Free Full Text

[59] 59. Almagro Armenteros JJ, Tsirigos KD, Sønderby CK, et al.: SignalP 5.0 improves signal peptide predictions using deep neural networks. Nat Biotechnol. 2019; 37(4): 420–3. PubMed Abstract | Publisher Full Text

[60] 60. Bendtsen JD, Jensen LJ, Blom N, et al.: Feature-based prediction of non-classical and leaderless protein secretion. Protein Eng Des Sel. 2004; 17(4): 349–56. PubMed Abstract | Publisher Full Text

[61] 61. Emanuelsson O, Nielsen H, Brunak S, et al.: Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J Mol Biol. 2000; 300(4): 1005–16. PubMed Abstract | Publisher Full Text

[62] 62. Krogh A, Larsson B, von Heijne G, et al.: Predicting Transmembrane Protein Topology with a Hidden Markov Model: Application to Complete Genomes. J Mol Biol. 2001; 305(3): 567–80. PubMed Abstract | Publisher Full Text

[63] 63. Miele V, Penel S, Duret L: Ultra-fast sequence clustering from similarity networks with SiLiX. BMC Bioinformatics. 2011; 12(1): 116. PubMed Abstract | Publisher Full Text | Free Full Text

[64] 64. Wang Y, Wang M, Yin S, et al.: NeuroPep: A comprehensive resource of neuropeptides. Database (Oxford). 2015; 2015: bav038. PubMed Abstract | Publisher Full Text | Free Full Text

[65] 65. Meinken J, Walker G, Cooper CR, et al.: MetazSecKB: The human and animal secretome and subcellular proteome knowledgebase. Database (Oxford). 2015; 2015: bav077. PubMed Abstract | Publisher Full Text | Free Full Text

[66] 66. Arendsee Z, Li J, Singh U, et al.: Phylostratr: A framework for phylostratigraphy. Bioinformatics. 2019; 35(19): 3617–27. PubMed Abstract | Publisher Full Text

[67] 67. Nesterenko M: Figure S1. Phylogenetic tree of species whose data were used for phylostratigraphic analysis of the Peltogaster reticulata gene set. figshare. Figure. 2022. http://www.doi.org/10.6084/m9.figshare.21747554.v1

[68] 68. Drost HG, Gabel A, Liu J, et al.: MyTAI: Evolutionary transcriptomics with R. Bioinformatics. 2018; 34(9): 1589–90. PubMed Abstract | Publisher Full Text | Free Full Text

[69] 69. Schindelin J, Arganda-Carreras I, Frise E, et al.: Fiji: an open-source platform for biological-image analysis. Nat Methods. 2012; 9(7): 676–82. PubMed Abstract | Publisher Full Text | Free Full Text

[70] 70. Nesterenko M: Table S1. Summary of paired-end read libraries preparation results. figshare. Journal contribution. 2022. http://www.doi.org/10.6084/m9.figshare.19307486.v1

[71] 71. Nesterenko M: Table S2. Peltogaster reticulata reference sequence set annotation results. figshare. Journal contribution. 2022. http://www.doi.org/10.6084/m9.figshare.19307516.v1

[72] 72. Nesterenko M: Figure S2. Phylogenetic relationships between the crustacean species based on orthologs analysis results. figshare. Figure. 2022. http://www.doi.org/10.6084/m9.figshare.19307444.v4

[73] 73. Nesterenko M: Table S3. Gene expression quantification and analysis results. figshare. Journal contribution. 2022. http://www.doi.org/10.6084/m9.figshare.19307549.v1

[74] 74. Nesterenko M: Table S4. Gene Set Enrichment Analysis (GSEA) results for the molecular signatures and sets of over-expressed genes. figshare. Journal contribution. 2022. http://www.doi.org/10.6084/m9.figshare.19307570.v1

[75] 75. Nesterenko M: Table S5. Potential excretory/secretory proteins analysis results. figshare. Journal contribution. 2022. http://www.doi.org/10.6084/m9.figshare.19307594.v1

[76] 76. Domazet-Loso T, Brajković J, Tautz D: A phylostratigraphy approach to uncover the genomic history of major adaptations in metazoan lineages. Trends Genet. 2007; 23(11): 533–9. PubMed Abstract | Publisher Full Text

[77] 77. Nesterenko M: Table S6. Phylostratigraphic affiliation analysis results for different set of sequences. figshare. Journal contribution. 2022. http://www.doi.org/10.6084/m9.figshare.19307606.v1

[78] 78. Domazet-Lošo T, Tautz D: A phylogenetically based transcriptome age index mirrors ontogenetic divergence patterns. Nature. 2010; 468(7325): 815–8. PubMed Abstract | Publisher Full Text

[79] 79. Nesterenko M: Table S7. Evolutionary transcriptomics results. figshare. Journal contribution. 2022. http://www.doi.org/10.6084/m9.figshare.19307621.v1

[80] 80. Chan BKK, Dreyer N, Gale AS, et al.: The evolutionary diversity of barnacles, with an updated classification of fossil and living forms. Zool J Linn Soc. 2021; 193(3): 789–846. Publisher Full Text

[81] 81. Boxshall GA, Harrison K: New nicothoid copepods (Copepoda: Siphonostomatoida) from an amphipod and from deep-sea isopods. Bulletin of the British Museum (Natural History) Zoology. London, BM(NH); 1988; 54: 285–99. Publisher Full Text

[82] 82. Lamb EJ, Boxshall GA, Mill PJ, et al.: Nucellicolidae: A New Family of Endoparasitic Copepods (Poecilostomatoida) from the Dog Whelk Nucella lapillus (Gastropoda). J Crustac Biol. 1996; 16(1): 142-148. Publisher Full Text

[83] 83. Miroliubov AA, Lianguzova AD, Ilyutkin SA, et al.: The interna of the rhizocephalan Peltogaster reticulata: Comparative morphology and ultrastructure. Arthropod Struct Dev. 2022; 70: 101190. PubMed Abstract | Publisher Full Text

[84] 84. Shirley SM, Shirley TC, Meyers TR: Hemolymph responses of Alaskan king crabs to rhizocephalan parasitism. Can J Zool. 1986; 64(8): 1774–81. Publisher Full Text

[85] 85. Shukalyuk A, Isaeva V, Kizilova E, et al.: Stem cells in the reproductive strategy of colonial rhizocephalan crustaceans (Crustacea: Cirripedia: Rhizocephala). Invertebr Reprod Dev. 2005; 48(1–3): 41–53. Publisher Full Text

[86] 86. Isaeva VV, Shukalyuk AI, Trofimova AV, et al.: The structure of colonial interna in Sacculina polygenea (Crustacea: Cirripedia: Rhizocephala). Crustacean Research. 2001; 30: 133–46. Publisher Full Text

[87] 87. Glenner H, Høeg JT: A Scenario for the Evolution of the Rhizocephala. In: Escobar-Briones E, Alvarez F, editors. Modern Approaches to the Study of Crustacea. Springer, Boston, MA; 2002; 301–10. Publisher Full Text

[88] 88. Orgogozo V, Morizot B, Martin A: The differential view of genotype-phenotype relationships. Front Genet. 2015; 6: 179. PubMed Abstract | Publisher Full Text | Free Full Text

[89] 89. Glenner H, Hebsgaard MB: Phylogeny and evolution of life history strategies of the Parasitic Barnacles (Crustacea, Cirripedia, Rhizocephala). Mol Phylogenet Evol. 2006; 41(3): 528–38. PubMed Abstract | Publisher Full Text

From head to rootlet: comparative transcriptomic analysis of a rhizocephalan barnacle Peltogaster reticulata (Crustacea: Rhizocephala)

Abstract

Keywords

Revised Amendments from Version 1

Introduction

Methods

Sampling

Figure 1. Molecular signatures of female Peltogaster reticulata.

Preparation of reads libraries and de novo transcriptome assembly

Removal of potential contamination

Quantification of gene expression levels and identification of encoded amino acid sequences

Reference gene set preparation

Sequence annotation

Identification of orthogroups

Reference gene set expression analysis

Multidimensional scaling

Potential excretory/secretory proteins (ESP) identification and analysis

Gene set enrichment analysis (GSEA)

Phylostratigraphy and Transcriptome Age Index (TAI) measuring

Histology

Confocal laser scanning microscopy (CLSM)

Scanning electron microscope (SEM)

Results

The de novo assembled transcriptome was characterized by high quality and completeness of assembly

Peltogaster reticulata reference transcriptome included transcripts of 12,620 protein-encoding genes

Most of the sequences from the reference sets were annotated successfully

The proteins set of P. reticulata was similar to the reference proteome of another cirripede barnacle, Amphibalanus amphitrite.

The externa was clustered separately from the interna based on the gene expression analysis results

GSEA results and histological studies indicate the presence of germ-like cells in the interna

Figure 2. Gene set enrichment analysis (GSEA) results for the gene sets.

Hundreds of genes encoding potential excretory/secretory proteins (ESP) were identified

Figure 3. Gene set enrichment analysis (GSEA) results for the identified sets of potential excretory/secretory proteins (ESP).

Significant differences between the TAI of body parts were revealed

Figure 4. Phylostratigraphy and evolutionary transcriptomics results.

Figure 5. Gene set enrichment analysis (GSEA) results for top-500 annotated genes with the largest contribution to Transcriptome Age Index (TAI).

Discussion

Data availability

Underlying data

Extended data

Acknowledgements

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated