Genomic organization of the glutathione S-transferase family in insects

https://doi.org/10.1016/j.ympev.2011.08.027Get rights and content

Abstract

Cytosolic glutathione S-transferases (GSTs) are a large and diverse gene family in insects. They are classified into six major subclasses. Sigma, Omega, Zeta, and Theta have representatives across Metazoa while Delta and Epsilon are specific to Insecta and Holometabola, respectively. In this study, GSTs are assigned to a subclass by a combination of literature, phylogenetic, and genomic evidence. Moreover, it is confirmed that GSTs frequently cluster by genomic position as a result of recent gene expansions. These expansions are largely explained by the number of protein-coding genes in the genome, although life history is another contributing factor.

Highlights

► Insect cytosolic glutathione S-transferases confirmed as mainly members of six major subclasses. ► Epsilon subclass has early origin along endopterogote lineage. ► Delta subclass has an earlier origin in Insecta. ► Confirmed that many glutathione S-transferase genes cluster by genomic position. ► Pattern of gene expansions largely explained by the number of genes in genome.

Introduction

Glutathione S-transferases (GSTs) are a gene family of enzymes involved in detoxifying cells of natural and artificial molecules (Lumjuan et al., 2005, Ranson et al., 2000, Rogers et al., 1999, Tang and Tu, 1994). Examples include GST-D1 in Drosophila melanogaster and an unidentified number of GSTs in Pediculus humanus which are associated with resistance to insecticides (Barrios et al., 2010, Tang and Tu, 1994). GSTs mainly metabolize these substances by making them more water soluble, aiding in their excretion from the organism (Habig et al., 1974).

In insects, there are two major classes of GSTs, microsomal and cytosolic. The membrane-bound microsomal type is structurally and evolutionarily distinct from the cytosolic type (Enayati et al., 2005). The microsomal class contains few gene duplicates – such as D. melanogaster with only one member (Ranson et al., 2002, Toba and Aigaki, 2000).

This study is focused on the larger family of cytosolic GSTs. The cytosolic type has an active site of catalytic activity and typically forms a dimeric complex. These GSTs are characterized by two domains (Pfam domains PS50404 and PS50405; Hulo et al., 2006), a N-terminal domain of ∼80 amino acid sites and a C-terminal domain of ∼120 sites. The N-terminal domain contains a site which interacts with a glutathione molecule. The C-terminal domain contains a site which interacts with the substrate (review by Enayati et al. (2005)).

This highly diverse insect gene family is divided into six major subclasses: Delta, Epsilon, Sigma, Omega, Theta, and Zeta. Among the currently available genomes, the dipterans consistently have large expansions of these genes. For instance, Ding et al. (2003) identified 37 putative cytosolic GSTs in D. melanogaster and 28 in Anopheles gambiae in contrast to the hymenopteran Apis mellifera with only 11 GSTs (Claudianos et al., 2006, Honeybee Genome Sequence Consortium, 2006). Similar to the eusocial hymenopteran, the exopterogote parasite P. humanus has 11 GSTs which span five of the subclasses (Kirkness et al., 2010). However, this pattern is not a trend in exopterogotes as the free-living aphid Acyrthosiphon pisum contains 18 putative GSTs (Ramsey et al., 2010, Aphid Genomics Consortium, 2010). The current sample of insect genomes is currently insufficient to distinguish whether the GST expansion pattern is mainly driven by shared ancestry or by ecological adaptation to a niche (i.e., parasitism).

The bulk of the GST expansions are in the Delta and Epsilon subclasses. These are Insecta-specific while the other four subclasses have a broader taxonomic distribution (Low et al., 2007, Ranson et al., 2001, Ranson et al., 2002, Sawicki et al., 2003). The overall high rate of turnover of GST genes is especially apparent from studies of the 12 drosophilid genomes (Low et al., 2007, Drosophila 12 Genomes Consortium., 2007). The turnover does not occur abruptly over time but instead conforms to a somewhat uniform rate in the loss and gain of genes. This is not surprising since Hahn et al. (2007) estimated that ∼40% of drosophilid gene families differed in size among lineages and ∼20% of gene families were lost altogether in one or more lineages.

Moreover, Severson et al. (2004) suggested that gene order across 50 kilobases has a one-half probability of being conserved between D. melanogaster and Culicidae. Therefore, genetic linkage of GSTs was examined across each genome (Fig. 1). In this study, a diverse set of insect genomes was examined and expands upon previous studies of GSTs which generally were restricted to a genus-level analysis. This set includes: four Diptera, two Hymenoptera, one Coleopteran, one Lepidopteran, and two outgroup species of Exopterogota. The aim was to gather evidence across Insecta of gene duplication and linkage patterns in this large and diverse gene family.

Section snippets

Sequence retrieval

Insect protein-coding sequences were available from VectorBase (Lawson et al., 2009), FlyBase (Drysdale and the FlyBase Consortium, 2008), BeeBase (Munoz-Torres et al., 2010), SilkDB (Wang et al., 2005), Baylor College of Medicine (Savard et al., 2006), AphidBase (Legeai et al., 2010) and BeetleBase (Wang et al., 2007). This study utilized the following genome assemblies from the above databases: Aedes aegypti version AaegL1, A. gambiae v. AgamP3.5, Culex quinquefasciatus v. CpipJ1.2, A.

Results

The results include phylogenetic trees of the glutathione S-transferases (GSTs) in Insecta and a table of their linkage patterns. Fig. 2 has an overview of the six GST subclasses in D. melanogaster: Sigma, Theta, Omega, Zeta, Delta and Epsilon. The subclasses are defined by the vast biological literature which is supplemented by phylogenetic-, linkage-, and BLAST-based evidence (Altschul et al., 1997, Corona and Robinson, 2006, Ranson et al., 2001). Trees are shown of each of these subclasses

Discussion

The glutathione S-transferase (GST) trees show the birth-and-death process of genes (Nei and Rooney, 2005). This is particularly pronounced among the Delta and Epsilon subclasses (Fig. 3) where the turnover of genes in this family is consistent with their role in detoxifying cells from a varying set of chemical compounds. The other pattern of interest is the genomic clustering of these genes (Table 1). This clustering could be evidence of recent duplication or nature selecting for genomic

Acknowledgments

Three anonymous reviewers provided invaluable suggestions on nomenclature and phylogenetic methods to reliably classify genes into families.

References (56)

  • S. Barrios et al.

    Activity of increased specific and non-specific esterases and glutathione transferases associated with resistance to permethrin in Pediculus humanus capitis (phthiraptera: pediculidae) from Argentina

    Parasitol. Res.

    (2010)
  • C. Claudianos et al.

    A deficit of detoxification enzymes: pesticide sensitivity and environmental response in the honeybee

    Insect Mol. Biol.

    (2006)
  • M. Corona et al.

    Genes of the antioxidant system of the honey bee: annotation and phylogeny

    Insect Mol. Biol.

    (2006)
  • R.R. da Fonseca et al.

    Molecular evolution and the role of oxidative stress in the expansion and functional diversification of cytosolic glutathione transferases

    BMC Evol. Biol.

    (2010)
  • Y. Ding et al.

    The Anopheles gambiae glutathione transferase supergene family: annotation, phylogeny and expression profiles

    BMC Genom.

    (2003)
  • Drosophila 12 Genomes Consortium, 2007. Evolution of genes and genomes on the Drosophila phylogeny. Nature 450,...
  • R. Drysdale et al.

    FlyBase: a database for the Drosophila research community

    Methods Mol. Biol.

    (2008)
  • R.C. Edgar

    MUSCLE: multiple sequence alignment with high accuracy and high throughput

    Nucl. Acids Res.

    (2004)
  • A.A. Enayati et al.

    Insect glutathione transferases and insecticide resistance

    Insect Mol. Biol.

    (2005)
  • J. Felsenstein

    Confidence limits on phylogenies: an approach using the bootstrap

    Evolution

    (1985)
  • J. Felsenstein

    PHYLIP – phylogeny inference package (version 3.2)

    Cladistics

    (1989)
  • P. Flicek et al.

    Ensembl 2011

    Nucl. Acids Res.

    (2010)
  • A. Gattiker et al.

    ScanProsite: a reference implementation of a PROSITE scanning tool

    Appl. Bioinformatics

    (2002)
  • M.W. Gaunt et al.

    An insect molecular clock dates the origin of the insects and accords with palaeontological and biogeographic landmarks

    Mol. Biol. Evol.

    (2002)
  • M.W. Hahn et al.

    Gene family evolution across 12 Drosophila genomes

    PLoS Genet.

    (2007)
  • Honeybee Genome Sequence Consortium

    Insights into social insects from the genome of the honeybee Apis mellifera

    Nature

    (2006)
  • A.L. Hughes et al.

    Gene family content-based phylogeny of prokaryotes: the effect of criteria for inferring homology

    Syst. Biol.

    (2005)
  • N. Hulo et al.

    The PROSITE database

    Nucl. Acids Res.

    (2006)
  • Cited by (84)

    View all citing articles on Scopus
    View full text