A glimpse into the epigenetic landscape of gene regulation

https://doi.org/10.1016/j.gde.2007.12.005Get rights and content

Post-translational modifications to histone proteins and methylation of DNA comprise the epigenome of a cell. The epigenome, which changes through development, controls access to our genes. Recent advances in DNA sequencing technology has led to genome-wide distribution data for a limited number of histone modifications in mammalian stem cells and some differentiated lineages. These studies reveal predictive correlations between histone modifications, different classes of gene and chromosomal features. Moreover, this glimpse into our epigenome challenges current ideas about regulation of gene expression. Many genes in stem cells are poised for expression with initiated RNA polymerase II at the promoter. This state is maintained by an epigenetic mark through multiple lineages until the gene is expressed.

Introduction

Histone proteins are subject to four major post-translational modifications: methylation of arginine and lysine residues, serine phosphorylation, lysine acetylation and lysine ubiquitylation [1, 2]. Early studies in Drosophila, budding yeast and fission yeast gave the first indication, now confirmed in mammals, that lysine (K) methylation (me) and acetylation (ac) show functionally relevant and distinct associations with repressed or active chromatin. This review explores genome-wide distributions for these modifications associated with repressed or active chromatin in pluripotent and differentiated mammalian cells.

Section snippets

Techniques to study histone modifications

Two techniques are used to characterise histone modifications: mass spectrometry and chromatin immunoprecipitation (ChIP). Mass spectrometry of histone proteins can be used to detect novel modifications, to determine whether two modifications occupy the same histone tail [3], to show interdependency between modifications [4] and to compare the range of modifications in different organisms [5]. Mass spectrometry, however, gives no information on where a modified histone is found in the genome.

Lysine acetylation defines promoters, enhancers and active chromatin

Acetylation neutralises the positive charge on the ɛ amino group of lysine and acts as a binding site for proteins containing a bromodomain [6]. Although the data on genome-wide distributions for individual acetylated lysine residues lags far behind that for lysine methylation, acetylation is generally associated with active or decondensed chromatin [7, 8, 9]. In human T-cells, there are over 45 000 acetylation islands rich in H3K9acK14ac, many of which correspond to transcriptional regulatory

Lysine methylation

Patterns of lysine methylation are more complex than acetylation. Each of the three different methylated forms of any particular lysine (me1, me2 and me3) is a functionally distinct mark capable of specific interactions with domains such as PHD, chromo, MBT or Tudor on histone-binding proteins [15]. These interactions play a key role in transducing the pattern of modifications into a functional outcome [6].

Lysine methylation at enhancers and promoters

There are clear functional links between methylation and acetylation [3, 16, 17]. Acetylated nucleosomes at promoters and enhancers in mammalian cells are also methylated at several positions. H3K4me1 was the first methyl mark shown to be enriched at enhancers in HeLa cells [13]. ChIP seq in human T-cells has allowed this initial observation to be refined and reveals a general pattern of methylation at enhancers, DNAase I hypersensitive sites, CTCF insulators and some chromosome breakpoints [18

Lysine methylation and repressed chromatin

There are three modifications, K9me3, K20me3 and K27me3, which are associated with repressed chromatin in many organisms. High-resolution ChIP seq in different mammalian cells gives a genome-wide snapshot of the modifications and reveals distinct patterns that reflect chromosome organisation [18•, 19••] (Figure 2).

H3K9me3 and H4K20me3 mark silent imprinted genes, silent clustered gene repeats, non-expressed pseudogenes, centromeres and telomeres

High levels of H3K9me3 and H4K20me3 show strong enrichment at telomeric, satellite and long terminal repeats (LTRs) as well as clustered silent genes such as the ZNF repeats [18•, 19••] (Figure 2a). H3K27me3 is excluded from these regions. H3K9me3 and H3K20me3 also appear together as localised foci on the promoters of silent imprinted genes and non-expressed pseudogenes within a large cluster of imprinted genes found within a 250 kb region in mouse embryo fibroblasts [14] (Figure 2d). Thus, the

Tissue-specific and developmentally regulated silent genes are marked with H3K27me3

In mouse embryonic fibroblasts, the Igf2r imprinted gene cluster, marked with H3K9me3 and H3K20me3 is interspersed with a large domain of tissue-specific silent genes and several expressed genes [14]. The domain of tissue-specific silent genes is marked with H3K27me3 alone, while the expressed genes are marked, as expected, at promoters and regulatory sequences with H3K4me3, H3K4me2 and H3K9ac. Interestingly, there are smooth transitions between the different chromatin marks with no evidence

Non-coding RNAs are implicated in silencing within discrete domains

ncRNAs are implicated in silencing both H3K9me3/H3K20me3 marked chromatin and H3K27me3 marked chromatin [22, 23, 24•]. ncRNA-based sequence-specific interactions in cis and even in trans could demarcate silent and active chromatin [25•, 26, 27]. A ncRNA-based system could discriminate different modifying activities and limit their site of action to homologous sequences. ncRNAs in association with the PRC2 complex will deposit H3K27me3 at homologous sequences, while Ago-containing complexes such

Both active (H3K4me3) and repressive marks (H3K9me3) at gene loci define imprinted regions or regulatory non-coding transcripts

One of the most puzzling observations to appear in the literature over the past year or so is that of genes marked with both activating (H3K4me3) and repressing modifications (either H3K9me3 with K20me3 or H3K27me3). One explanation for this is allelic difference, for example, an imprinted locus showing differential expression where one allele is repressed and the other active [19••, 28] (Figure 2d). In the example illustrated in Figure 2d both repressive and active marks show discrete foci on

Both active (H3K4me3) and repressive marks (H3K27me3) define poised but silent genes with initiated RNA polymerase II at the promoter

Genes marked with H3K4me3 and H3K27me3 (the so-called ‘bivalent’ modification) have been reported by a number of groups [3, 11, 18•, 30, 31•, 32•, 33•, 34]. This type of bivalent mark is particularly associated with the pluripotency of embryonic stem cells [31•, 32•, 33•, 34] and the PRC2 H3K27me3 HMTase complex [35], but is also found in differentiated cells [11, 18•, 30] and ciliates [3]. Genes carrying bivalent marks often have complex expression patterns and include key developmental

Retention and resolution of the bivalent mark during differentiation

A detailed analysis correlating the bivalent mark with gene expression in mouse stem cells, neural progenitors, mouse embryonic fibroblasts and adult cells derived from these lineages, revealed lineage-specific resolution and retention of the bivalent mark [19••]. Genes not destined to be expressed in these cell types resolve to the silent state marked with either H3K27me3 or no mark (Figure 2c). By contrast, genes actually expressed resolve to H3K4me3 alone while those destined to be expressed

DNA sequence at promoters correlates with the nature of the epigenetic mark

The nature of the underlying DNA sequence also influences how a gene is epigenetically marked [19••]. Genes with promoters rich in CG dinucleotides (CpG-rich) are almost always marked with H3K4me3 whether they are expressed or not and are enriched in the class of genes showing the bivalent mark and control at the post-initiation stage of transcription. CpG-poor promoters in contrast are only marked with H3K4me3 when expressed, whatever the cell type, and virtually none have the bivalent mark.

Bivalent marks are resolved using H3K27me3 demethylases

This new work relating the epigenome to expression profiles predicts that genes with the bivalent mark will require an H3K27me3 demethylase for expression (Figure 2e). Several recent reports link H3K27me3 demethylases such as UTX and JMJD3 to demethylation of HOX genes during the transition from a pluripotent state to a differentiated state or activation of macrophages by inflammatory stimuli [36•, 37•, 38•, 39•]. In both examples, the H3K27me3 demethylase that resolves the bivalent state is

Conclusions and perspectives

The work reviewed here shows clearly that post-recruitment or post-initiation regulatory mechanisms, maintained epigenetically through many generations, are likely to be as important as activation-coupled recruitment of RNA polymerase II in controlling gene expression. It is difficult to gauge the generality of post-initiation control of gene transcription in other eukaryotes. There are, however, reports that many thousands of genes in Drosophila have poised polymerase at their promoters [40].

References and recommended reading

Papers of particular interest, published within the annual period of review, have been highlighted as:

  • • of special interest

  • •• of outstanding interest

Acknowledgement

Work in the author's laboratory is supported by The Wellcome Trust.

References (44)

  • V. Azuara et al.

    Chromatin signatures of pluripotent cell lines

    Nat Cell Biol

    (2006)
  • J. Muller et al.

    Histone methyltransferase activity of a Drosophila Polycomb group repressor complex

    Cell

    (2002)
  • S.L. Berger

    The complex language of chromatin regulation during transcription

    Nature

    (2007)
  • S.D. Taverna et al.

    Long-distance combinatorial linkage between methylation and acetylation on histone H3 N termini

    Proc Natl Acad Sci U S A

    (2007)
  • B.A. Garcia et al.

    Pervasive combinatorial modification of histone H3 in human cells

    Nat Methods

    (2007)
  • S.D. Taverna et al.

    How chromatin-binding modules interpret histone modifications: lessons from professional pocket pickers

    Nat Struct Mol Biol

    (2007)
  • C.L. Liu et al.

    Single-nucleosome mapping of histone modifications in S. cerevisiae

    PLoS Biol

    (2005)
  • M. Shogren-Knaak et al.

    Histone H4-K16 acetylation controls chromatin structure and protein interactions

    Science

    (2006)
  • T.Y. Roh et al.

    Genome-wide prediction of conserved and nonconserved enhancers by histone acetylation patterns

    Genome Res

    (2007)
  • T.Y. Roh et al.

    The genomic landscape of histone modifications in human T cells

    Proc Natl Acad Sci U S A

    (2006)
  • N.D. Heintzman et al.

    Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome

    Nat Genet

    (2007)
  • K. Regha et al.

    Active and repressive chromatin are interspersed without spreading in an imprinted gene cluster in the mammalian genome

    Mol Cell

    (2007)
  • Cited by (54)

    • Epigenetic regulation of inflammation in insulin resistance

      2024, Seminars in Cell and Developmental Biology
      Citation Excerpt :

      For example, the methylation of lysine 9 in the silent DNA on histone H3 (H3K9me) is widespread in heterochromatin is known to inactivate the X chromosomes in women. Another example is the methylation of lysine 4 or 36 on the same histone, i.e. H3 (H3K4me, H3K36me, respectively), which acts as a marker of active genes [12,13]. In turn, other methylations of lysines 9, 27 and 20 of the histone H3 (H3K9me, H3K27me, H3K20me, respectively) correlate with inactive heterochromatin, and are hence regarded as repressive markers acting as indicators of "inactivation" [14–16].

    • Crosstalk between phospholipases and noncoding RNAs in cancer

      2023, Phospholipases in Physiology and Pathology: Volumes 1-7
    • Epigenetic biomarkers in diagnosis, prognosis, and treatment of hepatocellular carcinoma

      2022, Theranostics and Precision Medicine for the Management of Hepatocellular Carcinoma: Volume 3: Translational and Clinical Outcomes
    • Molecular Design of Compounds Targeting Histone Methyltransferases

      2016, Epi-Informatics: Discovery and Development of Small Molecule Epigenetic Drugs and Probes
    • Establishing the Architecture of Plant Gene Regulatory Networks

      2016, Methods in Enzymology
      Citation Excerpt :

      Protein–protein interactions and other modifications play key roles in the in vivo interaction of TFs with DNA, resulting in the specificity that TFs display in vivo. Superimposed on the activity of TFs is chromatin structure, which can affect the accessibility of the DNA to TFs, or to components of the basal transcriptional machinery, and histone modifications play a central role in regulation of gene expression (Mellor, Dudek, & Clynes, 2008; Peterson & Laniel, 2004). TFs often control the expression of genes encoding other TFs, resulting in the formation of complex interaction webs.

    View all citing articles on Scopus
    View full text