Chromatin, Histones and Histone Modifications

The complete guide to DNA regulation by histone modifications.

Ryan Hamnett, PhD | 10^th September 2024

The structure that DNA takes in the nucleus strongly influences numerous cellular processes, including gene expression, the DNA damage response, and cell division. This structure is primarily determined by histones. Histones are proteins that DNA wraps around to form chromatin, and they can be modified with epigenetic marks to dictate how loose or compact that DNA-histone interaction is. Histone modifications are tightly regulated and can respond to cellular stimuli and external cues as a dynamic method of controlling transcriptional activity.

This guide covers the basics of histone proteins, how histones are modified, and the consequences of those modifications, as well as techniques that researchers can use to study histone proteins.

Histones and Chromatin State

DNA in the nucleus exists as chromatin, which refers to DNA in a complex with evolutionarily conserved proteins called histones. The resulting DNA-protein complex is termed a nucleosome. Nucleosomes are the fundamental building block of chromatin, with nucleosomes strung on DNA like beads on a string then further packaged and coiled into higher order chromatin structures (Figure 1).

Figure 1: DNA complexes with histone proteins to form chromatin. Chromatin forms higher order structures to facilitate packaging in the nucleus.

The structure of chromatin facilitates the careful spatial organization of an organism’s genomic material in the nucleus, which is essential for transcriptional regulation, packaging DNA into a small volume to fit in nuclei, and protecting the DNA sequence from disruption. Chromatin structure also influences DNA replication, mitosis and meiosis, DNA repair and genetic recombination.

Histone Protein Expression

The transcription and translation of histone proteins themselves are tightly regulated, because excess accumulation of histones results in genomic instability, DNA damage, chromosome aggregation or loss, and cell death.¹ Core histone genes have an unusual structure at both the DNA and RNA levels: the genes are found clustered together, do not contain introns and their RNA is not poly-adenylated.²

The core histone proteins are incorporated into chromatin during DNA replication in S phase of the cell cycle, which is why they are often referred to as replication-dependent histones.³ This deposition of histones is managed by histone chaperone proteins working alongside DNA polymerase;⁴ these chaperones, like histone mRNA, tend to be abundant only during S phase.

Nucleosome Structure

There are 5 different histone proteins (plus some variants⁵): H1, H2A, H2B, H3, H4. Two copies each of H2A, H2B, H3 and H4 bind together to form a histone core, around which 147 bp of DNA wraps 1.75 times (Figure 2). While H1 is not in the core nucleosome, it acts as a linker protein and is important for stabilizing DNA-histone interactions and the assembly of higher order chromatin.⁶

Figure 2: Nucleosome composition and chromatin architecture.The core histone octamer of the nucleosome is formed of an H3-H4 tetramer and two H2A-H2B dimers. Each histone protein has an unstructured N-terminal tail (as well as an unstructured C-terminal for H2A) that protrudes from the nucleosome core and are accessible to histone-modifying proteins. This makes histone tails, particularly of H3 and H4, the main site for posttranslational modifications (PTMs), which in turn influence chromatin state and can recruit transcription factors via reader proteins. Transcriptionally active euchromatin is marked by acetylation, while methylation can be associated with either euchromatin or heterochromatin depending on the methylated residue. Ubiquitination is also associated with transcriptional silencing, and tends to occur on residues of H2A and H2B.⁷

Chromatin State

Chromatin exists in two main states (Figure 2). Euchromatin is loosely packed, which makes it accessible to transcription machinery such as RNA polymerase complexes, as well as transcription factors and cofactors. It is therefore more permissive of gene transcription.⁸ In contrast, heterochromatin is tightly packed, preventing binding of transcription factors and so associated with reduced transcription. As well as providing a method for gene silencing, heterochromatin is also thought to be important for the maintenance of genomic stability.⁹

There are 4 core mechanisms of regulating the chromatin state that work in concert to control DNA accessibility and gene transcription:¹⁰

Histone variants: Alternative histones that can be deposited into nucleosomes and possess different qualities to the core histones
Post-translational modifications: Modifications to histone tails that affect DNA-histone interactions and act as recruitment sites for chromatin remodelers and transcription factors
ATP-dependent chromatin remodelers: Proteins that move, exchange, deposit and evict histones and nucleosomes
Histone chaperones: Proteins that regulate histone expression and work with remodelers in histone deposition

Histone Variants

There are several histone protein variants that perform specific functions, respond to defined stimuli, or are present in certain tissues such as the gametes. Some histone variants differ in residue identity by as much as 50% from their core histone counterpart, while others differ by only a few amino acids, but biophysical analyses indicate that even a few amino acid differences can substantially alter the DNA-protein interface.^11,12 Substituting specific histone variants into the nucleosome can therefore alter the nucleosome structure and its interaction with DNA, resulting in changes to chromatin and higher order chromatin architecture. In this way, they represent an important tool in transcriptional regulation and have distinct functions in cell division, DNA repair, and differentiation.⁵

Some histone variants, such as the essential H3 variant H3.3, are found ubiquitously throughout the body, others are located within specific types of chromatin or DNA, such as CENP-A at the centromere, but many are specific to the DNA packaging requirements of gametes. Table 1 illustrates many of the histone variants found in mammals, where they are found in the body, and unique features of their function or expression.

Canonical Histone	Histone Variant	Function and Features
H2A	H2A.X	Distributed throughout the genome, accumulates at sites of DNA damage to recruit DNA repair proteins. DNA repair, remodeling sex chromosomes
	H2A.Z.1	Transcriptional regulation, transcription initiation heterochromatin organization, chromosome segregation, genome stability, cellular proliferation
	H2A.Z.2	Transcriptional regulation
	macroH2A	Restricted to vertebrates. Enriched in transcriptionally silenced regions of the genome. Suppresses transcription by inhibiting recruitment of transcription machinery and chromatin remodelers.
	H2A.J	Conserved in mammals. Accumulates in senescent cells and upregulates inflammatory gene pathways
	H2A.Bbd (H2A.B)	Short H2A variant. Restricted to vertebrates. Found in high levels in testis and brain. Increases transcriptional activity by reducing chromatin compaction.
	H2A.L.2	Short H2A variant. Facilitates histone-to-protamine transition during spermatogenesis
	H2A.P, H2A.Q	Short H2A variants. Found in the testis
	TH2A (H2A.1)	Found in the gametes, where it dimerizes with TH2B. Enhances reprogramming to pluripotent state.
H2B	TH2B (H2B.1)	Found in the gametes, where it dimerizes with TH2A. Enhances reprogramming to pluripotent state.
	H2B.E	Found in odour-sensing neurons of mice, associated with cell longevity
	H2B.K	Found in ovaries, oocytes and zygotes
	H2B.L	Not found in chromatin; localizes to the subacrosome in sperm
	H2B.N	Found in ovaries, oocytes and zygotes
	H2B.W	Testis-specific, localized to telomeres
	H2B.3	Enriched in mature leaves and in nucleosomes containing H3.3 and/or H2A.Z
H3	H3.3	Found at both active and silenced regions, mediated by distinct chaperones. Nucleosomes containing H3.3 are deposited at sites that have lost nucleosomes during transcription: active genes, promoters, etc. Also found at telomeres, imprinted genes and heterochromatin to maintain heterochromatin state.
	H3T (H3.4)	Found in testis, and developing spermatozoa. Essential for spermatogenesis, ensures entry into meiosis, granting an open chromatin state.
	H3.5	Found in testis in immature gametes, seminiferous tubules. Associated with actively transcribed regions
	H3.6	Weakly expressed in various tissues, forms unstable nucleosomes
	H3.7	Weakly expressed in various tissues, forms unstable nucleosomes
	H3.8	Weakly expressed in various tissues, forms unstable nucleosomes
	H3.X	Primate-specific. Accumulates at transcriptional start sites of actively transcribed genes, open chromatin state.
	H3.Y	Primate-specific. Accumulates at transcriptional start sites and coding region of actively transcribed genes, open chromatin state. H3.Y nucleosomes tend to exclude H1.
	CENP-A (CenH3)	Indicates centromere location, required for kinetochore attachment and chromosome segregation during mitosis
	H3mm6-18	Mouse specific H3.3 variants. H3mm7 is expressed in skeletal muscle satellite cells. Some form stable nucleosomes, while others are more labile.
H4	H4.7 (H4.G)	Found in nucleolus. Produces less compact chromatin on ribosomal DNA, promoting rDNA transcription
H1	H1.0	Enriched in nucleolus. May replace H1 in terminally differentiated cells.
	H1.1	Widely expressed, peaks in S phase.
	H1.2	Widely expressed, peaks in S phase. Important for chromatin compaction
	H1.3	Widely expressed, peaks in S phase.
	H1.4	Widely expressed, peaks in S phase.
	H1.5	Widely expressed, peaks in S phase.
	H1.6	Found in testis
	H1.7 (H1T2)	Found in testis
	H1.8 (H1oo)	Enriched in oocytes.
	H1.9 (HILS1)	Found in testis
	H1.10 (H1X)	Enriched in nucleolus

Table 1:Histone variants and their tissue expression and associated functions. Information sourced from references 5,11,13–17.

Histone Variant Regulation

Histone variants are not variants in the sense of splice isoforms. Instead they are separate genes, located outside of the replication-dependent histone gene clusters. While some variants, such as some H1 variants, are regulated in a similar manner to the core histones, the transcriptional and translational regulation of most variants is more typical of other genes in the genome. Their mRNA is poly-adenylated, expression is not tied to the cell cycle, and their genes contain introns, making alternative splicing a possibility for some variant genes.

Distinct chaperones tend to be involved in depositing and removing histone variants compared to the core histone proteins, and the identity of those chaperones can determine the role of the variant in gene regulation. For instance, H3.3 is incorporated into active genes by the chaperone Hira, but deposited into heterochromatin and telomeres by the chaperone dimer Atrx/Daxx.⁵

Posttranslational Modifications of Histones

While histone variants represent one way to modify chromatin state, a more common, faster and more responsive approach is to modify the core histone proteins that are already present within nucleosomes. Posttranslational modifications (PTMs), particularly of lysine residues, are major determinants of chromatin state by influencing how tightly packed together nucleosomes are. PTMs typically occur on residues of N-terminal histone tails and include phosphorylation, methylation, acetylation, ubiquitination, sumoylation, ADP ribosylation and more.

Some PTMs are strongly associated with one type of chromatin state, for example acetylation of histone tails typically results in euchromatin. Other PTMs, such as methylation, are more context dependent, with the resultant chromatin state being determined by the specific amino acids in the histones that are modified, and the combination of PTMs on neighboring histones.

Histones are modified by specific enzymes known as writers and erasers, and interpreted by proteins known as readers, which can recruit transcription factors and chromatin remodeling proteins.¹⁸ Readers, writers and erasers therefore directly influence chromatin state and transcriptional activity at specific genomic loci through their addition, removal and recognition of histone PTMs.¹⁹

Histones and Epigenetics

The “histone code hypothesis” posits that histone modifications are the driving force behind transcriptional regulation, with specific combinations of PTMs conferring specific biological functions to the associated region of the genome.

Through histone PTMs, an organism’s phenotype can change without changes to the underlying genetic sequence, leading to histone PTMs being referred to as epigenetic marks. Epigenetics refers to heritable alterations to chromatin that regulate gene expression but do not alter the base sequence of DNA. Methods of transcriptional regulation such as DNA methylation, chromatin remodeling, and non-coding RNA signaling are also considered epigenetic. Epigenetic marks can be influenced by the environment, allowing for dynamic alterations to gene expression that can be passed on to subsequent generations.

Not all histone modifications are epigenetic marks. Histone modifications are often highly dynamic, with acetylation and phosphorylation events appearing and disappearing on chromatin in response to a stimulus within minutes.^20,21 For PTMs to be heritable, they must be stable.²² Histone methylation has a much longer half-life and so is more likely to be passed on during cell division.²³

Types of Histone Modification

Different histone modifications confer different biological functions to regions of DNA, such as making it more or less transcriptionally active, or being involved in specific cellular processes. Acetylation, methylation and phosphorylation are the best understood histone PTMs, but there are now at least 18 histone modifications known, and our knowledge of how they are involved in cellular function continues to expand.²⁴

Figure 3 illustrates the diversity of histone modifications possible on the core histone proteins and the linker H1. Descriptions of the main types of histone modification can be found below, along with tables of their commonly associated biological functions. For a more comprehensive list, we recommend the review by Zhao and Garcia, 2015.²⁴

Diagram of histone proteins with long tails displaying many different modifications

Figure 3: Schematic of histone modifications sites on the four core histone proteins and the linker H1. Note that histone variants such as H2A.X, H3.3 and CENP-A may have distinct and unique modifications. Distance between amino acid residues is not to scale.

Naming Conventions

A system for naming histone modifications has been devised to enable researchers to describe them with high precision. This is necessary due the number of different histone modifications that there are (and continue to be discovered), and the importance of the identities of both the histone protein and the modified residue. Note that this is not the “histone code” that you might have heard about (see Histones and Epigenetics)! There are 4 components of this convention; we shall use trimethylation of histone H3 lysine 27, or H3K27me3, as an example.

Modified histone: In this case, H3. Histone variants can also be indicated here, e.g. H2AX
Modified residue: Using the single letter amino acid code, which is lysine (K) in this case.
Residue location within the protein: The modified lysine is the 27th residue in this example. Most histone tails are at the N-terminus, so residue number is often low.
Modification type: This lysine has been trimethylated, indicated as me3. Methylation can be me1, 2, or 3, but most modifications are singular (e.g. “ac” or “ph”). Table 2 below gives the abbreviations for modifications as well as which residues can be targeted by different modifications.

Modification	Abbreviation	Residues
2-hydroxyisobutyrylation	hib	K
Acetylation	ac	K, sometimes S, T, Y
ADP-ribosylation	ar(n)	K, E
Biotinylation	bio	K
Butyrlyation	bu(t)	K
Citrullination (deimination)	cit	R
Crotonylation	cr	K
Formylation	fo(r)	K
Glutathionylation	gt	C
Hydroxylation	oh	Y
Malonylation	ma	K
Methylation	me1, me2, me3	K, R, sometimes Q, G
O-GlcNAcylation	og	K, S, T, Y
Phosphorylation	ph	S, T, Y
Proline isomerization	iso	P
Proprionylation	pr	K
Succinylation	suc	K
SUMOylation	su	K
Ubiquitination	ub	K

Table 2:Different types of histone modifications. For more information see references 24–26.

Acetylation

Acetylation occurs on lysine residues in all of the core histone proteins, as well as histone H1.^27,28 A highly dynamic process, acetylation can appear or disappear in response to a stimulus within minutes.²⁹ Acetyl groups are transferred to the ε-amino group of lysine side chains by histone acetyltransferases (HATs; see Histone Modifying Enzymes – HATs and HDACs), which use acetyl CoA as a cofactor.¹⁹ This transfer effectively neutralizes the positive charge that lysine naturally carries, resulting in a weakened interaction between the positively charged histones and negatively charged DNA.

Acetylation of histones is therefore associated with loosely packed euchromatin and increased transcriptional activity. Acetylation of lys9 and lys27 on histone H3 (H3K9ac and H3K27ac) are canonical, strong active marks, frequently associated with active gene promoter and enhancer regions of the genome.

While acetylation may also occur at non-lysine residues, including serine and threonine (e.g. H3T22ac), the role of these modifications in chromatin structure and function is currently unknown.³⁰

Histone	Site	Histone-modifying Enzyme(s)	Proposed Function
H2A	Lys5	Tip60, p300/CBP	Transcriptional activation
H2B	Lys5	p300, ATF2	Transcriptional activation
	Lys12	p300/CBP, ATF2	Transcriptional activation
	Lys15	p300/CBP, ATF2	Transcriptional activation
	Lys20	p300	Transcriptional activation
H3	Lys9	Unknown	Histone deposition
	Lys9	Gcn5, SRC-1	Transcriptional activation
	Lys14	Unknown	Histone deposition
		Gcn5, PCAF, Esa1, Tip60, SRC-1, p300	Transcriptional activation
		Elp3, Sas3	Transcriptional elongation
		Esa1, Tip60	DNA repair
		hTFIIIC90	RNA polymerase III transcription
		TAF1	RNA polymerase II transcription
		Sas2	Euchromatin
	Lys18	Gcn5	Transcriptional activation, DNA repair
	Lys18	p300/CBP	DNA replication, transcriptional activation
	Lys23	Unknown	Histone deposition
		Gcn5	Transcriptional activation, DNA repair
		p300/CBP	Transcriptional activation
		Sas3	Transcriptional elongation
	Lys27	p300/CBP, Gcn5	Enhancer function, gene expression
	Lys36	Gcn5	Marker of active promoters
	Lys64	p300	Nucleosome dynamics and transcription
H4	Lys5	Hat1	Histone deposition
		ATF2, p300	Transcriptional activation
		Esa1, Tip60	DNA repair
	Lys8	Gcn5, PCAF, Esa1, Tip60, ATF2, p300	Transcriptional activation
		Elp3	Transcriptional elongation
		Esa1, Tip60	DNA repair
	Lys12	Hat1	Histone deposition, telomeric silencing
		Esa1, Tip60, p300	Transcriptional activation
		Esa1, Tip60	DNA repair
	Lys16	Gcn15, Esa1, Tip60, ATF2	Transcriptional activation
		Esa1, Tip60	DNA repair
		Sas2	Euchromatin

Table 3:Acetylation of histones

Methylation

Methylation primarily occurs on lysine and arginine residues of histones H3 and H4, though methylation of H1, H2A, H2B and various histone variants have been observed, including on glutamine and glycine residues.^24,31–33 The overall effects of methylation can result in either increased or decreased transcriptional activity, depending on the specific modifications and the context in which they occur.

Unlike acetylation, methylation of histones does not alter DNA-histone interactions by modifying charge, which may explain its more diverse effects compared to acetylation. Instead, histone methylation acts as recruitment sites for chromatin remodeling proteins, such as those containing chromodomains (see Histone Modification Readers). These remodeling proteins modify chromatin state to be more open (euchromatin) or closed (heterochromatin), depending on the context and the specific epigenetic mark. Methylation can also render histones bulkier and more hydrophobic, affecting nucleosome stability and how tightly DNA is wound around histones.

Multi-methylation

To further complicate the landscape of histone methylation, residues can be methylated to different extents. Lysines can be mono-, di-, or tri-methylated, while arginines can be monomethylated and asymmetrically or symmetrically demethylated.

The extent of methylation has important functional consequences. For example, monomethylation of H3K27, H3K9 and H3K79 are associated with gene activation, whereas trimethylation of those residues is linked to gene repression.³⁴ Different amounts of methylation on a given residue may indicate the same effect on transcriptional activity, but mark different genetic elements. For instance, H3K4 methylation marks active genes, but H3K4me1 typically marks transcriptional enhancers, while H3K4me3 marks gene promoters.

Methylation of H3K9 and H3K27

Some of the best-studied histone modifications are those on H3K9 and H3K27. As mentioned above, acetylation of these marks active genes, whereas trimethylation of these same residues (H3K9me3 and H3K27me3) is a well-established signal for gene silencing. These repressive marks contain further nuanced functions. H3K27me3 tends to be a temporary signal in promoter regions, whereas H3K9me3 is a permanent mark in gene-poor genomic regions of heterochromatin, such as satellite repeat regions, telomeres, and certain transposable elements.

Histone	Site	Histone-modifying Enzyme(s)	Proposed Function
H1	Lys26	Ezh2	Transcriptional silencing
H2A	Arg3	PRMT1/6, PRMT5/7	Transcriptional activation, transcriptional repression
H2A	Gln104	Fibrillarin	Ribosomal gene expression
H3	Arg2	PRMT5, PRMT6	Transcriptional repression
	Lys4	SETD1A/B, MLL, ALL-1	Transcriptional activation
		MLL3/4	Enhancer function
		Meisetz	Meiotic prophase progression
	Arg8	PRMT5	Transcriptional repression
	Lys9	Suv39h, Clr4	Transcriptional silencing
		G9a	Transcriptional repression, genomic imprinting
		SETDB1	Transcriptional repression
	Arg17	CARM1	Transcriptional activation
	Arg26	CARM1	Transcriptional activation
	Lys27	Ezh2	Transcriptional silencing, X inactivation
	Lys27	G9a	Transcriptional silencing
	Lys36	Set2	Transcriptional elongation
	Arg42	CARM1	Transcriptional activation
	Lys79	Dot1	Euchromatin, transcriptional elongation
H4	Arg3	PRMT1/5/6/7	Transcriptional activation
	Lys8	SET5	Stress response
	Lys20	SETD8, PR-Set7	Transcriptional silencing, mitotic condensation
	Lys20	Suv4-20h	Heterochromatin

Table 4:Methylation of histones

Phosphorylation

Serine, threonine and tyrosine residues are phosphorylated in all of the core histone proteins and many variants, and are associated with many cellular processes including transcription regulation, DNA repair, mitosis and apoptosis.¹⁰ Histone phosphorylation primarily alters gene activity and chromatin structure by recruiting proteins to sites of interest, including chromatin remodeling proteins and writers or erasers of other histone modifications, introducing substantial crosstalk between modifications.

Phosphorylation in the DNA Damage Response

One of the best characterized cases of phosphorylation is the phosphorylation of H2A.X (or H2A in yeast), which plays a major role in the DNA damage response. Following a double-strand break (DSB) in the DNA, phosphorylation of H2AS129 near to the DSB occurs within 30 minutes.¹⁰ This modification then spreads over several kilobases or even megabases up- and down-stream of the DSB, creating a clear platform to recruit DNA repair proteins. One of these is the NuA4 acetyltransferase complex, leading to acetylation of histone H4 and an open chromatin state to facilitate DNA repair.

Phosphorylation in Transcription Regulation

Phosphorylation of histone H3S10, H3S28 and H2BS32 have all been associated with regulating gene activity, particularly for proliferation-related genes, such as in response to epidermal growth factor (EGF)-responsive genes, and the proto-oncogenes c-fos, c-jun and c-myc.

Phosphorylation is often linked with acetylation in the regulation of genes. For example, phosphorylation of H3S10, T11 and S28 result in Gcn5-dependent acetylation of H3.¹⁰ Similarly, in cells that have been stimulated with EGF, H3S10ph is strongly associated with H3K9 and H3K14 acetylation, which are indicators of gene activity. Interactions between phosphorylation have also been shown, such as H3S28ph resulting in demethylation of the neighboring H3K27, relieving the gene silencing imparted by that mark.

Phosphorylation in Chromatin Compaction

Histone phosphorylation is also involved in chromatin compaction and chromatid separation that are necessary for mitosis and meiosis, for which histone H3 residues T3, S10, T11, and S28 are known to be important.¹⁰

Chromatin condensation also occurs during apoptosis. H2BS14ph is important for this, again acting as a platform for acetylation of a nearby lysine residue. H2A.X phosphorylation also increases upon DNA fragmentation and apoptosis, with H2AXY142 appearing to recruit pro-apoptotic factor JNK1 instead of repair factors.¹⁰

Histone	Site	Histone-modifying Enzyme(s)	Proposed Function
H1	Thr10	Unknown	Mitosis
	Ser17	Unknown	Mitosis (interphase)
	Ser27	Unknown	Transcriptional activation, chromatin decondensation
	Thr137	Unknown	Mitosis
	Thr154	Unknown	Mitosis
	Ser172	Unknown	Mitosis (interphase)
	Ser188	Unknown	Mitosis (interphase)
H2A	Ser1	Unknown	Mitosis, chromatin assembly
	Ser1	MSK1	Transcriptional repression
	Thr120	Bub1, VprBP	Mitosis, transcriptional repression
	Ser139 (H2A.X)	ATR, ATM, DNA-PK	DNA repair
	Thr142 (H2A.X)	WSTF	Apoptosis, DNA repair
H2B	Ser14	Mst1	Apoptosis
	Ser14	Unknown	DNA repair
	Ser36	AMPK	Transcriptional activation
H3	Thr3	Haspin/Gsg2	Centromere mitotic spindle function
	Thr6	PKCβ	Inhibits AR-dependent transcription
	Ser10	Aurora-B kinase	Mitosis, meiosis
		Rsk2, MSK1, MSK2	Immediate-early gene activation
		IKKα	Transcriptional activation
	Thr11	Dlk/Zip	Mitosis
	Ser28	Aurora-B kinase	Mitosis
	Ser28	MSK1, MSK2	Immediate-early activation
	Tyr41	JAK2	Transcriptional activation
	Tyr45	PKCδ	Apoptosis
H4	Ser1	CK2	DNA repair

Table 5:Phosphorylation of histones

Ubiquitination

Histone ubiquitination, also known as ubiquitylation, involves attaching ubiquitin to lysine residues. Ubiquitin is an 8.6 kDa protein. While small for a protein, this makes it considerably larger than other histone modifications, with methyl, acetyl and phosphate groups all being under 100 Da. How ubiquitin achieves its effects may be due in part to its large size, sterically hindering the interaction between histones and DNA.³⁵ It can also act as a binding site for chromatin remodelers.

Ubiquitination in the cell more generally is best known as a way of marking a protein for proteasome-dependent degradation. This is particularly true of polyubiquitination, in which lysines within ubiquitin itself are ubiquitinated to form long chains of ubiquitin. In contrast, histone ubiquitination mainly involves monoubiquitination.

All histone core proteins can be ubiquitinated, but the best studied ubiquitination events occur on H2A at lysine 119 and H2B at lysine 120.³⁶ These two events have opposing impacts, with H2AK119ub resulting in transcriptional repression and H2BK120ub associated with transcriptional activation.

Histone ubiquitination has important roles in transcription, chromatin architecture, stem cell plasticity, development, DNA replication, and DNA damage and repair.^36–38 Monoubiquitination of H2A, H2A.X and H2B (e.g. H2AK13/15ub) are found at DNA double strand breaks, while polyubiquitination of H2AK63 is an important recognition site for DNA repair proteins.³⁹ These occur in response to extensive histone phosphorylation.

Many of the functions of ubiquitination are achieved via crosstalk with other PTMs. For example, ubiquitination of H2A can recruit methyltransferases such as polycomb repressive complex 1 and 2 (PRC1 and 2), ultimately resulting in methylation of H3K9 and H3K27 for transcriptional repression.³⁷ A similar mechanism exists for H2B ubiquitination, which results in H3K4 and H3K79 methylation, markers of transcriptional activity.³⁷

Histone	Site	Histone-modifying Enzyme(s)	Proposed Function
H2A	Lys13	Rnf168	DNA damage response
	Lys15	Rnf168	DNA damage response
	Lys63	Rnf18	DNA damage response
	Lys119	Ring2	Spermatogenesis
H2B	Lys120	RNF20/40	Cell cycle progression, meiosis, DNA damage response

Table 6:Ubiquitination of histones

ADP-Ribosylation

ADP-ribosylation refers to the transfer of an ADP-ribose group from the co-factor nicotine-amide adenine dinucleotide (NAD⁺) to amino acid residues in histones. While ADP-ribosylation can occur on lysine, arginine, glutamate, aspartate, cysteine, phospho-serine and asparagine residues, lysine and glutamate are the main acceptors in histone proteins.^24,40 Proteins can be mono- or poly-ADP-ribosylated, with mono-ADP-ribosylation acting as an acceptor for further ADP-ribosylation.

All core histones can be ADP-ribosylated, and histone H1 in particular is a strong acceptor of poly-ADP ribosylation. Given the role of H1 in chromatin architecture, this suggested that ADP ribosylation may be involved in chromatin structure. Indeed, ADP-ribosylation can directly alter histone-DNA interactions to increase DNA accessibility for transcription factors to increase transcriptional activity, and for repair factors at sites of DNA damage.^41,42 ADP ribosylation is also involved in DNA replication and the cell cycle, with poly-ADP-ribosylation being highest during cell proliferation.⁴³

Histone	Site	Histone-modifying Enzyme(s)	Proposed Function
H1	Glu2	PARP-1	Neurotrophic
	Glu14	PARP-1	Neurotrophic
	Lys213	PARP-1	Neurotrophic

Table 7:ADP-ribosylation of histones

SUMOylation

Histone lysines can be modified with the small ubiquitin-like modifier (SUMO) in a similar way to histone ubiquitination. SUMOylation has been observed on residues of all of the core histones in humans, as well as H1 and the variant H2A.X.⁴⁴ SUMOylation tends to be associated with transcriptional repression, for example by recruiting histone deacetylases (HDACs), which reduce histone acetylation and result in condensed chromatin.⁴⁴

Other Modifications

Beyond the well-established histone modifications described above, there are many additional modifications to regulate DNA accessibility, replication, repair and transcription. These include:

Lysine acylations: These are typically involved in transcriptional activation, and include crotonylation, butyrylation, and propionylation.⁴⁵
O-GlcNAcylation: A nutrient-linked modification that modulates transcription factors and RNA polymerase.⁴⁶
Biotinylation: A rare histone mark that stabilizes DNA-histone associations to contribute to gene silencing.⁴⁷
Citrullination: Also known as deimination, this converts arginine in histones to citrulline, resulting in a looser chromatin structure.

Histone Modifying Enzymes

Many histone modifications are dynamically added or removed from histones in response to external factors or cellular state, such as growth factors, hormones, and cellular stress. This is achieved by enzymes referred to as histone writers and erasers. Writers and erasers therefore allow the environment to modulate chromatin architecture and DNA expression, via intracellular signaling cascades. The enzymes responsible for histone modification are specific to each type of modification, and sometimes specific to precise amino acid residues.

Because of their high specificity and their impact on chromatin structure and gene expression, imbalances between histone writers and erasers can result in in genomic instability and various diseases relating to epigenetic abnormalities, including cancer. Writers and erasers therefore make attractive targets for small molecule inhibitor therapeutics.^18,48,49

Table 8 lists writers and erasers of the most common modifications, which will be discussed further below.

Modification	Writers	Erasers
Acetylation	Histone acetyltransferases (HATs)	Histone deacetylases (HDACs)
Methylation	Histone methyltransferases (HMTs/KMTs) and protein arginine methyltransferases (PRMTs)	Lysine demethylases (KDMs)
Phosphorylation	Histone kinases	Histone phosphatases
Ubiquitination	Ubiquitin E1 activation, E2 conjugation, E3 ligation enzymes	Deubiquitinases
ADP-ribosylation	ADP-ribosyltransferases	ADP-ribosyl glycohydrolases
SUMOylation	SUMO E1 activation, E2 conjugation, E3 ligation enzymes	DeSUMOylases

Table 8:Writers and erasers of histone modifications

HATs and HDACs

Histone acetyltransferases (HATs) are enzymes that add acetyl groups to specific lysine residues on histones. By neutralizing the positive charge of these residues, acetylation promotes gene activation.

Two major classes of HATs exist: type-A and type-B. Type-A HATs tend to be categorized into GNAT, MYST and CBP/p300 families, and are often found in large, multi-protein transcriptional activator complexes. Despite limited sequence homology among type-A HAT families, a structural similarity in the core enzymatic region suggests a common requirement for Acetyl-CoA as a cofactor.⁵⁰

In contrast, type-B HATs are highly conserved. They predominantly operate in the cytoplasm, acetylating free histones but not those already incorporated into chromatin in the nucleus. Acetylation of such newly synthesized histone H4 at K5 and K12 is important for their deposition into nucleosomes, after which the marks are removed.¹⁹

Histone deacetylases (HDACs) play a contrasting role by removing acetyl groups from lysine residues on histones. This deacetylation process effectively restores the positive charge of lysine residues, leading to chromatin condensation and gene repression. 18 mammalian HDACs have been identified and grouped into four classes. Classes I, II and IV are similar in sequence to yeast deacetylases and are zinc-dependent. Class III HDACs are sirtuin proteins and require nicotinamide adenine dinucleotide (NAD) as a cofactor.

In addition to cofactor requirements, some HDACs need to be in multi-protein corepressor complexes to achieve their full catalytic function.⁵¹ This is particularly true of HDACs 1, 2 and 3 in class I, which can join several different complexes depending on cell type, available partner proteins, and environmental context.⁵¹

Methyltransferases and Demethylases

Histone methyltransferases (HMTs) add methyl groups histone proteins to regulate gene transcription. Lysine methyltransferase (KMT) activity results in the mono-, di-, or tri-methylation of lysines, while arginine methyltransferases (PRMTs) can monomethylate or dimethylate arginine residues. Dimethylation can be either asymmetrical, mediated by Type I PRMTs, or symmetrical, mediated by Type II PRMTs.

HMTs are highly substrate specific: many only methylate a specific histone residue (e.g. H3K4), though there can be several different enzymes capable of methylating a single residue.⁵² Such enzymatic redundancy can make methylation events highly context-specific.

Histone demethylases (KDMs) remove methyl groups. LSD1 was the first KDM to be identified, and the discovery of the Jumonji C (JmjC) domain being a signature of demethylases further broadened the number of known KDMs.⁵³ Similar to HMTs, KDMs exhibit high specificity for particular methyllysine or methylarginine residues.

Kinases and Phosphatases

Histone kinases add phosphate groups to serine, threonine, or tyrosine residues. Some key kinases, such as aurora B kinase, act as master regulators, coordinating the activity of several histone kinases.⁵⁴

Histone phosphorylation often defines large regions of chromatin, such as different regions of the centromere during mitosis, which can help to provide clear signals to chromatin remodeling factors. These large regions emerge as a result of positive feedback loops involving phosphorylation of both histones and kinases.⁵⁴ For instance, the kinase Haspin phosphorylates H3T3, which recruits Aurora B kinase. Aurora B in turn phosphorylates Haspin to drive more modification of histones nearby.

Kinases such as MSK1/2 that are capable of phosphorylating both histones and transcription factors can also result in rapid upregulation of target genes by activating transcription factors and opening up chromatin, making them invaluable in fast responses to external signals.⁵⁵

Histone phosphatases tend to be common cellular phosphatases that are capable of dephosphorylating a wide array of different phosphorylated proteins, including histones. PP1, PP2A and PP4 have all been identified has being able to dephosphorylate histones.⁵⁴ To achieve specificity, targeting factors such as Repo-Man are required for recruitment to chromatin.

Histone Modification Readers

Histone readers are proteins that recognize different histone modifications and respond based on the modifications present. Such responses can include chromatin remodeling and recruitment of transcription factors. While histone writers and erasers are responsible for adding and removing different functional modifications, it is likely that histone reader proteins are the core determinants of the ultimate effects of the histone code present in a given location.

Histone modifications are read by specialized domains or modules. Specificity of these modules is determined by residues in close proximity to the module’s binding pocket. Even minor sequence changes can modify recognition, including between highly similar epigenetic marks such as mono-, di- or tri-methylation.⁵⁶

Table 9 describes various histone recognition domains that are found in reader proteins, which histone modifications they recognize, and their likely roles in cellular activity.

Histone-binding or Effector Domain	Proteins Containing Domain	Known Histone Marks	Suggested Function
14-3-3	14-3-3 Family	H3S10ph H3S28ph	Recruit transcriptional machinery and enhance acetylation of neighboring lysine residues, promoting an open chromatin state and increasing gene activation at binding sites.^57,58
Bromodomain	BRD2, BRD4, BAZ1B, BRG1	H3K9ac H3K14ac H4K5ac H4K8ac H4K12ac H4K16ac	Recognize acetylated lysine residues. Recruit transcriptional machinery and co-activators to acetylated regions to promote gene expression and facilitate chromatin remodeling.⁵⁹
BRCT	53BP1, BRCA1, MDC1, NBS1	H2A.XS139ph	BRCT domains bind phosphorylated serine and threonine residues. Recognize phosphorylated histones at sites of DNA damage to recruit DNA repair machinery. Can also recruit transcriptional regulators to promote transcription.^60,61
Chromodomain	CHD1, CDY, HP1, MPP8, MRG15	H3K4me2/3 H3K9me2/3 H3K27me2/3	Recognize methylated lysine residues and can be associated with either gene activation or repression via changes to chromatin state.⁶²
MBT	L3MBTL1, L3MBTL2, MBTD1	H3K4me1 H4K20me1/2 H1K26me1	MBT domains recognize and bind to methylated lysines. Involved in chromatin compaction and gene repression.^63–65
PHD Fingers	BPTF, ING, RAG2, BHC80, DNMT3L, PYGO1, JMJD2A, UHRF, PHF20	H3K4me3 H3K9me3 H3K36me3 H3K14ac	Recruit transcription factors and chromatin remodeling proteins involved in gene activation or repression, depending on the specific mark. Play roles in chromatin remodeling and gene regulation.⁶⁶
Tudor Domains	TDRD3, TDRD7, 53BP1, SETDB1	H3K4me3 H3K9me3 H3R17me2 H4K20me2	Tudor domains bind to methylated lysines and arginines, particularly trimethylated lysines, resulting in either activation or repression of gene expression depending on the specific protein. Involved in various processes, including DNA damage response and gene regulation.^67,68
WD40 repeats	WDR5, EED, LRWD1, WDR77, RbBP4/7	H3K4me2 H3R2me2 H3K9me3 H3K27me3	Typically recognize methylated lysines, and they can recruit transcription factors and chromatin remodeling proteins involved in gene activation or repression, depending on the specific mark.^69-71

Table 9: Histone reader domains and their cellular functions.

Multivalent Histone Modifications

Combinations of histone modifications can be recognized by histone readers that contain multiple histone-binding modules within them, either as individual effector proteins or as complexes.⁵⁶ Such multivalent interactions greatly expand the nuance and complexity of the histone code, whereby histone modifications are read together rather than in isolation. Modifications to be read simultaneously can be on the same histone tail, but they can also be further apart on the same nucleosome, on neighboring nucleosomes, or on nucleosomes that are not close on a contiguous stretch of DNA but are spatially located in common chromatin territory.⁵⁶ How combinatorial modifications are read can depend on nucleosomal context.⁷²

This mechanism reinforces common markers of chromatin states, enabling histone readers to form stable interactions with modified regions of chromatin despite relatively weak binding affinity for individual modifications.⁷³

One well known multivalent modification is H3K4me3 and H3K27me3, which is present together almost exclusively in pluripotent stem cells and is thought to help maintain the pluripotency of the cell population.⁷⁴ Others have been recognized as being involved in gene silencing, X-chromosome inactivation, and responses to external signals.⁵⁶

Techniques for Studying Histone Modifications

Chromatin Immunoprecipitation (ChIP)

Chromatin immunoprecipitation (ChIP) is a biochemical technique that uses antibodies to bind to and isolate proteins of interest alongside any DNA it is bound to (Figure 4). The DNA can then be purified, sequenced, and mapped to the genome to determine where the protein of interest was bound in the genome and its abundance at different sites.

Figure 4: Schematic of chromatin immunoprecipitation (ChIP) workflow.

The experimental aims and outcomes vary depending on the identity of the protein of interest. Antibodies against histone proteins or even specific histone modifications can be used in ChIP for a number of questions, including:

Locations of higher order chromatin structures, active or silenced genes, and genetic elements, based on histone marks known to mark those elements
Genes that are active or inactive in a given biological context, based on the presence or absence of a well-established histone modification
Function of a poorly understood histone modification, based on genetic sequences where it is enriched

Mass Spectrometry

Mass spectrometry (MS) provides a comprehensive view of the histone modification landscape in an unbiased manner. For MS, histones are enzymatically digested into peptides, which are then ionized and separated within the spectrometer based on their mass-to-charge ratio. The resultant spectra can then be analyzed to determine the presence and abundance of specific histone modifications.

MS has several advantages over ChIP. It is highly quantitative, providing precise measures of abundance. MS is not limited to a specific region of the genome or to specific types of modifications as antibodies would be, meaning it is unbiased and can provide a complete picture of all modifications that change during an experimental treatment. This makes it ideal for investigating combinatorial changes, such as if both acetylation and methylation are altered. Novel PTMs can also be identified by MS.

Western Blot

Western blots use antibodies to detect proteins on a membrane that have been separated based on their size by gel electrophoresis. They are useful for gaining an understanding of the relative abundance of specific histones (e.g. histone variants) or histone modifications, which can then be compared across different experimental conditions. They are more targeted than MS, capable of investigating only a few modifications at a time with specific antibodies. However, they are still capable of investigating combinatorial modifications by sequential or parallel blotting.

Immunohistochemistry

Immunohistochemistry (IHC) uses antibodies to detect and visualize histones and their modifications in tissue samples. This allows researchers to observe the spatial distribution of histone modifications in the cell, such as in different types of chromatin or different cellular structures. IHC is particularly useful for studying the localization of modifications during stages of the cell cycle. It can also be used to investigate combinatorial modifications, with complementary antibodies targeting specific modifications and techniques such as Förster resonance energy transfer (FRET) or proximity ligation assay (Figure 5).⁷⁵

Multipanel fluorescent imge of histone modification IHC

Figure 5: Using immunohistochemistry to detect histone modifications in mouse embryonic stem cells.A-D, Individual antibodies against specific modifications can be used to reveal the localization of histone modifications within the nucleus and suggest colocalization (A-C) or not (D). E-H, Proximity ligation assay will only generate a fluorescent signal when targeted proteins or modifications are within ~30 nm of each other (E-G). No signal is seen if they are not in close proximity (H). Nuclei are counterstained with DAPI.
Edited and reproduced under Creative Commons 4.0 CC-BY from Hattori, N., Niwa, T., Kimura, K., Helin, K. & Ushijima, T. Visualization of multivalent histone modification in a single cell reveals highly concerted epigenetic changes on differentiation of embryonic stem cells. Nucleic Acids Res. 41, 7231–7239 (2013).

References

Diagrams created with BioRender.com.

Show all