Epigenetic signature of human immune aging in the GESTALT study

  1. Roshni Roy
  2. Pei-Lun Kuo
  3. Julián Candia
  4. Dimitra Sarantopoulou
  5. Ceereena Ubaida-Mohien
  6. Dena Hernandez
  7. Mary Kaileh
  8. Sampath Arepalli
  9. Amit Singh
  10. Arsun Bektas
  11. Jaekwan Kim
  12. Ann Z Moore
  13. Toshiko Tanaka
  14. Julia McKelvey
  15. Linda Zukley
  16. Cuong Nguyen
  17. Tonya Wallace
  18. Christopher Dunn
  19. William Wood
  20. Yulan Piao
  21. Christopher Coletta
  22. Supriyo De
  23. Jyoti Sen
  24. Nan-ping Weng
  25. Ranjan Sen
  26. Luigi Ferrucci  Is a corresponding author
  1. Laboratory of Molecular Biology and Immunology, National Institute on Aging, United States
  2. Translational Gerontology Branch, National Institute on Aging, United States
  3. Laboratory of Neurogenetics, National Institute on Aging, United States
  4. Clinical Research Core, National Institute on Aging, United States
  5. Flow Cytometry Unit, National Institute on Aging, United States
  6. Laboratory of Genetics and Genomics, National Institute on Aging, United States
  7. Laboratory of Clinical Investigation, National Institute on Aging, United States
5 figures and 13 additional files

Figures

Figure 1 with 1 supplement
Study design and identification of age-associated methylation probes.

(A) Study design. (B) Age-associated CpG methylation (False Discovery Rate or FDR p < 0.05) in six cell types. (C, D) SuperExactTest circular plots to show the number of age-associated hypo- and hypermethylated probes shared among different combinations of cell types (indicated by green boxes), respectively. The outermost bars show the number of probes shared among each cell-type combination (regardless of other cell types). For example, probes hypomethylated with age in B + CD4 + CD8 + gran + mono (n = 222) includes probes also hypomethylated in NK cells (n = 181) and probes not hypomethylated with age in NK cells (n = 41). Based on the exact probability distributions of multi-set intersections, all the overlaps shown are highly statistically significant (p < 10−100). (E) Graphical representation of age-associated hypomethylation in promoter region of RCAN1 in all six cell types. (F) Graphical representation of age-associated hypermethylation in promoter region of KLF14. The methylation status in peripheral blood mononuclear cell (PBMC) and buffy coat are also shown. Missing methylation data are represented in white.

Figure 1—figure supplement 1
Characteristics of entire dataset and age-associated methylation data in six primary immune cells.

(A) Principal component analysis (PCA) plot of normalized methylation data of six immune cells in the 55 healthy donors. The cell types are indicated in different colors, while the three broad age groups (20–40, 40–60, and 60–90 years) are indicated in different shapes (PC1 – principal component 1, PC2 – principal component 2). (B) Distribution of beta-regression coefficient of the age-associated hypo- and hypermethylated probes in all six immune cell types estimated this cross-sectional study. Distribution of coefficient values categorized into groups is shown in Supplementary file 3. (C) Age-associated probes from independent sample t-test analysis of young (≤35 years, 25th percentile) vs old (≥70 years, 75th percentile) donors for each cell type. The pie charts on top show the extent of overlap with results from the beta-regression analysis. (D) Distribution of age-associated probes from beta-regression into groups based on distance from CpG islands (CGI) (Island – within CGI, shore – within 2 kb of CGI, shelf – 2–4 kb of CGI, open sea – >4 kb from CGI). (E) Distribution of age-associated hypo- and hypermethylated probes with respect to location (promoter – 1500TSS to first exon, genebody – within exons, introns, and 3′UTR). (F, G) Overlap of age-associated hypo- and hypermethylated probes in the six immune cell types with those identified in peripheral blood mononuclear cells (PBMCs). The first bar indicates the number of age-associated probes identified in PBMC. The following bars show the counts in the other immune cells, the lighter portion of the bars show the number of probes that are shared with PBMCs, and the darker portion indicates non-PBMC cell-specific probes. The numbers above the bars show the log of odds ratio and its 95% confidence interval to represent the significance of the overlap between age-associated probes in each cell type with PBMC. The p-value from odds ratio analysis is represented as ** as they were all <0.01.

Figure 2 with 2 supplements
Characteristics of age-associated probes.

(A, B) Manhattan plot of age-associated hypo- and hypermethylated CpG sites in B cells, respectively. Most significant genic probes (−log padj >10) are labeled. (C) Correlation between beta-regression coefficients of age-differentially methylated CPGs in GESTALT and longitudinal InCHIANTI study. X-axis – InCHIANTI, Y-axis – B cell (C) and CD4+ T cell coefficients (D). Blue dots – age-hypomethylated CpGs, yellow triangles – age-hypermethylated CpGs. (E, F) Scatter plot of age-associated CpGs showing opposite trends in different immune cells. (E) cg27123256 (in BCL11B promoter) is hypomethylated with older age in B, monocytes, and NK while is hypermethylated with older age in CD4+ T cells. (F) cg03530364 (in FAM19A1 promoter) is hypermethylated with older age in B, granulocytes, monocytes, and NK cells while it is hypomethylated with older age in CD4+ T cells.

Figure 2—figure supplement 1
Most significant age-associated CpGs in non-B immune cells along with CpGs showing opposite age-associated trends.

(A) Manhattan plot of age-associated CpGs in CD4+ T cells. The X-axis shows the distribution of significant CpGs (FDR p < 0.05), and the Y-axis shows the associated negative log of the adjusted p-value from beta-regression. Positive axis comprises of probes hypermethylated with age while the negative axis shows age-associated hypomethylated probes. The top hits in each group with the most significant p-values are labeled where the orange dot present CpG probes in the gene promoter. (B) Manhattan plot of age-associated probes in CD8+ T cells. (C) Manhattan plot of age-associated probes in granulocytes. (D) Manhattan plot of age-associated probes in NK cells. (E) Manhattan plot of age-associated probes in monocytes. (F) Count of probes hypomethylated with age in cell type of interest but showing hypermethylation in one or more other cell types. For example, 315 probes are age-hypomethylated in B cells but are significantly hypermethylated with age in one or more other immune cell types. Maximum number of such probes are observed in CD4+ T cells followed by B cells and monocytes. (G) Count of probes hypermethylated with age in cell type of interest but showing hypomethylation in one or more other cell types. For example, 282 probes are hypermethylated in B cells but are significantly hypomethylated with age in one or more other immune cell types. Maximum number of such probes are observed in CD4+ T cells followed by B cells and monocytes.

Figure 2—figure supplement 2
Comparison of individual immune cells with InCHIANTI longitudinal study.

(A, B) Correlation between beta-regression coefficients of age-associated methylation probes in five or more cell types in study and beta-regression coefficients estimated from longitudinal data in the InCHIANTI study. On the X-axis is the data from InCHIANTI longitudinal study cohort while on the Y-axis is cell-specific coefficient values for CD8+ T cells (top left), granulocytes (top right), monocytes (bottom left), and NK cells (bottom right). Pink dots are the coefficients of the age-hypomethylated probes (A) while the blue dots are for age-hypermethylated probes (B).

Pathway analysis of methylated probes.

Enrichment analysis of genes annotated to age-associated hypo- and hypermethylated CpGs in ≥5 cell types (left-most column) and in individual cell types. Red/green shades indicate enrichment scores in hyper- (red) and hypo- (green) methylated genes. Yellow indicates ambiguous pathways associated with both hypo- and hypermethylated genes in individual cell types. Not significant pathways are shown in gray. Full results in Supplementary file 6.

Figure 4 with 1 supplement
Functional annotation of age-associated probes along with their grouping based on sharedness.

(A) ChromHMM annotation of age-associated CpGs. (B) Proportion of CpGs mapping to weak/active enhancers (left, orange box), bivalent enhancers/TSS (inset, brown box) and polycomb repressor regions (right, gray box) in age-associated hypo- (blue line), hypermethylated (red line) CpGs as compared to all MethylationEPIC CpGs (gray line). (C) DeepTools plots showing the distribution of accessible chromatin (DNase hypersensitive sites) and H3K4me1 histone mark in and around ±3 kb region of age-differentially methylated CpGs. The age-associated sites were divided into shared (blue) (common between five or more immune cells) and selective sites (green). The top row shows the pattern for age-associated hypomethylated CpGs while the bottom row is for the age-associated hypermethylated CpGs in B and CD4+ T cells.

Figure 4—figure supplement 1
Functional annotation of age-associated probes with respect to DHS and three other histone marks from ENCODE.

Rows 1 and 2 – the functional annotation of age-associated probes in CD8, monocytes, and NK cells with respect to DNase hypersensitivity sites and H3K4me1 peaks. Rows 3–6 – the functional annotation of age-associated probes in B, CD4, CD8, monocytes, and NK cells with respect to H3K4me3 and H3K27ac peaks. Age-associated hypo- or hypermethylated probes were grouped into shared (blue) and selective (green) based on whether they were common across five or more of the six cell types or not. A region of 3 kb of either side of the CpG probes of interest was examined. The DHS and histone bigwig files for the five primary cells were obtained from ENCODE. Granulocyte data were not available and hence could not be examined.

Figure 5 with 1 supplement
Association of transcription factor (TF)-binding motifs with age-differentially methylated CpGs.

(A) Top 5 TF motifs at and around (±200 bp) of CpG sites that are hypomethylated with age. All the age-hypomethylated sites were considered for the analysis in each cell type. Recurring motifs like ARNT and CTCF/BORIS are highlighted. (B) Top 5 TF motifs at and around (±200 bp) CpG sites that are hypermethylated with age. All the age-hypermethylated sites were considered for the analysis in each cell type. Recurring motifs like REST and Sp100 are highlighted. (C) Hypoxia-centric model of age-associated sites with ARNT and REST motifs. CpG sites hypomethylated with aging across six different cell types are significantly more likely to host-binding motifs for ARNT, the core hub for the hypoxia response. On the contrary, CpG sites hypermethylated with aging are significantly more likely to host-binding motifs for REST, a hypoxia-response transcriptional repressor. On the right are selected age-associated genes that carry the motifs for ARNT or REST TFs.

Figure 5—figure supplement 1
Count of age-associated hypo- or hypermethylated probes with ARNT or REST motifs within 1 kb, respectively.

ARNT and REST were the top transcription factor (TF) motifs associated, respectively, with all hypo- and hypermethylated age-associated probes in most cell types. To verify whether the same set of probes were present in each cell type with the ARNT (A) or REST (B) motifs, HOMER was used to find the probes that have an ARNT or REST motif within ±500 bp. The SuperExact test-based circular plots show the overlap between the different cell types (relevant cell types are indicated in green boxes in the inner circles for each combination). The numbers above the outermost bars indicate the count of probes that have ARNT or REST motifs across various combinations of cells while the color of the outermost bars in the plot indicates the log-transformed p-values obtained from hypergeometric test to check whether the overlap is significant or not.

Additional files

Supplementary file 1

Demographic and flow cytometry marker details of the cohort.

Details of the age and sex distribution of the healthy donors from the GESTALT study for each of the primary immune cell-type population are described. The flow cytometry markers for cell selection are also mentioned.

https://cdn.elifesciences.org/articles/86136/elife-86136-supp1-v2.xlsx
Supplementary file 2

Pathway enrichment analysis of genes annotated to top 500 probes corresponding to PC1, PC2, and PC3 components of principal component analysis (PCA).

The top 500 CpG sites corresponding to PC1, PC2, and PC3 components were annotated to genes followed by gene enrichment analysis. Age-associated genes in each pathway are in column M.

https://cdn.elifesciences.org/articles/86136/elife-86136-supp2-v2.xlsx
Supplementary file 3

Distribution of slope for probes significantly changing with age in the immune cells.

The age-associated probes were identified from beta-regression (FDR p < 0.05).

https://cdn.elifesciences.org/articles/86136/elife-86136-supp3-v2.xlsx
Supplementary file 4

List of age-associated probes each of the six primary immune cells.

Beta-regression coefficient, FDR p-value, and genomic annotation of the age-associated probes were identified from beta-regression (FDR p < 0.05).

https://cdn.elifesciences.org/articles/86136/elife-86136-supp4-v2.xlsx
Supplementary file 5

List of top age-associated genes in the six immune cell types.

The list of top 15 and top 50 age-associated hypo- and hypermethylated genes derived from the most significant age-associated probes in each cell type.

https://cdn.elifesciences.org/articles/86136/elife-86136-supp5-v2.xlsx
Supplementary file 6

Detailed output of gene set enrichment analysis.

Gene set enrichment analysis was performed on genes based on annotation of age-associated hypo- and hypermethylation probes commonly changing in five or more cell types.

https://cdn.elifesciences.org/articles/86136/elife-86136-supp6-v2.xlsx
Supplementary file 7

Top 5 transcription factor (TF) motifs within ±50 bp of age-associated methylated sites.

HOMER de novo analysis was performed to identify the top 5 TF motifs within 50 bp of age-associated hypo- and hypermethylated sites in each of the six cell types.

https://cdn.elifesciences.org/articles/86136/elife-86136-supp7-v2.xlsx
Supplementary file 8

Average read depths and Kallisto TPM normalized read counts of ARNT, CTCF, and REST for all the donors.

RNA-Seq data were used to look into the gene expression change of three selected transcription factors (TFs; ARNT, CTCF, and REST) with age. These TF motifs are most commonly associated with the age-related methylated sites in all immune cells. The mapping rates along with the Kallisto TPM normalized values for the three TFs for each cell type in each of the donors have been shown.

https://cdn.elifesciences.org/articles/86136/elife-86136-supp8-v2.xlsx
Supplementary file 9

Age-associated differences of transcripts for ARNT, REST, and CTCF.

FDR p-values derived from the linear regression of expression levels of the three transcription factors (TFs) with age in each of the six cell types.

https://cdn.elifesciences.org/articles/86136/elife-86136-supp9-v2.xlsx
Supplementary file 10

List of genes with age-associated methylated CpG sites showing ARNT or REST motif within 1 kb.

The age-associated probes with ARNT or REST motifs within 1 kb region were annotated to genes and summarized into a table. For each gene, number of age-associated CpG sites with ARNT/REST motif and number of cell types in which this occurrence has been observed have been mentioned.

https://cdn.elifesciences.org/articles/86136/elife-86136-supp10-v2.xlsx
Supplementary file 11

Output of beta-regression analysis with age and sex and the seven analytes.

Summary of two beta-regression models has been tabularized. Column C shows the number of age-associated probes from the original model CpG ~ age + sex with FDR cutoff of adjusted page < 0.05. Columns D–J show the number of age-associated probes from the model CpG ~ age + sex + analyte with FDR cutoff of adjusted page < 0.05 and adjusted panalyte < 0.05. Finally, columns K–Q represent the number of age-associated probes from the model CpG ~ age + sex + analyte with FDR cutoff of adjusted panalyte < 0.05.

https://cdn.elifesciences.org/articles/86136/elife-86136-supp11-v2.xlsx
Supplementary file 12

List of softwares.

https://cdn.elifesciences.org/articles/86136/elife-86136-supp12-v2.xlsx
MDAR checklist
https://cdn.elifesciences.org/articles/86136/elife-86136-mdarchecklist1-v2.docx

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Roshni Roy
  2. Pei-Lun Kuo
  3. Julián Candia
  4. Dimitra Sarantopoulou
  5. Ceereena Ubaida-Mohien
  6. Dena Hernandez
  7. Mary Kaileh
  8. Sampath Arepalli
  9. Amit Singh
  10. Arsun Bektas
  11. Jaekwan Kim
  12. Ann Z Moore
  13. Toshiko Tanaka
  14. Julia McKelvey
  15. Linda Zukley
  16. Cuong Nguyen
  17. Tonya Wallace
  18. Christopher Dunn
  19. William Wood
  20. Yulan Piao
  21. Christopher Coletta
  22. Supriyo De
  23. Jyoti Sen
  24. Nan-ping Weng
  25. Ranjan Sen
  26. Luigi Ferrucci
(2023)
Epigenetic signature of human immune aging in the GESTALT study
eLife 12:e86136.
https://doi.org/10.7554/eLife.86136