Population clustering of structural brain aging and its association with brain development
eLife assessment
Duan et al analyzed brain imaging data in UKBK and divided structural brain aging into two groups, revealing that one group is more vulnerable to aging and brain-related diseases compared to the other group. Such subtyping could be valuable and utilized in predicting and diagnosing cognitive decline and neurodegenerative brain disorders in the future. This discovery, supported by solid evidence, harbors a substantial impacts in aging and brain structure and function.
https://doi.org/10.7554/eLife.94970.3.sa0Valuable: Findings that have theoretical or practical implications for a subfield
- Landmark
- Fundamental
- Important
- Valuable
- Useful
Solid: Methods, data and analyses broadly support the claims with only minor weaknesses
- Exceptional
- Compelling
- Convincing
- Solid
- Incomplete
- Inadequate
During the peer-review process the editor and reviewers write an eLife Assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife Assessments
Abstract
Structural brain aging has demonstrated strong inter-individual heterogeneity and mirroring patterns with brain development. However, due to the lack of large-scale longitudinal neuroimaging studies, most of the existing research focused on the cross-sectional changes of brain aging. In this investigation, we present a data-driven approach that incorporate both cross-sectional changes and longitudinal trajectories of structural brain aging and identified two brain aging patterns among 37,013 healthy participants from UK Biobank. Participants with accelerated brain aging also demonstrated accelerated biological aging, cognitive decline and increased genetic susceptibilities to major neuropsychiatric disorders. Further, by integrating longitudinal neuroimaging studies from a multi-center adolescent cohort, we validated the ‘last in, first out’ mirroring hypothesis and identified brain regions with manifested mirroring patterns between brain aging and brain development. Genomic analyses revealed risk loci and genes contributing to accelerated brain aging and delayed brain development, providing molecular basis for elucidating the biological mechanisms underlying brain aging and related disorders.
Introduction
The structure of the brain undergoes continual changes throughout the entire lifespan, with structural brain alterations intimately linking brain development and brain aging (Fjell and Walhovd, 2010; Shaw et al., 2008). Brain aging is a progressive process that often co-occurs with biological aging and declines of cognitive functions (Elliott et al., 2021; Mattson and Arumugam, 2018; Park and Reuter-Lorenz, 2009), which contribute to the onset and acceleration of neurodegenerative (Mariani et al., 2005) and neuropsychiatric disorders (Kaufmann et al., 2019). Studies on healthy brain aging have revealed significant inter-individual heterogeneity in the patterns of neuroanatomical changes (Raz et al., 2010; Raz and Rodrigue, 2006). Therefore, examining the patterns of structural brain aging and its associations with cognitive decline is of paramount importance in understanding the diverse biological mechanisms of age-related neuropsychiatric disorders.
Despite the fact that there exist large differences between brain development and brain aging (Courchesne et al., 2000), a discernible association between these two processes remains evident. Direct comparisons of brain development and brain aging using structural MRI indicated a ‘last in, first out’ mirroring pattern, where brain regions develop relatively late during adolescence demonstrated accelerated degeneration in older ages (McGinnis et al., 2011; Tamnes et al., 2013). In addition, brain regions with strong mirroring effects showed increased vulnerability to neurodegenerative and neuropsychiatric disorders, including Alzheimer’s disease and schizophrenia (Douaud et al., 2014). However, due to the lack of large-scale longitudinal MRI studies during adolescence and mid-to-late adulthood, validation of the ‘last in, first out’ mirroring hypothesis remains unavailable.
Prior investigations have largely focused on regional and cross-sectional changes of brain aging (Raz and Rodrigue, 2006; Douaud et al., 2014; Suzuki et al., 2019), with relatively few studies exploring longitudinal trajectories of brain aging and its associations with brain development (Raz et al., 2010; Fjell et al., 2015; Nyberg et al., 2023). In this article, we present a data-driven approach to examine the population clustering of longitudinal brain aging trajectories using structure MRI data obtained from 37,013 healthy individuals during mid-to-late adulthood (44–82 years), and explore its association with biological aging, cognitive decline and susceptibilities for neuropsychiatric disorders. Further, mirroring patterns between longitudinal brain development and brain aging are investigated by comparing the region-specific aging / developmental trajectories, and manifestation of the mirroring patterns are investigated across the whole-brain and among participants with different brain aging patterns. Genomic analyses are conducted to reveal risk loci and genes associated with accelerated brain aging and delayed brain development.
Results
Longitudinal trajectories of whole-brain gray matter volume in mid-to-late adulthood define two brain aging patterns
Figure 1 provides the data sources, analytical workflow and research methodology of this study. After the sample selection process (Appendix 1—figure 1, Appendix 1—table 1 and 2), longitudinal grey matter volume (GMV) trajectories in 40 ROIs (33 cortical and 7 subcortical ROIs, see Appendix 1—table 3) were estimated for each of the 37,013 healthy participants in UK Biobank. The first 15 principal components derived from dimensionality reduction via principal component analysis were used in the clustering analysis (see Methods; Alexander-Bloch et al., 2013; Whitwell et al., 2009). Two brain aging patterns were identified, where 18,929 (51.1%) participants with the first brain aging pattern (pattern 1) had higher total GMV at baseline and a slower rate of GMV decrease over time, and the remaining participants with the second pattern (pattern 2) had lower total GMV at baseline and a faster rate of GMV decrease (Figure 2a). Comparing the region-specific rate of GMV decrease, pattern 2 showed a more rapid GMV decrease in medial occipital (lingual gyrus, cuneus, and pericalcarine cortex) and medial temporal (entorhinal cortex, parahippocampal gyrus) regions (Figure 2b and c and Appendix 1—figure 2), which had the largest loadings in the second and third principal components (Appendix 1—table 4). These two patterns can be clearly stratified by both linear and non-linear dimensionality reduction methods, indicating distinct structural differences in brain aging between patterns (Appendix 1—figure 3). Sample characteristics of these 37,013 UK Biobank participants stratified by brain aging patterns are summarized in Appendix 1—table 5. Overall, participants with different brain aging patterns had similar distributions with regard to age, sex, ethnicity, smoking status, Townsend deprivation index (TDI), body mass index (BMI) and years of schooling.
Brain aging patterns were significantly associated with biological aging
To explore the relationships between structural brain aging and biological aging, we investigated the distribution of aging biomarkers, such as telomere length and PhenoAge (Levine et al., 2018), across brain aging patterns identified above (Figure 3 and Appendix 1—table 6). Compared to pattern 1, participants in pattern 2 with more rapid GMV decrease had shorter leucocyte telomere length (p=0.009, Cohen’s D=–0.028) and this association remained consistent after adjusting for sex, age, ethnic, BMI, smoking status and alcohol intake frequency (Demanelis et al., 2020). Next, we examined PhenoAge, which was developed as an aging biomarker incorporating composite clinical and biochemical data (Levine et al., 2018), and observed higher PhenoAge among participants with brain aging pattern 2 compared to pattern 1 (p=0.019, Cohen’s D=0.027). Again, the association remained significant after adjusting for sex, age, ethnic, BMI, smoking status, alcohol intake frequency and education years (p=3.05×10–15, Cohen’s D=0.092). Group differences in terms of each individual component of PhenoAge (including albumin, creatinine, glucose, c-reactive protein, lymphocytes percentage, mean corpuscular volume, erythocyte distribution width, alkaline phosphatase and leukocyte count) were also investigated and results were consistent with PhenoAge (Appendix 1—figure 4).
Accelerated brain aging was associated with cognitive decline and increased genetic susceptibilities to attention-deficit/hyperactivity disorder and delayed brain development
Next, we conducted comprehensive comparisons of cognitive functions between participants with different brain aging patterns. In general, those with brain aging pattern 2 (lower baseline total GMV and more rapid GMV decrease) exhibited worse cognitive performances compared to pattern 1. Specifically, brain aging pattern 2 showed lower numbers of correct pairs matching (p=0.006, Cohen’s D=–0.029), worse prospective memory (OR = 0.943, 95% CI [0.891, 0.999]), lower fluid intelligence (p<1.00×10–20, Cohen’s D=–0.102), and worse numeric memory (p=5.97×10–11, Cohen’s D=–0.082). No statistically significant differences were observed in terms of the reaction time (p=0.99) and prospective memory (p=0.052) between these two brain aging patterns after FDR correction. Results were consistent when using models adjusted for sex, age, and socioeconomic status (TDI, education and income; Foster et al., 2018; Townsend et al., 2023; Figure 4). Full results demonstrating the associations between brain aging patterns and cognitive functions are presented in Appendix 1—table 7.
Having observed cognitive decline among participants with accelerated brain aging pattern, we next investigated whether brain aging patterns were associated with genetic vulnerability to major neuropsychiatric disorders. Since current GWAS are under-powered for attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorders (ASD) and the difficulty in identifying genetic variants was likely due to their polygenic nature, we calculated the corresponding polygenic risk scores (PRS) using multiple p value thresholds. This approach enabled robust investigation of the association between genetic susceptibility of neuropsychiatric disorders and brain imaging phenotypes. PRS for major neuro-developmental disorders including attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorders (ASD), neurodegenerative diseases including Alzheimer’s disease (AD) and Parkinson’s disease (PD), neuropsychiatric disorders including bipolar disorder (BIP), major depressive disorder (MDD), and schizophrenia (SCZ), and delayed structural brain development (GWAS from an unpublished longitudinal neuroimaging study) (Shi et al., 2023) were calculated for each participant using multiple p value thresholds (from 0.005 to 0.5 at intervals of 0.005) and results were then averaged over all thresholds (Figure 5). The primary GWAS datasets used for calculating the PRS were listed in Appendix 1—table 8. Overall, we observed increased genetic susceptibility to ADHD (p=0.040) and delayed brain development (p=1.4810–6) among participants with brain aging pattern 2 after FDR correction, while no statistically significant differences were observed for ASD, AD, PD, BIP, MDD, and SCZ (Figure 5). Details regarding the genetic liability to other common diseases and phenotypes using enhanced PRS from UK Biobank are displayed in Appendix 1tables 9 and 10.
Genome Wide Association Studies (GWAS) identified significant genetic loci associated with accelerated brain aging
Having observed significant associations between brain aging patterns and cognitive performances / genetic liabilities to major neurodevelopmental disorders, we further investigated if there exist genetic variants contributing to individualized brain aging phenotype. We conducted genome-wide association studies (GWAS) using estimated total GMV at 60 years old as the phenotype. This phenotype was derived by adding individual specific deviations to the population averaged total GMV, thus providing additional information compared to studies using only cross-sectional neuroimaging phenotypes.
Six independent single nucleotide polymorphisms (SNPs) were identified at genome-wide significance level (p<5×10–8) (Figure 6) and were subsequently mapped to genes using NCBI, Ensembl and UCSC Genome Browser database (Appendix 1—table 11). Among them, two SNPs (rs10835187 and rs779233904) were also found to be associated with multiple brain imaging phenotypes in previous studies (Smith et al., 2021), such as regional and tissue volume, cortical area and white matter tract measurements. Compared to the GWAS using global gray matter volume as the phenotype, our GWAS revealed additional signal in chromosome 7 (rs7776725), which was mapped to the intron of FAM3C and encodes a secreted protein involved in pancreatic cancer (Grønborg et al., 2006) and Alzheimer’s disease (Liu et al., 2016). This signal was further validated to be associated with specific brain aging mode by another study using a data-driven decomposition approach (Smith et al., 2020). In addition, another significant loci (rs10835187, p=1.11×10–13) is an intergenic variant between gene LGR4-AS1 and LIN7C, and was reported to be associated with bone density and brain volume measurement (Smith et al., 2021; Estrada et al., 2012). LIN7C encodes the Lin-7C protein, which is involved in the localization and stabilization of ion channels in polarized cells, such as neurons and epithelial cells (Bohl et al., 2007; Kaech et al., 1998). Previous study has revealed the association of both allelic and haplotypic variations in the LIN7C gene with ADHD (Lanktree et al., 2008).
Mirroring patterns between brain aging and brain development
Having observed significant associations between brain aging and genetic susceptibility to neurodevelopmental disorders, we are now interested in examining the mirroring patterns between brain aging and brain development in the whole population, and whether these mirroring patterns were more pronounced in those with accelerated brain aging. Adolescents in the IMAGEN cohort showed more rapid GMV decrease in the frontal and parietal lobes, especially the frontal pole, superior frontal gyrus, rostral middle frontal gyrus, inferior parietal lobule and superior parietal lobule, while those in their mid-to-late adulthood showed more accelerated GMV decrease in the temporal lobe, including medial orbitofrontal cortex, inferior parietal lobule and lateral occipital sulcus (Figure 7a). The mirroring patterns (with slower GMV decrease during brain development and more rapid GMV decrease during brain aging) were particularly prominent in inferior temporal gyrus, caudal anterior cingulate cortex, fusiform cortex, middle temporal gyrus and rostral anterior cingulate cortex (Figure 7b). The regional mirroring patterns became weaker when we focus on late brain aging at age 75 years old, especially in the frontal lobe and cingulate cortex. Further, mirroring patterns were represented more prominently in participants with brain aging pattern 2, where stronger mirroring between brain aging and brain development was observed in frontotemporal area, including lateral occipital sulcus and lingual gyrus (Figure 7c).
Gene expression profiles were associated with delayed brain development and accelerated brain aging
The Allen Human Brain Atlas (AHBA) transcriptomic dataset (http://human.brain-map.org) were used to obtain the spatial correlation between gene expression profiles across cortex and structural brain development/aging via partial least square (PLS) regression. The first PLS component explained 24.7% and 53.6% of the GMV change during brain development (estimated at age 15y, rspearman = 0.51, Ppermutation = 0.03) and brain aging (estimated at age 55y, rspearman = 0.49, Ppermutation = 1.510–4), respectively. Seventeen of the 45 genes mapped to GWAS significant SNP were found in AHBA, with LGR4 (rspearman = 0.56, Ppermutation <0.001) significantly associated with delayed brain development and ESR1 (rspearman = 0.53, Ppermutation <0.001) and FAM3C (rspearman = –0.37, Ppermutation = 0.004) significantly associated with accelerated brain aging. BDNF-AS was positively associated with both delayed brain development and accelerated brain aging after spatial permutation test (Appendix 1—table 12 and 13).
Next, we screened the genes based on their contributions and effect directions to the first PLS components in brain development and brain aging. 990 and 2293 genes were identified to be positively associated with brain development and negatively associated with brain aging at FDR corrected p value of 0.005, respectively, representing gene expressions associated with delayed brain development and accelerated brain aging. These genes were then tested for enrichment of GO biological processes and KEGG pathways. Genes associated with delayed brain development showed significant enrichment in ‘regulation of trans-synaptic signaling’, ‘forebrain development’, ‘signal release’ and ‘cAMP signaling pathway’ (Figure 8a), and genes associated with accelerated brain aging showed significant enrichment in ‘macroautophagy’, ‘establishment of protein localization to organelle’, ‘histone modification’, and ‘pathways of neurodegeneration – multiple diseases’ (Figure 8b). Full results of the gene set enrichment analysis were provided in Appendix 1—figure 5. In summary, the analyses from using the databases of GO biological processes and KEGG Pathways indicate synaptic transmission as an important process in the common mechanisms of brain development and aging, and cellular processes (autophagy), as well as the progression of neurodegenerative diseases, are important processes in the mechanisms of brain aging.
Discussion
In this study, we adopted a data-driven approach and revealed two distinct brain aging patterns using large-scale longitudinal neuroimaging data in mid-to-late adulthood. Compared to brain aging pattern 1, brain aging pattern 2 were characterized by a faster rate of GMV decrease, accelerated biological aging, cognitive decline, and genetic susceptibility to neurodevelopmental disorders. By integrating longitudinal neuroimaging data from adult and adolescent cohorts, we demonstrated the ‘last in, first out’ mirroring patterns between structural brain aging and brain development, and showed that the mirroring pattern was manifested in the temporal lobe and among participants with accelerated brain aging. Further, genome-wide association studies identified significant genetic loci contributing to accelerated brain aging, while spatial correlation between whole-brain transcriptomic profiles and structural brain aging / development revealed important gene sets associated with both accelerated brain aging and delayed brain development.
Brain aging is closely related to the onset and progression of neurodegenerative and neuropsychiatric disorders. Both neurodegenerative and neuropsychiatric disorders demonstrate strong inter-individual heterogeneity, which prevents the comprehensive understanding of their neuropathology and neurogenetic basis. Therefore, multidimensional investigation into disease subtyping and population clustering of structural brain aging are crucial in elucidating the sources of heterogeneity and neurophysiological basis related to the disease spectrum (Habes et al., 2020). In the last decades, major developments in the subtyping of Alzheimer’s disease, dementia and Parkinson’s disease, have provided new perspectives regarding their clinical diagnosis, treatment, disease progression and prognostics (Habes et al., 2020; Berg et al., 2021; Ferreira et al., 2020). While previous studies of brain aging mostly focused on the cross-sectional differences between cases and healthy controls, we here delineated the structural brain aging patterns among healthy participants using a novel data-driven approach that captured both cross-sectional and longitudinal trajectories of the whole-brain gray matter volume (Feczko et al., 2019; Poulakis et al., 2022). The two brain aging patterns identified using the above approach showed large differences in the rate of change in medial occipitotemporal gyrus, which is involved in vision, word processing, and scene recognition (Bogousslavsky et al., 1987; Epstein et al., 1999; Mechelli et al., 2000). Significant reduction of the gray matter volume and abnormal changes of the functional connectivity in this region were found in subjects with mild cognitive impairment (MCI) and AD, respectively (Chételat et al., 2005; Yao et al., 2010). Previous research on brainAGE (Elliott et al., 2021; Christman et al., 2020) (the difference between chronological age and the age predicted by the machine learning model of brain imaging data) showed that as a biomarker of accelerated brain aging, people with older brainAGE have accelerated biological aging and early signs of cognitive decline, which is consistent with our discoveries in this study. Our results support the establishment of a network connecting brain aging patterns with biological aging profiles involving multi-organ systems throughout the body (Tian et al., 2023). Since structural brain patterns might manifest and diverge decades before cognitive decline (Aljondi et al., 2019), subtyping of brain aging patterns could aid in the early prediction of cognitive decline and severe neurodegenerative and neuropsychiatric disorders.
Mirroring pattern between brain development and brain aging has long been hypothesized by postulating that phylogenetically newer and ontogenetically less precocious brain structures degenerate relatively early (Douaud et al., 2014). Early studies have reported a positive correlation between age-related differences of cortical volumes and precedence of myelination of intracortical fibers (Raz, 2000). Large differences in the patterns of change between adolescent late development and aging in the medial temporal cortex were previously found in studies of brain development and aging patterns (Tamnes et al., 2013). Here, we compared the annual volume change of the whole-brain gray matter during brain development and early / late stages of brain aging, and found that mirroring patterns are predominantly localized to the lateral / medial temporal cortex and the cingulate cortex. These cortical regions characterized by ‘last in, first out’ mirroring patterns showed increased vulnerability to the several neuropsychiatric disorders. For example, regional deficits in the superior temporal gyrus and medial temporal lobe were observed in schizophrenia (Honea et al., 2005), along with morphological abnormalities in the medial occipitotemporal gyrus (Schultz et al., 2010). Children diagnosed with ADHD had lower brain surface area in the frontal, cingulate, and temporal regions (Hoogman et al., 2019). Douaud et al., 2014 revealed a population transmodal network with lifespan trajectories characterized by the mirroring pattern of development and aging. We investigated the genetic susceptibility to individual-level mirroring patterns based on the lasting impact of neurodevelopmental genetic factors on brain (Fjell et al., 2015), demonstrating that those with more rapidly brain aging patterns have a higher risk of delayed development.
Identifying genes contributing to structural brain aging remains a critical step in understanding the molecular changes and biological mechanisms that govern age-related cognitive decline. Several genetic loci have been reported to be associated with brain aging modes and neurocognitive decline, many of which demonstrated global overlap with neuropsychiatric disorders and their related risk factors (Smith et al., 2020; Glahn et al., 2013; Brouwer et al., 2022). Here, we focused on the individual brain aging phenotype by estimating individual deviation from the population averaged total GMV and conducted genome-wide association analysis with this phenotype. Our approach identified six risk SNPs associated with accelerated brain aging, most of which could be further validated by previous studies using population averaged brain aging phenotypes. However, our approach revealed additional genetic signals and demonstrated genetic architecture underlying brain aging patterns overlap with bone density (Estrada et al., 2012; Zheng et al., 2015). In addition, molecular profiling of the aging brain has been thoroughly investigated among patients with neurogenerative diseases, but rarely conducted to shed light on the mirroring patterns among healthy participants. Analysis of the spatial correlation between gene expression profiles and structural brain development / aging further identified genes contributing to delayed brain development and accelerated brain aging. Specifically, expression of gene BDNF-AS was significantly associated with both processes. BDNF-AS is an antisense RNA gene and plays a role in the pathoetiology of non-neoplastic conditions mainly through the mediation of BDNF (Ghafouri-Fard et al., 2021). LGR4 (associated with delayed brain development) and FAM3C (associated with accelerated brain aging) identified in the spatial genetic association analysis also validated our findings in the GWAS.
There are several limitations in the current study that need to be addressed in future research. Firstly, the UK Biobank cohort, which we leveraged to identify population clustering of brain aging patterns, had a limited number of repeated structural MRI scans. Therefore, it remains challenging to obtain robust estimation of the longitudinal whole-brain GMV trajectory at the individual level. As a robustness check, we have calculated both intra-class correlation and variance of both random intercept and age slope to ensure appropriateness of the mixed effect models. Secondly, although aging is driven by numerous hallmarks, we have only investigated the association between brain aging patterns and biological aging in terms of telomere length and blood biochemical markers due to limitations of data access. Other dimensions of aging hallmarks and their relationship with structural brain aging need to be investigated in the future. Thirdly, our genomic analyses were restricted to ‘white British’ participants of European ancestry. The diversity of genomic analyses will continue to improve as the sample sizes of GWAS of non-European ancestry increase. Further, although the gene expression maps from Allen Human Brain Atlas enabled us to gain insights into the spatial coupling between gene expression profiles and mirroring patterns of the brain, the strong inter-individual variation of whole-brain gene expression levels and large temporal span of the human brain samples may lead to the inaccurate correspondence in the observed associations. Finally, we focused on structural MRIs in deriving brain aging patterns in this analysis, future investigations could consider other brain imaging modalities from a multi-dimensional perspective. Nevertheless, our study represents a novel attempt for population clustering of structural brain aging and validated the mirroring pattern hypothesis by leveraging large-scale adolescent and adult cohorts.
Methods
Participants
T1-weighted brain MRI images were obtained from 37,013 individuals aged 44–82 years old from UK Biobank (36,914 participants at baseline visit in 2014+, 4007 participants at the first follow-up visit in 2019+). All participants from UK Biobank provided written informed consent, and ethical approval was granted by the North West Multi-Center Ethics committee (https://www.ukbiobank.ac.uk/learn-more-about-uk-biobank/about-us/ethics) with research ethics committee (REC) approval number 16/NW/0274. Participants were excluded if they were diagnosed with severe psychiatric disorders or neurological diseases using ICD-10 primary and secondary diagnostic codes or from self-reported medical conditions at UK Biobank assessment center (see Appendix 1—tables 1 and 2). Data were obtained under application number 19542. A total of 1529 adolescents with structural MRI images were drawn from the longitudinal project IMAGEN (1463 at age 14, 1377 at age 19, and 1148 at age 23), of which the average number of MRI scans was 2.61 per adolescent. The lMAGEN study was approved by local ethics research committees of King’s College London, University of Nottingham. Trinity College Dublin, University of Heidelberg, Technische Universität Dresden, Commissariat à l'Énergie Atomique et aux Énergies Alternatives, and University Medical Center at the University of Hamburg in compliance with the Declaration of Helsinki (Association, 2013). Informed consent was given by all participants and a parent/guardian of each participant.
MRI acquisition
Quality-controlled T1-weighted neuroimaging data from UK Biobank and IMAGEN were processed using FreeSurfer v6.0. Detailed imaging processing pipeline can be found online for UK Biobank (https://biobank.ctsu.ox.ac.uk/crystal/crystal/docs/brain_mri.pdf) and IMAGEN (https://github.com/imagen2/imagen_mri; Schumann et al., 2010; Imagen, 2020). Briefly, cortical gray matter volume (GMV) from 33 regions in each hemisphere were generated using Desikan–Killiany Atlas (Desikan et al., 2006), and total gray matter volume (TGMV), intracranial volume (ICV) and subcortical volume were derived from ASEG atlas (Fischl et al., 2002 See Appendix 1—table 3). Regional volume was averaged across left and right hemispheres. To avoid deficient segmentation or parcellation, participants with TGMV, ICV or regional GMV beyond 4 standard deviations from the sample mean were considered as outliers and removed from the following analyses.
Identification of longitudinal brain aging patterns
Whole-brain GMV trajectory was estimated for each participant in 40 brain regions of interest (ROIs) (33 cortical regions and 7 subcortical regions), using mixed effect regression model with fixed linear and quadratic age effects, random intercept and random age slope. Covariates include sex, assessment center, handedness, ethnic, and ICV. Models with random intercept and with both random intercept and random age slope were compared using AIC, BIC and evaluation of intra-class correlation (ICC). Results suggested that random age slope model should be chosen for almost all ROIs (Appendix 1—table 14). Deviation of regional GMV from the population average was calculated for each participant at age 60 years and dimensionality reduction was conducted via principal component analysis (PCA). The first 15 principal components explaining approximately 70% of the total variations of regional GMV deviation were used in multivariate k-means clustering. Optimal number of clusters was chosen using both elbow diagram and contour coefficient (Appendix 1—figure 6). Rates of volumetric change for total gray matter and each ROI were estimated using generalized additive mixed effect models (GAMM) with fixed cubic splines of age, random intercept and random age slope, which incorporates both cross-sectional between-subject variation and longitudinal within-subject variation from 40,921 observations and 37,013 participants. Covariates include sex, assessment center, handedness, ethnic, and ICV. We also applied PCA and locally linear embedding (LLE; Roweis and Saul, 2000) to the adjusted GMV ROIs in order to map the high-dimensional imaging-derived phenotypes to a low-dimensional space for stratification visualisation. The GMV of 40 ROIs at baseline were linearly adjusted for sex, assessment center, handedness, ethnic, ICV, and second-degree polynomial in age to be consistent with the whole-brain GMV trajectory model.
Association between brain aging patterns and biological aging, cognitive decline and genetic susceptibilities of neuropsychiatric disorders
Individuals with Z-standardized leucocyte telomere length (Codd et al., 2021) and blood biochemistry (which were used to calculate PhenoAge (Levine et al., 2018) that characterizes biological aging) outside 4 standard deviations from the sample mean were excluded for better quality control. A total of 11 cognitive tests performed on the touchscreen questionnaire were included in the analysis. More information about the cognitive tests is provided in Supplementary Information. Comparisons of biological aging (leucocyte telomere length, PhenoAge) and cognitive function were conducted among participants with different brain aging patterns using both unadjusted and adjusted multivariate regression models with Bonferroni / FDR correction. Polygenic Risk Scores (PRS) were calculated for autism spectrum disorder (ASD), attention deficit hyperactivity disorder (ADHD), Alzheimer’s disease (AD), Parkinson’s Disease (PD), bipolar disorder (BIP), major depressive disorder (MDD), schizophrenia (SCZ), and delayed brain development using GWAS summary statistics (Shi et al., 2023) at multiple p value thresholds (from 0.005 to 0.5 at intervals of 0.005, and 1), with higher p value thresholds incorporating larger number of independent SNPs. After quality control of genotype and imaging data, PRSs were generated for 25,861 participants on UK Biobank genotyping data. SNPs were pruned and clumped with a cutoff r2 ≥ 0.1 within a 250 kb window. All calculations were conducted using PRSice v2.3.5 (Choi and O’Reilly, 2019). Enhanced PRS from UK Biobank Genomics for multiple diseases were also tested. Detailed instructions for calculating enhanced PRS in UK Biobank can be found in research of Thompson et al., 2022 Comparisons of neuropsychiatric disorders were conducted among participants with different brain aging patterns using t test with FDR correction. All statistical tests were two-sided.
Genome wide association Study to identify SNPs associated with brain aging patterns
We performed Genome-wide association studies (GWAS) on individual deviations of total GMV relative to the population average at 60 years using PLINK 2.0 (Chang et al., 2015). Variants with missing call rates exceeding 5%, minor allele frequency below 0.5% and imputation INFO score less than 0.8 were filtered out after the genotyping quality control for UK Biobank Imputation V3 dataset. Among the 337,138 unrelated ‘white British’ participants of European ancestry included in our study, 25,861 with recent UK ancestry and accepted genotyping and imaging quality control were included in the GWAS. The analyses were further adjusted for age, age2, sex, assessment center, handedness, ethnic, ICV, and the first 10 genetic principal components. Genome-wide significant SNPs (p<510–8) obtained from the GWAS were clumped by linkage disequilibrium (LD) (r2 <0.1 within a 250 kb window) using UKB release2b White British as the reference panel. We subsequently performed gene-based annotation in FUMA (Watanabe et al., 2017) using genome-wide significant SNPs and SNPs in close LD (r2 ≥ 0.1) using Annotate Variation (ANNOVAR) on Ensemble v102 genes (Wang et al., 2010).
Mirroring patterns between brain aging and brain development
To validate the ‘last in, first out’ mirroring hypothesis, we evaluated the structural association between brain development and brain aging. Longitudinal neuroimaging data from 1529 adolescents in the IMGAEN cohort and 3908 mid-to-late adulthood in the UK Biobank cohort were analyzed. Annual percentage volume change (APC) for each ROI was calculated among individuals with at least two structural MRI scans by subtracting the baseline GMV from follow-up GMV and dividing by the number of years between baseline and follow-up visits. Region-specific APC was regressed on age using smoothing spline with cross validated degree of freedom. Estimated APC for each ROI was obtained at age 15y for adolescents and at age 55y (early aging) and 75y (late aging) for participants in UK Biobank. Region-specific APC during adolescence (or mid-to-late adulthood) was then standardized across all cortical regions to create the brain development (or aging) map. Finally, the brain development map and brain aging map were compared to assess the mirroring pattern for each ROI in the overall population and across different aging subgroups.
Gene expression analysis
The Allen Human Brain Atlas (AHBA) dataset (http://human.brain-map.org), which comprises gene expression measurements in six postmortem adults (age 24–57y) across 83 parcellated brain regions (Hawrylycz et al., 2012; Markello et al., 2021), were used to identify gene expressions significantly associated with structural brain development and aging. The expression profiles of 15,633 genes were averaged across donors to form a 8315,633 transcriptional matrix and partial least squares (PLS) regression was adopted for analyzing the association between regional change rate of gray matter volume and gene expression profiles. Specifically, estimated regional APC at 15 (obtained from IMAGEN cohort) and 55 years old (obtained from UK Biobank) were regressed on the high-dimensional gene expression profiles upon regularization. Associations between the first PLS component and estimated APC during brain development and brain aging were tested by spatial permutation analysis (10,000 times; Váša et al., 2018). Additionally, gene expression profiles of genes mapped to GWAS significant SNP were extracted from AHBA. The association between gene expression profiles of mapped genes and estimated APC during brain development and aging was also tested by spatial permutation analysis. Statistical significance of each gene’s contribution to the first PLS component was tested with standard error calculated using bootstrap (Li et al., 2021; Morgan et al., 2019; Romero-Garcia et al., 2020), and genes significantly associated with delayed brain development and accelerated brain aging were selected. Enrichment of Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways and gene ontology (GO) of biological processes for these selected genes were analyzed using R package clusterProfiler (Yu et al., 2012). All statistical significances were corrected for multiple testing using FDR.
Code availability
R version 4.2.0 was used to perform statistical analyses. FreeSurfer version 6.0 was used to process neuroimaging data. lme4 1.1 in R version 4.2.0 was used to perform longitudinal data analyses. PRSice version 2.3.5 (https://choishingwan.github.io/PRSice/; Choi and O’Reilly, 2019) was used to calculate the PRS. PLINK 2.0 (https://www.cog-genomics.org/plink/2.0/) and FUMA version 1.5.6 (https://fuma.ctglab.nl/) were used to perform genome-wide association analysis, and ANNOVAR was used to perform gene-based annotation. AHBA microarray expression data were processed using abagen toolbox version 0.1.3 (https://doi.org/10.5281/zenodo.5129257). The rotate_parcellation code used to perform a spatial permutation test of a parcellated cortical map: https://github.com/frantisekvasa/rotate_parcellation (Váša, 2023; Váša et al., 2018). Code for PLS analysis and bootstrapping to estimate PLS weights are available at https://github.com/KirstieJane/NSPN_WhitakerVertes_PNAS2016/tree/master/SCRIPTS (Whitaker, 2016; Whitaker et al., 2016). clusterProfiler 4.6 in R version 4.2.0 was used to analyze gene-set enrichment.
Appendix 1
Population clustering of structural brain aging and its association with brain development.
Supplementary method
Cognitive assessment
Reaction time
This cognitive function test is based on 12 rounds of the card-game 'Snap'. The participant is shown two cards at a time; if both cards are the same, they press a button-box that is on the table in front of them as quickly as possible. The score used for analysis is mean time to correctly identify matches (UK Biobank data field 20023), which is the mean duration to first press of snap-button summed over rounds in which both cards matched.
Numeric memory
The participant was shown a 2-digit number to remember. The number then disappeared and after a short while they were asked to enter the number onto the screen. The number became one digit longer each time they remembered correctly (up to a maximum of 12 digits). This test is available for a subset of participants. The score used for analysis is maximum digits remembered correctly (UK Biobank data field 4282), which is longest number correctly recalled during the numeric memory test.
Fluid intelligence / reasoning
'Fluid intelligence' is defined as the capacity to solve problems that require logic and reasoning ability, independent of acquired knowledge. The participant has 2 min to complete as many questions as possible from the test. This test was incorporated into the touchscreen towards the end of recruitment. The score used for analysis is fluid intelligence score (UK Biobank data field 20016), which is a simple unweighted sum of the number of correct answers given to the 13 fluid intelligence questions. Participants who did not answer all of the questions within the allotted 2 min limit are scored as zero for each of the unattempted questions.
Trail making
The participant was presented with sets of digits/letters in circles scattered around the screen and asked to click on them sequentially according to a specific algorithm. The scores used for analysis are duration to complete numeric path (trail #1) (UK Biobank data field 6348) and duration to complete alphanumeric path (trail #2) (UK Biobank data field 6350).
Matrix pattern completion
The participant was presented with a series of matrix pattern blocks with an element missing and asked to select the element that best completed the pattern from a range of displayed choices. The score used for analysis is number of puzzles correctly solved (UK Biobank data field 6373).
Symbol digit substitution
The participant was presented with one grid linking symbols to single-digit integers and a second grid containing only the symbols. They were then asked to indicate the numbers attached to each of the symbols in the second grid using the first one as a key. The score used for analysis is number of symbol digit matches made correctly (UK Biobank data field 23324).
Tower rearranging
The participant was presented with an illustration of three pegs (towers) on which three differently-colored hoops had been placed. The were then asked to indicate how many moves it would take to re-arrange the hoops into another specific position. The score used for analysis is number of puzzles correct (UK Biobank data field 21004).
Paired associate learning
In the paired associate learning test the participants were shown 12 pairs of words (for 30 s in total) then, after an interval (in which they did a different test), presented with the first word of 10 of these pairs and asked to select the matching second word from a choice of four alternatives. The words were presented in the order: huge, happy, tattered, old, long, red, sulking, pretty, tiny and new. The score used for analysis is number of word pairs correctly associated (UK Biobank data field 21097), which is the number of word pairs correctly associated out of 10 attempts.
Prospective memory
Early in the touchscreen cognitive section, the participant is shown the message "At the end of the games we will show you four colored shapes and ask you to touch the Blue Square. However, to test your memory, we want you to actually touch the Orange Circle instead." The score used for analysis is prospective memory result (UK Biobank data field 20018), which condenses the results of the prospective memory test into 3 groups (“0”: instruction not recalled, either skipped or incorrect, “1”: correct recall on first attempt, “2”: correct recall on second attempt). We divided the test results into two groups for simplicity of analysis: “0” indicating no correct recall, and the combination of “1” and “2” indicating correct recall.
Pairs matching
Participants are asked to memorize the position of as many matching pairs of cards as possible. The cards are then turned face down on the screen and the participant is asked to touch as many pairs as possible in the fewest tries. Multiple rounds were conducted. The first round used 3 pairs of cards and the second 6 pairs of cards. In the pilot phase an additional (i.e. third) round was conducted using six pairs of cards. However this was dropped from the main study as the extra set of results were very similar to the second and not felt to add significant new information. The score used for analysis is number of incorrect matches in round 2 (UK Biobank data field 399.2).
Data availability
The summary statistics of GWAS for individual deviations of total GMV is available at https://doi.org/10.5061/dryad.jh9w0vtmn. All the UK Biobank data used in the study are available at https://www.ukbiobank.ac.uk. The IMAGEN project data are available at https://imagen-project.org (Schumann et al., 2010). GWAS summary statistics used to calculate the PRS are available in the Appendix 1—table 8. Human gene expression data are available in the Allen Human Brain Atlas dataset: https://human.brainmap.org (Hawrylycz et al., 2012). Source data files have been provided for Figures 2—5, 7 and 8, which contain the numerical data used to generate the figures. Summary statistics of the GWAS for delayed brain development by Shi et al are available at https://delayedneurodevelopment.page.link/amTC.
-
DryadThe summary statistics of GWAS for individual deviations of total GMV.https://doi.org/10.5061/dryad.jh9w0vtmn
-
GWAS CatalogID GCST009325. Parkinson’s disease or first degree relation to individual with Parkinson’s disease.
-
Vrije Universiteit Research DriveID l7aiRr1UEgdoJfZ. Summary statistics for Genome-wide meta-analysis identifies new loci and functional pathways influencing Alzheimer's disease risk.
References
-
Imaging structural co-variance between human brain regionsNature Reviews. Neuroscience 14:322–336.https://doi.org/10.1038/nrn3465
-
A decade of changes in brain volume and cognitionBrain Imaging and Behavior 13:554–563.https://doi.org/10.1007/s11682-018-9887-z
-
Prodromal Parkinson disease subtypes - key to understanding heterogeneityNature Reviews. Neurology 17:349–361.https://doi.org/10.1038/s41582-021-00486-9
-
Lingual and fusiform gyri in visual processing: a clinico-pathologic study of superior altitudinal hemianopiaJournal of Neurology, Neurosurgery & Psychiatry 50:607–614.https://doi.org/10.1136/jnnp.50.5.607
-
The stardust family protein MPP7 forms a tripartite complex with LIN7 and DLG1 that regulates the stability and localization of DLG1 to cell junctionsThe Journal of Biological Chemistry 282:9392–9400.https://doi.org/10.1074/jbc.M610002200
-
The heterogeneity problem: approaches to identify psychiatric subtypesTrends in Cognitive Sciences 23:584–601.https://doi.org/10.1016/j.tics.2019.03.009
-
Structural brain changes in aging: courses, causes and cognitive consequencesReviews in the Neurosciences 21:187–221.https://doi.org/10.1515/revneuro.2010.21.3.187
-
A concise review on the role of BDNF-AS in human disordersBiomedicine & Pharmacotherapy 142:112051.https://doi.org/10.1016/j.biopha.2021.112051
-
Biomarker discovery from pancreatic cancer secretome using a differential proteomic approachMolecular & Cellular Proteomics 5:157–171.https://doi.org/10.1074/mcp.M500178-MCP200
-
Regional deficits in brain volume in schizophrenia: a meta-analysis of voxel-based morphometry studiesThe American Journal of Psychiatry 162:2233–2245.https://doi.org/10.1176/appi.ajp.162.12.2233
-
Brain imaging of the cortex in adhd: a coordinated analysis of large-scale clinical and population-based samplesThe American Journal of Psychiatry 176:531–542.https://doi.org/10.1176/appi.ajp.2019.18091033
-
Association study of brain-derived neurotrophic factor (BDNF) and LIN-7 homolog (LIN-7) genes with adult attention-deficit/hyperactivity disorderAmerican Journal of Medical Genetics. Part B, Neuropsychiatric Genetics 147B:945–951.https://doi.org/10.1002/ajmg.b.30723
-
Oxidative stress in brain aging, neurodegenerative and vascular diseases: An overviewJournal of Chromatography B 827:65–75.https://doi.org/10.1016/j.jchromb.2005.04.023
-
Age-related changes in the thickness of cortical zones in humansBrain Topography 24:279–291.https://doi.org/10.1007/s10548-011-0198-6
-
Differential effects of word length and visual contrast in the fusiform and lingual gyri duringProceedings of the Royal Society of London. Series B 267:1909–1913.https://doi.org/10.1098/rspb.2000.1229
-
The adaptive brain: aging and neurocognitive scaffoldingAnnual Review of Psychology 60:173–196.https://doi.org/10.1146/annurev.psych.59.103006.093656
-
BookAging of the brain and its impact on cognitive performance: integration of structural and functional findingsUniversity of Memphis.
-
Differential aging of the brain: patterns, cognitive correlates and modifiersNeuroscience & Biobehavioral Reviews 30:730–748.https://doi.org/10.1016/j.neubiorev.2006.07.001
-
Increased parahippocampal and lingual gyrification in first-episode schizophreniaSchizophrenia Research 123:137–144.https://doi.org/10.1016/j.schres.2010.08.033
-
The IMAGEN study: reinforcement-related behaviour in normal brain function and psychopathologyMolecular Psychiatry 15:1128–1139.https://doi.org/10.1038/mp.2010.4
-
Neurodevelopmental trajectories of the human cerebral cortexThe Journal of Neuroscience 28:3586–3594.https://doi.org/10.1523/JNEUROSCI.5309-07.2008
-
Functional mapping and annotation of genetic associations with FUMANature Communications 8:1826.https://doi.org/10.1038/s41467-017-01261-5
-
Abnormal cortical networks in mild cognitive impairment and Alzheimer’s diseasePLOS Computational Biology 6:e1001006.https://doi.org/10.1371/journal.pcbi.1001006
Article and author information
Author details
Funding
National Natural Science Foundation of China (No.82304241)
- Xiaolei Lin
National Key Research and Development Program of China (No.2019YFA0709502)
- Jianfeng Feng
National Key Research and Development Program of China (No.2018YFC1312904)
- Jianfeng Feng
Shanghai Municipal Science and Technology Major Project (No.2018SHZDZX01)
- Jianfeng Feng
ZJ Lab
- Jianfeng Feng
Shanghai Center for Brain Science and Brain-Inspired Technology
- Jianfeng Feng
111 project (No.B18015)
- Jianfeng Feng
European Union FP6 Integrated Project IMAGEN (LSHM-CT- 2007-037286)
- Gunter Schumann
Horizon 2020 ERC Advanced Grant 'STRATIFY' (695313)
- Gunter Schumann
Human Brain Project (HBP SGA 2,785907)
- Gunter Schumann
Human Brain Project (HBP SGA 3,945539)
- Gunter Schumann
Medical Research Council Grant 'c-VEDA' (MR/N000390/1)
- Gunter Schumann
National Institutes of Health (R01DA049238)
- Gunter Schumann
National Institute for Health Research (NIHR) Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King's College London
- Gunter Schumann
Deutsche Forschungsgemeinschaft (SM 80/7-2)
- Gunter Schumann
Medical Research Foundation and Medical Research Council (MR/R00465X/1)
- Sylvane Desrivières
National Institutes of Health (ENIGMA 5U54EB020403-05)
- Sylvane Desrivières
National Institutes of Health (U54 EB020403)
- Sylvane Desrivières
ANR (ANR-12-SAMA-0004 AAPG2019 - GeBra)
- Jean-Luc Martinot
Eranet Neuron (AF12-NEUR0008-01 - WM2NA)
- Jean-Luc Martinot
Fondation de France (00081242)
- Jean-Luc Martinot
Fondation pour la Recherche Médicale (DPA20140629802)
- Jean-Luc Martinot
Mission Interministérielle de Lutte-contre-les-Drogues-et-les-Conduites-Addictives (MILDECA)
- Jean-Luc Martinot
Assistance-Publique-Hôpitaux-de-Paris and INSERM (interface grant)
- Marie-Laure Paillère Martinot
Paris Sud University IDEX 2012
- Jean-Luc Martinot
Fondation de l'Avenir (AP-RM-17-013)
- Marie-Laure Paillère Martinot
Fédération pour la Recherche sur le Cerveau
- Gunter Schumann
National Institutes of Health, Science Foundation Ireland (16/ERCD/3797)
- Robert Whelan
NSFC (82150710554)
- Gunter Schumann
environMENTAL (grant no: 101057429)
- Gunter Schumann
Bundesministeriumfür Bildung und Forschung (01GS08152)
- Gunter Schumann
Bundesministeriumfür Bildung und Forschung (01EV0711)
- Gunter Schumann
Forschungsnetz AERIAL (01EE1406A)
- Gunter Schumann
Forschungsnetz AERIAL (01EE1406B)
- Gunter Schumann
Forschungsnetz IMAC-Mind (01GL1745B)
- Gunter Schumann
Deutsche Forschungsgemeinschaft (SFB 940)
- Gunter Schumann
Deutsche Forschungsgemeinschaft (TRR 265)
- Gunter Schumann
Deutsche Forschungsgemeinschaft (NE 1383/14-1)
- Gunter Schumann
Medical Research Foundation and Medical Research Council (MR/S020306/1)
- Sylvane Desrivières
National Institutes of Health (ENIGMA 1R56AG058854-01)
- Sylvane Desrivières
Eranet (ANR-18-NEUR00002-01 - ADORe)
- Jean-Luc Martinot
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
This research used the UK Biobank Resource under application number 19542. We thank all participants and researchers from the UK Biobank. We thank the IMAGEN Consortium for providing the discover data. This work received support from the following sources: the National Nature Science Foundation of China (No.82304241 [to XL]), National Key R&D Program of China (No.2019YFA0709502 [to JF], No.2018YFC1312904 [to JF]), Shanghai Municipal Science and Technology Major Project (No.2018SHZDZX01 [to JF], ZJ Lab [to JF], and Shanghai Center for Brain Science and Brain-Inspired Technology [to JF]), the 111 Project (No.B18015 [to JF]), the European Union-funded FP6 Integrated Project IMAGEN (Reinforcement-related behaviour in normal brain function and psychopathology) (LSHM-CT- 2007–037286 [to GS]), the Horizon 2020 funded ERC Advanced Grant ‘STRATIFY’ (Brain network based stratification of reinforcement-related disorders) (695313 [to GS]), Human Brain Project (HBP SGA 2, 785907, and HBP SGA 3, 945539 [to GS]), the Medical Research Council Grant 'c-VEDA’ (Consortium on Vulnerability to Externalizing Disorders and Addictions) (MR/N000390/1 [to GS]), the National Institute of Health (NIH) (R01DA049238 [to GS], A decentralized macro and micro gene-by-environment interaction analysis of substance use behavior and its brain biomarkers), the National Institute for Health Research (NIHR) Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King’s College London, the Bundesministeriumfür Bildung und Forschung (BMBF grants 01GS08152; 01EV0711 [to GS]; Forschungsnetz AERIAL 01EE1406A, 01EE1406B; Forschungsnetz IMAC-Mind 01GL1745B [to GS]), the Deutsche Forschungsgemeinschaft (DFG grants SM 80/7–2, SFB 940, TRR 265, NE 1383/14–1 [to GS]), the Medical Research Foundation and Medical Research Council (grants MR/R00465X/1 and MR/S020306/1 [to SD]), the National Institutes of Health (NIH) funded ENIGMA (grants 5U54EB020403-05 and 1R56AG058854-01 [to SD]), NSFC grant 82150710554 [to GS] and European Union funded project ‘environMENTAL’, grant no: 101057429 [to GS]. Further support was provided by grants from: - the ANR (ANR-12-SAMA-0004, AAPG2019 - GeBra [to JLM]), the Eranet Neuron (AF12-NEUR0008-01 - WM2NA; and ANR-18-NEUR00002-01 - ADORe [to JLM]), the Fondation de France (00081242 [to J.-L.M.]), the Fondation pour la Recherche Médicale (DPA20140629802 [to JLM]), the Mission Interministérielle de Lutte-contre-les-Drogues-et-les-Conduites-Addictives (MILDECA [to JLM]), the Assistance-Publique-Hôpitaux-de-Paris and INSERM (interface grant [to MLPM]), Paris Sud University IDEX 2012 [to J.-LM], the Fondation de l’Avenir (grant AP-RM-17–013 [to MLPM]), the Fédération pour la Recherche sur le Cerveau [to GS]; the National Institutes of Health, Science Foundation Ireland (16/ERCD/3797 [to RW]) and by NIH Consortium grant U54 EB020403 [to SD], supported by a cross-NIH alliance that funds Big Data to Knowledge Centres of Excellence. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.
Ethics
All participants from UK Biobank provided written informed consent, and ethical approval was granted by the North West Multi-Center Ethics committee (https://www.ukbiobank.ac.uk/learn-more-about-uk-biobank/about-us/ethics) with research ethics committee (REC) approval number 16/NW/0274. The lMAGEN study was approved by local ethics research committees of King's College London, University of Nottingham. Trinity College Dublin, University of Heidelberg, Technische Universität Dresden, Commissariat à l'Énergie Atomique et aux Énergies Alternatives, and University Medical Center at the University of Hamburg in compliance with the Declaration of Helsinki (https://doi.org/10.1001/jama.2013.281053). Informed consent was given by all participants and a parent/guardian of each participant.
Version history
- Sent for peer review:
- Preprint posted:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
- Version of Record published:
Cite all versions
You can cite all versions using the DOI https://doi.org/10.7554/eLife.94970. This DOI represents all versions, and will always resolve to the latest one.
Copyright
© 2024, Duan et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 749
- views
-
- 61
- downloads
-
- 0
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Computational and Systems Biology
- Genetics and Genomics
Enhancers and promoters are classically considered to be bound by a small set of transcription factors (TFs) in a sequence-specific manner. This assumption has come under increasing skepticism as the datasets of ChIP-seq assays of TFs have expanded. In particular, high-occupancy target (HOT) loci attract hundreds of TFs with often no detectable correlation between ChIP-seq peaks and DNA-binding motif presence. Here, we used a set of 1003 TF ChIP-seq datasets (HepG2, K562, H1) to analyze the patterns of ChIP-seq peak co-occurrence in combination with functional genomics datasets. We identified 43,891 HOT loci forming at the promoter (53%) and enhancer (47%) regions. HOT promoters regulate housekeeping genes, whereas HOT enhancers are involved in tissue-specific process regulation. HOT loci form the foundation of human super-enhancers and evolve under strong negative selection, with some of these loci being located in ultraconserved regions. Sequence-based classification analysis of HOT loci suggested that their formation is driven by the sequence features, and the density of mapped ChIP-seq peaks across TF-bound loci correlates with sequence features and the expression level of flanking genes. Based on the affinities to bind to promoters and enhancers we detected five distinct clusters of TFs that form the core of the HOT loci. We report an abundance of HOT loci in the human genome and a commitment of 51% of all TF ChIP-seq binding events to HOT locus formation thus challenging the classical model of enhancer activity and propose a model of HOT locus formation based on the existence of large transcriptional condensates.
-
- Developmental Biology
- Genetics and Genomics
O-GlcNAcylation is an essential intracellular protein modification mediated by O-GlcNAc transferase (OGT) and O-GlcNAcase (OGA). Recently, missense mutations in OGT have been linked to intellectual disability, indicating that this modification is important for the development and functioning of the nervous system. However, the processes that are most sensitive to perturbations in O-GlcNAcylation remain to be identified. Here, we uncover quantifiable phenotypes in the fruit fly Drosophila melanogaster carrying a patient-derived OGT mutation in the catalytic domain. Hypo-O-GlcNAcylation leads to defects in synaptogenesis and reduced sleep stability. Both these phenotypes can be partially rescued by genetically or chemically targeting OGA, suggesting that a balance of OGT/OGA activity is required for normal neuronal development and function.