Epigenetic scores for the circulating proteome as tools for disease prediction
Abstract
Protein biomarkers have been identified across many age-related morbidities. However, characterising epigenetic influences could further inform disease predictions. Here, we leverage epigenome-wide data to study links between the DNAm signatures of the circulating proteome and incident diseases. Using data from four cohorts, we trained and tested epigenetic scores (EpiScores) for 953 plasma proteins, identifying 109 scores that explained between 1% and 58% of the variance in protein levels after adjusting for known protein quantitative trait loci (pQTL) genetic effects. By projecting these EpiScores into an independent sample, (Generation Scotland; n=9,537) and relating them to incident morbidities over a follow-up of 14 years, we uncovered 137 EpiScore – disease associations. These associations were largely independent of immune cell proportions, common lifestyle and health factors and biological aging. Notably, we found that our diabetes-associated EpiScores highlighted previous top biomarker associations from proteome-wide assessments of diabetes. These EpiScores for protein levels can therefore be a valuable resource for disease prediction and risk stratification.
Data availability
Datasets generated in this study are made available in Supplementary file 1; this file includes the protein EpiScore weights for the 109 EpiScores we provide for future studies to use. All datasets used to create figures are included in Supplementary file 1 and specific locations for these are noted in figure legends.All code used in the analyses is available with open access at the following Gitlab repository: https://github.com/DanniGadd/EpiScores-for-protein-levels.The source datasets analysed during the current study are not publicly available due to them containing information that could compromise participant consent and confidentiality. Data can be obtained from the data owners. Instructions for Lothian Birth Cohort data access can be found here: https://www.lothianbirthcohort.ed.ac.uk/content/collaboration. Dr Simon Cox must be contacted to obtain a Lothian Birth Cohort 'Data Request Form' by email: simon.cox@ed.ac.uk. Instructions for accessing Generation Scotland data can be found here: https://www.ed.ac.uk/generation-scotland/for-researchers/access; the 'GS Access Request Form' can be downloaded from this site. Completed request forms must be sent to access@generationscotland.org to be approved by the Generation Scotland access committee. Data from the KORA study can be requested from KORA-gen: http://epi.helmholtz-muenchen.de/kora-gen. Requests are submitted online and are subject to approval by the KORA board.
Article and author information
Author details
Funding
Wellcome Trust (108890/Z/15/Z)
- Danni A Gadd
- Robert F Hillary
Dementias Platform UK (MR/L023784/2)
- Liu Shi
Medical Research Council (MC_UU_00007/10)
- Caroline Hayward
Wellcome Trust (104036/Z/14/Z)
- Ian J Deary
- David J Porteous
- Andrew M McIntosh
Wellcome Trust (220857/Z/20/Z)
- Andrew M McIntosh
Wellcome Trust (216767/Z/19/Z)
- Chloe Fawns-Ritchie
- Cliff Nangle
- Archie Campbell
- Robin Flaig
- Ian J Deary
- David J Porteous
- Caroline Hayward
- Andrew M McIntosh
- Riccardo E Marioni
Chief Scientist Office of the Scottish Government Health Directorates (CZD/16/6)
- David J Porteous
Scottish Funding Council (HR03006)
- David J Porteous
Australian Research Council Fellowship (FT200100837)
- Allan F McRae
Australian Research Council (DP160102400,FL180100072)
- Peter M Visscher
Australian National Health and Medical Research Council (1113400,1010374)
- Peter M Visscher
Wellcome Trust (203771/Z/16/Z)
- Anna J Stevenson
Medical Research Council and Biotechnology and Biological Sciences Research Council (MR/K026992/1)
- Ian J Deary
UK's Biotechnology and Biological Sciences Research Council
- Ian J Deary
Royal Society-Wolfson Research Merit Award
- Ian J Deary
Chief Scientist Office of the Scottish Government's Health Directorates
- Ian J Deary
Age UK ((Disconnected Mind project))
- Sarah E Harris
- Ian J Deary
- Simon R Cox
Medical Research Council (G0701120,G1001245,MR/M013111/1,MR/R024065/1)
- Ian J Deary
- Simon R Cox
Biotechnology and Biological Sciences Research Council (BB/F019394/1)
- Ian J Deary
Sir Henry Dale Fellowship jointly funded by the Wellcome Trust and the Royal Society (221890/Z/20/Z)
- Simon R Cox
National Institutes of Health (RF1AG073593,P2CHD042849,P30AG066614)
- Elliot M Tucker-Drob
National Institutes of Health (R01AG054628)
- Elliot M Tucker-Drob
- Ian J Deary
- Simon R Cox
Alzheimer's Research UK (ARUK-PG2017B−10)
- Daniel L McCartney
- Riccardo E Marioni
Health Data Research UK (substantive site award)
- Archie Campbell
MRC Human Genetics Unit (MRC core support)
- Caroline Hayward
Qatar Foundation (Biomedical Research Program at Weill Cornell Medicine)
- Shaza B Zaghlool
- Karsten Suhre
Qatar National Research Fund (NPRP11C-0115-180010)
- Shaza B Zaghlool
- Karsten Suhre
German Federal Ministry of Education and Research (Helmholtz Zentrum München)
- Christian Gieger
- Annette Peters
- Melanie Waldenberger
- Johannes Graumann
Munich Center of Health Sciences (LMUinnovativ)
- Christian Gieger
- Annette Peters
- Melanie Waldenberger
- Johannes Graumann
Bavarian State Ministry of Health and Care (DigiMed Bayern)
- Christian Gieger
- Annette Peters
- Melanie Waldenberger
- Johannes Graumann
NIHR Biomedical Research Centre at Oxford Health NHS Foundation Trust
- Liu Shi
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Ethics
Human subjects: All KORA participants have given written informed consent and the study was approved by the Ethics Committee of the Bavarian Medical Association.All components of GS received ethical approval from the NHS Tayside Committee on Medical Research Ethics (REC Reference Number: 05/S1401/89). GS has also been granted Research Tissue Bank status by the East of Scotland Research Ethics Service (REC Reference Number: 20/ES/0021), providing generic ethical approval for a wide range of uses within medical research.Ethical approval for the LBC1921 and LBC1936 studies was obtained from the Multi-Centre Research Ethics Committee for Scotland (MREC/01/0/56) and the Lothian Research Ethics committee (LREC/1998/4/183; LREC/2003/2/29). In both studies, all participants provided written informed consent. These studies were performed in accordance with the Helsinki declaration.
Copyright
© 2022, Gadd et al.
This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 5,893
- views
-
- 768
- downloads
-
- 56
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Epidemiology and Global Health
- Genetics and Genomics
Alzheimer’s disease (AD) is a complex degenerative disease of the central nervous system, and elucidating its pathogenesis remains challenging. In this study, we used the inverse-variance weighted (IVW) model as the major analysis method to perform hypothesis-free Mendelian randomization (MR) analysis on the data from MRC IEU OpenGWAS (18,097 exposure traits and 16 AD outcome traits), and conducted sensitivity analysis with six models, to assess the robustness of the IVW results, to identify various classes of risk or protective factors for AD, early-onset AD, and late-onset AD. We generated 400,274 data entries in total, among which the major analysis method of the IVW model consists of 73,129 records with 4840 exposure traits, which fall into 10 categories: Disease, Medical laboratory science, Imaging, Anthropometric, Treatment, Molecular trait, Gut microbiota, Past history, Family history, and Lifestyle trait. More importantly, a freely accessed online platform called MRAD (https://gwasmrad.com/mrad/) has been developed using the Shiny package with MR analysis results. Additionally, novel potential AD therapeutic targets (CD33, TBCA, VPS29, GNAI3, PSME1) are identified, among which CD33 was positively associated with the main outcome traits of AD, as well as with both EOAD and LOAD. TBCA and VPS29 were negatively associated with the main outcome traits of AD, as well as with both EOAD and LOAD. GNAI3 and PSME1 were negatively associated with the main outcome traits of AD, as well as with LOAD, but had no significant causal association with EOAD. The findings of our research advance our understanding of the etiology of AD.