Biomarkers: Getting closer to the clinic
Note added [December 5, 2023]
This Insight article has been updated to reflect a correction made to the associated Research Article: in the sixth paragraph, the number of connections between EpiScores and 11 diseases or death has been changed from 137 to 130.
Lengthening life expectancies and decreased mortality rates have led to an unprecedented expansion of the older population. At the same time, chronic diseases – which often affect older individuals – have become more prevalent. Healthcare systems around the world are falling short of this challenge, in part because they remain focused on preventing and curing one disease at a time, even though 80% of clinical patients over 60 have multiple diseases at once.
Even when one specific disease causes most of a person’s symptoms, older patients often have co-existing conditions that affect the course, treatment and prognosis of the main disease. Pressed for time, physicians often ignore underlying illnesses until they begin to seriously affect the patient’s health or start causing frailty. There is no easy solution to this rising crisis, but the emerging field of biomarkers may soon come to the aid of clinicians.
Biomarkers are molecules, genes or characteristics that can be used to detect or predict the onset of a disease. Traditionally, biomarkers have included circulating levels of plasma proteins, lipids and other metabolites. More recently, epigenetic markers – chemical modifications of DNA that affect whether genes are turned on or off, such as addition of methyl groups at specific DNA sites – have shown promise as biomarkers for age-related conditions. Using biomarkers could allow physicians to obtain a molecular map of a patient’s health from a single drop of blood. This would allow clinicians to detect illnesses before they become symptomatic, which is particularly important in the case of serious conditions that could become chronic (Tanaka et al., 2020).
Developing algorithms that extract the relevant information from biomarkers in the blood is perhaps the most promising and potentially powerful line of research in chronic diseases. Until recently, most biomarker studies examined one layer of information (DNA modifications, protein levels or specific metabolites) at a time. However, combining information on DNA methylation with the level of a small number of circulating proteins has been shown to predict the risk of specific chronic diseases as well as global, adverse health outcomes such as having several illnesses at once, and mortality (Lu et al., 2019; Levine et al., 2018; Belsky et al., 2020). Now, in eLife, Riccardo Marioni from the University of Edinburgh and colleagues – including Danni Gadd, Robert Hillary, Daniel McCartney and Shaza Zaghlool as joint first authors – report on how to leverage the associations between DNA methylation and protein levels to predict the onset of disease earlier and more accurately (Gadd et al., 2022).
The team (who are based in the United Kingdom, the United States, Germany, Australia and Qatar) first measured the abundance of 953 proteins in the blood plasma of people in the German KORA cohort (an epidemiological study that ran from 1984 to 2001 in Augsburg and evaluated participants every five years, with an emphasis on major chronic diseases) and the Scottish Lothian Birth Cohort 1936 (the surviving participants of the Scottish Mental Survey 1947 who now live in the Lothian area of Scotland). Gadd et al. then used machine learning to identify clusters of specific DNA methylation sites that could predict the levels of each protein in the plasma. This data was used to assign an epigenetic score or ‘EpiScore’ to each protein. Using this approach, Gadd et al. found that their new algorithm could predict between 1% and 58% of the variation between different people in the plasma levels of 109 proteins.
Next, the team applied the EpiScores of the 109 proteins to data from an independent epidemiological study called Generation Scotland to test whether it was possible to predict the onset of 11 major chronic diseases, as well as death, over a follow-up period of 14 years (Figure 1). This resulted in the identification of 130 connections between EpiScores and 11 diseases or death. Some EpiScores predicted the onset of selected conditions but other were associated with multiple conditions and, perhaps unsurprisingly, the results also suggested a strong correlation between inflammation and age-related chronic disease.

Epigenetic scores of plasma proteins predict onset of major chronic diseases over 14 years.
A machine learning approach was used to find associations, called EpiScores, between DNA methylation (top left) and the abundance of 953 plasma proteins (top right). The results identified 109 proteins with EpiScores that explained between 1% and 58% of the variance in their levels. These scores were then applied to an epidemiological study that contains the medical records of 1,537 individuals over the course of 14 years. Gadd et al. found 130 connections between these EpiScores and 11 age-related conditions (represented by icons), and also between the EpiScores and mortality (represented by the survival graph).
One of the notable observations (that has also been reported in previous studies) is that these analyses on EpiScores confirmed known associations between certain proteins and diseases, even when there is only a moderate correlation between the EpiScore and the protein. This suggests that EpiScores are not a mere proxy for plasma protein levels, but may contain different information about disease risk. In the future, it is likely that biomarkers for disease will encompass multiple molecular layers, such as protein levels together with epigenetic markers or metabolite composition.
The findings of Gadd et al. offer a glimpse into a possible future of medicine. One could imagine a busy physician evaluating a 75-year-old patient complaining of sudden back pain. The physician collects a small blood sample and analyzes it using a fast robotized laboratory connected to a powerful computer that can measure molecular biomarkers and assign a ‘health score’. The computer would then provide information about the patient’s risk for potential diseases that the physician can address before they become symptomatic. The systematic use of this technology could increase awareness and understanding of co-existing, but not yet visible, medical problems.
Of course, before this can happen more research is needed. The predictivity of some EpiScores is modest and only adequate for risk prediction. Even in this context, it would be important to understand whether performing early interventions on patients with high scores is cost effective. As always in prevention, there is a trade-off between the stigmata of tagging an individual as ‘high risk’ and how this information can be used to improve health. A study in which information about proteins and DNA methylation is first compared ‘head to head’ in the same large cohort, and then combined, could reveal whether these two biomarkers provide complementary information and increase specificity. Over time, the data collected systematically using this approach and surveillance studies of electronic medical records could help identify common co-morbidities, allowing clinicians to develop more effective strategies for treating patients with complex combinations of diseases.
References
Article and author information
Author details
Publication history
Copyright
This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.
Metrics
-
- 778
- views
-
- 82
- downloads
-
- 4
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Epidemiology and Global Health
- Microbiology and Infectious Disease
Background:
In many settings, a large fraction of the population has both been vaccinated against and infected by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Hence, quantifying the protection provided by post-infection vaccination has become critical for policy. We aimed to estimate the protective effect against SARS-CoV-2 reinfection of an additional vaccine dose after an initial Omicron variant infection.
Methods:
We report a retrospective, population-based cohort study performed in Shanghai, China, using electronic databases with information on SARS-CoV-2 infections and vaccination history. We compared reinfection incidence by post-infection vaccination status in individuals initially infected during the April–May 2022 Omicron variant surge in Shanghai and who had been vaccinated before that period. Cox models were fit to estimate adjusted hazard ratios (aHRs).
Results:
275,896 individuals were diagnosed with real-time polymerase chain reaction-confirmed SARS-CoV-2 infection in April–May 2022; 199,312/275,896 were included in analyses on the effect of a post-infection vaccine dose. Post-infection vaccination provided protection against reinfection (aHR 0.82; 95% confidence interval 0.79–0.85). For patients who had received one, two, or three vaccine doses before their first infection, hazard ratios for the post-infection vaccination effect were 0.84 (0.76–0.93), 0.87 (0.83–0.90), and 0.96 (0.74–1.23), respectively. Post-infection vaccination within 30 and 90 days before the second Omicron wave provided different degrees of protection (in aHR): 0.51 (0.44–0.58) and 0.67 (0.61–0.74), respectively. Moreover, for all vaccine types, but to different extents, a post-infection dose given to individuals who were fully vaccinated before first infection was protective.
Conclusions:
In previously vaccinated and infected individuals, an additional vaccine dose provided protection against Omicron variant reinfection. These observations will inform future policy decisions on COVID-19 vaccination in China and other countries.
Funding:
This study was funded the Key Discipline Program of Pudong New Area Health System (PWZxk2022-25), the Development and Application of Intelligent Epidemic Surveillance and AI Analysis System (21002411400), the Shanghai Public Health System Construction (GWVI-11.2-XD08), the Shanghai Health Commission Key Disciplines (GWVI-11.1-02), the Shanghai Health Commission Clinical Research Program (20214Y0020), the Shanghai Natural Science Foundation (22ZR1414600), and the Shanghai Young Health Talents Program (2022YQ076).
-
- Epidemiology and Global Health
Background:
The role of circulating metabolites on child development is understudied. We investigated associations between children’s serum metabolome and early childhood development (ECD).
Methods:
Untargeted metabolomics was performed on serum samples of 5004 children aged 6–59 months, a subset of participants from the Brazilian National Survey on Child Nutrition (ENANI-2019). ECD was assessed using the Survey of Well-being of Young Children’s milestones questionnaire. The graded response model was used to estimate developmental age. Developmental quotient (DQ) was calculated as the developmental age divided by chronological age. Partial least square regression selected metabolites with a variable importance projection ≥1. The interaction between significant metabolites and the child’s age was tested.
Results:
Twenty-eight top-ranked metabolites were included in linear regression models adjusted for the child’s nutritional status, diet quality, and infant age. Cresol sulfate (β=–0.07; adjusted-p <0.001), hippuric acid (β=–0.06; adjusted-p <0.001), phenylacetylglutamine (β=–0.06; adjusted-p <0.001), and trimethylamine-N-oxide (β=–0.05; adjusted-p=0.002) showed inverse associations with DQ. We observed opposite directions in the association of DQ for creatinine (for children aged –1 SD: β=–0.05; pP=0.01;+1 SD: β=0.05; p=0.02) and methylhistidine (–1 SD: β = - 0.04; p=0.04;+1 SD: β=0.04; p=0.03).
Conclusions:
Serum biomarkers, including dietary and microbial-derived metabolites involved in the gut-brain axis, may potentially be used to track children at risk for developmental delays.
Funding:
Supported by the Brazilian Ministry of Health and the Brazilian National Research Council.