Cytokine ranking via mutual information algorithm correlates cytokine profiles with presenting disease severity in patients infected with SARS-CoV-2
Abstract
Although the range of immune responses to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is variable, cytokine storm is observed in a subset of symptomatic individuals. To further understand the disease pathogenesis and, consequently, to develop an additional tool for clinicians to evaluate patients for presumptive intervention, we sought to compare plasma cytokine levels between a range of donor and patient samples grouped by a COVID-19 Severity Score (CSS) based on the need for hospitalization and oxygen requirement. Here we utilize a mutual information algorithm that classifies the information gain for CSS prediction provided by cytokine expression levels and clinical variables. Using this methodology, we found that a small number of clinical and cytokine expression variables are predictive of presenting COVID-19 disease severity, raising questions about the mechanism by which COVID-19 creates severe illness. The variables that were the most predictive of CSS included clinical variables such as age and abnormal chest x-ray as well as cytokines such as macrophage colony-stimulating factor, interferon-inducible protein 10, and interleukin-1 receptor antagonist. Our results suggest that SARS-CoV-2 infection causes a plethora of changes in cytokine profiles and that particularly in severely ill patients, these changes are consistent with the presence of macrophage activation syndrome and could furthermore be used as a biomarker to predict disease severity.
Introduction
In December 2019, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the origin of coronavirus disease 2019 (COVID-19), emerged in Wuhan, China (Zhu et al., 2020). Although many COVID-19 patients remain asymptomatic, there exists a subset of patients who present with severe illness. Early treatment with dexamethasone appears to improve outcomes in these patients. However, it is not always initially clear which patients would benefit from this therapy (The RECOVERY Collaborative Group, 2020). Moreover, COVID-19 infection can be accompanied by a severe inflammatory response characterized by the release of pro-inflammatory cytokines, an event known as cytokine storm (CS) (Tang et al., 2020; Ragab et al., 2020). Thus far, this COVID-19-associated CS has predominantly been characterized by the presence of IL-1β, IL-2, IL-17, IL-8, TNF, CCL2, and most notably IL-6 (Tang et al., 2020; Merad and Martin, 2020; McGonagle et al., 2020; Wan et al., 2020; Otsuka and Seino, 2020). Severe cases of CS can be life threatening, and early diagnosis as well as treatment of this condition can lead to improved outcome. We hypothesize that cytokine profiles combined with clinical information can predict disease severity, potentially giving clinicians an additional tool when evaluating patients for preemptive intervention.
Results
Analysis was performed for 36 PCR-confirmed COVID-19 (+) and 36 (−) human plasma samples (Figure 1—source data 1). The COVID-19 Severity Score (CSS) was developed to categorize patients based on their status upon presentation to the emergency department. CSS is graded as follows: 0 = COVID (−), no symptoms, healthy control (n = 24); 1 = COVID (−), symptoms (n = 12); 2 = COVID (+), discharged from emergency room (n = 15); 3 = COVID (+), admitted, but who did not require supplemental oxygen (n = 7); 4 = COVID (+), admitted and required any amount of supplemental oxygen or positive pressure ventilation (n = 8); and 5 = COVID (+), admitted to ICU/step-down (n = 6) (Figure 1). CSS was used as the outcome variable for a mutual information minimum-redundancy maximum-relevance algorithm (Kratzer and Furrer, 2018; Figure 1), with the goal of selecting a subset of variables most predictive of CSS. The algorithm confirmed the predictive value of clinical variables such as age and chest x-ray abnormality and also ranked the information gain provided by each of 15 cytokines tested. Several cytokines were able to add unique predictive value to the mutual information model in addition to what was provided by clinical factors such as age or patient comorbidities. This algorithm also deprioritized factors when their predictive value was redundant with the most predictive variables. Macrophage colony-stimulating factor (M-CSF) was ranked second after age as it was the factor that added the most predictive power to the algorithm with minimal redundancy with age. It ranked ahead of abnormalities on chest x-ray because while both were relevant in predicting COVID severity, part of the predictiveness of chest x-ray abnormality was also explained by age differences (Figure 2). The top four cytokines combined with age were predictive of the most severe CSS (4–5) and had a receiver operating characteristic (Figure 2), with an area under the curve of 0.86. Multiple cytokines, including M-CSF (p<0.01), interferon-inducible protein 10 (IP-10) (p<0.01), interleukin 18 (IL-18) (p<0.01), and interleukin-1 receptor antagonist (IL-1RA) (p<0.01), were more relevant in predicting CSS than more frequently characterized cytokines in the context of COVID-19 such as IL-6 (p<0.01). These cytokines showed a statistically significant difference in their profiles when segregated by CSS (Figure 3), yet the mutual information algorithm prioritized them differently than would be expected based on univariate analyses. This indicates that the mutual information algorithm is prioritizing cytokines whose predictive value for COVID-19 severity cannot be fully explained by other clinical variables such as age or medical comorbidities.
Discussion
We found that a small number of clinical variables when combined with cytokine expression are predictive of presenting COVID-19 disease severity. Cytokines singled out for relevance by the mutual information algorithm shared a connection to macrophage activation syndrome (MAS), raising questions about the mechanism by which SARS-CoV-2 creates severe illness in a subset of patients. First, we examined the significant contribution of IP-10 to CSS. IP-10 is secreted by monocytes, fibroblasts, and endothelial cells in response to interferon gamma (IFN-γ), which is secreted by T cells (mainly, Th1), macrophages, mucosal epithelial cells, and natural killer (NK) cells (Liu et al., 2011). This release of IFN-γ induces several cell types to produce IP-10, which consequently recruits more Th1 cells, contributing to a positive feedback loop. IP-10 is also chemoattractant to CXCR3-postitive cells such as macrophages, dendritic cells, NK cells, and T cells. It has been proposed that macrophages recruited by IP-10, in the presence of persistent IFN-γ production, can lead to MAS (Merad and Martin, 2020; McGonagle et al., 2020; Otsuka and Seino, 2020). MAS is characterized as a state of systemic hyperinflammation often accompanied by CS, which, without intervention, can lead to severe tissue damage and, in extreme cases, death (Otsuka and Seino, 2020).
Moreover, the cytokine most relevant in predicting CSS was M-CSF, which is secreted by eukaryotic cells in response to viral infection and stimulates hematopoietic stem cells to differentiate into macrophages. Currently, there are three separate immune stages that describe the progression of COVID-19. The first stage is characterized by a potent induction of interferons that marks the early activation of the immune system that is important in the viral response, and the second stage is characterized by a delayed interferon response (Merad and Martin, 2020). These stages may prime the body for a third stage comprised of detrimental hyperinflammation characterized by CS and MAS (Merad and Martin, 2020). This excessive macrophage activation could explain the increase in IL1-RA that we observed, a cytokine abundantly produced by macrophages.
Steroids have shown a survival benefit for COVID-19, likely by suppressing such detrimental hyperinflammation (The RECOVERY Collaborative Group, 2020). Our analysis identified a pattern of cytokine alterations on presentation associated with COVID-19 severity. The ability to identify a cytokine pattern less redundant with known clinical factors such as age and chest x-ray could help better identify patients in need of immunomodulatory treatment without the confounders of current models where the measured cytokines correlate as much with age as with severity (Pierce et al., 2020). Further studies should be conducted to clarify the mechanistic role that these cytokines and macrophages play in the various stages of COVID-19 and correlate them with other hematologic parameters that were not collected in this database. The results of these future studies could identify more targeted immunomodulatory strategies beyond steroid administration such as treatment with MEK inhibitors (Zhou et al., 2020), as well as the ideal timing of these interventions to maximize therapeutic efficacy. Future studies could also address the size limitations of this study, which was not powered to explore race- or ethnicity-related differences in COVID-19 severity. Finally, we present the application of this mutual information algorithm as a way to evaluate the dataset as a whole and elucidate the most important cytokines in predicting the presenting severity of COVID-19. COVID-19 severity is influenced by many clinical factors, such as age, and this algorithm is able to identify cytokines that contribute information not present in the tested clinical variables. Identifying the most important variables for severe presentation of COVID-19 within a more complete cytokine profile may help determine global immune mechanisms of disease severity.
Materials and methods
Biobank samples
Request a detailed protocolCOVID-19 (+) and (−) human plasma samples were received from the Lifespan Brown COVID-19 Biobank from Brown University at Rhode Island Hospital (Providence, RI). All biobank samples were collected on patients’ arrival in the Emergency Department at Rhode Island Hospital. All patient samples were deidentified but included the available clinical information as described in Results. It is unknown if any patients were blood relatives. The IRB study protocol ‘Pilot Study Evaluating Cytokine Profiles in COVID-19 Patient Samples’ did not meet the definition of human subjects research by either the Brown University or the Rhode Island Hospital IRBs. All samples were thawed and centrifuged at 14,000 rpm for 10 min following the manufacturer protocol included with the Luminex kit to remove cellular debris immediately before the assay was run.
Donor samples
Request a detailed protocolNormal, healthy, COVID-19 (−) samples were commercially available from Lee BioSolutions (991–58-PS-1, Lee BioSolutions, Maryland Heights, MO). All samples were thawed and centrifuged at 14,000 rpm for 10 min following the manufacturer protocol included with the Luminex kit to remove cellular debris immediately before the assay was run.
Cytokine and chemokine measurements
Request a detailed protocolA MilliPlex MILLIPLEX MAP Human Cytokine/Chemokine/Growth Factor Panel A – Immunology Multiplex Assay (HCYTA-60K-13, Millipore Sigma, Burlington, MA) was run on a Luminex 200 Instrument (LX200-XPON-RUO, Luminex Corporation, Austin, TX) according to the manufacturer’s instructions. Plasma levels of granulocyte colony-stimulating factor (G-CSF), IFN-γ, interleukin one alpha (IL-1α), interleukin-1 receptor antagonist (IL-1RA), IL-2, IL-6, IL-7, IL-12, IP-10, monocyte chemoattractant protein-1 (MCP-1), M-CSF, macrophage inflammatory protein-1 alpha (MIP-1α), and tumor necrosis factor alpha (TNF-α) were measured. Data pre-processing: values below limit of detection were re-coded as half the limit of detection. A single extreme outlier value in IFN-y levels was removed after confirming outlier status via Hampel and Grubbs outlier testing (both p<0.01).
Clinical variables
Request a detailed protocolAvailable deidentified clinical variables were collected from patients and from chart review during their time in the emergency department. Clinical variables were categorized to create combined variables such as the number of chronic conditions or the number of presenting symptoms. The full breakdown of clinical variable categorization can be found in Figure 2—source data 1.
Data analysis
Request a detailed protocolData analysis and visualization were generated using R (R Development Core Team, 2020). The varrank package (Kratzer and Furrer, 2020) was used to apply a minimum-redundancy maximum-relevance mutual information algorithm. The algorithm classifies the amount of information each cytokine and clinical variable can provide about the outcome variable, CSS. Each cytokine variable was discretized into two clusters – either high or low analyte concentration in pg/mL – using k-means clustering to minimize within-variable entropy and, thus, over-fitting. This algorithm partitions each data point into the cluster (high or low analyte concentration) with the nearest mean. Clinical variables and cytokine levels were used to predict CSS. The first variable was selected for local optimum relevance by a greedy algorithm. All subsequent variables were ordered to maximize relevancy and minimize redundancy. The ordering was robust to leave-one-out cross-validation. For each cytokine, one-way ANOVA with Tukey’s honest significant difference test and Šidák correction for multiple comparisons was used to compare plasma cytokine levels among CSS groups.
Data availability
Source data and source code files have been provided.
References
-
CXCL10/IP-10 in infectious diseases pathogenesis and potential therapeutic implicationsCytokine & Growth Factor Reviews 22:121–130.https://doi.org/10.1016/j.cytogfr.2011.06.001
-
Pathological inflammation in patients with COVID-19: a key role for monocytes and macrophagesNature Reviews Immunology 20:355–362.https://doi.org/10.1038/s41577-020-0331-4
-
Macrophage activation syndrome and COVID-19Inflammation and Regeneration 40:19.https://doi.org/10.1186/s41232-020-00131-w
-
Immune responses to SARS-CoV-2 infection in hospitalized pediatric and adult patientsScience Translational Medicine 12:eabd5487.https://doi.org/10.1126/scitranslmed.abd5487
-
SoftwareR: A Language and Environment for Statistical ComputingR Foundation for Statistical Computing, Vienna, Austria.
-
The COVID-19 cytokine storm; What we know so farFrontiers in Immunology 11:e1446.https://doi.org/10.3389/fimmu.2020.01446
-
Cytokine storm in COVID-19: the current evidence and treatment strategiesFrontiers in Immunology 11:e1708.https://doi.org/10.3389/fimmu.2020.01708
-
Dexamethasone in hospitalized patients with Covid-19 — Preliminary ReportThe New England Journal of Medicine 17:NEJMoa2021436.https://doi.org/10.1056/NEJMoa2021436
-
A novel coronavirus from patients with pneumonia in China, 2019New England Journal of Medicine 382:727–733.https://doi.org/10.1056/NEJMoa2001017
Article and author information
Author details
Funding
Brown University
- Wafik S El-Deiry
National Institute of General Medical Sciences (U54GM115677)
- Kelsey E Huntington
- Anna D Louie
- Chun Geun Lee
- Jack A Elias
- Eric A Ross
- Wafik S El-Deiry
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Acknowledgements
The work was supported by a Brown University COVID-19 Seed Grant (to WSE-D). The COVID-19 Biobank through which plasma samples were obtained was supported by Institutional Development Award Number U54GM115677 from the National Institute of General Medical Sciences of the National Institutes of Health, which funds Advance Clinical and Translational Research (Advance-CTR). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. WSE-D is an American Cancer Society Research Professor.
Copyright
© 2021, Huntington et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.
Metrics
-
- 1,761
- views
-
- 223
- downloads
-
- 24
- citations
Views, downloads and citations are aggregated across all versions of this paper published by eLife.
Download links
Downloads (link to download the article as PDF)
Open citations (links to open the citations from this article in various online reference manager services)
Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)
Further reading
-
- Immunology and Inflammation
Natural killer (NK) cells can control metastasis through cytotoxicity and IFN-γ production independently of T cells in experimental metastasis mouse models. The inverse correlation between NK activity and metastasis incidence supports a critical role for NK cells in human metastatic surveillance. However, autologous NK cell therapy has shown limited benefit in treating patients with metastatic solid tumors. Using a spontaneous metastasis mouse model of MHC-I+ breast cancer, we found that transfer of IL-15/IL-12-conditioned syngeneic NK cells after primary tumor resection promoted long-term survival of mice with low metastatic burden and induced a tumor-specific protective T cell response that is essential for the therapeutic effect. Furthermore, NK cell transfer augments activation of conventional dendritic cells (cDCs), Foxp3-CD4+ T cells and stem cell-like CD8+ T cells in metastatic lungs, to which IFN-γ of the transferred NK cells contributes significantly. These results imply direct interactions between transferred NK cells and endogenous cDCs to enhance T cell activation. We conducted an investigator-initiated clinical trial of autologous NK cell therapy in six patients with advanced cancer and observed that the NK cell therapy was safe and showed signs of effectiveness. These findings indicate that autologous NK cell therapy is effective in treating established low burden metastases of MHC-I+ tumor cells by activating the cDC-T cell axis at metastatic sites.
-
- Genetics and Genomics
- Immunology and Inflammation
PIK3R1 encodes three regulatory subunits of class IA phosphoinositide 3-kinase (PI3K), each associating with any of three catalytic subunits, namely p110α, p110β, or p110δ. Constitutional PIK3R1 mutations cause diseases with a genotype-phenotype relationship not yet fully explained: heterozygous loss-of-function mutations cause SHORT syndrome, featuring insulin resistance and short stature attributed to reduced p110α function, while heterozygous activating mutations cause immunodeficiency, attributed to p110δ activation and known as APDS2. Surprisingly, APDS2 patients do not show features of p110α hyperactivation, but do commonly have SHORT syndrome-like features, suggesting p110α hypofunction. We sought to investigate this. In dermal fibroblasts from an APDS2 patient, we found no increased PI3K signalling, with p110δ expression markedly reduced. In preadipocytes, the APDS2 variant was potently dominant negative, associating with Irs1 and Irs2 but failing to heterodimerise with p110α. This attenuation of p110α signalling by a p110δ-activating PIK3R1 variant potentially explains co-incidence of gain-of-function and loss-of-function PIK3R1 phenotypes.