Research Article

Neuroscience

Predictive models for secondary epilepsy in patients with acute ischemic stroke within one year

Department of Neurosurgery, Chongqing Emergency Medical Center, Chongqing University Central Hospital, School of Medicine, Chongqing University, China
Bioengineering College of Chongqing University, China
Department of Neurosurgery, Chongqing University Qianjiang Hospital, China
Department of Neurosurgery, Yubei District Hospital of Traditional Chinese Medicine, China
Department of Neurosurgery, Bishan hospital of Chongqing Medical University, China
Department of Prehospital Emergency, Chongqing University Central Hospital, Chongqing Emergency Medical Center, China
Chongqing Key Laboratory of Emergency Medicine, China
Jinfeng Laboratory, China

Nov 14, 2024

https://doi.org/10.7554/eLife.98759.3

Open access
Copyright information

eLife Assessment

This valuable study reports machine learning models derived from large-scale data to predict the risk of post-stroke epilepsy. The evidence supporting the conclusions is convincing, although there are some validation issues (lack of cross-validation, possible bias in external validation results). The study may be of interest in the field of clinical neurology

https://doi.org/10.7554/eLife.98759.3.sa0

Significance of the findings:

Valuable: Findings that have theoretical or practical implications for a subfield

Landmark
Fundamental
Important
Valuable
Useful

Strength of evidence:

Convincing: Appropriate and validated methodology in line with current state-of-the-art

Exceptional
Compelling
Convincing
Solid
Incomplete
Inadequate

During the peer-review process the editor and reviewers write an eLife Assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife Assessments

Abstract
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

Background:

Post-stroke epilepsy (PSE) is a critical complication that worsens both prognosis and quality of life in patients with ischemic stroke. An interpretable machine learning model was developed to predict PSE using medical records from four hospitals in Chongqing.

Methods:

Medical records, imaging reports, and laboratory test results from 21,459 ischemic stroke patients were collected and analyzed. Univariable and multivariable statistical analyses identified key predictive factors. The dataset was split into a 70% training set and a 30% testing set. To address the class imbalance, the Synthetic Minority Oversampling Technique combined with Edited Nearest Neighbors was employed. Nine widely used machine learning algorithms were evaluated using relevant prediction metrics, with SHAP (SHapley Additive exPlanations) used to interpret the model and assess the contributions of different features.

Results:

Regression analyses revealed that complications such as hydrocephalus, cerebral hernia, and deep vein thrombosis, as well as specific brain regions (frontal, parietal, and temporal lobes), significantly contributed to PSE. Factors such as age, gender, NIH Stroke Scale (NIHSS) scores, and laboratory results like WBC count and D-dimer levels were associated with increased PSE risk. Tree-based methods like Random Forest, XGBoost, and LightGBM showed strong predictive performance, achieving an AUC of 0.99.

Conclusions:

The model accurately predicts PSE risk, with tree-based models demonstrating superior performance. NIHSS score, WBC count, and D-dimer were identified as the most crucial predictors.

Funding:

The research is funded by Central University basic research young teachers and students research ability promotion sub-projec t(2023CDJYGRH-ZD06), and by Emergency Medicine Chongqing Key Laboratory Talent Innovation and development joint fund project (2024RCCX10).

Introduction

Stroke is the second leading cause of death worldwide, with an annual mortality of about 5.5 million, and is the leading cause of disability, accounting for 50% of cases globally (Feigin et al., 2017). Ischemic strokes make up approximately 80% of all stroke cases (Galovic et al., 2018; Krishnamurthi et al., 2013). PSE is a common complication, with studies showing that 3–30% of stroke patients develop epilepsy, which worsens their prognosis and quality of life (Zhao et al., 2018). PSE can exacerbate cognitive, psychiatric, and physical impairments caused by cerebrovascular disease and related conditions (Al-Sahli et al., 2023). The highest incidence of PSE occurs within the first year after an acute stroke, representing nearly half of all cases (Galovic et al., 2018). Therefore, early prediction and intervention for PSE, especially in ischemic strokes, are essential.

Currently, most studies rely on clinical data to build statistical models using survival analysis, Cox regression (Galovic et al., 2018; Chen et al., 2018), and multiple linear regression (Merkler et al., 2018) to predict PSE. Last year, Lin et al., 2023 developed a radiomics-based model that outperformed conventional clinical models in predicting PSE related to intracerebral hemorrhage (ICH), suggesting that a combined radiomics-clinical model could improve individual risk assessment of PSE after the first ICH, facilitating early diagnosis and treatment. However, later research raised concerns about the use of radiomics, indicating the need for further investigation (Pszczolkowski and Law, 2023). Overall, research on PSE prediction remains limited, with most studies focusing on specific risk factors (Waafi et al., 2023; Herzig-Nichtweiß et al., 2023; Lin et al., 2023; Pitkänen et al., 2016) and building basic models, rather than proposing more comprehensive and scientifically robust prediction models.

Machine learning has gained recognition as a powerful tool for developing medical models, due to its ability to process large datasets and handle complex information. It has been increasingly applied in neuroscience and clinical prediction (El Naamani et al., 2024; Lu et al., 2023; Daidone et al., 2024). Previous studies have used machine learning to explore post-stroke cognitive impairments (Lee et al., 2023), predict stroke and myocardial infarction risk in patients with large artery vasculitis (Lu et al., 2023), develop post-stroke depression models using liver function tests (Gong et al., 2023), and predict hematoma expansion in traumatic brain injury (TBI) (He et al., 2024). Machine learning models can automatically capture both linear and complex nonlinear relationships between variables, providing insights into how different factors contribute to the prediction target—something traditional statistical models struggle with. However, machine learning requires large datasets and is prone to overfitting when sample sizes are small. The quality and quantity of input data are critical for the algorithm to detect patterns and make accurate predictions.

This study aims to identify key risk factors from features extracted from clinical records and test data of ischemic stroke patients. Using these features, a machine learning-based model will be developed to predict PSE. By leveraging early admission data, the goal is to automatically predict the likelihood of PSE occurrence, guiding clinical decision-making and patient care.

Results

Filling of missing data

Missing values were imputed using a Random Forest (RF) model, addressing one feature at a time. The imputed features included: Plt, WBC, RBC, HbA1c, CRP, TG, LDL, HDL, AST, ALT, bilirubin, albumin, urea, creatinine, BUA, PT, APTT, TT, INR, D-dimer, fibrinogen, CK, CK-MB, LDH, HBDH, IMA, lactate, anion gap, TCO2, and NIHSS (Figure 1).

Figure 1

Download asset Open asset

LASSO regression coefficient paths.

The image shows the LASSO regression coefficient paths for various features related to a medical or research study. The x-axis shows the log of the regularization parameter alpha, and the y-axis shows the regression coefficient values. The lines in the plot represent the coefficient paths for different features as the regularization parameter changes. The features are labeled on the right side of the plot, and the most important features selected by the LASSO model are shown at the bottom of the image.

Characteristics of study participants

A total of 21,459 patients were included in the study. The training set comprised 15,021 patients, with a PSE incidence of 4.3%. The test set consisted of 6438 patients, also with a 4.3% incidence of PSE. An external validation cohort included 536 patients from three hospitals. Detailed statistical information on the clinical characteristics is presented in (Supplementary file 1).

Analysis indicated that patients with a higher likelihood of developing PSE had complications such as uremia, a history of DVT, atrial fibrillation, hyperuricemia, cerebral hernia, and hydrocephalus. Affected brain regions included the frontal, parietal, occipital, and temporal lobes, along with the cortex, subcortex, basal ganglia, and hypothalamus. General characteristics influencing risk included age, gender, and NIHSS. Laboratory indicators associated with a higher risk of PSE included WBC, HbA1c, CRP, TG, AST, ALT, bilirubin, urea, uric acid, APTT, PT, D-dimer, CK, CK-MB, LDH, HBDH, IMA, lactate, and anion gap. Significant p-values were also observed for fatty liver, coronary heart disease, hyperlipidemia, and HDL, with low or negative values linked to higher risks of secondary complications. Details of statistical, univariate, and multivariate regression analyses are provided in Supplementary file 1, Supplementary file 2, Supplementary file 3.

Performance of machine learning models

Performance indicators for the machine learning models are summarized in Supplementary file 4. The ROC curves, calibration curve, and decision curve analysis (DCA) are shown in Figure 2. Tree-based models such as RF, XGBoost, and LightGBM demonstrated the highest AUC scores, outperforming other models. Among them, RF achieved the highest positive predictive value (PPV) at 0.864, which was the most significant metric in this study. Complex machine learning algorithms outperformed traditional logistic regression. The calibration curve showed a Brier score of 0.006, and DCA indicated strong practical value in clinical decision-making. In the external validation cohort, RF achieved a sensitivity of 0.91 and a PPV of 0.95, confirming its strong predictive capability.

Figure 2 with 3 supplements see all

Download asset Open asset

Model evaluation metrics and curves.

The figure shows model performance curves across six sections (A1, A2, A3 on the left; B1, B2, B3 on the right) for training and test sets. ROC Curve: Illustrates the trade-off between sensitivity and specificity, with the AUC indicating overall model performance. Calibration Curve: Compares predicted probabilities to actual outcomes, assessing the model’s confidence accuracy. Precision-Recall Curve: Analyzes the balance between precision and recall at various thresholds, particularly useful for imbalanced datasets.

Analysis of SHAP risk factors

Figure 3 presents SHAP values, individual decision attempts, and overall decision curves. Among general characteristics, females had a higher PSE rate. A higher NIHSS was associated with an increased incidence of PSE. Elevated WBC, D-dimer, CRP, AST, CK-MB, HbA1c, bilirubin, TCO2, and LDH levels at admission were linked to a greater likelihood of developing PSE. Conversely, lower levels of HBDH, Plt, and APTT were also associated with a higher risk of PSE. Specific brain regions did not have a significant individual effect on the outcome. Among complications, hypertension was more strongly associated with PSE development, while conditions such as coronary heart disease, diabetes, hyperlipidemia, and fatty liver were less likely to be related. A force plot for the first patient illustrated the influence of different features on the prediction. In this case, a prolonged APTT time contributed the most to PSE, followed by elevated AST levels, while a low NIHSS negatively impacted the final result. The decision plot aggregated model decisions to demonstrate how complex models generated their predictions.

Figure 3 with 3 supplements see all

Download asset Open asset

Description of the SHapley Additive exPlanations (SHAP) values and feature importance.

SHAP Value (Left): Displays the impact of each feature on the model’s predictions, with features sorted by importance. The color gradient indicates the range of feature values, from low (blue) to high (red). Force plot (upper right): Illustrates the contribution of individual features of the first sample to the final model output, highlighting how each feature value pushes the prediction away from the baseline value. Decision plot (lower right): Visualizes the cumulative impact of features on the model output for each sample, showing how the feature values combine to produce the final prediction.

Discussion

The study utilized comprehensive clinical, imaging, and laboratory data from stroke patients to develop a predictive model using machine learning algorithms. The model achieved an AUC above 0.95, indicating more accurate predictions compared to traditional statistical methods. Tree-based ensemble models demonstrated superior predictive performance, particularly when handling large datasets with high-dimensional features.

During the modeling process, the extreme imbalance between negative and positive samples was addressed using the SMOTEENN technique, which improved model performance. SHAP analysis was conducted to assess model interpretability and identify the importance of different features.

According to the statistical results, age and NIHSS were treated as continuous variables. The results show that female patients, older individuals, and those with higher NIHSS were more likely to develop PSE, consistent with recent studies. Higher NIHSS, indicating more severe strokes, significantly increased the risk of complications, ranking second only to WBC and D-dimer in predictive importance (Al-Sahli et al., 2023; Lin et al., 2021; Waafi et al., 2023; Zöllner et al., 2020). However, there are differing perspectives regarding the effect of age. Some studies (Al-Sahli et al., 2023; Yamada et al., 2020) suggest that age below 65 is a high-risk factor, which aligns with this findings, while other studies (Lidetu and Zewdu, 2023) indicate that advanced age is the key factor. Yamada et al., 2020 supported the results, showing that female patients have a higher risk of complications, while (Waafi et al., 2023) reported that male patients are 3.325 times more likely to develop complications, contradicting these results.

Previous research indicates that patients with diabetes, dyslipidemia, hypertension, depression, or dementia are at higher risk of developing vascular epilepsy (Pitkänen et al., 2016). In this study, statistical analysis and multiple ML models examined the relationship between comorbidities and complications. Patients with coronary heart disease, diabetes, fatty liver, hyperlipidemia, or large artery stenosis or plaques (CCA and ICA) were found to be less likely to develop epilepsy. According to the TOAST classification, ischemic stroke is divided into five categories: large artery atherosclerosis, cardioembolism, small vessel occlusion, other determined etiology, and undetermined etiology. Patients with multiple comorbidities often fall into the large artery atherosclerosis and cardioembolism categories, which are more clearly defined and easier to treat, resulting in a lower likelihood of epilepsy. In contrast, strokes of undetermined etiology tend to have worse prognoses and a higher likelihood of leading to epilepsy. Among patients with diabetes, higher HbA1c levels indicate poor blood sugar control and a higher risk of complications. Better blood sugar control was associated with a lower overall risk of developing complications.

Lekoubou et al., 2023 reported that cortical infarction is more likely to lead to epilepsy in patients hospitalized with anterior circulation ischemic stroke. Lin et al., 2023 also found that factors such as cortical involvement and intracerebral hemorrhage volume increase the likelihood of PSE, aligning with the statistical result findings. Al-Sahli et al. suggested that cortical brain injury and large-area lesions elevate the risk of PSE (Al-Sahli et al., 2023; Yamada et al., 2020). According to the statistical results, both cortical and subcortical involvement were associated with an increased likelihood of PSE, but these regions had a smaller influence compared to other features and were not selected in LASSO regression.

Previous studies have identified acute infection as a risk factor for ischemic stroke (Bova et al., 1996). CRP, which reflects inflammation levels, is an independent prognostic factor (Di Napoli et al., 2001). Both regression and SHAP analysis of the results indicated that WBC had a significant impact among routine blood test parameters, even surpassing NIHSS in SHAP analysis. A high WBC may indicate severe inflammation or infection, as well as increased blood viscosity, predisposing patients to secondary complications. High RBC and low Plt counts were also associated with an increased risk of complications.

A large-scale study on Chinese individuals found a negative correlation between plasma HDL and the risk of ischemic stroke, a weak positive correlation between TG levels and stroke risk, and a strong correlation between LDL and apolipoprotein B levels (Sun et al., 2019). High HDL levels are linked to better prognosis (Bandeali and Farmer, 2012). The data analysis and model interpretation results align with these findings, showing that high LDL, low HDL, and elevated TG levels are more likely to result in PSE. This can be explained by high cholesterol and TG levels increasing blood viscosity and contributing to vascular sclerosis, promoting clot formation (Pitkänen et al., 2016; Gasparini et al., 2022; Abraira et al., 2020). Higher D-dimer levels indicate more significant brain tissue damage, increasing the likelihood of PSE. In general, lower APTT and fibrinogen levels are associated with higher PSE risk, while INR, PT, and TT have smaller impacts. Among liver function indicators, AST had the greatest influence on PSE. High AST, low ALT, and low albumin levels also had some impact. Ding et al., 2023 reported that liver enzyme subgroups defined by ALT and AST were linked to higher risks of adverse outcomes, consistent with this findings.

Studies have shown that renal function biomarkers such as urinary microalbumin, cystatin C, and creatinine are associated with higher stroke recurrence rates and poorer prognosis (Ding et al., 2023). In the light of statistical results, low urea levels and high uric acid levels had a negative impact (Zhang et al., 2023; Wang et al., 2019; Wang et al., 2021). Elevated uric acid at admission was positively associated with PSE, although patients with a prior diagnosis of hyperuricemia were less likely to develop epilepsy. Since uric acid acts as a strong antioxidant and has neuroprotective properties (Ng et al., 2017), patients with normal liver and kidney function and mild hyperuricemia may exhibit greater resilience in emergencies (Amaro et al., 2011). However, excessively high uric acid levels suggest metabolic disorders and poor liver and kidney function, which are linked to a poor prognosis.

When stroke patients are admitted, cardiac enzyme tests are often performed to rule out myocardial ischemia. However, studies have shown that elevated CK-MB in stroke patients may not be solely heart-related (Ay et al., 2002). Cardiac enzymes are important prognostic indicators (Liu et al., 2014; Zeng et al., 2021) and have been incorporated into stroke scores (Hijazi et al., 2016). Some studies have reported a higher incidence of abnormal serum cardiac enzyme levels in the acute phase of stroke. Although these abnormalities are not related to stroke type, they are associated with stroke severity, with patients exhibiting consciousness disorders having a significantly higher incidence of abnormal cardiac enzymes than those without such disorders (Zheng Yuan-Hui and Jian, 2009). According to the statistical and SHAP results, CK, CK-MB, and IMA in the cardiac enzyme profile demonstrated significant impact and high predictive value, though further research is needed to understand the specific mechanisms involved (Ng et al., 2017).

Despite incorporating extensive clinical, imaging, and laboratory data to build more accurate prediction models using machine learning algorithms, surpassing traditional statistical methods, there were several limitations in the modeling process.

Although the current study offers valuable insights, the dataset may not be fully representative, and the model’s generalizability requires further evaluation. The data were collected from multiple tertiary hospitals and included over 20,000 cases, but earlier data were lost due to hospital system upgrades. Consequently, the dataset primarily reflects patients diagnosed within the past five years and is predominantly from the Chongqing region, which may limit the model’s applicability to other geographic areas.

Additionally, the retrospective nature of the study led to the absence of some important predictive indicators. Several potentially valuable features, such as hemorheology, thromboelastography, and hormone levels, were missing and had to be excluded, potentially affecting the model’s accuracy. Including these features could further enhance the model’s predictive power.

To improve the model, it would be beneficial to incorporate additional data beyond baseline patient characteristics. The current analysis mainly used results from the initial examination upon admission, without fully leveraging information from subsequent exams. Future research could employ recurrent neural networks to extract features from the entire sequence of examinations more comprehensively.

To strengthen this study further, data standardization should be improved, and the number of cases and key indicators should continue to grow. Additionally, exploring more advanced scientific methods, such as deep learning, and utilizing all available data could enhance prediction accuracy.

Materials and methods

Research patients

Request a detailed protocol

This study retrospectively included all stroke patients admitted to Chongqing Emergency Center between June 2017 and June 2022 to develop the prediction model. Data from three external validation centers—Qianjiang Central Hospital, Bishan District People’s Hospital, and Yubei District Traditional Chinese Medicine Hospital—were collected between July 2022 and July 2023 to validate and evaluate the model externally. The external validation cohort focused on collecting positive cases to accurately test the model’s ability to identify them.

Inclusion criteria included: (1) Patients aged 18–90 years at admission; (2) Diagnosed with acute ischemic stroke and hospitalized for treatment.

Exclusion criteria were: (1) History of stroke or transient ischemic attack (TIA); (2) History of conditions such as traumatic brain injury, intracranial tumors, or cerebral vascular malformations that may cause epilepsy; (3) History of epilepsy or prior antiseizure medication use for seizure prevention or for other diseases (e.g. migraine or psychiatric disorders); (4) Death within 72 hr after stroke onset.

De-identified data from relevant patients were collected to build a multi-modal stroke patient database. The study protocol was approved by the Ethics Committees of Chongqing University Center Hospital, Chongqing University Qianjiang Central Hospital, Bishan District People’s Hospital, and Yubei District Traditional Chinese Medicine Hospital.

The selection process is outlined in Figure 4. A total of 42,079 records were retrieved from the stroke database, and 24,733 patients were diagnosed with new-onset ischemic or lacunar stroke. Patients with hemorrhagic strokes (4565), a history of stroke (2154), TIA (3570), unclear-cause strokes (561), and those with missing essential data (6496) were excluded. Additionally, patients whose seizures may have been caused by other factors (such as brain tumors, intracranial vascular malformations, or traumatic brain injury) (865), those with a history of seizures (152), and those who died in the hospital (1444) were excluded. Patients lost to follow-up (those without outpatient records or unreachable by phone) or who died within three months of the stroke incident (813) were also excluded. In total, 21,459 cases were included in the study.

Figure 4

Download asset Open asset

Slection and exclusion procedure of patients.

A total of 42,079 records were retrieved from the stroke database, and 24,733 patients were diagnosed with ischemic or lacunar stroke with new onset. Hemorrhagic strokes (4565), a history of stroke (2154), TIA (3570), unclear cause strokes (561), and records with missing essential data (6496) were excluded. Patients whose seizures might have been caused by other factors (such as brain tumors, intracranial vascular malformations, or traumatic brain injury) (865), those with a seizure history (152), and patients who died in the hospital (1444) were also excluded. Additionally, patients lost to follow-up (those without outpatient records or unreachable by phone) or who died within three months of the stroke incident (813) were excluded. Finally, 21,459 cases were included in the study.

Data collection

Request a detailed protocol

Relevant records and data were extracted from hospital databases. PostgreSQL was used to manage the data, with Structured Query Language (SQL) queries organized as follows:

General Information: This included gender, age, and NIH Stroke Scale (NIHSS) score at admission.
Comorbidities and Complications: Conditions such as uremia, deep vein thrombosis (DVT), diabetes mellitus, hypertension, coronary atherosclerosis, atrial fibrillation, cerebral hernia, hydrocephalus, hypoproteinemia, hyperuricemia, hyperlipidemia, internal carotid stenosis, and common carotid stenosis were recorded.
Brain Involvement (CT or MRI records): Involvement of cortical lobes (frontal, parietal, temporal, occipital, and insular) and subcortical areas (basal ganglia, internal capsule, brain stem, cerebellum, periventricular area, centrum semiovale, and thalamus) was noted. Cortical involvement was scored with each lobe contributing 1 point, and subcortical involvement was scored similarly, with each area contributing 1 point.
Vascular Involvement (CTA, MRA, or DSA records): The presence of vascular stenosis or occlusion in the anterior cerebral artery (ACA), middle cerebral artery (MCA), posterior cerebral artery (PCA), vertebral artery (VA), and basilar artery (BA) was documented.
Key Laboratory Indicators: These included blood lipid levels such as triglycerides, high-density lipoprotein cholesterol (HDL), and low-density lipoprotein cholesterol (LDL); liver function markers such as alanine transaminase (ALT), aspartate aminotransferase (AST), bilirubin, and albumin; renal function markers such as urea, blood uric acid (BUA), and creatinine; blood gas parameters such as lactate, anion gap, and total carbon dioxide (TCO2); coagulation markers such as international normalized ratio (INR), prothrombin time (PT), activated partial thromboplastin time (APTT), thrombin time (TT), D-dimer, and fibrinogen; and myocardial enzymes such as creatine kinase (CK), creatine kinase isoenzyme (CK-MB), lactate dehydrogenase (LDH), ischemic modified albumin (IMA), and α-hydroxybutyrate dehydrogenase (HBDH).

Data processing and model building

Request a detailed protocol

Processing of Missing Data: Laboratory indicators were recorded from the first set of tests after stroke admission. Indicators with more than 10% missing data were excluded. Remaining indicators with missing values were imputed using the random forest algorithm with default parameters. Features were processed in order of increasing missing data to minimize imputation complexity. During imputation, missing values in other features were temporarily replaced with 0, and predicted values were inserted into the original feature matrix before moving to the next feature. This process continued until all features were complete.

Distribution of Characteristics: Univariate analysis was performed to compare the distribution of characteristics between PSE-negative and PSE-positive groups. The dataset was then divided into a training set and a test set in a 7:3 ratio.

Handling Imbalanced Data: Due to the low incidence of PSE and the small proportion of positive cases, positive data in the training set were augmented using the Synthetic Minority Over-sampling Technique combined with Edited Nearest Neighbors (SMOTEENN). The SMOTEENN method from the imblearn Python package was applied with default parameters, and a random seed of 42 was set for reproducibility.

Processing of Categorical Data: Categorical variables were transformed using one-hot encoding. The LASSO method was then applied to the training set to identify the most important features.

Model Building: LASSO regression was used to select the 20 most important features. Nine widely used machine learning methods were employed, including Naive Bayes, Logistic Regression, Decision Tree, Random Forest, Gradient Boosting, Multi-Layer Perceptron, XGBoost, LightGBM, and K-Nearest Neighbors. Hyperparameters were optimized through grid search to enhance model performance. Evaluation metrics included accuracy, sensitivity, specificity, F1-score, positive predictive value, and negative predictive value. Additionally, ROC curves, calibration curves, and decision curves were generated to assess model performance. An independent external validation dataset was used to evaluate the model’s generalizability. The SHAP algorithm was then applied to the best-performing model to interpret feature contributions and their clinical relevance. This approach enabled the development of a robust machine learning model with strong predictive performance and interpretability, providing valuable support for clinical decision-making.

Statistical approach

Request a detailed protocol

PostgreSQL v15 (http://www.postgresql.org/) was used to search and extract data from the local database. Statistical analysis was performed using the open-source Scipy.stats package in Python. The details of the univariate significance analysis were as follows:

The Shapiro-Wilk test was used to assess the normality of each feature’s distribution. For features not following a normal distribution, the Mann-Whitney U test was used to evaluate their significance in relation to the target variable. For features with a normal distribution, the Levene test was performed to evaluate the homogeneity of variances. Features with homogeneous variances were analyzed using the Student’s t-test, while those with heterogeneous variances were analyzed using Welch’s t-test.

Confidence intervals for AUC values and Brier scores were calculated using 1000 bootstrap resampling iterations. Binary classification thresholds for predicted probabilities were set using the maximum Youden index derived from the training cohort. A two-tailed p-value of less than 0.05 was considered statistically significant throughout the study.

All code used in this study is available at https://github.com/conanan/lasso-ml (copy archived at conanan, 2024).

Conclusion

Request a detailed protocol

An interpretable machine learning model was developed to predict the risk of PSE in hospitalized patients with ischemic stroke. Using a large dataset of medical records, the model demonstrated strong predictive performance for PSE. Key predictors identified by the model include NIHSS, D-dimer, lactate, and WBC, along with liver function and cardiac enzyme profile indicators. The model’s transparency and interpretability can build trust among clinicians and support decision-making. While the results are promising, further prospective studies are necessary to validate the clinical utility of this tool before it can be applied in real-world settings.

Data availability

The codes, models, analysis and results are uploaded at https://github.com/conanan/lasso-ml (copy archived at conanan, 2024). The full dataset and codes are also uploaded at https://doi.org/10.5061/dryad.w0vt4b92c.

The following data sets were generated

1. Deng Y
(2024) Dryad Digital Repository
Data From: Predictive Models for Secondary Epilepsy in Patients with Acute Ischemic Stroke Within One Year.

https://doi.org/10.5061/dryad.w0vt4b92c

References

1. Abraira L
2. Giannini N
3. Santamarina E
4. Cazorla S
5. Bustamante A
6. Quintana M
7. Toledo M
8. Grau-López L
9. Jiménez M
10. Ciurans J
11. Becerra JL
12. Millán M
13. Cardona P
14. Terceño M
15. Zaragoza J
16. Cánovas D
17. Gasull T
18. Ustrell X
19. Rubiera M
20. Castellanos M
21. Dávalos A
22. Montaner J
23. Álvarez-Sabin J
(2020) Correlation of blood biomarkers with early-onset seizures after an acute stroke event
Epilepsy & Behavior 104:106549.

https://doi.org/10.1016/j.yebeh.2019.106549
- Google Scholar
(2023) Post-stroke epileptic seizures: risk factors, clinical presentation, principles of diagnosis and treatment
Epilepsy and Paroxysmal Conditions 15:148–159.

https://doi.org/10.17749/2077-8333/epi.par.con.2023.135
- Google Scholar
1. Amaro S
2. Urra X
3. Gómez-Choco M
4. Obach V
5. Cervera A
6. Vargas M
7. Torres F
8. Rios J
9. Planas AM
10. Chamorro A
(2011) Uric acid levels are relevant in patients with stroke treated with thrombolysis
Stroke 42:S28–S32.

https://doi.org/10.1161/STROKEAHA.110.596528
- PubMed
- Google Scholar
(2002) Creatine kinase-MB elevation after stroke is not cardiac in origin: comparison with troponin T levels
Stroke 33:286–289.

https://doi.org/10.1161/hs0102.101544
- PubMed
- Google Scholar
1. Bandeali S
2. Farmer J
(2012) High-density lipoprotein and atherosclerosis: the role of antioxidant activity
Current Atherosclerosis Reports 14:101–107.

https://doi.org/10.1007/s11883-012-0235-2
- PubMed
- Google Scholar
(1996) Acute infection as a risk factor for ischemic stroke
Stroke 27:2204–2206.

https://doi.org/10.1161/01.str.27.12.2204
- PubMed
- Google Scholar
1. Chen Z
2. Churilov L
3. Chen Z
4. Naylor J
5. Koome M
6. Yan B
7. Kwan P
(2018) Association between implementation of a code stroke system and poststroke epilepsy
Neurology 90:e1126–e1133.

https://doi.org/10.1212/WNL.0000000000005212
- PubMed
- Google Scholar
Software
1. conanan
(2024) Lasso-ml, version swh:1:rev:8c8987778cbe3ebba267a40a9c8268556a1b262c
Software Heritage.

https://archive.softwareheritage.org/swh:1:dir:426b0a372d85d149e25b85fd3514e0ec9f08e4a2;origin=https://github.com/conanan/lasso-ml;visit=swh:1:snp:3f94ca2fe73f9ca4d7a2c8ecc82702f34a065f77;anchor=swh:1:rev:8c8987778cbe3ebba267a40a9c8268556a1b262c
(2024) Machine learning applications in stroke medicine: advancements, challenges, and future prospectives
Neural Regeneration Research 19:769–773.

https://doi.org/10.4103/1673-5374.382228
- PubMed
- Google Scholar
(2001) C-reactive protein in ischemic stroke: an independent prognostic factor
Stroke 32:917–924.

https://doi.org/10.1161/01.str.32.4.917
- PubMed
- Google Scholar
1. Ding L
2. Liu Y
3. Meng X
(2023) Biomarker and genomic analyses reveal molecular signatures of non-cardioembolic ischemic stroke
Signal Transduction and Targeted Therapy 8:.

https://doi.org/10.1038/s41392-023-01465-w
- Google Scholar
1. El Naamani K
2. Musmar B
3. Gupta N
4. Ikhdour O
5. Abdelrazeq H
6. Ghanem M
7. Wali MH
8. El-Hajj J
9. Alhussein A
10. Alhussein R
11. Tjoumakaris SI
12. Gooch MR
13. Rosenwasser RH
14. Jabbour PM
15. Herial NA
(2024) The artificial intelligence revolution in stroke care: a decade of scientific evidence in review
World Neurosurgery 184:15–22.

https://doi.org/10.1016/j.wneu.2024.01.012
- PubMed
- Google Scholar
1. Feigin VL
2. Abajobir AA
3. Abate KH
4. Abd-Allah F
5. Abdulle AM
6. Abera SF
7. Abyu GY
8. Ahmed MB
9. Aichour AN
10. Aichour I
11. Aichour MTE
12. Akinyemi RO
13. Alabed S
14. Al-Raddadi R
15. Alvis-Guzman N
16. Amare AT
17. Ansari H
18. Anwari P
19. Ärnlöv J
20. Asayesh H
21. Asgedom SW
22. Atey TM
23. Avila-Burgos L
24. Frinel E
25. Avokpaho GA
26. Azarpazhooh MR
27. Barac A
28. Barboza M
29. Barker-Collo SL
30. Bärnighausen T
31. Bedi N
32. Beghi E
33. Bennett DA
34. Bensenor IM
35. Berhane A
36. Betsu BD
37. Bhaumik S
38. Birlik SM
39. Biryukov S
40. Boneya DJ
41. Bulto LNB
42. Carabin H
43. Casey D
44. Castañeda-Orjuela CA
45. Catalá-López F
46. Chen H
47. Chitheer AA
48. Chowdhury R
49. Christensen H
50. Dandona L
51. Dandona R
52. de Veber GA
53. Dharmaratne SD
54. Do HP
55. Dokova K
56. Dorsey ER
57. Ellenbogen RG
58. Eskandarieh S
59. Farvid MS
60. Fereshtehnejad SM
61. Fischer F
62. Foreman KJ
63. Geleijnse JM
64. Gillum RF
65. Giussani G
66. Goldberg EM
67. Gona PN
68. Goulart AC
69. Gugnani HC
70. Gupta R
71. Hachinski V
72. Gupta R
73. Hamadeh RR
74. Hambisa M
75. Hankey GJ
76. Hareri HA
77. Havmoeller R
78. Hay SI
79. Heydarpour P
80. Hotez PJ
81. Javanbakht M
82. Jeemon P
83. Jonas JB
84. Kalkonde Y
85. Kandel A
86. Karch A
87. Kasaeian A
88. Kastor A
89. Keiyoro PN
90. Khader YS
91. Khalil IA
92. Khan EA
93. Khang YH
94. Tawfih A
95. Khoja A
96. Khubchandani J
97. Kulkarni C
98. Kim D
99. Kim YJ
100. Kivimaki M
101. Kokubo Y
102. Kosen S
103. Kravchenko M
104. Krishnamurthi RV
105. Defo BK
106. Kumar GA
107. Kumar R
108. Kyu HH
109. Larsson A
110. Lavados PM
111. Li Y
112. Liang X
113. Liben ML
114. Lo WD
115. Logroscino G
116. Lotufo PA
117. Loy CT
118. Mackay MT
119. El Razek HMA
120. El Razek MMA
121. Majeed A
122. Malekzadeh R
123. Manhertz T
124. Mantovani LG
125. Massano J
126. Mazidi M
127. McAlinden C
128. Mehata S
129. Mehndiratta MM
130. Memish ZA
131. Mendoza W
132. Mengistie MA
133. Mensah GA
134. Meretoja A
135. Mezgebe HB
136. Miller TR
137. Mishra SR
138. Ibrahim NM
139. Mohammadi A
140. Mohammed KE
141. Mohammed S
142. Mokdad AH
143. Moradi-Lakeh M
144. Velasquez IM
145. Musa KI
146. Naghavi M
147. Ngunjiri JW
148. Nguyen CT
149. Nguyen G
150. Le Nguyen Q
151. Nguyen TH
152. Nichols E
153. Ningrum DNA
154. Nong VM
155. Norrving B
156. Noubiap JJN
157. Ogbo FA
158. Owolabi MO
159. Pandian JD
160. Parmar PG
161. Pereira DM
162. Petzold M
163. Phillips MR
164. Piradov MA
165. Poulton RG
166. Pourmalek F
167. Qorbani M
168. Rafay A
169. Rahman M
170. Rahman MH
171. Rai RK
172. Rajsic S
173. Ranta A
174. Rawaf S
175. Renzaho AMN
176. Rezai MS
177. Roth GA
178. Roshandel G
179. Rubagotti E
180. Sachdev P
181. Safiri S
182. Sahathevan R
183. Sahraian MA
184. Samy AM
185. Santalucia P
186. Santos IS
187. Sartorius B
188. Satpathy M
189. Sawhney M
190. Saylan MI
191. Sepanlou SG
192. Shaikh MA
193. Shakir R
194. Shamsizadeh M
195. Sheth KN
196. Shigematsu M
197. Shoman H
198. Silva DAS
199. Smith M
200. Sobngwi E
201. Sposato LA
202. Stanaway JD
203. Stein DJ
204. Steiner TJ
205. Stovner LJ
206. Abdulkader RS
207. EI Szoeke C
208. Tabarés-Seisdedos R
209. Tanne D
210. Theadom AM
211. Thrift AG
212. Tirschwell DL
213. Topor-Madry R
214. Tran BX
215. Truelsen T
216. Tuem KB
217. Ukwaja KN
218. Uthman OA
219. Varakin YY
220. Vasankari T
221. Venketasubramanian N
222. Vlassov VV
223. Wadilo F
224. Wakayo T
225. Wallin MT
226. Weiderpass E
227. Westerman R
228. Wijeratne T
229. Wiysonge CS
230. Woldu MA
231. Wolfe CDA
232. Xavier D
233. Xu G
234. Yano Y
235. Yimam HH
236. Yonemoto N
237. Yu C
238. Zaidi Z
239. El Sayed Zaki M
240. Zunt JR
241. Murray CJL
242. Vos T
(2017) Global, regional, and national burden of neurological disorders during 1990–2015: a systematic analysis for the global burden of disease study 2015
The Lancet Neurology 16:877–897.

https://doi.org/10.1016/S1474-4422(17)30299-5
- Google Scholar
1. Galovic M
2. Döhler N
3. Erdélyi-Canavese B
4. Felbecker A
5. Siebel P
6. Conrad J
7. Evers S
8. Winklehner M
9. von Oertzen TJ
10. Haring HP
11. Serafini A
12. Gregoraci G
13. Valente M
14. Janes F
15. Gigli GL
16. Keezer MR
17. Duncan JS
18. Sander JW
19. Koepp MJ
20. Tettenborn B
(2018) Prediction of late seizures after ischaemic stroke with a novel prognostic model (the SeLECT score): a multivariable prediction model development and validation study
The Lancet. Neurology 17:143–152.

https://doi.org/10.1016/S1474-4422(17)30404-0
- PubMed
- Google Scholar
1. Gasparini S
2. Neri S
3. Brigo F
4. Cianci V
5. Mammì A
6. Pascarella A
7. Manzo L
8. Benna P
9. Striano P
10. Martino T
11. Romoli M
12. Muccioli L
13. Nilo A
14. Valente M
15. Cagnetti C
16. Labate A
17. Gambardella A
18. Pisani F
19. Casciato S
20. Di Gennaro G
21. Belcastro V
22. Aguglia U
23. Ferlazzo E
(2022) Late epileptic seizures following cerebral venous thrombosis: a systematic review and meta-analysis
Neurological Sciences 43:5229–5236.

https://doi.org/10.1007/s10072-022-06148-y
- PubMed
- Google Scholar
1. Gong J
2. Zhang Y
3. Zhong X
4. Zhang Y
5. Chen Y
6. Wang H
(2023) Liver function test indices-based prediction model for post-stroke depression: a multicenter, retrospective study
BMC Medical Informatics and Decision Making 23:127.

https://doi.org/10.1186/s12911-023-02241-0
- PubMed
- Google Scholar
1. He H
2. Liu J
3. Li C
4. Guo Y
5. Liang K
6. Du J
7. Xue J
8. Liang Y
9. Chen P
10. Liu L
11. Cui M
12. Wang J
13. Liu Y
14. Tian S
15. Deng Y
(2024) Predicting hematoma expansion and prognosis in cerebral contusions: a radiomics-clinical approach
Journal of Neurotrauma 41:1337–1352.

https://doi.org/10.1089/neu.2023.0410
- PubMed
- Google Scholar
(2023) Prognosis and management of acute symptomatic seizures: A prospective, multicenter, observational study
Annals of Intensive Care 13:85.

https://doi.org/10.1186/s13613-023-01183-0
- PubMed
- Google Scholar
(2016) The abc (age, biomarkers, clinical history) stroke risk score: a biomarker-based risk score for predicting stroke in atrial fibrillation
European Heart Journal 37:1582–1590.

https://doi.org/10.1093/eurheartj/ehw054
- PubMed
- Google Scholar
1. Krishnamurthi RV
2. Feigin VL
3. Forouzanfar MH
4. Mensah GA
5. Connor M
6. Bennett DA
7. Moran AE
8. Sacco RL
9. Anderson LM
10. Truelsen T
11. O’Donnell M
12. Venketasubramanian N
13. Barker-Collo S
14. Lawes CMM
15. Wang W
16. Shinohara Y
17. Witt E
18. Ezzati M
19. Naghavi M
20. Murray C
(2013) Global and regional burden of first-ever ischaemic and haemorrhagic stroke during 1990-2010: findings from the global burden of disease study 2010
The Lancet. Global Health 1:e259–e281.

https://doi.org/10.1016/S2214-109X(13)70089-5
- PubMed
- Google Scholar
1. Lee M
2. Yeo NY
3. Ahn HJ
4. Lim JS
5. Kim Y
6. Lee SH
7. Oh MS
8. Lee BC
9. Yu KH
10. Kim C
(2023) Prediction of post-stroke cognitive impairment after acute ischemic stroke using machine learning
Alzheimer’s Research & Therapy 15:147.

https://doi.org/10.1186/s13195-023-01289-4
- PubMed
- Google Scholar
1. Lekoubou A
2. Ssentongo P
3. Maffie J
4. Debroy K
5. Kwon M
6. Nguyen C
7. Pelton M
8. Watt B
9. Ceasar J
10. Dinunno N
11. Satyasi V
12. Pascal Kengne A
13. Bonilha L
14. Chinchilli VM
(2023) Associations of small vessel disease and acute symptomatic seizures in ischemic stroke patients
Epilepsy & Behavior 145:109233.

https://doi.org/10.1016/j.yebeh.2023.109233
- Google Scholar
1. Lidetu T
2. Zewdu D
(2023) Incidence and predictors of post stroke seizure among adult stroke patients admitted at Felege Hiwot compressive specialized hospital, Bahir Dar, North West Ethiopia, 2021: a retrospective follow up study
BMC Neurology 23:40.

https://doi.org/10.1186/s12883-023-03083-z
- PubMed
- Google Scholar
1. Lin R
2. Yu Y
3. Wang Y
4. Foster E
5. Kwan P
6. Lin M
7. Xia N
8. Xu H
9. Xie C
10. Yang Y
11. Wang X
(2021) Risk of post-stroke epilepsy following stroke-associated acute symptomatic seizures
Frontiers in Aging Neuroscience 13:707732.

https://doi.org/10.3389/fnagi.2021.707732
- PubMed
- Google Scholar
1. Lin R
2. Lin J
3. Xu Y
4. Yu Y
5. Foster E
6. Lin M
7. Xu H
8. Li X
9. Ye J
10. Yao F
11. Xu H
12. Chen M
13. Chen J
14. Zheng H
15. Li J
16. Zheng M
17. Huang S
18. Zhu S
19. Yang Y
20. Wang X
(2023) Development and validation of a novel radiomics-clinical model for predicting post-stroke epilepsy after first-ever intracerebral haemorrhage
European Radiology 33:4526–4536.

https://doi.org/10.1007/s00330-023-09429-y
- PubMed
- Google Scholar
1. Liu X
2. Chen X
3. Wang H
4. Pan S
(2014) Prognostic significance of admission levels of cardiac indicators in patients with acute ischaemic stroke: prospective observational study
The Journal of International Medical Research 42:1301–1310.

https://doi.org/10.1177/0300060514549217
- PubMed
- Google Scholar
1. Lu YT
2. Zhang ZL
3. Zhou XY
4. Zhang D
5. Tian T
6. Fan P
7. Zhang Y
8. Zhou XL
(2023) Predicting stroke and myocardial infarction risk in Takayasu arteritis with automated machine learning models
iScience 26:108421.

https://doi.org/10.1016/j.isci.2023.108421
- PubMed
- Google Scholar
1. Merkler AE
2. Gialdini G
3. Lerario MP
4. Parikh NS
5. Morris NA
6. Kummer B
7. Dunn L
8. Reznik ME
9. Murthy SB
10. Navi BB
11. Grinspan ZM
12. Iadecola C
13. Kamel H
(2018) Population-based assessment of the long-term risk of seizures in survivors of stroke
Stroke 49:1319–1324.

https://doi.org/10.1161/STROKEAHA.117.020178
- Google Scholar
1. Ng GJL
2. Quek AML
3. Cheung C
4. Arumugam TV
5. Seet RCS
(2017) Stroke biomarkers in clinical practice: a critical appraisal
Neurochemistry International 107:11–22.

https://doi.org/10.1016/j.neuint.2017.01.005
- PubMed
- Google Scholar
(2016) Development of epilepsy after ischaemic stroke
The Lancet. Neurology 15:185–197.

https://doi.org/10.1016/S1474-4422(15)00248-3
- PubMed
- Google Scholar
1. Pszczolkowski S
2. Law ZK
(2023) Editorial comment on “Development and validation of a novel radiomics-clinical model for predicting post-stroke epilepsy after first-ever intracerebral haemorrhage.”
European Radiology 33:4524–4525.

https://doi.org/10.1007/s00330-023-09565-5
- PubMed
- Google Scholar
1. Sun L
2. Clarke R
3. Bennett D
4. Guo Y
5. Walters RG
6. Hill M
7. Parish S
8. Millwood IY
9. Bian Z
10. Chen Y
11. Yu C
12. Lv J
13. Collins R
14. Chen J
15. Peto R
16. Li L
(2019) Causal associations of blood lipids with risk of ischemic stroke and intracerebral hemorrhage in Chinese adults
Nature Medicine 25:569–574.

https://doi.org/10.1038/s41591-019-0366-x
- PubMed
- Google Scholar
(2023) Clinical risk factors related to post-stroke epilepsy patients in Indonesia: a hospital-based study
The Egyptian Journal of Neurology, Psychiatry and Neurosurgery 59:e6373.

https://doi.org/10.1186/s41983-023-00637-3
- Google Scholar
1. Wang D
2. Hu B
3. Dai Y
4. Sun J
5. Liu Z
6. Feng Y
7. Cheng F
8. Zhang X
(2019) Serum uric acid is highly associated with epilepsy secondary to cerebral infarction
Neurotoxicity Research 35:63–70.

https://doi.org/10.1007/s12640-018-9930-2
- PubMed
- Google Scholar
1. Wang C
2. Cui T
3. Wang L
4. Zhu Q
5. Wang A
6. Yuan Y
7. Hao Z
8. Wu B
(2021) Prognostic significance of uric acid change in acute ischemic stroke patients with reperfusion therapy
European Journal of Neurology 28:1218–1224.

https://doi.org/10.1111/ene.14643
- PubMed
- Google Scholar
1. Yamada S
2. Nakagawa I
3. Tamura K
4. Nishimura F
5. Motoyama Y
6. Park YS
7. Nakase H
(2020) Investigation of poststroke epilepsy (INPOSE) study: a multicenter prospective study for prediction of poststroke epilepsy
Journal of Neurology 267:3274–3281.

https://doi.org/10.1007/s00415-020-09982-2
- PubMed
- Google Scholar
1. Zeng YY
2. Zhang WB
3. Cheng L
4. Wang L
5. Geng DD
6. Tang WJ
7. He JC
8. Deng BB
(2021) Cardiac parameters affect prognosis in patients with non-large atherosclerotic infarction
Molecular Medicine 27:2.

https://doi.org/10.1186/s10020-020-00260-5
- PubMed
- Google Scholar
1. Zhang W
2. Cheng Z
3. Fu F
4. Zhan Z
(2023) Serum uric acid and prognosis in acute ischemic stroke: a dose-response meta-analysis of cohort studies
Frontiers in Aging Neuroscience 15:1223015.

https://doi.org/10.3389/fnagi.2023.1223015
- PubMed
- Google Scholar
1. Zhao Y
2. Li X
3. Zhang K
4. Tong T
5. Cui R
(2018) The progress of epilepsy after stroke
Current Neuropharmacology 16:71–78.

https://doi.org/10.2174/1570159X15666170613083253
- PubMed
- Google Scholar
1. Zheng Yuan-Hui ZJY
2. Jian Z
(2009)
Changes of serum myocardial enzyme profile in acute stage of stroke

Chinese Journal of Advanced Medical Doctors, China Medical Journal 32:46–47.
- Google Scholar
1. Zöllner JP
2. Misselwitz B
3. Kaps M
4. Stein M
5. Konczalla J
6. Roth C
7. Krakow K
8. Steinmetz H
9. Rosenow F
10. Strzelczyk A
(2020) National Institutes of Health Stroke Scale (NIHSS) on admission predicts acute symptomatic seizure risk in ischemic stroke: a population-based study involving 135,117 cases
Scientific Reports 10:3779.

https://doi.org/10.1038/s41598-020-60628-9
- PubMed
- Google Scholar

Article and author information

Author details

Jinxin Liu

Department of Neurosurgery, Chongqing Emergency Medical Center, Chongqing University Central Hospital, School of Medicine, Chongqing University, Chongqing, China

Contribution
Data curation, Software, Formal analysis, Investigation, Visualization, Methodology, Writing – original draft, Writing – review and editing

Contributed equally with
Haoyue He

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0009-0003-8923-2536
Haoyue He
1. Department of Neurosurgery, Chongqing Emergency Medical Center, Chongqing University Central Hospital, School of Medicine, Chongqing University, Chongqing, China
2. Bioengineering College of Chongqing University, Chongqing, China
Contribution
Data curation, Software, Formal analysis, Visualization, Methodology, Writing – original draft, Writing – review and editing

Contributed equally with
Jinxin Liu

Competing interests
No competing interests declared
Yanglingxi Wang

Department of Neurosurgery, Chongqing Emergency Medical Center, Chongqing University Central Hospital, School of Medicine, Chongqing University, Chongqing, China

Contribution
Writing – original draft, Writing – review and editing

Competing interests
No competing interests declared
Jun Du

Department of Neurosurgery, Chongqing University Qianjiang Hospital, Chongqing, China

Contribution
Data curation, Methodology, Writing – original draft, Writing – review and editing

Competing interests
No competing interests declared
Kaixin Liang

Department of Neurosurgery, Yubei District Hospital of Traditional Chinese Medicine, Chongqing, China

Contribution
Data curation, Methodology, Writing – original draft, Writing – review and editing

Competing interests
No competing interests declared
Jun Xue

Department of Neurosurgery, Bishan hospital of Chongqing Medical University, Chongqing, China

Contribution
Data curation, Writing – original draft, Writing – review and editing

Competing interests
No competing interests declared
Yidan Liang

Department of Neurosurgery, Chongqing Emergency Medical Center, Chongqing University Central Hospital, School of Medicine, Chongqing University, Chongqing, China

Contribution
Formal analysis, Methodology, Writing – original draft, Writing – review and editing

Competing interests
No competing interests declared
Peng Chen

Department of Neurosurgery, Chongqing Emergency Medical Center, Chongqing University Central Hospital, School of Medicine, Chongqing University, Chongqing, China

Contribution
Formal analysis, Methodology, Writing – original draft, Writing – review and editing

Competing interests
No competing interests declared
Shanshan Tian

Department of Prehospital Emergency, Chongqing University Central Hospital, Chongqing Emergency Medical Center, Chongqing, China

Contribution
Data curation, Formal analysis, Supervision, Methodology, Writing – original draft, Writing – review and editing

For correspondence
710836163@qq.com

Competing interests
No competing interests declared
Yongbing Deng
1. Department of Neurosurgery, Chongqing Emergency Medical Center, Chongqing University Central Hospital, School of Medicine, Chongqing University, Chongqing, China
2. Chongqing Key Laboratory of Emergency Medicine, Chongqing, China
3. Jinfeng Laboratory, Chongqing, China
Contribution
Conceptualization, Resources, Data curation, Formal analysis, Supervision, Funding acquisition, Validation, Visualization, Methodology, Writing – original draft, Writing – review and editing

For correspondence
dyb0913@cqu.edu.cn

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-8581-5748

Funding

Central University Basic Research Fund of China (2023CDJYGRH-ZD06)

Yongbing Deng

Emergency Medicine chongqing Key Laboratory Talent Innovation and development joint fund project (2024RCCX10)

Yongbing Deng

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

The authors would like to thank their colleagues in the information and imaging departments for their hard work contributing to the final research results.

Version history

Sent for peer review: April 28, 2024
Preprint posted: June 28, 2024
Reviewed Preprint version 1: July 8, 2024
Reviewed Preprint version 2: October 15, 2024
Version of Record published: November 14, 2024

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.98759. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.