Open Access Article
International Journal of Medicine and Data. 2025; 9: (1) ; 74-77 ; DOI: 10.12208/j.ijmd.20250015.
Mortality risk prediction model for patients with acute ischemic stroke: a machine learning method based on intrinsic interpretability
急性缺血性脑卒中患者死亡风险预测模型:基于内在可解释性机器学习方法
作者:
刘曜嘉1,
高思齐1,
张硕1,
杨树1,
纪家琪1,
刘俊杰1,2 *,
王建军2
1华北理工大学临床医学院 河北唐山
2华北理工大学附属医院重症医学科 河北唐山
*通讯作者:
刘俊杰,单位:华北理工大学临床医学院 河北唐山华北理工大学附属医院重症医学科 河北唐山;
发布时间: 2025-02-27 总浏览量: 13
PDF 全文下载
引用本文
摘要
目的 本研究基于MIMIC-IV数据库,旨在开发一种可解释的机器学习模型,用于预测卒中患者的ICU死亡风险。方法 本研究从MIMIC-IV数据库中,根据ICD-9和ICD-10编码提取急性缺血性脑卒中患者。利用LASSO回归算法进行特征筛选。通过七种机器学习算法,依据AUC、准确率以及F1分数等指标进行评估和比较,选出最优算法。按照7:3的比例划分为训练集和测试集,在训练集中进行五折交叉验证。超参数优化采用网格搜索方法,以提升算法性能。在测试集上评估最优算法的预测能力及其泛化性能。采用SHAP方法解释关键特征对ICU死亡风险的影响。结果 本研究共从MIMIC-IV数据库中提取急性缺血性脑卒中患者1998例,其中436例(占21.8%)在入住ICU后30天内死亡。通过对多种机器学习算法在验证集上的评估与比较,最终选定XGBoost算法作为最优算法。研究中将数据按7:3的比例划分为训练集和测试集,并结合五折交叉验证与网格搜索优化超参数,结果表明XGBoost算法在测试集上展现了良好的ICU死亡风险预测性能和泛化能力(AUC=0.821,95%CI:0.778~0.864;准确率=80.7%)。SHAP解释分析显示,早期有创氧疗和高龄是卒中患者ICU死亡风险增加的主要危险因素。结论 XGBoost算法在预测急性缺血性脑卒中患者ICU死亡风险方面展现出较强的潜力。此外,SHAP解释分析突显了早期有创氧疗和高龄对ICU死亡风险的重要性。
关键词: 急性缺血性脑卒中;死亡风险预测模型;机器学习;内在可解释性;MIMIC-IV数据库
Abstract
Objective This study, based on the MIMIC-IV database, aims to develop an interpretable machine learning model to predict the ICU mortality risk in stroke patients. Methods Acute ischemic stroke patients were extracted from the MIMIC-IV database based on ICD-9 and ICD-10 codes. Feature selection was performed using the LASSO regression algorithm. Seven machine learning algorithms were evaluated and compared using metrics such as AUC, accuracy, and F1 score to identify the optimal algorithm. The dataset was split into training and testing sets in a 7:3 ratio, and five-fold cross-validation was conducted on the training set. Hyperparameter optimization was performed using grid search to enhance algorithm performance. The predictive ability and generalization performance of the optimal algorithm were evaluated on the test set. SHAP analysis was used to interpret the impact of key features on ICU mortality risk. Results: A total of 1,998 acute ischemic stroke patients were extracted from the MIMIC-IV database, of which 436 (21.8%) died within 30 days of ICU admission. After evaluating and comparing the performance of multiple machine learning algorithms on the validation set, the XGBoost algorithm was selected as the optimal model. The data were divided into training and testing sets in a 7:3 ratio, and five-fold cross-validation with grid search was employed for hyperparameter optimization. The results showed that the XGBoost algorithm demonstrated excellent ICU mortality risk prediction performance and generalization ability on the test set (AUC = 0.821, 95% CI: 0.778–0.864; accuracy = 80.7%). SHAP analysis revealed that early invasive oxygen therapy and advanced age were the primary risk factors for increased ICU mortality in stroke patients. Conclusion The XGBoost algorithm shows strong potential for predicting ICU mortality risk in acute ischemic stroke patients. Moreover, SHAP analysis highlights the significant roles of early invasive oxygen therapy and advanced age in determining ICU mortality risk.
Key words: Acute ischemic stroke; Mortality risk prediction model; Machine learning; Intrinsic interpretability; MIMIC-IV database
参考文献 References
[1] 史雪,于乐,谷洁冰,等.LncRNA PINK1-AS靶向调控miR-455-3p/GAB2轴对缺氧复氧诱导的PC12细胞损伤的作用及机制[J].中国老年学杂志,2025,45(06):1386-1390.
[2] 沈骏,周仁华,徐建红.急重症脑卒中患者相关性肺炎的危险因素及外周血单核粒细胞计数对其预测价值[J].脑与神经疾病杂志,2015,23(03):211-214.
[3] 任海蓉.老年性脑卒中合并肺炎的回顾性分析[J].中国医药指南,2012,10(24):128-129.
[4] 陈玥,张慧,关纯,等.ICU肠内营养患者误吸风险预测模型构建[J].医学新知,2025,35(01):22-32.
[5] 尚媛媛,段莹,龙杰琦,等.贵州省苗岭以南地区气象因素对心脑血管疾病影响的分析与预测[J].现代预防医学,2024, 51(19): 3594-3601.
[6] 叶壮.基于机器学习方法的糖尿病预测与分析[J].数字技术与应用,2024,42(10):33-35.
引用本文
刘曜嘉, 高思齐, 张硕, 杨树, 纪家琪, 刘俊杰, 王建军, 急性缺血性脑卒中患者死亡风险预测模型:基于内在可解释性机器学习方法[J]. 国际医学与数据杂志, 2025; 9: (1) : 74-77.