基于可解释机器学习的缺血性脑卒中合并吞咽困难患者院内死亡风险预测模型比较

Comparison of the in-hospital mortality risk predictive models among patients with ischemic stroke combined by dysphagia based on interpretable machine learning

  • 摘要:
    目的 使用可解释机器学习方法预测缺血性脑卒中合并吞咽困难患者院内死亡风险,以期为缺血性脑卒中合并吞咽困难的预后预测提供更多循证依据。
    方法 回顾性分析美国重症监护医学信息数据库Ⅳ(MIMIC⁃Ⅳ)(2.0)中诊断为缺血性脑卒中合并吞咽困难 308例患者的病历信息,基于最小绝对值收缩和选择算子筛选特征,将数据集按7∶3随机划分为训练集和测试集,分别构建逻辑回归(LR)、随机森林(RF)、K最近邻(KNN)、线性判别分析(LDA)、朴素贝叶斯(NB)、神经网络(NN)、二次判别分析(QDA)、及递归分割树(RPT)、极端梯度提升树(XGBoost)、支持向量机(SVM)等10种模型。通过计算受试者工作特征曲线下面积(AUC)衡量预测效果,使用校准曲线及布里尔分数评价模型的校准程度,依据决策曲线反映临床净收益。采用沙普利值加性解释方法分析模型的可解释性,探索重要决策因素。
    结果 测试集中,NB模型相较于其他模型表现出更好的预测能力[AUC(95%CI)=0.85(0.83~0.88)]。经可解释性分析,血尿素氮、年龄、序贯器官衰竭评估、碳酸氢盐、氯化物、高血压是缺血性脑卒中合并吞咽困难患者院内死亡的重要影响因素。
    结论 在预测缺血性脑卒中合并吞咽困难患者的院内死亡风险方面,NB模型的综合表现优于其他9种模型。模型可解释性可以帮助临床医生更好地理解结果背后的原因,针对影响因素采取合理的干预措施,提高患者生存率。

     

    Abstract:
    Objective To predict the in-hospital mortality risk among patients with ischemic stroke combined by dysphagia using interpretable machine learning methods, so as to provide more evidence-based support for the prognosis prediction of patients with ischemic stroke combined by dysphagia.
    Methods Medical record of 308 patients diagnosed with ischemic stroke combined by dysphagia in the Medical Information Mart for Intensive Care Ⅳ (MIMIC-Ⅳ) (2.0) in the United States were retrospectively analyzed. Features of the research data were screened based on the least absolute shrinkage and selection operator, and which were randomly divided into a training set and a test set at a ratio of 7∶3. Then ten models, including logistic regression, random forest, K-nearest neighbor, linear discriminant analysis, naive bayes (NB), neural network, quadratic discriminant analysis, recursive partitioning tree, extreme gradient boosting tree, and support vector machine, etc. were constructed. The predictive effect was measured by calculating the area under the curve (AUC) of receiver operating characteristics. In addition, the calibration curve and Brier score were used to evaluate the calibration degree of the model, and the decision curve was drawn to reflect the clinical net benefit. The Shapley additive explanation method was used to analyze the interpretability of the black box model and explore the important decision-making factors.
    Results The NB model in the test set showed better predictive ability compared with other models (AUC=0.85, 95%CI: 0.83‒0.88). After interpretability analysis, it was found that blood urea nitrogen (BUN), age, sequential organ failure assessment, bicarbonate, chloride, and hypertension were important risk factors for in-hospital mortality in patients with ischemic stroke combined by dysphagia.
    Conclusion The comprehensive performance of the NB model is better than that of the other nine models in predicting the risk of in-hospital mortality in patients with ischemic stroke combined by dysphagia. The interpretability of the model can help clinicians better understand the reasons behind the results and take further reasonable intervention measures for risk factors to improve the survival probability of patients.

     

/

返回文章
返回