精神分裂症患者奥氮平治疗失效的XGBoost预测模型构建与临床应用验证

doi:10.3969/j.issn.1009-6574.2026.05.003

首页 > 按月查看>2026年第5月 >318-324. DOI:10.3969/j.issn.1009-6574.2026.05.003

精神分裂症患者奥氮平治疗失效的XGBoost预测模型构建与临床应用验证
DOI:
                        10.3969/j.issn.1009-6574.2026.05.003
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
基金项目:

Construction and clinical application validation of an XGBoost prediction model for olanzapine treatment failure in patients with schizophrenia

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

目的构建并验证基于极端梯度提升（XGBoost）的预测模型，用于早期识别精神分裂症患者对奥氮平治疗在8 周时的治疗失效风险，以期为个体化治疗决策提供辅助支持。方法纳入2023 年 1 月—2024 年12 月在厦门市仙岳医院受奥氮平治疗且完成8 周随访的200 例精神分裂症患者为研究对象。治疗失效定义为治疗8 周后阳性与阴性症状量表（PANSS）评分减分率＜ 30%。收集受试者人口学资料（年龄、性别、病程等）、临床资料（基线PANSS 评分、合并症、既往用药史）、实验室指标［血常规、肝肾功能、血清白细胞介素6（IL-6）等］及基因标记（DRD2 rs1076560）等候选预测因子。对缺失值采用多重插补（MICE，m=5）。将样本按7∶3 随机分为训练集（n=140）与测试集（n=60）。训练集中采用5 折交叉验证并通过网格搜索调优XGBoost 超参数；必要时对类别不平衡采用合成少数类过采样技术（SMOTE）。模型性能在独立测试集中以曲线下面积（AUC）、敏感度、特异度、准确度、校准曲线（Hosmer-Lemeshow 检验）及决策曲线分析（DCA）评估，并采用SHAP值解释变量重要性。统计分析使用Python（XGBoost、 Scikit-learn）与R 软件完成。结果训练集与测试集的患者在性别、年龄、病程、基线PANSS 评分、共病焦虑障碍比例、血清IL-6水平及DRD2基因rs1076560位点多态性方面差异均无统计学意义（均P＞0.05）。 XGBoost 模型在训练/ 验证过程中筛选出5 个重要预测因子：基线PANSS 阳性症状评分、病程、血清IL-6 水平、DRD2 基因rs1076560 位点多态性与合并焦虑障碍。测试集中模型表现为：准确度0.833，敏感度 0.794，特异度0.885，AUC=0.897（95%CI：0.808～0.986）；Hosmer-Lemeshow检验P=0.620，校准良好。DCA表明，当阈值概率＞ 0.25 时，模型相较单一预测指标具有更高的临床净获益。结论 XGBoost 预测模型在本队列中能较好地识别奥氮平治疗8 周失效的高危患者，所识别的关键因子涉及症状严重度、病程、炎症指标、基因多态性与共病。模型需在外部队列验证后方可用于临床辅助决策。

Abstract:

Objective To construct and validate a prediction model based on extreme gradient boosting （XGBoost） for early identification of the risk of treatment failure of olanzapine in schizophrenia patients at 8 weeks， so as to provide auxiliary support for individualized treatment decisions. Methods This study included 200 patients with schizophrenia who received olanzapine treatment and completed an 8-week follow-up at Xiamen Xianyue Hospital from January 2023 to December 2024. Treatment failure was defined as a Positive and Negative Syndrome Scale（ PANSS） reduction rate of＜30% after 8 weeks of treatment. Candidate predictors included subject demographic characteristics（ such as age， gender， disease duration）， clinical features（ baseline PANSS score， comorbidities， past medication history）， laboratory indicators［ such as complete blood count，liver and kidney function， serum interleukin-6（ IL-6）］， and genetic markers（ DRD2 rs1076560）. Multiple imputation by chained equations（ MICE， m=5） was used for missing values. The samples were randomly divided into a training set（ n=140） and a testing set（ n=60） in a 7∶3 ratio. The training set employed 5-fold cross-validation， with XGBoost hyperparameters tuned via grid search. When necessary， synthetic minority oversampling technique（ SMOTE） was used to address class imbalance. Model performance was evaluated in the independent testing set using area under the curve（ AUC）， sensitivity， specificity， accuracy， calibration curve （Hosmer-Lemeshow test）， and decision curve analysis（ DCA）， with variable importance interpreted using Shapley additive explanations（ SHAP） values. Statistical analysis was performed using Python（ XGBoost， scikit-learn） and R software. Results There were no statistically significant differences between patients in the training and testing sets in terms of gender， age， disease duration， baseline PANSS scores， comorbid anxiety disorder ratio， serum IL-6 levels， and genotype distribution at the DRD2 gene rs1076560 locus（ all P＞0.05）. The XGBoost model identified five important predictors during training/validation： baseline PANSS positive symptom score， disease duration， serum IL-6 levels， genotype distribution at the DRD2 gene rs1076560 locus， and comorbid anxiety disorder. The model performance in the testing set was as follows： accuracy 0.833， sensitivity 0.794， specificity 0.885， AUC of 0.897［ 95%CI（ 0.808，0.986）］， Hosmer-Lemeshow test P=0.620， with good calibration. DCA indicated that when the threshold probability exceeded 0.25， the model demonstrated greater clinical net benefit compared to a single predictor. Conclusions The XGBoost prediction model established in this retrospective study effectively identifies high-risk patients for olanzapine treatment failure at 8 weeks within this cohort. Key identified factors include symptom severity， disease duration， inflammatory indicators， genetic polymorphisms， and comorbidities. The model needs to be validated in external cohorts before it can be used for clinical decision support.

参考文献

相似文献

引证文献

文章指标

点击次数:
下载次数:

历史

收稿日期:
最后修改日期:
录用日期:
在线发布日期: 2026-05-19