欢迎来到《四川大学学报(医学版)》

多模态数据驱动的妇科手术时长预测研究

Multimodal Data-Driven Prediction of Gynecological Surgery Duration

  • 摘要:
    目的 本研究聚焦妇科手术,旨在通过人工智能技术提取非结构化手术规划步骤文本特征以融合多模态数据,建立妇科手术时长预测模型。
    方法 纳入2022年1月–2024年10月34614例四川大学华西第二医院的妇科手术患者信息,搭建Embedding-Transformer模型将手术规划步骤文本转化为一维数值特征(步骤特征),通过对比线性回归、随机森林、极限梯度提升算法(XGBoost)、支持向量回归、K近邻回归和人工神经网络在输入和不输入步骤特征两种场景下的性能改进以评估其预测价值。采用平均绝对误差(mean absolute error, MAE)、均方根误差(root mean squared error, RMSE)和判定系数(R-squared, R2)评估模型的预测准确度;进一步采用SHapley Additive exPlanations(SHAP)值描述模型的可解释性。
    结果 SHAP结果显示,步骤特征具有最高的预测贡献度。手术排程的时间因素亦对妇科手术时长有影响。XGBoost模型在测试集中表现最优,较未输入步骤特征的基线模型显著提升预测精度R2为40.43%,MAE和RMSE分别降低21.27%和20.13%。
    结论 本研究搭建的Embedding-Transformer模型能有效提取手术规划步骤文本特征,提高机器学习模型的预测性能。XGBoost预测模型可辅助医院管理者开展更精细化的妇科手术管理,提高手术资源利用效率。

     

    Abstract:
    Objective  Focusing on gynecological surgery, we constructed a prediction model for surgical duration by extracting features from unstructured surgical planning texts and integrating multimodal data via artificial intelligence technology.
    Methods  The clinical data of 34614 patients who underwent gynecologic surgeries at West China Second University Hospital, Sichuan University between January 2022 and October 2024 were collected. An embedding-transformer model was constructed to convert surgical planning texts into a one-dimensional numerical feature, referred to as the step feature. The predictive value of the step feature was assessed by comparing the performance improvements of linear regression, random forest, eXtreme Gradient Boosting (XGBoost), support vector regression, K-nearest neighbor regression, and artificial neural network algorithms in two scenarios—with and without the step feature as an input. The out-of-sample prediction accuracy of the models was assessed using mean absolute error (MAE), root mean squared error (RMSE), and R-squared (R2). Furthermore, the model interpretability was examined using SHapley Additive exPlanations (SHAP) values.
    Results  SHAP results showed that the step feature had the highest predictive contribution. Temporal factors in surgical scheduling also influenced gynecological surgery duration. The XGBoost model demonstrated optimal performance on the test set, significantly improving prediction accuracy with a 40.43% increase in R2, while reducing MAE and RMSE by 21.27% and 20.13%, respectively, compared to the baseline model without the step feature.
    Conclusion  The embedding-transformer model developed in this study effectively extracts features from surgical planning texts and enhances the predictive performance of machine learning models. The XGBoost prediction model can assist hospital administrators in implementing more refined management of gynecological surgeries and improving the utilization efficiency of surgical resources.

     

/

返回文章
返回