Skip to main content

A risk prediction model for venous thromboembolism in hospitalized patients with thoracic trauma: a machine learning, national multicenter retrospective study

Abstract

Background

Early treatment and prevention are the keys to reducing the mortality of VTE in patients with thoracic trauma. This study aimed to develop and validate an automatic prediction model based on machine learning for VTE risk screening in patients with thoracic trauma.

Methods

In this national multicenter retrospective study, the clinical data of chest trauma patients hospitalized in 33 hospitals in China from October 2020 to September 2021 were collected for model training and testing. The data of patients with thoracic trauma at Shanghai Sixth People’s Hospital from October 2021 to September 2022 were included for further verification. The performance of the model was measured mainly by the area under the receiver operating characteristic curve (AUROC) and the mean accuracy (mAP), and the sensitivity, specificity, positive predictive value, and negative predictive value were also measured.

Results

A total of 3116 patients were included in the training and validation of the model. External validation was performed in 408 patients. The random forest (RF) model was selected as the final model, with an AUROC of 0·879 (95% CI 0·856–0·902) in the test dataset. In the external validation, the AUROC was 0.83 (95% CI 0.794–0.866), the specificity was 0.756 (95% CI 0.713–0.799), the sensitivity was 0.821 (95% CI 0.692–0.923), the negative predictive value was 0.976 (95% CI 0.958–0.993), and the positive likelihood ratio was 3.364.

Conclusions

This model can be used to quickly screen for the risk of VTE in patients with thoracic trauma. More than 90% of unnecessary VTE tests can be avoided, which can help clinicians target interventions to high-risk groups and ensure resource optimization. Although further validation and improvement are needed, this study has considerable clinical value.

Introduction

Venous thromboembolism (VTE) includes pulmonary thromboembolism (PE) and deep venous thromboembolism (DVT). The incidence of VTE among hospitalized patients in the United States ranges from 10–40%, and up to 60% of surgical patients are at intermediate or high risk for VTE [1, 2]. The incidence of VTE ranges from 7 to 53%, depending on the type of disease, and survivors also have varying degrees of sequelae. [2,3,4] According to different reports, the mortality rate of DVT ranges from 2·6 to 19·6%, [2, 5] and the mortality rate of PE ranges from 5·2 to 19·6% [2, 6]. VTE is one of the most important fatal complications in trauma patients [7, 8], and it is one of the leading causes of death in trauma patients who survive beyond the first day [9]. The incidence of VTE in patients with trauma ranges from 11·8–65% [9]. The incidence of VTE can reach 40–80% in patients with severe or multiple traumas who do not receive thromboprophylaxis [7, 10]. It poses great clinical risk and financial burden to patients and medical institutions. [11]

Unlike the traditional view that PE is secondary to DVT in the past, PE and DVT in patients with thoracic trauma are independent clinical events [12, 13]. Therefore, identifying high-risk individuals before further examination to make personalized prevention, diagnosis, and treatment decisions is urgently needed by thoracic surgeons [14, 15]. In our previous literature search, we found that there was no targeted prediction model for VTE in patients with thoracic trauma, and there was a lack of efficient prediction methods. (eText1, eFig1).

Materials and methods

This study was approved by the Ethics Committee of the Third Hospital of Shijiazhuang (2021) Ethical Approval No. 047 and was reviewed by all participating institutions.

Subjects

The clinical data of 5774 patients with thoracic trauma treated at 33 tertiary or secondary medical institutions in China from October 2020 to September 2021 were collected. After screening according to the inclusion and exclusion criteria, 3116 patients were included in this study. In addition, we enrolled 474 patients with thoracic trauma in a single center at six hospitals in Shanghai from October 2021 to September 2022. Among these patients, 408 patients who met the inclusion criteria were used for external validation. All the participants provided written informed consent. The inclusion and exclusion criteria were as follows:

Inclusion criteria

(1) 25–80 years old, admitted within 3 days after chest trauma; (2) for patients with thoracic trauma combined with other injuries, the abbreviated injury score (AIS) was defined as AIS ≥ 3 for thoracic injury and AIS ≤ 3 for other injury sites independently.

Exclusion criteria were as follows

(1) received preventive physical or chemical anticoagulation therapy before admission; (2) VTE occurred before admission; and (3) except for thoracic injury, the other injury sites had an independent AIS score > 3. (4) Incomplete case data. (5) Written informed consent was not provided.

Main prediction results and diagnostic methods

The primary outcome was in-hospital VTE, including DVT confirmed via deep vein ultrasonography and PE confirmed via CTPA [16]. DVT was routinely detected, and PE was selectively detected.

Development of machine learning models

The development process of the model is shown in Fig. 1, which is divided into the following steps:

Fig. 1
figure 1

Flowchart of the study and data analyses

Data collection and sorting

All the data were collected and managed via the designated chest trauma cloud database. All analyses were performed via R Studio (version 4.2.2) and Python (version 3.9.0).

A total of 25 clinical data and epidemiological characteristics of patients with thoracic trauma [14, 16,17,18,19], including sex, age, body mass index (BMI), number of broken ends of rib fractures, treatment methods, surgical treatment plans, underlying diseases, and complications, were recorded. (eTable 1).

Selection of related factors

The development of the machine learning model began with the Python-based BorutaShap algorithm (eMethod 1) for feature selection [20,21,22]. Only the influencing factors screened from the feature selection were included in the final machine learning model.

Sample size calculation and data segmentation

At present, there is no perfect sample size calculation rule for the training and validation of machine learning models [23, 24]. Generally, the research data are divided into a training set, test set, and external validation set = 3:1:1. In this study, the sample size calculation method of Riley et al. [25] was used to calculate the minimum sample size of the training set and the external validation set in three steps (eMethod 2; eFig2; eFig3).

Model training and verification

Five machine learning models were trained [26, 27]: logistic regression (LR) [28, 29], random forest (RF) [30], support vector machine (SVM) [31], multilayer perceptron (MLP) [32], and gradient boosting machine (GBM) [33]. The synthetic minority oversampling technique (SMOTE) was used to address imbalanced data in the training set [34]. The test set was used to validate the five models to further screen out the best-performing machine learning model. To ensure the fairness of the comparison, algorithms that had advantages in some training processes received corresponding weakening adjustments. The logistic regression model was trained on the basis of the Akaike information criterion and reverse feature method. To adjust the hyperparameters of the RF, SVM, MLP, and GBM models, we defined a hyperparameter space for each model and performed a grid search and triple cross-validation in the training set. The hyperparameters that yielded the highest area under the receiver operating characteristic curve (AUROC) for each model during validation were selected and fixed for subsequent model testing. The outcome measures included the AUROC for each model and the mean accuracy (mAP) of the model, as well as the sensitivity, specificity, positive predictive values, and negative predictive values. The mAP was defined as the average precision over all the recall rates and was equal to the area under the precision‒recall curve in the current study. The bootstrap method was used to calculate the 95% confidence interval (CI) and compare the AUROC and average accuracy curves. The best-performing model was subjected to operation-point analysis with the highest Youden index as the operating point of the ROC curve. The AgrestiCoull method was used to calculate 95% confidence intervals (CIs) for the sensitivity, specificity, and positive and negative predictive values [35]. Differences in sensitivity and specificity were analyzed via McNemar’s chi-square test, and differences in predictive values were analyzed via generalized score statistics [36]. The Shapley additive explanatory value (SHAP value) is used to measure the contribution of each feature to the model output. [37]The output probabilities were calibrated via the Platt scale, and a calibration curve was drawn. To assess the net clinical benefit of different thresholds, we performed a decision curve analysis. [38, 39]As the actual risk of VTE in the screened population was low, the use of the polygenic risk score (PRS) instead of the actual risk of VTE is more conducive to further analysis [40]. The PRS reflects the estimated actual risk via the following formula:

$${\text{Estimated actual}} {\text{risk }} = \frac{{\exp \left( {6\cdot37 \times PRS - 6\cdot41} \right)}}{{1 + \exp \left( {6\cdot37 \times PRS - 6\cdot41} \right)}}$$

Finally, an external validation set was used to validate the performance of the optimal prediction model. In the above procedures, all tests were two-sided, and p values less than 0 \(\cdot\) 05 were considered to indicate significant differences.

Construction of the visual web program

The research team constructed a visualization procedure (eMethod 3; eFig. 4; eFig. 5) for the model on the basis of the gap value and PRS of the model.

Results

Selection of the main prediction results

Among the 3116 patients with thoracic trauma included in the analysis, there were 190 cases of VTE, with an incidence of approximately 6 \(\cdot\) 1%, including 183 cases of DVT and 17 cases of PE, seven of which were found to have PE without DVT. There were 37 VTE cases in the external validation set of 408 cases, with an incidence of 9·1%, including 41 cases of DVT and three cases of PE, of which two cases of PE were detected without DVT. Because the sample size and incidence of PE cases were much lower than those of DVT cases, VTE was used as the only main prediction result.

Feature selection

Twelve related factors were selected: age, BMI, number of broken ends of rib fracture, rib fracture surgery, multiple traumas, lower limb fracture, tracheal intubation, blood transfusion, D-D 24 h, PT 24 h, Plt 24 h, and Hb 24 h.

Machine learning model training and screening

Segmentation of data

The study population was divided into a training group (before) and a test group (after), with June 30, 2021, as the cutoff. A ratio of 3:1 was used to divide the 2337 samples before June 30, 2021, into the training set and 779 samples after June 30, 2021, into the test set. The sample size of the training set used in this study was significantly larger than the minimum calculated sample size (eMethod 2; eFig 2). eTable 2 shows the baseline data after variable screening and grouping.

Machine learning model training performance

All the models performed well in the training set, and the performances of all the parameters of the training and test sets are shown in Table 1.

Table 1 Prediction performance of the machine learning models in the training and testing sets

External verification

Data from 408 patients with chest trauma who met the inclusion criteria were included in an external validation set to further verify the performance of the RF prediction model (Table 2). Compared with the test dataset, the RF model also had similar AUROCs (0·83, 95% CI 0·794–0·866) and mAPs (0·688) in the external validation dataset.

Table 2 Comparison of the results of the test set with those of the external verification set

We analyzed the relative influence of the 12 features on the model output according to the mean absolute SHAP values (Fig. 3A). In addition, to further clarify the influence of different features on the model output, we plotted the prediction plots of negative and positive correlations (Fig. 3B).

Discussion

This study has the advantages of being a multicenter study with a large sample size and can better reflect the incidence of VTE in patients with thoracic trauma during hospitalization in most parts of China. The use of machine learning algorithms can ensure the rigor of research to the greatest extent possible and improve the accuracy and generalizability of prediction models. [26, 27]

In this model, the excluded features had little influence on the final prediction results, and if they were included together, the model would overfit [27]. Brain trauma, spinal fracture, pelvic fracture, abdominal injury, upper limb fracture, and surgery in other departments are known risk factors for VTE [18, 41], but they did not have a significant influence in this study. Only patients with simple thoracic trauma and severe thoracic trauma (independent AIS score ≥ 3) and patients with mild trauma (independent AIS score ≤ 3) were included in this study. Therefore, injuries to other parts of the body with minor injuries and surgeries had an insufficient influence on the final prediction results of the model and were not included in this model. Among the 12 features included in the model, the influence of different features on the final prediction results is shown in Fig. 3. Lower limb fracture, Hb24h, age, BMI, and the number of broken ends of the rib fracture had the highest correlation with the final prediction result. Tracheal intubation, PT24h, blood transfusion, D-D24h, multiple traumas, Plt24h, and rib surgery were also strongly correlated with the final prediction.

Lower extremity fracture, age, BMI, tracheal intubation, PT24h, D-D24h, and multiple traumas were positively correlated with VTE. Hb24h and Plt24h levels were negatively correlated with the occurrence of VTE, and blood transfusion was positively correlated with the occurrence of VTE. These results are consistent with previous findings (eText 2). [7, 18, 41,42,43,44]

The number of broken ends of rib fractures had a considerable influence on the final prediction results. In addition to the common risk factors for trauma, patients with thoracic trauma have their own characteristics [11, 14, 45]. Rib fracture, hemopneumothorax, pulmonary contusion directly caused by trauma, pneumonia, acute respiratory distress syndrome (ARDS), and respiratory failure indirectly caused by trauma are all risk factors for VTE, which may be caused by fractures, chest strap fixation, and prolonged bed rest [8, 41, 46, 47]. Patients with chest trauma, especially those with multiple rib fractures or flail chests, have limited breathing, poor sputum excretion, and an increased risk of respiratory infections. [46,47,48]

In the screening process of the machine learning models, the RF model showed obvious advantages over the other machine learning models in terms of the ROC curve, P-R curve, calibration curve, and DCA curve (Fig. 2). The highest AUC (0.879) and mAP (0.44) indicated that the RF model had the highest accuracy [27]. As shown in the calibration curve, the RF model had the highest degree of calibration. In the DCA curve, the RF model also showed the greatest net benefit, indicating that the RF model has the highest clinical value while meeting the actual needs of clinical decision-making [38]. When the incidence is low, the RF model has greater accuracy and clinical value. The overall incidence of VTE in this study (6·1%) was low and consistent with the incidence of VTE observed in clinical practice. [1, 2]

Fig. 2
figure 2

Receiver operating characteristic curve (A), precision‒recall curve (B), calibration curve (C), and decision curve (D)

Trauma patients, especially those with multiple injuries, are critically ill and have poor underlying conditions [49]. Unnecessary examinations and transportation often result in unnecessary economic burdens and clinical risks to patients. In this study, a machine learning model was used to automatically generate an objective PRS score by inputting easily available hospitalization data and epidemiological information of hospitalized patients, which effectively removed subjective factors in clinical VTE screening. The negative predictive value was 0.976 (95% CI 0.958–0.993). This means that approximately 97·6% of patients with thoracic trauma can avoid unnecessary transport and invasive tests when the model is used to predict the probability of VTE.

Notably, although rib fracture surgery was positively associated with VTE in this study, it had a relatively weak effect. Figure 3(B) shows that not undergoing rib surgery had little effect on the occurrence of VTE, but the patient population that underwent surgery had both positive and negative effects. This may be due to the large differences in the degree of rib fracture in the patients with thoracic trauma included in this study and the different indications for rib surgery at each center. Recent studies have shown that rib surgery may have a protective effect on pulmonary complications and VTE in patients with severe rib fractures [47, 48, 50]. Studies have also reported that the incidence of VTE in patients who undergo surgery as soon as possible may be lower than that in delayed surgery patients [16]. This study could not determine whether rib surgery had a protective effect on VTE in patients at this time, which requires more targeted and standardized research.

Fig. 3
figure 3

Mean SHAP value (A), SHAP value (B)

This study had several limitations. First, the incidence of VTE in patients with thoracic trauma in this study was lower than that reported in other studies [51]. Because of the differences in physicians’ clinical experience and because many asymptomatic PE cases occur without significant clinical manifestations, data from many patients with PE were not included in the study [12, 52]. In addition, only patients admitted within three days after trauma were included in this study. In a study of patients with acute trauma, up to 62% of VTE cases were reported after hospital discharge.8 This study also had problems such as insufficient sample size, imperfect inclusion and exclusion criteria, incomplete long-term follow-up, insufficient data homogeneity, and an insufficient external validation set. Further multicenter studies with larger sample sizes and improved internal and external consistency are needed. In addition, the fully automated clinical screening model developed in this study will require further regulatory review and approval to evaluate its performance, potential risks, and benefits for broad clinical applications.

In conclusion, we developed and tested a machine learning model to predict the probability of VTE in patients with thoracic trauma during the perioperative period. This convenient and automated screening method showed comparable diagnostic performance and prevented 97·6% of unnecessary lower-extremity vascular ultrasound or CTPA screening. The results of our study have the advantages of noninvasive examination, convenience, and high efficiency, which can significantly improve the efficiency of VTE prevention and treatment in patients with thoracic trauma and pave the way for better optimization of medical resources.

Data availability

The data of this multicenter study were collected from 33 medical institutions in China. The data were analyzed through the self-developed cloud database of chest trauma patients. Considering the infringement of patients’ privacy rights and the violation of national medical data confidentiality regulations, it is not convenient to share the original data to the world for the time being. The original data were obtained from the corresponding author when reasonable. In this study, Kaibin Liu, Di qian and Dongsheng Zhang have limited data sharing rights. They had full access to all of the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis. They can obtain the individual patient data of multi-center for data analysis, but cannot modify the data information or share the data through network channels.

References

  1. Blitzer RR, Eisenstein S. Venous thromboembolism and pulmonary embolism: strategies for prevention and management. Surg Clin North Am. 2021;101(5):925–38. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.suc.2021.06.015.

    Article  PubMed  Google Scholar 

  2. Lutsey PL, Zakai NA. Epidemiology and prevention of venous thromboembolism. Nat Rev Cardiol. 2023;20(4):248–62. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41569-022-00787-6.

    Article  PubMed  Google Scholar 

  3. Tagalakis V, Patenaude V, Kahn SR, Suissa S. Incidence of and mortality from venous thromboembolism in a real-world population: the Q-VTE Study Cohort. Am J Med. 2013;126(9):832.e813-821. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.amjmed.2013.02.024.

    Article  Google Scholar 

  4. Heit JA, Cohen AT, Anderson FA. Estimated annual number of incident and recurrent, non-fatal and fatal venous thromboembolism (VTE) events in the US. Blood. 2005;106(11):910–910. https://doiorg.publicaciones.saludcastillayleon.es/10.1182/blood.V106.11.910.910.

    Article  Google Scholar 

  5. Minges KE, Bikdeli B, Wang Y, Attaran RR, Krumholz HM. National and regional trends in deep vein thrombosis hospitalization rates, discharge disposition, and outcomes for medicare beneficiaries. Am J Med. 2018;131(10):1200–8. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.amjmed.2018.04.033.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Minges KE, Bikdeli B, Wang Y, et al. National trends in pulmonary embolism hospitalization rates and outcomes for adults aged ≥65 years in the United States (1999 to 2010). Am J Cardiol. 2015;116(9):1436–42. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.amjcard.2015.07.068.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Peng S, Zhang M, Jin J, MacCormick AD. The effectiveness of venous thromboembolism prophylaxis interventions in trauma patients: A systematic review and network meta-analysis. Injury. 2023;54(12): 111078. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.injury.2023.111078.

    Article  PubMed  Google Scholar 

  8. Park MS, Perkins SE, Spears GM, et al. Risk factors for venous thromboembolism after acute trauma: a population-based case-cohort study. Thromb Res. 2016;144:40–5. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.thromres.2016.03.026.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Barrera LM, Perel P, Ker K, Cirocchi R, Farinella E. Morales Uribe CH Thromboprophylaxis for trauma patients. Cochrane Database Syst Rev. 2013;3: Cd08303. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/14651858.CD008303.pub2.

    Article  Google Scholar 

  10. Recommendations from the ICM-VTE. Trauma. J Bone Joint Surg Am. 2022;104(1):280–308. https://doiorg.publicaciones.saludcastillayleon.es/10.2106/jbjs.21.01476.

    Article  Google Scholar 

  11. Rapp CM, Shields EJ, Wiater BP, Wiater JM. Venous thromboembolism after shoulder arthoplasty and arthroscopy. J Am Acad Orthop Surg. 2019;27(8):265–74. https://doiorg.publicaciones.saludcastillayleon.es/10.5435/jaaos-d-17-00763.

    Article  PubMed  Google Scholar 

  12. Knudson MM, Moore EE, Kornblith LZ, et al. Challenging traditional paradigms in posttraumatic pulmonary thromboembolism. JAMA Surg. 2022;157(2): 216356. https://doiorg.publicaciones.saludcastillayleon.es/10.1001/jamasurg.2021.6356.

    Article  Google Scholar 

  13. Wenger N, Sebastian T, Engelberger RP, Kucher N, Spirk D. Pulmonary embolism and deep vein thrombosis: Similar but different. Thromb Res. 2021;206:88–98. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.thromres.2021.08.015.

    Article  CAS  PubMed  Google Scholar 

  14. Shuster R, Mathew J, Olaussen A, et al. Variables associated with pulmonary thromboembolism in injured patients: A systematic review. Injury. 2018;49(1):1–7. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.injury.2017.08.024.

    Article  PubMed  Google Scholar 

  15. Richie CD, Castle JT, Davis GA, et al. Modes of failure in venous thromboembolism prophylaxis. Angiology. 2022;73(8):712–5. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/00033197221083724.

    Article  CAS  PubMed  Google Scholar 

  16. Cerbasi S, Bernasconi A, Balato G, et al. Assessment of deep vein thrombosis using routine pre- and postoperative duplex doppler ultrasound in patients with pelvic trauma a prospective study. Bone Joint J. 2022;104-b(2):283–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1302/0301-620x.104b2.Bjj-2021-0764.R2.

    Article  PubMed  Google Scholar 

  17. Pastori D, Cormaci VM, Marucci S, et al. A comprehensive review of risk factors for venous thromboembolism: from epidemiology to pathophysiology. Int J Mol Sci. 2023;24(4):3169. https://doiorg.publicaciones.saludcastillayleon.es/10.3390/ijms24043169.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Goldhaber SZ. Risk factors for venous thromboembolism. J Am Coll Cardiol. 2010;56(1):1–7. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.jacc.2010.01.057.

    Article  PubMed  Google Scholar 

  19. Guan C, Ma F, Chang S, Zhang J. Interpretable machine learning models for predicting venous thromboembolism in the intensive care unit: an analysis based on data from 207 centers. Crit Care. 2023;27(1):406. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13054-023-04683-4.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Keany E (2021) BorutaShap 1.0.15 2020. Accessed 26 May 2021.

  21. Wang R, Shao X, Zheng J, et al. A machine-learning approach to identify a prognostic cytokine signature that is associated with nivolumab clearance in patients with advanced melanoma. Clin Pharmacol Ther. 2020;107(4):978–87. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/cpt.1724.

    Article  CAS  PubMed  Google Scholar 

  22. Lundberg SM, Erion G, Chen H, et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. 2020;2(1):56–67. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s42256-019-0138-9.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Baeza-Delgado C, Cerdá Alberich L, Carot-Sierra JM, et al. A practical solution to estimate the sample size required for clinical prediction models generated from observational research on data. Eur Radiol Exp. 2022;6(1):22. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s41747-022-00276-y.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Jiang X, Hoffmeister M, Brenner H, et al. End-to-end prognostication in colorectal cancer by deep learning: a retrospective, multicenter study [published correction appears in Lancet Digit Health. Lancet Digit Health. 2024;6(1):e33–43. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/s2589-7500(23)00208-x.

    Article  CAS  PubMed  Google Scholar 

  25. Riley RD, Ensor J, Snell KIE, et al. Calculating the sample size required for developing a clinical prediction model. BMJ. 2020;368: m441. https://doiorg.publicaciones.saludcastillayleon.es/10.1136/bmj.m441.

    Article  PubMed  Google Scholar 

  26. Deo RC. Machine learning in medicine. Circulation. 2015;132(20):1920–30. https://doiorg.publicaciones.saludcastillayleon.es/10.1161/circulationaha.115.001593.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Handelman GS, Kok HK, Chandra RV, Razavi AH, Lee MJ, Asadi H. eDoctor: machine learning and the future of medicine. J Intern Med. 2018;284(6):603–19. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/joim.12822.

    Article  CAS  PubMed  Google Scholar 

  28. Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J Clin Epidemiol. 2019;110:12–22. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.jclinepi.2019.02.004.

    Article  PubMed  Google Scholar 

  29. Song X, Liu X, Liu F, Wang C. Comparison of machine learning and logistic regression models in predicting acute kidney injury: A systematic review and meta-analysis. Int J Med Inform. 2021;151: 104484. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.ijmedinf.2021.104484.

    Article  PubMed  Google Scholar 

  30. Hu J, Szymczak S. A review on longitudinal data analysis with random forest. Brief Bioinform. 2023;24(2):002. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/bib/bbad002.

    Article  Google Scholar 

  31. Wang H, Shao Y, Zhou S, Zhang C, Xiu N. Support vector machine classifier via L(0/1) soft-margin loss. IEEE Trans Pattern Anal Mach Intell. 2022;44(10):7253–65. https://doiorg.publicaciones.saludcastillayleon.es/10.1109/tpami.2021.3092177.

    Article  PubMed  Google Scholar 

  32. Zhang Z, Zhou D, Zhang J, et al. Multilayer perceptron-based prediction of stroke mimics in prehospital triage. Sci Rep. 2022;12(1):17994. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41598-022-22919-1.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Atkinson EJ, Therneau TM, Melton LJ 3rd, et al. Assessing fracture risk using gradient boosting machine (GBM) models. J Bone Miner Res. 2012;27(6):1397–404. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/jbmr.1577.

    Article  PubMed  Google Scholar 

  34. Xu Z, Shen D, Kou Y, Nie T. A synthetic minority oversampling technique based on gaussian mixture model filtering for imbalanced data classification. IEEE Trans Neural Netw Learn Syst. 2024;35(3):3740–53. https://doiorg.publicaciones.saludcastillayleon.es/10.1109/tnnls.2022.3197156.

    Article  PubMed  Google Scholar 

  35. Zhou XH, Li CM, Yang Z. Improving interval estimation of binomial proportions. Philos Trans A Math Phys Eng Sci. 1874;2008(366):2405–18. https://doiorg.publicaciones.saludcastillayleon.es/10.1098/rsta.2008.0037.

    Article  Google Scholar 

  36. Fisher MJ, Marshall AP, Mitchell M. Testing differences in proportions. Aust Crit Care. 2011;24(2):133–8. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.aucc.2011.01.005.

    Article  PubMed  Google Scholar 

  37. Kawano K, Otaki Y, Suzuki N, et al. Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study. Sci Rep. 2022;12(1):14154. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41598-022-18276-8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Van Calster B, Wynants L, Verbeek JFM, et al. Reporting and interpreting decision curve analysis: a guide for investigators. Eur Urol. 2018;74(6):796–804. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.eururo.2018.08.038.

    Article  PubMed  PubMed Central  Google Scholar 

  39. Vickers AJ, Holland F. Decision curve analysis to evaluate the clinical benefit of prediction models. Spine J. 2021;21(10):1643–8. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.spinee.2021.02.024.

    Article  PubMed  PubMed Central  Google Scholar 

  40. Choi SW, Mak TS, O’Reilly PF. Tutorial: a guide to performing polygenic risk score analyses. Nat Protoc. 2020;15(9):2759–72. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41596-020-0353-1.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Anderson FA Jr, Spencer FA. Risk factors for venous thromboembolism. Circulation. 2003;107(23 Suppl 1):I9-16. https://doiorg.publicaciones.saludcastillayleon.es/10.1161/01.Cir.0000078469.07362.E6.

    Article  PubMed  Google Scholar 

  42. Gregson J, Kaptoge S, Bolton T, et al. Cardiovascular risk factors associated with venous thromboembolism. JAMA Cardiol. 2019;4(2):163–73. https://doiorg.publicaciones.saludcastillayleon.es/10.1001/jamacardio.2018.4537.

    Article  PubMed  Google Scholar 

  43. Rahmani J, Haghighian Roudsari A, Bawadi H, et al. Relationship between body mass index, risk of venous thromboembolism and pulmonary embolism: a systematic review and dose-response meta-analysis of cohort studies among four million participants. Thromb Res. 2020;192:64–72. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.thromres.2020.05.014.

    Article  CAS  PubMed  Google Scholar 

  44. Ding D, Sekar P, Moomaw CJ, et al. Venous thromboembolism in patients with spontaneous intracerebral hemorrhage: a multicenter study. Neurosurgery. 2019;84(6):E304-e310. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/neuros/nyy333.

    Article  PubMed  Google Scholar 

  45. Whiting PS, Jahangir AA. Thromboembolic disease after orthopedic trauma. Orthop Clin North Am. 2016;47(2):335–44. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.ocl.2015.09.002.

    Article  PubMed  Google Scholar 

  46. Talbot BS, Gange CP, Jr., Chaturvedi, A., Klionsky, N., Hobbs, S. K., Chaturvedi, A. Traumatic Rib Injury: Patterns, Imaging Pitfalls, Complications, and Treatment. Radiographics. 2017;37(2):628–51. https://doiorg.publicaciones.saludcastillayleon.es/10.1148/rg.2017160100.

    Article  PubMed  Google Scholar 

  47. Zhang D, Yang Y, Yi Y, Wang D, Jiang L, Huang H, Jin L, Meng H, Xia F, Guo G. Incidence and influence factors of venous thromboembolism in traumatic rib fracture patient: a multicenter study. J Orthop Surg Res. 2024;19(1):152. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13018-024-04622-1.

    Article  PubMed  PubMed Central  Google Scholar 

  48. Choi J, Gomez GI, Kaghazchi A, Borghi JA, Spain DA, Forrester JD. Surgical stabilization of rib fracture to mitigate pulmonary complication and mortality: a systematic review and bayesian meta-analysis. J Am Coll Surg. 2021;232(2):211-219.e212. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.jamcollsurg.2020.10.022.

    Article  PubMed  Google Scholar 

  49. Neugebauer EA, Waydhas C, Lendemans S, Rixen D, Eikermann M, Pohlemann T. The treatment of patients with severe and multiple traumatic injuries. Dtsch Arztebl Int. 2012;109(6):102–8. https://doiorg.publicaciones.saludcastillayleon.es/10.3238/arztebl.2012.0102.

    Article  PubMed  PubMed Central  Google Scholar 

  50. Kaibin Liu, Yi Yang. Analysis of risk factors and construction of risk prediction model for venous thromboembolism in 787 patients with thoracic trauma in a single center[J/OL]. Chin J Thoracic Surg (Electronic Editon). 2023;10(2):57–62. https://doiorg.publicaciones.saludcastillayleon.es/10.3877/cma.j.issn.2095-8773.2023.02.01.

    Article  Google Scholar 

  51. Heit JA. Epidemiology of venous thromboembolism. Nat Rev Cardiol. 2015;12(8):464–74. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/nrcardio.2015.83.

    Article  PubMed  PubMed Central  Google Scholar 

  52. Rali PM, Criner GJ. Submassive pulmonary embolism. Am J Respir Crit Care Med. 2018;198(5):588–98. https://doiorg.publicaciones.saludcastillayleon.es/10.1164/rccm.201711-2302CI.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

Additional Contributions: We thank the members of the CCIRS study group and patients who participated in the trial.

Author information

Authors and Affiliations

Authors

Consortia

Contributions

Kaibin Liu, Di qian, and Dongsheng Zhang had full access to all the data in the study and took responsibility for the integrity of the data and the accuracy of the data analysis. All the members of the CCIRS contributed equally to this study. Concept and design: All authors. Acquisition, analysis, or interpretation of data: All members of the CCIRS. Drafting of the manuscript: Kaibin Liu, Di Qian, Dongsheng Zhang, Zhichao Jin, Yi Yang, and Yanfang Zhao. Critical review of the manuscript for important intellectual content: Yi Yang and Yanfang Zhao, members of the CCIRS. Statistical analysis: Kaibin Liu, Di qian, Yanfang Zhao, Zhichao Jin. Obtained funding: All members of the China Chest Injury Research Society (CCIRS). Administrative, technical, or material support: Zhichao Jin, Yi Yang, Yanfang Zhao, and all members of the CCIRS. Supervision: All authors. Conflict of Interest Disclosures: None. Group Information: CCIRS members. Data Sharing Statement: See Supplement. The machine learning part of this study was performed via R Studio (version 4.2.2) and Python (version 3.9.0). Machine learning was used only for model training and validation. All the authors take responsibility for the integrity of the content generated.

Corresponding authors

Correspondence to Yi Yang or Yanfang Zhao.

Ethics declarations

Informed consent

Written informed consent was obtained from the patient for publication of this case report and accompanying images. A copy of the written consent is available for review by the Editor-in-Chief of this journal upon request.

Competing interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, K., Qian, D., Zhang, D. et al. A risk prediction model for venous thromboembolism in hospitalized patients with thoracic trauma: a machine learning, national multicenter retrospective study. World J Emerg Surg 20, 14 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13017-025-00583-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13017-025-00583-w

Keywords