Abstract

OBJECTIVES

A study of the performance of in-hospital/30-day mortality risk prediction models using an alternative machine learning algorithm (XGBoost) in adults undergoing cardiac surgery.

METHODS

Retrospective analyses of prospectively routinely collected data on adult patients undergoing cardiac surgery in the UK from January 2012 to March 2019. Data were temporally split 70:30 into training and validation subsets. Independent mortality prediction models were created using sequential backward floating selection starting with 61 variables. Assessments of discrimination, calibration, and clinical utility of the resultant XGBoost model with 23 variables were then conducted.

RESULTS

A total of 224,318 adults underwent cardiac surgery during the study period with a 2.76% (N = 6,100) mortality. In the testing cohort, there was good discrimination (area under the receiver operator curve 0.846, F1 0.277) and calibration (especially in high-risk patients). Decision curve analysis showed XGBoost-23 had a net benefit till a threshold probability of 60%. The most important variables were the type of operation, age, creatinine clearance, urgency of the procedure and the New York Heart Association score.

CONCLUSIONS

Feature-selected XGBoost showed good discrimination, calibration and clinical benefit when predicting mortality post-cardiac surgery. Prospective external validation of a XGBoost-derived model performance is warranted.

INTRODUCTION

Prediction models are used in international guidelines [1, 2] to determine the most appropriate treatments and enable clinicians to counsel patients. With reductions in mortality, the emergence of minimally invasive surgery and interventional procedures, updated and more accurate models are needed. Accurate predictions also help in benchmarking individual surgical and institutional results. Older models, such as the logistic European System for Cardiac Operative Risk Evaluation (EuroSCORE) [3] and EuroSCOREII [4], need to be updated. The primary limitation of previous models is poor calibration with over-estimation of risk in the highest risk group [5–10]. Their performance varies by era, centre [4, 11–13] and inherent procedural risk [5, 14, 15], and the application of risk-adjusted mortality ratios is recommended [4, 12].

Current models utilize logistic regression (LR). Complex interactions may exist between the input variables and would need to be accounted for by the model developer. Alternative machine learning (ML) models use algorithms to interpret these interactions using large volumes of data. We have previously shown that alternative ML-based models had statistically better discrimination and clinical utility than a retrained-LR model but similar calibration when using the EuroSCOREII [11] variables. The most consistent improvement in performance was with a XGBoost-based model.

Herein, we have built upon our previous work by utilising all available variables routinely collected in the UK National Adult Cardiac Surgery Audit (NACSA). It is the largest dataset reviewed and utilizes an exhaustive variable selection approach in order to create a novel XGBoost-based model.

METHODS

Ethics statement

This register-based cohort study is part of research approved by the Health Research Authority and Health and Care Research Wales and a need for patients consent was waived (IRAS ID: 257758, 23.7.2019). Reporting of results follows the TRIPOD statement.

Data extraction

A complete extract of prospectively collected data from the NACSA was obtained from the National Institute of Cardiovascular Outcomes Research (NICOR), containing data on all adults undergoing cardiac surgery in England and Wales between January 2012 and March 2019. Patients undergoing transplant, congenital surgery or isolated mechanical support insertion were excluded. The outcome measure was death within the same admission or 30 days of the index operation. Cases where the outcome measure was missing were excluded. Data processing and imputation of missing data were as previously described using R v4.0.2 [16]: ‘age and weight were imputed using the median and the absence of other variables was presumed to be an absence of that risk factor, a method previously established by NACSA’ [17, 18]. Assumptions and pre-processing of data were in line with previously published NACSA guidelines [11, 17, 18]. Missing data were <1%.

The variables routinely collected by NACSA were reviewed by 2 clinicians and a data analyst. Based on published literature, clinical experience and degree of missingness of the data 71 variables were initially reviewed. Those variables representing singular episodes were removed. The variables for pre-existing cardiac rhythm and the need for mechanical support were combined as shown in Supplementary Material, Fig. S1. Ultimately 61 variables were utilized (Supplementary Material, Fig. S1). These variables are defined in the NACSA database v5.1.0 [19].

Statistical analysis

Categorical variables were summarized as counts and percentages. Continuous variables were summarized as mean and standard deviation.

Model development

Both models were developed with Python [20] Scikit-learn v0.23.1 and Keras v2.4.0. Categorical variables’ values were converted to either binary values for dichotomous categories or the ordinal scale for multi-category variables. For reasons stated previously [11], we temporally split the data—70% of records (1 January 2012 to 31 December 2016) for training/validation and 30% (1 January 2017 to 31 March 2019) for testing. Data standardization was performed by subtracting variable mean and dividing by the standard deviation.

The XGBoost model is a combination of multiple short decision tree-based predictions. At each iteration, the performance of the model and the difference from the observed outcome is assessed. A weak decision tree model is then generated to predict this residual. This weaker tree is weighted by its predictive accuracy and added to the pre-existing model. Following multiple iterations, a final model is developed and used for onward assessment. Thus, it is an ensemble model that does not rely solely on one tree. It is a form of gradient boosting that incorporates a regularization term and cross-validation. The combination of these features in such a large dataset as ours reduces the risk of over-fitting. The final hyperparameters are listed in Supplementary Material, Table S2.The XGBoost-based model was further refined using a wrapper-based greedy algorithm for feature selection called sequential backward floating selection, using the mlxtend [21, 22] package in python. A model was developed using all 61 variables. Thereafter, sequentially one variable is removed at a time and the performance of the model assessed, in this case by measuring the area under the receiver operator curve (AUC). The ‘floating’ component allows the inclusion of variables even after the initial exclusion phase so long as the final combination of variables has an improved overall performance. This continues till the optimal combination of variables is reached.

Model validation and evaluation were conducted as before [11]. The primary outcomes were model discrimination (AUC, F1 score), calibration [visual inspection of a plot of observed versus expected outcomes and quantification of the observed minus expected outcomes (O-E), expected calibration error (ECE)], Brier score and clinical utility [decision curve analysis (DCA)]. 5-fold cross-validation was applied for evaluation of the training/validation set for AUC, F1, O-E and net benefit (NB) with the corresponding 95% confidence intervals (CIs) calculated using bootstrap-t sampling (n = 100) with replacement. The latter emphasizes the true positives and penalizes a high type II error rate. The calculation and interpretation of these are stated in our previous study [11]. An ideal model has a high AUC and F1 score, be on the bisector of the observed-to-expected curve, have a high adjusted ECE/Brier score (1−ECE and 1−Brier) and a high overall NB (i.e. the higher the curve on the DCA plot the better the clinical performance).

Variable importance

The SHapley Adaptive exPlanations (SHAP) method [23] was used to understand variable importance in the final model. SHAP values [24] are derived from game theory and assign an importance value to each variable. In the final model, a prediction is made for an individual in the presence or absence of a given variable. The difference between the predictions is the marginal contribution of the variable to the prediction. Differing combinations of variables are utilized and averaged to determine the final SHAP value of a given variable and hence its importance to the final model.

RESULTS

Demographics

A total of 224 318 adults underwent cardiac surgery across 42 centres during the study period. There were 6100 deaths (2.72%) (Fig. 1). Baseline differences in the variables between survivors and non-survivors are shown in Table 1 and Supplementary Material, Table S1.

Consort diagram showing flow of participants through the study. NACSA: National Adult Cardiac Surgery Audit.
Figure 1:

Consort diagram showing flow of participants through the study. NACSA: National Adult Cardiac Surgery Audit.

Table 1:

Baseline patient demographics

CharacteristicOverallSurvivorNonsurvivorDifferenceb95% CIP-valueb
N = 224 318aN = 218 218aN = 6100a
Age (years)<0.001
 Median (IQR)69.30 (61.10–76.00)69.20 (61.00–75.90)73.30 (65.60–78.80)
Weight<0.001
 Median (IQR)81.00 (71.00–92.00)81.00 (71.00–92.00)77.00 (66.00–90.00)
BMI<0.001
 Median (IQR)27.68 (25.10–31.19)27.70 (25.14–31.20)27.61 (24.22–30.84)
Female60 368/224 318 (27%)58 102/218 218 (27%)2266/6100 (37%)−11%−12% to −9.3%<0.001
Hypertension155 736/224 318 (69%)151 386/218 218 (69%)4350/6100 (71%)−1.9%−3.1% to −0.78%0.001
Diabetes0.199
 Not diabetic169 435/224 318 (76%)164 918/218 218 (76%)4517/6100 (74%)
 Diet controlled8836/224 318 (3.9%)8571/218 218 (3.9%)265/6100 (4.3%)
 Oral therapy32 844/224 318 (15%)31 969/218 218 (15%)875/6100 (14%)
 Insulin-dependent diabetes13 203/224 318 (5.9%)12 760/218 218 (5.8%)443/6100 (7.3%)
Smoker<0.001
 Never smoked94 188/224 318 (42%)91 567/218 218 (42%)2621/6100 (43%)
 Ex-smoker107 027/224 318 (48%)104 146/218 218 (48%)2881/6100 (47%)
 Current smoker23 103/224 318 (10%)22 505/218 218 (10%)598/6100 (9.8%)
History of pulmonary disease27 585/224 318 (12%)26 401/218 218 (12%)1184/6100 (19%)−7.3%−8.3% to −6.3%<0.001
History of stroke<0.001
 TIA9616/218 634 (4.4%)9237/212 748 (4.3%)379/5886 (6.4%)
 CVA with full recovery5122/218 634 (2.3%)4901/212 748 (2.3%)221/5886 (3.8%)
 CVA with residual neurological deficit3384/218 634 (1.5%)3181/212 748 (1.5%)203/5886 (3.4%)
 (Missing)56845470214
Neurologic dysfunction7141/224 318 (3.2%)6743/218 218 (3.1%)398/6100 (6.5%)−3.4%−4.1% to −2.8%<0.001
Peripheral vascular disease23 462/224 318 (10%)22 264/218 218 (10%)1198/6100 (20%)−9.4%−10% to −8.4%<0.001
Poor mobility1935/224 318 (0.9%)1798/218 218 (0.8%)137/6100 (2.2%)−1.4%−1.8% to −1.0%<0.001
Pulmonary hypertension4651/224 318 (2.1%)4249/218 218 (1.9%)402/6100 (6.6%)<0.001
LV function<0.001
 Very poor1382/224 318 (0.6%)1246/218 218 (0.6%)136/6100 (2.2%)
 Poor4512/224 318 (2.0%)4207/218 218 (1.9%)305/6100 (5.0%)
 Fair31 444/224 318 (14%)30 372/218 218 (14%)1072/6100 (18%)
 Good186 980/224 318 (83%)182 393/218 218 (84%)4587/6100 (75%)
Creatinine
 Median (IQR)85.00 (73.00–101.00)85.00 (73.00–100.00)100.00 (80.00–131.00)
 Mean (SD)93.99 (50.43)93.22 (48.76)122.49 (88.44)−29−32 to −27<0.001
 (Missing)14 26613 691575
Creatinine clearance<0.001
 Median (IQR)82.57 (62.67–100.58)82.95 (63.18–100.84)66.29 (46.85–99.04)
 (Missing)33 67832 990688
Pre-op dialysis<0.001
 None214 727/220 015 (98%)209 500/214 122 (98%)5227/5893 (89%)
 Dialysis for acute renal failure within 6 weeks of surgery838/220 015 (0.4%)687/214 122 (0.3%)151/5893 (2.6%)
 Dialysis for chronic renal failure1656/220 015 (0.8%)1484/214 122 (0.7%)172/5893 (2.9%)
 No dialysis but acute renal failure preop2794/220 015 (1.3%)2451/214 122 (1.1%)343/5893 (5.8%)
 (Missing)43034096207
CCS<0.001
 075 514/224 318 (34%)72 859/218 218 (33%)2655/6100 (44%)
 121 249/224 318 (9.5%)20 756/218 218 (9.5%)493/6100 (8.1%)
 264 613/224 318 (29%)63 575/218 218 (29%)1038/6100 (17%)
 343 717/224 318 (19%)42 715/218 218 (20%)1002/6100 (16%)
 419 225/224 318 (8.6%)18 313/218 218 (8.4%)912/6100 (15%)
NYHA<0.001
 148 602/224 318 (22%)47 578/218 218 (22%)1024/6100 (17%)
 297 609/224 318 (44%)96 017/218 218 (44%)1592/6100 (26%)
 365 752/224 318 (29%)63 556/218 218 (29%)2196/6100 (36%)
 412 355/224 318 (5.5%)11 067/218 218 (5.1%)1288/6100 (21%)
Urgency<0.001
 Elective142 456/224 318 (64%)140 024/218 218 (64%)2432/6100 (40%)
 Urgent73 575/224 318 (33%)71 455/218 218 (33%)2120/6100 (35%)
 Emergency7395/224 318 (3.3%)6234/218 218 (2.9%)1161/6100 (19%)
 Salvage892/224 318 (0.4%)505/218 218 (0.2%)387/6100 (6.3%)
Number of previous MIs<0.001
 0152 138/224 318 (68%)148 288/218 218 (68%)3850/6100 (63%)
 161 800/224 318 (28%)60 000/218 218 (27%)1800/6100 (30%)
 210 380/224 318 (4.6%)9930/218 218 (4.6%)450/6100 (7.4%)
Recent MI44 651/224 318 (20%)43 141/218 218 (20%)1510/6100 (25%)−5.0%−6.1% to −3.9%<0.001
Interval from MI to surgery<0.001
 No previous MI153 246/224 318 (68%)149 354/218 218 (68%)3892/6100 (64%)
 MI < 6 hours516/224 318 (0.2%)409/218 218 (0.2%)107/6100 (1.8%)
 MI 6–24 hours1023/224 318 (0.5%)880/218 218 (0.4%)143/6100 (2.3%)
 MI 1–30 days36 296/224 318 (16%)35 222/218 218 (16%)1074/6100 (18%)
 MI 31–90 days6816/224 318 (3.0%)6630/218 218 (3.0%)186/6100 (3.0%)
 MI >90 days26 421/224 318 (12%)25 723/218 218 (12%)698/6100 (11%)
Previous PCI<0.001
 No previous PCI198 924/224 318 (89%)193 587/218 218 (89%)5337/6100 (87%)
 PCI <24 h before surgery931/224 318 (0.4%)805/218 218 (0.4%)126/6100 (2.1%)
 PCI >24 h before surgery same admission2067/224 318 (0.9%)1958/218 218 (0.9%)109/6100 (1.8%)
 PCI >24 h before surgery previous admission22 396/224 318 (10.0%)21 868/218 218 (10%)528/6100 (8.7%)
Preoperative AF24 923/224 318 (11%)23 637/218 218 (11%)1286/6100 (21%)−10%−11% to −9.2%<0.001
Preoperative VF or VT440/224 318 (0.2%)387/218 218 (0.2%)53/6100 (0.9%)−0.69%−0.93% to −0.45%<0.001
Preoperative CHB/paced3203/224 318 (1.4%)2999/218 218 (1.4%)204/6100 (3.3%)−2.0%−2.4% to −1.5%<0.001
Left main stem disease38 474/224 318 (17%)37 529/218 218 (17%)945/6100 (15%)1.7%0.78%,2.6%<0.001
Systolic PA pressure−11−12 to −9.9<0.001
 Median (IQR)24.00 (0.00–35.00)23.00 (0.00–35.00)32.00 (10.00–54.00)
 Mean (SD)23.30 (21.44)22.93 (21.20)33.84 (25.28)
 (Missing)153 722150 0703652
Critical preoperative state10 355/224 318 (4.6%)8918/218 218 (4.1%)1437/6100 (24%)−19%−21% to −18%<0.001
Cardiogenic shock2943/224 318 (1.3%)2204/218 218 (1.0%)739/6100 (12%)−11%−12% to −10%<0.001
Preoperative ventilation1838/224 318 (0.8%)1399/218 218 (0.6%)439/6100 (7.2%)−6.6%−7.2% to −5.9%<0.001
Preoperative nitrates10 208/224 318 (4.6%)9551/218 218 (4.4%)657/6100 (11%)−6.4%−7.2% to −5.6%<0.001
Preoperative inotropes2651/224,31 (1.2%)1994/218218 (0.9%)657/6100 (11%)−9.9%−11% to −9.1%<0.001
Number of previous operations<0.001
 0154 514/163 933 (94%)150 780/159 244 (95%)3734/4689 (80%)
 18372/163933 (5.1%)7555/159 244 (4.7%)817/4689 (17%)
 2877/163 933 (0.5%)773/159 244 (0.5%)104/4689 (2.2%)
 3140/163 933 (<0.1%)115/159 244 (<0.1%)25/4689 (0.5%)
 427/163 933 (<0.1%)19/159 244 (<0.1%)8/4689 (0.2%)
 51/163 933 (<0.1%)1/159 244 (<0.1%)0/4689 (0%)
 62/163 933 (<0.1%)1/159 244 (<0.1%)1/4689 (<0.1%)
 (Missing)60 38558 9741411
Previous CABG3010/224 318 (1.3%)2666/218 218 (1.2%)344/6100 (5.6%)−4.4%−5.0% to −3.8%<0.001
Previous valve surgery6360/224 318 (2.8%)5695/218 218 (2.6%)665/6100 (11%)−8.3%−9.1% to −7.5%<0.001
Previous aortic arch surgery1223/224 318 (0.5%)1050/218 218 (0.5%)173/6100 (2.8%)−2.4%−2.8% to −1.9%<0.001
Previous descending aortic surgery1/224 318 (<0.1%)1/218 218 (<0.1%)0/6100 (0%)0.00%0.00% to 0.00%>0.9
Previous thoracic surgery1/224 318 (<0.1%)1/218 218 (<0.1%)0/6100 (0%)0.00%0.00% to 0.00%>0.9
Endocarditis6309/224 318 (2.8%)5816/218 218 (2.7%)493/6100 (8.1%)−5.4%−6.1% to −4.7%<0.001
Post-infarct VSD195/224 318 (<0.1%)111/218 218 (<0.1%)84/6100 (1.4%)−1.3%−1.6% to −1.0%<0.001
Operation<0.001
 CABG only113,190 / 224,318 (50.5%)111 640/218 218 (51.2%)1550/6100 (25.4%)1,550 / 6,100 (25.4%)
 AVR only35,161 / 224,318 (15.7%)34 570/218 218 (15.8%)591/6100 (9.7%)591 / 6,100 (9.7%)
 CABG+AVR21,978 / 224,318 (9.8%)21 187/218 218 (9.7%)791/6100 (13%)791 / 6,100 (13%)
 CABG+MVR4,556 / 224,318 (2.0%)4268/218 218 (2%)288/6100 (4.7%)288 / 6,100 (4.7%)
 MVR only14,005 / 224,318 (6.2%)13 671/218 218 (6.3%)334/6100 (5.5%)334 / 6,100 (5.5%)
 Aortic ± other13,922 / 224,318 (6.2%)12 669/218 218 (5.8%)1253/6100 (20.5%)1,253 / 6,100 (20.5%)
 Other18,598 / 224,318 (8.3%)17 514/218 218 (8.0%)1084/6100 (17.8%)1,084 / 6,100 (17.8%)
 Unclassified2,878 / 224,318 (1.3%)2699/218 218 (1.2%)209/6100 (3.4%)209 / 6,100 (3.4%)
Days between angiogram and surgery0.2
 Median (IQR)62.00 (14.00–136.04)62.00 (14.00–136.04)36.00 (9.00–137.00)
 (Missing)68 44466 0122432
Left main stem disease38 474/224 318 (17%)37 529/218 218 (17%)945/6100 (15%)1.7%0.78%,2.6%<0.001
Extent of CAD
 Nil > 50%61 174/200 018 (31%)59 487/195 121 (30%)1687/4897 (34%)
 One vessel >50%18 092/200 018 (9.0%)17 510/195 121 (9.0%)582/4 897 (12%)
 2 vessels >50%31 917/200 018 (16%)31 206/195 121 (16%)711/4897 (15%)
 3 vessels > 50%88 835/200 018 (44%)86 918/195 121 (45%)1917/4897 (39%)
 (Missing)24 30023 0971203
Number of grafts<0.001
 119 083/164 803 (12%)18 320/160 736 (11%)763/4067 (19%)
 235 308/164 803 (21%)34 428/160 736 (21%)880/4067 (22%)
 361 855/164 803 (38%)60 648/160 736 (38%)1207/4067 (30%)
 425 362/164 803 (15%)24 965/160 736 (16%)397/4067 (9.8%)
 53362/164 803 (2.0%)3294/160 736 (2.0%)68/4067 (1.7%)
 6339/164 803 (0.2%)331/160 736 (0.2%)8/4067 (0.2%)
 (Missing)59 51557 4822033
First operator grade<0.001
 Consultant170 813/211 897 (81%)165 411/205 977 (80%)5402/5920 (91%)
 Staff grade6745/211 897 (3.2%)6674/205 977 (3.2%)71/5920 (1.2%)
 Registrar30 167/211 897 (14%)29 840/205 977 (14%)327/5920 (5.5%)
 SHO4172/211 897 (2.0%)4052/205 977 (2.0%)120/5920 (2.0%)
 (Missing)12 42112 241180
Payer status<0.001
 NHS214 259/222 167 (96%)208 318/216 124 (96%)5941/6043 (98%)
 Private7908/222 167 (3.6%)7806/216 124 (3.6%)102/6043 (1.7%)
 (Missing)2151209457
CharacteristicOverallSurvivorNonsurvivorDifferenceb95% CIP-valueb
N = 224 318aN = 218 218aN = 6100a
Age (years)<0.001
 Median (IQR)69.30 (61.10–76.00)69.20 (61.00–75.90)73.30 (65.60–78.80)
Weight<0.001
 Median (IQR)81.00 (71.00–92.00)81.00 (71.00–92.00)77.00 (66.00–90.00)
BMI<0.001
 Median (IQR)27.68 (25.10–31.19)27.70 (25.14–31.20)27.61 (24.22–30.84)
Female60 368/224 318 (27%)58 102/218 218 (27%)2266/6100 (37%)−11%−12% to −9.3%<0.001
Hypertension155 736/224 318 (69%)151 386/218 218 (69%)4350/6100 (71%)−1.9%−3.1% to −0.78%0.001
Diabetes0.199
 Not diabetic169 435/224 318 (76%)164 918/218 218 (76%)4517/6100 (74%)
 Diet controlled8836/224 318 (3.9%)8571/218 218 (3.9%)265/6100 (4.3%)
 Oral therapy32 844/224 318 (15%)31 969/218 218 (15%)875/6100 (14%)
 Insulin-dependent diabetes13 203/224 318 (5.9%)12 760/218 218 (5.8%)443/6100 (7.3%)
Smoker<0.001
 Never smoked94 188/224 318 (42%)91 567/218 218 (42%)2621/6100 (43%)
 Ex-smoker107 027/224 318 (48%)104 146/218 218 (48%)2881/6100 (47%)
 Current smoker23 103/224 318 (10%)22 505/218 218 (10%)598/6100 (9.8%)
History of pulmonary disease27 585/224 318 (12%)26 401/218 218 (12%)1184/6100 (19%)−7.3%−8.3% to −6.3%<0.001
History of stroke<0.001
 TIA9616/218 634 (4.4%)9237/212 748 (4.3%)379/5886 (6.4%)
 CVA with full recovery5122/218 634 (2.3%)4901/212 748 (2.3%)221/5886 (3.8%)
 CVA with residual neurological deficit3384/218 634 (1.5%)3181/212 748 (1.5%)203/5886 (3.4%)
 (Missing)56845470214
Neurologic dysfunction7141/224 318 (3.2%)6743/218 218 (3.1%)398/6100 (6.5%)−3.4%−4.1% to −2.8%<0.001
Peripheral vascular disease23 462/224 318 (10%)22 264/218 218 (10%)1198/6100 (20%)−9.4%−10% to −8.4%<0.001
Poor mobility1935/224 318 (0.9%)1798/218 218 (0.8%)137/6100 (2.2%)−1.4%−1.8% to −1.0%<0.001
Pulmonary hypertension4651/224 318 (2.1%)4249/218 218 (1.9%)402/6100 (6.6%)<0.001
LV function<0.001
 Very poor1382/224 318 (0.6%)1246/218 218 (0.6%)136/6100 (2.2%)
 Poor4512/224 318 (2.0%)4207/218 218 (1.9%)305/6100 (5.0%)
 Fair31 444/224 318 (14%)30 372/218 218 (14%)1072/6100 (18%)
 Good186 980/224 318 (83%)182 393/218 218 (84%)4587/6100 (75%)
Creatinine
 Median (IQR)85.00 (73.00–101.00)85.00 (73.00–100.00)100.00 (80.00–131.00)
 Mean (SD)93.99 (50.43)93.22 (48.76)122.49 (88.44)−29−32 to −27<0.001
 (Missing)14 26613 691575
Creatinine clearance<0.001
 Median (IQR)82.57 (62.67–100.58)82.95 (63.18–100.84)66.29 (46.85–99.04)
 (Missing)33 67832 990688
Pre-op dialysis<0.001
 None214 727/220 015 (98%)209 500/214 122 (98%)5227/5893 (89%)
 Dialysis for acute renal failure within 6 weeks of surgery838/220 015 (0.4%)687/214 122 (0.3%)151/5893 (2.6%)
 Dialysis for chronic renal failure1656/220 015 (0.8%)1484/214 122 (0.7%)172/5893 (2.9%)
 No dialysis but acute renal failure preop2794/220 015 (1.3%)2451/214 122 (1.1%)343/5893 (5.8%)
 (Missing)43034096207
CCS<0.001
 075 514/224 318 (34%)72 859/218 218 (33%)2655/6100 (44%)
 121 249/224 318 (9.5%)20 756/218 218 (9.5%)493/6100 (8.1%)
 264 613/224 318 (29%)63 575/218 218 (29%)1038/6100 (17%)
 343 717/224 318 (19%)42 715/218 218 (20%)1002/6100 (16%)
 419 225/224 318 (8.6%)18 313/218 218 (8.4%)912/6100 (15%)
NYHA<0.001
 148 602/224 318 (22%)47 578/218 218 (22%)1024/6100 (17%)
 297 609/224 318 (44%)96 017/218 218 (44%)1592/6100 (26%)
 365 752/224 318 (29%)63 556/218 218 (29%)2196/6100 (36%)
 412 355/224 318 (5.5%)11 067/218 218 (5.1%)1288/6100 (21%)
Urgency<0.001
 Elective142 456/224 318 (64%)140 024/218 218 (64%)2432/6100 (40%)
 Urgent73 575/224 318 (33%)71 455/218 218 (33%)2120/6100 (35%)
 Emergency7395/224 318 (3.3%)6234/218 218 (2.9%)1161/6100 (19%)
 Salvage892/224 318 (0.4%)505/218 218 (0.2%)387/6100 (6.3%)
Number of previous MIs<0.001
 0152 138/224 318 (68%)148 288/218 218 (68%)3850/6100 (63%)
 161 800/224 318 (28%)60 000/218 218 (27%)1800/6100 (30%)
 210 380/224 318 (4.6%)9930/218 218 (4.6%)450/6100 (7.4%)
Recent MI44 651/224 318 (20%)43 141/218 218 (20%)1510/6100 (25%)−5.0%−6.1% to −3.9%<0.001
Interval from MI to surgery<0.001
 No previous MI153 246/224 318 (68%)149 354/218 218 (68%)3892/6100 (64%)
 MI < 6 hours516/224 318 (0.2%)409/218 218 (0.2%)107/6100 (1.8%)
 MI 6–24 hours1023/224 318 (0.5%)880/218 218 (0.4%)143/6100 (2.3%)
 MI 1–30 days36 296/224 318 (16%)35 222/218 218 (16%)1074/6100 (18%)
 MI 31–90 days6816/224 318 (3.0%)6630/218 218 (3.0%)186/6100 (3.0%)
 MI >90 days26 421/224 318 (12%)25 723/218 218 (12%)698/6100 (11%)
Previous PCI<0.001
 No previous PCI198 924/224 318 (89%)193 587/218 218 (89%)5337/6100 (87%)
 PCI <24 h before surgery931/224 318 (0.4%)805/218 218 (0.4%)126/6100 (2.1%)
 PCI >24 h before surgery same admission2067/224 318 (0.9%)1958/218 218 (0.9%)109/6100 (1.8%)
 PCI >24 h before surgery previous admission22 396/224 318 (10.0%)21 868/218 218 (10%)528/6100 (8.7%)
Preoperative AF24 923/224 318 (11%)23 637/218 218 (11%)1286/6100 (21%)−10%−11% to −9.2%<0.001
Preoperative VF or VT440/224 318 (0.2%)387/218 218 (0.2%)53/6100 (0.9%)−0.69%−0.93% to −0.45%<0.001
Preoperative CHB/paced3203/224 318 (1.4%)2999/218 218 (1.4%)204/6100 (3.3%)−2.0%−2.4% to −1.5%<0.001
Left main stem disease38 474/224 318 (17%)37 529/218 218 (17%)945/6100 (15%)1.7%0.78%,2.6%<0.001
Systolic PA pressure−11−12 to −9.9<0.001
 Median (IQR)24.00 (0.00–35.00)23.00 (0.00–35.00)32.00 (10.00–54.00)
 Mean (SD)23.30 (21.44)22.93 (21.20)33.84 (25.28)
 (Missing)153 722150 0703652
Critical preoperative state10 355/224 318 (4.6%)8918/218 218 (4.1%)1437/6100 (24%)−19%−21% to −18%<0.001
Cardiogenic shock2943/224 318 (1.3%)2204/218 218 (1.0%)739/6100 (12%)−11%−12% to −10%<0.001
Preoperative ventilation1838/224 318 (0.8%)1399/218 218 (0.6%)439/6100 (7.2%)−6.6%−7.2% to −5.9%<0.001
Preoperative nitrates10 208/224 318 (4.6%)9551/218 218 (4.4%)657/6100 (11%)−6.4%−7.2% to −5.6%<0.001
Preoperative inotropes2651/224,31 (1.2%)1994/218218 (0.9%)657/6100 (11%)−9.9%−11% to −9.1%<0.001
Number of previous operations<0.001
 0154 514/163 933 (94%)150 780/159 244 (95%)3734/4689 (80%)
 18372/163933 (5.1%)7555/159 244 (4.7%)817/4689 (17%)
 2877/163 933 (0.5%)773/159 244 (0.5%)104/4689 (2.2%)
 3140/163 933 (<0.1%)115/159 244 (<0.1%)25/4689 (0.5%)
 427/163 933 (<0.1%)19/159 244 (<0.1%)8/4689 (0.2%)
 51/163 933 (<0.1%)1/159 244 (<0.1%)0/4689 (0%)
 62/163 933 (<0.1%)1/159 244 (<0.1%)1/4689 (<0.1%)
 (Missing)60 38558 9741411
Previous CABG3010/224 318 (1.3%)2666/218 218 (1.2%)344/6100 (5.6%)−4.4%−5.0% to −3.8%<0.001
Previous valve surgery6360/224 318 (2.8%)5695/218 218 (2.6%)665/6100 (11%)−8.3%−9.1% to −7.5%<0.001
Previous aortic arch surgery1223/224 318 (0.5%)1050/218 218 (0.5%)173/6100 (2.8%)−2.4%−2.8% to −1.9%<0.001
Previous descending aortic surgery1/224 318 (<0.1%)1/218 218 (<0.1%)0/6100 (0%)0.00%0.00% to 0.00%>0.9
Previous thoracic surgery1/224 318 (<0.1%)1/218 218 (<0.1%)0/6100 (0%)0.00%0.00% to 0.00%>0.9
Endocarditis6309/224 318 (2.8%)5816/218 218 (2.7%)493/6100 (8.1%)−5.4%−6.1% to −4.7%<0.001
Post-infarct VSD195/224 318 (<0.1%)111/218 218 (<0.1%)84/6100 (1.4%)−1.3%−1.6% to −1.0%<0.001
Operation<0.001
 CABG only113,190 / 224,318 (50.5%)111 640/218 218 (51.2%)1550/6100 (25.4%)1,550 / 6,100 (25.4%)
 AVR only35,161 / 224,318 (15.7%)34 570/218 218 (15.8%)591/6100 (9.7%)591 / 6,100 (9.7%)
 CABG+AVR21,978 / 224,318 (9.8%)21 187/218 218 (9.7%)791/6100 (13%)791 / 6,100 (13%)
 CABG+MVR4,556 / 224,318 (2.0%)4268/218 218 (2%)288/6100 (4.7%)288 / 6,100 (4.7%)
 MVR only14,005 / 224,318 (6.2%)13 671/218 218 (6.3%)334/6100 (5.5%)334 / 6,100 (5.5%)
 Aortic ± other13,922 / 224,318 (6.2%)12 669/218 218 (5.8%)1253/6100 (20.5%)1,253 / 6,100 (20.5%)
 Other18,598 / 224,318 (8.3%)17 514/218 218 (8.0%)1084/6100 (17.8%)1,084 / 6,100 (17.8%)
 Unclassified2,878 / 224,318 (1.3%)2699/218 218 (1.2%)209/6100 (3.4%)209 / 6,100 (3.4%)
Days between angiogram and surgery0.2
 Median (IQR)62.00 (14.00–136.04)62.00 (14.00–136.04)36.00 (9.00–137.00)
 (Missing)68 44466 0122432
Left main stem disease38 474/224 318 (17%)37 529/218 218 (17%)945/6100 (15%)1.7%0.78%,2.6%<0.001
Extent of CAD
 Nil > 50%61 174/200 018 (31%)59 487/195 121 (30%)1687/4897 (34%)
 One vessel >50%18 092/200 018 (9.0%)17 510/195 121 (9.0%)582/4 897 (12%)
 2 vessels >50%31 917/200 018 (16%)31 206/195 121 (16%)711/4897 (15%)
 3 vessels > 50%88 835/200 018 (44%)86 918/195 121 (45%)1917/4897 (39%)
 (Missing)24 30023 0971203
Number of grafts<0.001
 119 083/164 803 (12%)18 320/160 736 (11%)763/4067 (19%)
 235 308/164 803 (21%)34 428/160 736 (21%)880/4067 (22%)
 361 855/164 803 (38%)60 648/160 736 (38%)1207/4067 (30%)
 425 362/164 803 (15%)24 965/160 736 (16%)397/4067 (9.8%)
 53362/164 803 (2.0%)3294/160 736 (2.0%)68/4067 (1.7%)
 6339/164 803 (0.2%)331/160 736 (0.2%)8/4067 (0.2%)
 (Missing)59 51557 4822033
First operator grade<0.001
 Consultant170 813/211 897 (81%)165 411/205 977 (80%)5402/5920 (91%)
 Staff grade6745/211 897 (3.2%)6674/205 977 (3.2%)71/5920 (1.2%)
 Registrar30 167/211 897 (14%)29 840/205 977 (14%)327/5920 (5.5%)
 SHO4172/211 897 (2.0%)4052/205 977 (2.0%)120/5920 (2.0%)
 (Missing)12 42112 241180
Payer status<0.001
 NHS214 259/222 167 (96%)208 318/216 124 (96%)5941/6043 (98%)
 Private7908/222 167 (3.6%)7806/216 124 (3.6%)102/6043 (1.7%)
 (Missing)2151209457
a

Mean (SD) or frequency (%).

b

Two sample test for equality of proportions; Welch 2-sample t-test.

BMI: basal metabolic index; CABG: coronary artery bypass graft; CCS: Canadian Cardiovascular score; CI: confidence interval; CVA: cerebrovascular accident, IQR: interquartile range; LV: left ventricular, MI: myocardial infarction; NYHA: New York Heart Association, TIA: transient ischaemic attack, AF: Atrial fibrillation, AVR: aortic valve repair/replacement, CAD: coronary artery disease, CHB: complete heart block, MVR: mitral valve repair/replacement, NHS: National Health Service, PA: pulmonary artery, PCI: percutaneous coronary intervention, SHO: senior house officer, VF: ventricular fibrillation, VSD: ventricular septal defect, VT: ventricular tachycardia.

Table 1:

Baseline patient demographics

CharacteristicOverallSurvivorNonsurvivorDifferenceb95% CIP-valueb
N = 224 318aN = 218 218aN = 6100a
Age (years)<0.001
 Median (IQR)69.30 (61.10–76.00)69.20 (61.00–75.90)73.30 (65.60–78.80)
Weight<0.001
 Median (IQR)81.00 (71.00–92.00)81.00 (71.00–92.00)77.00 (66.00–90.00)
BMI<0.001
 Median (IQR)27.68 (25.10–31.19)27.70 (25.14–31.20)27.61 (24.22–30.84)
Female60 368/224 318 (27%)58 102/218 218 (27%)2266/6100 (37%)−11%−12% to −9.3%<0.001
Hypertension155 736/224 318 (69%)151 386/218 218 (69%)4350/6100 (71%)−1.9%−3.1% to −0.78%0.001
Diabetes0.199
 Not diabetic169 435/224 318 (76%)164 918/218 218 (76%)4517/6100 (74%)
 Diet controlled8836/224 318 (3.9%)8571/218 218 (3.9%)265/6100 (4.3%)
 Oral therapy32 844/224 318 (15%)31 969/218 218 (15%)875/6100 (14%)
 Insulin-dependent diabetes13 203/224 318 (5.9%)12 760/218 218 (5.8%)443/6100 (7.3%)
Smoker<0.001
 Never smoked94 188/224 318 (42%)91 567/218 218 (42%)2621/6100 (43%)
 Ex-smoker107 027/224 318 (48%)104 146/218 218 (48%)2881/6100 (47%)
 Current smoker23 103/224 318 (10%)22 505/218 218 (10%)598/6100 (9.8%)
History of pulmonary disease27 585/224 318 (12%)26 401/218 218 (12%)1184/6100 (19%)−7.3%−8.3% to −6.3%<0.001
History of stroke<0.001
 TIA9616/218 634 (4.4%)9237/212 748 (4.3%)379/5886 (6.4%)
 CVA with full recovery5122/218 634 (2.3%)4901/212 748 (2.3%)221/5886 (3.8%)
 CVA with residual neurological deficit3384/218 634 (1.5%)3181/212 748 (1.5%)203/5886 (3.4%)
 (Missing)56845470214
Neurologic dysfunction7141/224 318 (3.2%)6743/218 218 (3.1%)398/6100 (6.5%)−3.4%−4.1% to −2.8%<0.001
Peripheral vascular disease23 462/224 318 (10%)22 264/218 218 (10%)1198/6100 (20%)−9.4%−10% to −8.4%<0.001
Poor mobility1935/224 318 (0.9%)1798/218 218 (0.8%)137/6100 (2.2%)−1.4%−1.8% to −1.0%<0.001
Pulmonary hypertension4651/224 318 (2.1%)4249/218 218 (1.9%)402/6100 (6.6%)<0.001
LV function<0.001
 Very poor1382/224 318 (0.6%)1246/218 218 (0.6%)136/6100 (2.2%)
 Poor4512/224 318 (2.0%)4207/218 218 (1.9%)305/6100 (5.0%)
 Fair31 444/224 318 (14%)30 372/218 218 (14%)1072/6100 (18%)
 Good186 980/224 318 (83%)182 393/218 218 (84%)4587/6100 (75%)
Creatinine
 Median (IQR)85.00 (73.00–101.00)85.00 (73.00–100.00)100.00 (80.00–131.00)
 Mean (SD)93.99 (50.43)93.22 (48.76)122.49 (88.44)−29−32 to −27<0.001
 (Missing)14 26613 691575
Creatinine clearance<0.001
 Median (IQR)82.57 (62.67–100.58)82.95 (63.18–100.84)66.29 (46.85–99.04)
 (Missing)33 67832 990688
Pre-op dialysis<0.001
 None214 727/220 015 (98%)209 500/214 122 (98%)5227/5893 (89%)
 Dialysis for acute renal failure within 6 weeks of surgery838/220 015 (0.4%)687/214 122 (0.3%)151/5893 (2.6%)
 Dialysis for chronic renal failure1656/220 015 (0.8%)1484/214 122 (0.7%)172/5893 (2.9%)
 No dialysis but acute renal failure preop2794/220 015 (1.3%)2451/214 122 (1.1%)343/5893 (5.8%)
 (Missing)43034096207
CCS<0.001
 075 514/224 318 (34%)72 859/218 218 (33%)2655/6100 (44%)
 121 249/224 318 (9.5%)20 756/218 218 (9.5%)493/6100 (8.1%)
 264 613/224 318 (29%)63 575/218 218 (29%)1038/6100 (17%)
 343 717/224 318 (19%)42 715/218 218 (20%)1002/6100 (16%)
 419 225/224 318 (8.6%)18 313/218 218 (8.4%)912/6100 (15%)
NYHA<0.001
 148 602/224 318 (22%)47 578/218 218 (22%)1024/6100 (17%)
 297 609/224 318 (44%)96 017/218 218 (44%)1592/6100 (26%)
 365 752/224 318 (29%)63 556/218 218 (29%)2196/6100 (36%)
 412 355/224 318 (5.5%)11 067/218 218 (5.1%)1288/6100 (21%)
Urgency<0.001
 Elective142 456/224 318 (64%)140 024/218 218 (64%)2432/6100 (40%)
 Urgent73 575/224 318 (33%)71 455/218 218 (33%)2120/6100 (35%)
 Emergency7395/224 318 (3.3%)6234/218 218 (2.9%)1161/6100 (19%)
 Salvage892/224 318 (0.4%)505/218 218 (0.2%)387/6100 (6.3%)
Number of previous MIs<0.001
 0152 138/224 318 (68%)148 288/218 218 (68%)3850/6100 (63%)
 161 800/224 318 (28%)60 000/218 218 (27%)1800/6100 (30%)
 210 380/224 318 (4.6%)9930/218 218 (4.6%)450/6100 (7.4%)
Recent MI44 651/224 318 (20%)43 141/218 218 (20%)1510/6100 (25%)−5.0%−6.1% to −3.9%<0.001
Interval from MI to surgery<0.001
 No previous MI153 246/224 318 (68%)149 354/218 218 (68%)3892/6100 (64%)
 MI < 6 hours516/224 318 (0.2%)409/218 218 (0.2%)107/6100 (1.8%)
 MI 6–24 hours1023/224 318 (0.5%)880/218 218 (0.4%)143/6100 (2.3%)
 MI 1–30 days36 296/224 318 (16%)35 222/218 218 (16%)1074/6100 (18%)
 MI 31–90 days6816/224 318 (3.0%)6630/218 218 (3.0%)186/6100 (3.0%)
 MI >90 days26 421/224 318 (12%)25 723/218 218 (12%)698/6100 (11%)
Previous PCI<0.001
 No previous PCI198 924/224 318 (89%)193 587/218 218 (89%)5337/6100 (87%)
 PCI <24 h before surgery931/224 318 (0.4%)805/218 218 (0.4%)126/6100 (2.1%)
 PCI >24 h before surgery same admission2067/224 318 (0.9%)1958/218 218 (0.9%)109/6100 (1.8%)
 PCI >24 h before surgery previous admission22 396/224 318 (10.0%)21 868/218 218 (10%)528/6100 (8.7%)
Preoperative AF24 923/224 318 (11%)23 637/218 218 (11%)1286/6100 (21%)−10%−11% to −9.2%<0.001
Preoperative VF or VT440/224 318 (0.2%)387/218 218 (0.2%)53/6100 (0.9%)−0.69%−0.93% to −0.45%<0.001
Preoperative CHB/paced3203/224 318 (1.4%)2999/218 218 (1.4%)204/6100 (3.3%)−2.0%−2.4% to −1.5%<0.001
Left main stem disease38 474/224 318 (17%)37 529/218 218 (17%)945/6100 (15%)1.7%0.78%,2.6%<0.001
Systolic PA pressure−11−12 to −9.9<0.001
 Median (IQR)24.00 (0.00–35.00)23.00 (0.00–35.00)32.00 (10.00–54.00)
 Mean (SD)23.30 (21.44)22.93 (21.20)33.84 (25.28)
 (Missing)153 722150 0703652
Critical preoperative state10 355/224 318 (4.6%)8918/218 218 (4.1%)1437/6100 (24%)−19%−21% to −18%<0.001
Cardiogenic shock2943/224 318 (1.3%)2204/218 218 (1.0%)739/6100 (12%)−11%−12% to −10%<0.001
Preoperative ventilation1838/224 318 (0.8%)1399/218 218 (0.6%)439/6100 (7.2%)−6.6%−7.2% to −5.9%<0.001
Preoperative nitrates10 208/224 318 (4.6%)9551/218 218 (4.4%)657/6100 (11%)−6.4%−7.2% to −5.6%<0.001
Preoperative inotropes2651/224,31 (1.2%)1994/218218 (0.9%)657/6100 (11%)−9.9%−11% to −9.1%<0.001
Number of previous operations<0.001
 0154 514/163 933 (94%)150 780/159 244 (95%)3734/4689 (80%)
 18372/163933 (5.1%)7555/159 244 (4.7%)817/4689 (17%)
 2877/163 933 (0.5%)773/159 244 (0.5%)104/4689 (2.2%)
 3140/163 933 (<0.1%)115/159 244 (<0.1%)25/4689 (0.5%)
 427/163 933 (<0.1%)19/159 244 (<0.1%)8/4689 (0.2%)
 51/163 933 (<0.1%)1/159 244 (<0.1%)0/4689 (0%)
 62/163 933 (<0.1%)1/159 244 (<0.1%)1/4689 (<0.1%)
 (Missing)60 38558 9741411
Previous CABG3010/224 318 (1.3%)2666/218 218 (1.2%)344/6100 (5.6%)−4.4%−5.0% to −3.8%<0.001
Previous valve surgery6360/224 318 (2.8%)5695/218 218 (2.6%)665/6100 (11%)−8.3%−9.1% to −7.5%<0.001
Previous aortic arch surgery1223/224 318 (0.5%)1050/218 218 (0.5%)173/6100 (2.8%)−2.4%−2.8% to −1.9%<0.001
Previous descending aortic surgery1/224 318 (<0.1%)1/218 218 (<0.1%)0/6100 (0%)0.00%0.00% to 0.00%>0.9
Previous thoracic surgery1/224 318 (<0.1%)1/218 218 (<0.1%)0/6100 (0%)0.00%0.00% to 0.00%>0.9
Endocarditis6309/224 318 (2.8%)5816/218 218 (2.7%)493/6100 (8.1%)−5.4%−6.1% to −4.7%<0.001
Post-infarct VSD195/224 318 (<0.1%)111/218 218 (<0.1%)84/6100 (1.4%)−1.3%−1.6% to −1.0%<0.001
Operation<0.001
 CABG only113,190 / 224,318 (50.5%)111 640/218 218 (51.2%)1550/6100 (25.4%)1,550 / 6,100 (25.4%)
 AVR only35,161 / 224,318 (15.7%)34 570/218 218 (15.8%)591/6100 (9.7%)591 / 6,100 (9.7%)
 CABG+AVR21,978 / 224,318 (9.8%)21 187/218 218 (9.7%)791/6100 (13%)791 / 6,100 (13%)
 CABG+MVR4,556 / 224,318 (2.0%)4268/218 218 (2%)288/6100 (4.7%)288 / 6,100 (4.7%)
 MVR only14,005 / 224,318 (6.2%)13 671/218 218 (6.3%)334/6100 (5.5%)334 / 6,100 (5.5%)
 Aortic ± other13,922 / 224,318 (6.2%)12 669/218 218 (5.8%)1253/6100 (20.5%)1,253 / 6,100 (20.5%)
 Other18,598 / 224,318 (8.3%)17 514/218 218 (8.0%)1084/6100 (17.8%)1,084 / 6,100 (17.8%)
 Unclassified2,878 / 224,318 (1.3%)2699/218 218 (1.2%)209/6100 (3.4%)209 / 6,100 (3.4%)
Days between angiogram and surgery0.2
 Median (IQR)62.00 (14.00–136.04)62.00 (14.00–136.04)36.00 (9.00–137.00)
 (Missing)68 44466 0122432
Left main stem disease38 474/224 318 (17%)37 529/218 218 (17%)945/6100 (15%)1.7%0.78%,2.6%<0.001
Extent of CAD
 Nil > 50%61 174/200 018 (31%)59 487/195 121 (30%)1687/4897 (34%)
 One vessel >50%18 092/200 018 (9.0%)17 510/195 121 (9.0%)582/4 897 (12%)
 2 vessels >50%31 917/200 018 (16%)31 206/195 121 (16%)711/4897 (15%)
 3 vessels > 50%88 835/200 018 (44%)86 918/195 121 (45%)1917/4897 (39%)
 (Missing)24 30023 0971203
Number of grafts<0.001
 119 083/164 803 (12%)18 320/160 736 (11%)763/4067 (19%)
 235 308/164 803 (21%)34 428/160 736 (21%)880/4067 (22%)
 361 855/164 803 (38%)60 648/160 736 (38%)1207/4067 (30%)
 425 362/164 803 (15%)24 965/160 736 (16%)397/4067 (9.8%)
 53362/164 803 (2.0%)3294/160 736 (2.0%)68/4067 (1.7%)
 6339/164 803 (0.2%)331/160 736 (0.2%)8/4067 (0.2%)
 (Missing)59 51557 4822033
First operator grade<0.001
 Consultant170 813/211 897 (81%)165 411/205 977 (80%)5402/5920 (91%)
 Staff grade6745/211 897 (3.2%)6674/205 977 (3.2%)71/5920 (1.2%)
 Registrar30 167/211 897 (14%)29 840/205 977 (14%)327/5920 (5.5%)
 SHO4172/211 897 (2.0%)4052/205 977 (2.0%)120/5920 (2.0%)
 (Missing)12 42112 241180
Payer status<0.001
 NHS214 259/222 167 (96%)208 318/216 124 (96%)5941/6043 (98%)
 Private7908/222 167 (3.6%)7806/216 124 (3.6%)102/6043 (1.7%)
 (Missing)2151209457
CharacteristicOverallSurvivorNonsurvivorDifferenceb95% CIP-valueb
N = 224 318aN = 218 218aN = 6100a
Age (years)<0.001
 Median (IQR)69.30 (61.10–76.00)69.20 (61.00–75.90)73.30 (65.60–78.80)
Weight<0.001
 Median (IQR)81.00 (71.00–92.00)81.00 (71.00–92.00)77.00 (66.00–90.00)
BMI<0.001
 Median (IQR)27.68 (25.10–31.19)27.70 (25.14–31.20)27.61 (24.22–30.84)
Female60 368/224 318 (27%)58 102/218 218 (27%)2266/6100 (37%)−11%−12% to −9.3%<0.001
Hypertension155 736/224 318 (69%)151 386/218 218 (69%)4350/6100 (71%)−1.9%−3.1% to −0.78%0.001
Diabetes0.199
 Not diabetic169 435/224 318 (76%)164 918/218 218 (76%)4517/6100 (74%)
 Diet controlled8836/224 318 (3.9%)8571/218 218 (3.9%)265/6100 (4.3%)
 Oral therapy32 844/224 318 (15%)31 969/218 218 (15%)875/6100 (14%)
 Insulin-dependent diabetes13 203/224 318 (5.9%)12 760/218 218 (5.8%)443/6100 (7.3%)
Smoker<0.001
 Never smoked94 188/224 318 (42%)91 567/218 218 (42%)2621/6100 (43%)
 Ex-smoker107 027/224 318 (48%)104 146/218 218 (48%)2881/6100 (47%)
 Current smoker23 103/224 318 (10%)22 505/218 218 (10%)598/6100 (9.8%)
History of pulmonary disease27 585/224 318 (12%)26 401/218 218 (12%)1184/6100 (19%)−7.3%−8.3% to −6.3%<0.001
History of stroke<0.001
 TIA9616/218 634 (4.4%)9237/212 748 (4.3%)379/5886 (6.4%)
 CVA with full recovery5122/218 634 (2.3%)4901/212 748 (2.3%)221/5886 (3.8%)
 CVA with residual neurological deficit3384/218 634 (1.5%)3181/212 748 (1.5%)203/5886 (3.4%)
 (Missing)56845470214
Neurologic dysfunction7141/224 318 (3.2%)6743/218 218 (3.1%)398/6100 (6.5%)−3.4%−4.1% to −2.8%<0.001
Peripheral vascular disease23 462/224 318 (10%)22 264/218 218 (10%)1198/6100 (20%)−9.4%−10% to −8.4%<0.001
Poor mobility1935/224 318 (0.9%)1798/218 218 (0.8%)137/6100 (2.2%)−1.4%−1.8% to −1.0%<0.001
Pulmonary hypertension4651/224 318 (2.1%)4249/218 218 (1.9%)402/6100 (6.6%)<0.001
LV function<0.001
 Very poor1382/224 318 (0.6%)1246/218 218 (0.6%)136/6100 (2.2%)
 Poor4512/224 318 (2.0%)4207/218 218 (1.9%)305/6100 (5.0%)
 Fair31 444/224 318 (14%)30 372/218 218 (14%)1072/6100 (18%)
 Good186 980/224 318 (83%)182 393/218 218 (84%)4587/6100 (75%)
Creatinine
 Median (IQR)85.00 (73.00–101.00)85.00 (73.00–100.00)100.00 (80.00–131.00)
 Mean (SD)93.99 (50.43)93.22 (48.76)122.49 (88.44)−29−32 to −27<0.001
 (Missing)14 26613 691575
Creatinine clearance<0.001
 Median (IQR)82.57 (62.67–100.58)82.95 (63.18–100.84)66.29 (46.85–99.04)
 (Missing)33 67832 990688
Pre-op dialysis<0.001
 None214 727/220 015 (98%)209 500/214 122 (98%)5227/5893 (89%)
 Dialysis for acute renal failure within 6 weeks of surgery838/220 015 (0.4%)687/214 122 (0.3%)151/5893 (2.6%)
 Dialysis for chronic renal failure1656/220 015 (0.8%)1484/214 122 (0.7%)172/5893 (2.9%)
 No dialysis but acute renal failure preop2794/220 015 (1.3%)2451/214 122 (1.1%)343/5893 (5.8%)
 (Missing)43034096207
CCS<0.001
 075 514/224 318 (34%)72 859/218 218 (33%)2655/6100 (44%)
 121 249/224 318 (9.5%)20 756/218 218 (9.5%)493/6100 (8.1%)
 264 613/224 318 (29%)63 575/218 218 (29%)1038/6100 (17%)
 343 717/224 318 (19%)42 715/218 218 (20%)1002/6100 (16%)
 419 225/224 318 (8.6%)18 313/218 218 (8.4%)912/6100 (15%)
NYHA<0.001
 148 602/224 318 (22%)47 578/218 218 (22%)1024/6100 (17%)
 297 609/224 318 (44%)96 017/218 218 (44%)1592/6100 (26%)
 365 752/224 318 (29%)63 556/218 218 (29%)2196/6100 (36%)
 412 355/224 318 (5.5%)11 067/218 218 (5.1%)1288/6100 (21%)
Urgency<0.001
 Elective142 456/224 318 (64%)140 024/218 218 (64%)2432/6100 (40%)
 Urgent73 575/224 318 (33%)71 455/218 218 (33%)2120/6100 (35%)
 Emergency7395/224 318 (3.3%)6234/218 218 (2.9%)1161/6100 (19%)
 Salvage892/224 318 (0.4%)505/218 218 (0.2%)387/6100 (6.3%)
Number of previous MIs<0.001
 0152 138/224 318 (68%)148 288/218 218 (68%)3850/6100 (63%)
 161 800/224 318 (28%)60 000/218 218 (27%)1800/6100 (30%)
 210 380/224 318 (4.6%)9930/218 218 (4.6%)450/6100 (7.4%)
Recent MI44 651/224 318 (20%)43 141/218 218 (20%)1510/6100 (25%)−5.0%−6.1% to −3.9%<0.001
Interval from MI to surgery<0.001
 No previous MI153 246/224 318 (68%)149 354/218 218 (68%)3892/6100 (64%)
 MI < 6 hours516/224 318 (0.2%)409/218 218 (0.2%)107/6100 (1.8%)
 MI 6–24 hours1023/224 318 (0.5%)880/218 218 (0.4%)143/6100 (2.3%)
 MI 1–30 days36 296/224 318 (16%)35 222/218 218 (16%)1074/6100 (18%)
 MI 31–90 days6816/224 318 (3.0%)6630/218 218 (3.0%)186/6100 (3.0%)
 MI >90 days26 421/224 318 (12%)25 723/218 218 (12%)698/6100 (11%)
Previous PCI<0.001
 No previous PCI198 924/224 318 (89%)193 587/218 218 (89%)5337/6100 (87%)
 PCI <24 h before surgery931/224 318 (0.4%)805/218 218 (0.4%)126/6100 (2.1%)
 PCI >24 h before surgery same admission2067/224 318 (0.9%)1958/218 218 (0.9%)109/6100 (1.8%)
 PCI >24 h before surgery previous admission22 396/224 318 (10.0%)21 868/218 218 (10%)528/6100 (8.7%)
Preoperative AF24 923/224 318 (11%)23 637/218 218 (11%)1286/6100 (21%)−10%−11% to −9.2%<0.001
Preoperative VF or VT440/224 318 (0.2%)387/218 218 (0.2%)53/6100 (0.9%)−0.69%−0.93% to −0.45%<0.001
Preoperative CHB/paced3203/224 318 (1.4%)2999/218 218 (1.4%)204/6100 (3.3%)−2.0%−2.4% to −1.5%<0.001
Left main stem disease38 474/224 318 (17%)37 529/218 218 (17%)945/6100 (15%)1.7%0.78%,2.6%<0.001
Systolic PA pressure−11−12 to −9.9<0.001
 Median (IQR)24.00 (0.00–35.00)23.00 (0.00–35.00)32.00 (10.00–54.00)
 Mean (SD)23.30 (21.44)22.93 (21.20)33.84 (25.28)
 (Missing)153 722150 0703652
Critical preoperative state10 355/224 318 (4.6%)8918/218 218 (4.1%)1437/6100 (24%)−19%−21% to −18%<0.001
Cardiogenic shock2943/224 318 (1.3%)2204/218 218 (1.0%)739/6100 (12%)−11%−12% to −10%<0.001
Preoperative ventilation1838/224 318 (0.8%)1399/218 218 (0.6%)439/6100 (7.2%)−6.6%−7.2% to −5.9%<0.001
Preoperative nitrates10 208/224 318 (4.6%)9551/218 218 (4.4%)657/6100 (11%)−6.4%−7.2% to −5.6%<0.001
Preoperative inotropes2651/224,31 (1.2%)1994/218218 (0.9%)657/6100 (11%)−9.9%−11% to −9.1%<0.001
Number of previous operations<0.001
 0154 514/163 933 (94%)150 780/159 244 (95%)3734/4689 (80%)
 18372/163933 (5.1%)7555/159 244 (4.7%)817/4689 (17%)
 2877/163 933 (0.5%)773/159 244 (0.5%)104/4689 (2.2%)
 3140/163 933 (<0.1%)115/159 244 (<0.1%)25/4689 (0.5%)
 427/163 933 (<0.1%)19/159 244 (<0.1%)8/4689 (0.2%)
 51/163 933 (<0.1%)1/159 244 (<0.1%)0/4689 (0%)
 62/163 933 (<0.1%)1/159 244 (<0.1%)1/4689 (<0.1%)
 (Missing)60 38558 9741411
Previous CABG3010/224 318 (1.3%)2666/218 218 (1.2%)344/6100 (5.6%)−4.4%−5.0% to −3.8%<0.001
Previous valve surgery6360/224 318 (2.8%)5695/218 218 (2.6%)665/6100 (11%)−8.3%−9.1% to −7.5%<0.001
Previous aortic arch surgery1223/224 318 (0.5%)1050/218 218 (0.5%)173/6100 (2.8%)−2.4%−2.8% to −1.9%<0.001
Previous descending aortic surgery1/224 318 (<0.1%)1/218 218 (<0.1%)0/6100 (0%)0.00%0.00% to 0.00%>0.9
Previous thoracic surgery1/224 318 (<0.1%)1/218 218 (<0.1%)0/6100 (0%)0.00%0.00% to 0.00%>0.9
Endocarditis6309/224 318 (2.8%)5816/218 218 (2.7%)493/6100 (8.1%)−5.4%−6.1% to −4.7%<0.001
Post-infarct VSD195/224 318 (<0.1%)111/218 218 (<0.1%)84/6100 (1.4%)−1.3%−1.6% to −1.0%<0.001
Operation<0.001
 CABG only113,190 / 224,318 (50.5%)111 640/218 218 (51.2%)1550/6100 (25.4%)1,550 / 6,100 (25.4%)
 AVR only35,161 / 224,318 (15.7%)34 570/218 218 (15.8%)591/6100 (9.7%)591 / 6,100 (9.7%)
 CABG+AVR21,978 / 224,318 (9.8%)21 187/218 218 (9.7%)791/6100 (13%)791 / 6,100 (13%)
 CABG+MVR4,556 / 224,318 (2.0%)4268/218 218 (2%)288/6100 (4.7%)288 / 6,100 (4.7%)
 MVR only14,005 / 224,318 (6.2%)13 671/218 218 (6.3%)334/6100 (5.5%)334 / 6,100 (5.5%)
 Aortic ± other13,922 / 224,318 (6.2%)12 669/218 218 (5.8%)1253/6100 (20.5%)1,253 / 6,100 (20.5%)
 Other18,598 / 224,318 (8.3%)17 514/218 218 (8.0%)1084/6100 (17.8%)1,084 / 6,100 (17.8%)
 Unclassified2,878 / 224,318 (1.3%)2699/218 218 (1.2%)209/6100 (3.4%)209 / 6,100 (3.4%)
Days between angiogram and surgery0.2
 Median (IQR)62.00 (14.00–136.04)62.00 (14.00–136.04)36.00 (9.00–137.00)
 (Missing)68 44466 0122432
Left main stem disease38 474/224 318 (17%)37 529/218 218 (17%)945/6100 (15%)1.7%0.78%,2.6%<0.001
Extent of CAD
 Nil > 50%61 174/200 018 (31%)59 487/195 121 (30%)1687/4897 (34%)
 One vessel >50%18 092/200 018 (9.0%)17 510/195 121 (9.0%)582/4 897 (12%)
 2 vessels >50%31 917/200 018 (16%)31 206/195 121 (16%)711/4897 (15%)
 3 vessels > 50%88 835/200 018 (44%)86 918/195 121 (45%)1917/4897 (39%)
 (Missing)24 30023 0971203
Number of grafts<0.001
 119 083/164 803 (12%)18 320/160 736 (11%)763/4067 (19%)
 235 308/164 803 (21%)34 428/160 736 (21%)880/4067 (22%)
 361 855/164 803 (38%)60 648/160 736 (38%)1207/4067 (30%)
 425 362/164 803 (15%)24 965/160 736 (16%)397/4067 (9.8%)
 53362/164 803 (2.0%)3294/160 736 (2.0%)68/4067 (1.7%)
 6339/164 803 (0.2%)331/160 736 (0.2%)8/4067 (0.2%)
 (Missing)59 51557 4822033
First operator grade<0.001
 Consultant170 813/211 897 (81%)165 411/205 977 (80%)5402/5920 (91%)
 Staff grade6745/211 897 (3.2%)6674/205 977 (3.2%)71/5920 (1.2%)
 Registrar30 167/211 897 (14%)29 840/205 977 (14%)327/5920 (5.5%)
 SHO4172/211 897 (2.0%)4052/205 977 (2.0%)120/5920 (2.0%)
 (Missing)12 42112 241180
Payer status<0.001
 NHS214 259/222 167 (96%)208 318/216 124 (96%)5941/6043 (98%)
 Private7908/222 167 (3.6%)7806/216 124 (3.6%)102/6043 (1.7%)
 (Missing)2151209457
a

Mean (SD) or frequency (%).

b

Two sample test for equality of proportions; Welch 2-sample t-test.

BMI: basal metabolic index; CABG: coronary artery bypass graft; CCS: Canadian Cardiovascular score; CI: confidence interval; CVA: cerebrovascular accident, IQR: interquartile range; LV: left ventricular, MI: myocardial infarction; NYHA: New York Heart Association, TIA: transient ischaemic attack, AF: Atrial fibrillation, AVR: aortic valve repair/replacement, CAD: coronary artery disease, CHB: complete heart block, MVR: mitral valve repair/replacement, NHS: National Health Service, PA: pulmonary artery, PCI: percutaneous coronary intervention, SHO: senior house officer, VF: ventricular fibrillation, VSD: ventricular septal defect, VT: ventricular tachycardia.

Final model

The final model consisted of the variables in Table 2.

Table 2:

Variables used in the final model

XGBoost-23
Type of operation
Age
Creatinine clearance
Urgency
New York Heart Association Score
Critical pre-operative state
Peripheral vascular disease
Previous myocardial infarction
Pre-operative cardiac rhythm
Pulmonary disease
First operator grade
Diabetes
Basal metabolic index
Mitral valve procedure
Left ventricular function
Pre-operative stroke
Previous valve surgery
Aortic arch procedure
Previous coronary artery bypass
Need for mechanical support
Hospital Code
Previous operation
Number of grafts
XGBoost-23
Type of operation
Age
Creatinine clearance
Urgency
New York Heart Association Score
Critical pre-operative state
Peripheral vascular disease
Previous myocardial infarction
Pre-operative cardiac rhythm
Pulmonary disease
First operator grade
Diabetes
Basal metabolic index
Mitral valve procedure
Left ventricular function
Pre-operative stroke
Previous valve surgery
Aortic arch procedure
Previous coronary artery bypass
Need for mechanical support
Hospital Code
Previous operation
Number of grafts
Table 2:

Variables used in the final model

XGBoost-23
Type of operation
Age
Creatinine clearance
Urgency
New York Heart Association Score
Critical pre-operative state
Peripheral vascular disease
Previous myocardial infarction
Pre-operative cardiac rhythm
Pulmonary disease
First operator grade
Diabetes
Basal metabolic index
Mitral valve procedure
Left ventricular function
Pre-operative stroke
Previous valve surgery
Aortic arch procedure
Previous coronary artery bypass
Need for mechanical support
Hospital Code
Previous operation
Number of grafts
XGBoost-23
Type of operation
Age
Creatinine clearance
Urgency
New York Heart Association Score
Critical pre-operative state
Peripheral vascular disease
Previous myocardial infarction
Pre-operative cardiac rhythm
Pulmonary disease
First operator grade
Diabetes
Basal metabolic index
Mitral valve procedure
Left ventricular function
Pre-operative stroke
Previous valve surgery
Aortic arch procedure
Previous coronary artery bypass
Need for mechanical support
Hospital Code
Previous operation
Number of grafts

XGBoost-23

The XGBoost model with the greatest training set AUC had 27 variables—AUC 0.837 (95% CI: 0.837–0.838) and F1 score 0.277 (95% CI: 0.276–0.279) (Supplementary Material, Table S3). However, visual inspection of the AUC curve (Supplementary Material, Fig. S4) and output values (Supplementary Material, Table S3) showed that there was minimal loss in discrimination with less variables. Hence, after discussion and in the interest of parsimony, a ‘sensitivity’ analysis was performed to assess the alteration in model performance with a lower number of variables (20, 23, and 25) (Supplementary Material, Table S3). There was a marked drop in NB (treated) when going from 23 to 20 variables. There was risk over-estimation at 20 variables and under-estimation at 23 and 25 variables, with minimal differences in clinical utility between the 23 and 25 variable models. Good performance was achieved in terms of AUC, F1 score, Adjusted Brier Score, Adjusted ECE score and clinical NB when benchmarked against EuroSCORE II [25, 26] (Supplementary Material, Table S3B). On balance we opted to move forward with a XGBoost model with 23 variables (i.e. XGBoost-23)—AUC 0.846 (95% CI: 0.845–0.846) and F1 score 0.288 (95% CI: 0.287–0.290) (Fig. 2A), which showed good calibration even with those at high predicted risk (Fig. 2B). The number of patients in each risk centile, as predicted by XGBoost-23, are shown in Table 3 and Supplementary Material, Fig. S5. There are very few patients with a predicted risk above 30% that have undergone surgery, as would be expected. The DCA (Fig. 2C) showed an NB in those treated at all threshold probabilities below 60%.

Performance of the final model developed using XGBoost with feature selection. (A) Discrimination, (B) calibration, (C) decision curve analysis. AUC: Area under the receiver operator curve; CI: confidence interval; ROC: receiver operating characteristic.
Figure 2:

Performance of the final model developed using XGBoost with feature selection. (A) Discrimination, (B) calibration, (C) decision curve analysis. AUC: Area under the receiver operator curve; CI: confidence interval; ROC: receiver operating characteristic.

Table 3:

Number of patients in each decile of predicted risk, as per XGBoost-23, in the test group

Predicted risk range %Total numberSurvivedNot survivedActual % risk
0–10%65 86864 73111371.73
>10–20%2061174931215.14
>20–30%62147314823.83
>30–40%2351468937.87
>40–50%133696448.12
>50–60%84384654.76
>60–70%43152865.12
>70–80%92777.78
>80–90%0000.00
>90–100%0000.00
Predicted risk range %Total numberSurvivedNot survivedActual % risk
0–10%65 86864 73111371.73
>10–20%2061174931215.14
>20–30%62147314823.83
>30–40%2351468937.87
>40–50%133696448.12
>50–60%84384654.76
>60–70%43152865.12
>70–80%92777.78
>80–90%0000.00
>90–100%0000.00
Table 3:

Number of patients in each decile of predicted risk, as per XGBoost-23, in the test group

Predicted risk range %Total numberSurvivedNot survivedActual % risk
0–10%65 86864 73111371.73
>10–20%2061174931215.14
>20–30%62147314823.83
>30–40%2351468937.87
>40–50%133696448.12
>50–60%84384654.76
>60–70%43152865.12
>70–80%92777.78
>80–90%0000.00
>90–100%0000.00
Predicted risk range %Total numberSurvivedNot survivedActual % risk
0–10%65 86864 73111371.73
>10–20%2061174931215.14
>20–30%62147314823.83
>30–40%2351468937.87
>40–50%133696448.12
>50–60%84384654.76
>60–70%43152865.12
>70–80%92777.78
>80–90%0000.00
>90–100%0000.00

Variable importance

Variable importance of the XGBoost-23 model conducted on the training/validation set is shown in Fig. 3 and Supplementary Material, Table S4.The most influential factors are the type of operation, age, creatinine clearance, urgency and New York Heart Association scores.

Variable importance of the final XGBoost-23 model. Ao.Arch.Procedure: operation on the aortic arch; BMI: basal metabolic index; CardiacRhythm: preoperative cardiac rhythm; CPS: critical pre-operative state; CrCl: creatinine clearance; HospCode: hospital identification code; LVF2: left ventricular function using same criteria as EuroSCOREII; mechanicalSupport: need for pre-operative mechanical support. NYHA: New York Heart Association Score; PrevCABG: previous coronary artery bypass grafts; PrevMI: number of previous myocardial infarctions; PrevOp: any previous cardiac surgery; PrevValve: previous valve surgery; PVD: peripheral vascular disease; Stroke: pre-operative stroke.
Figure 3:

Variable importance of the final XGBoost-23 model. Ao.Arch.Procedure: operation on the aortic arch; BMI: basal metabolic index; CardiacRhythm: preoperative cardiac rhythm; CPS: critical pre-operative state; CrCl: creatinine clearance; HospCode: hospital identification code; LVF2: left ventricular function using same criteria as EuroSCOREII; mechanicalSupport: need for pre-operative mechanical support. NYHA: New York Heart Association Score; PrevCABG: previous coronary artery bypass grafts; PrevMI: number of previous myocardial infarctions; PrevOp: any previous cardiac surgery; PrevValve: previous valve surgery; PVD: peripheral vascular disease; Stroke: pre-operative stroke.

DISCUSSION

This study is an example of how using a feature-selected gradient boosting ML algorithm (XGBoost) can produce a mortality risk prediction model with good discrimination, calibration and clinical utility. This is the largest national study and builds on our previous work [11].

We began with 61 variables and used automated feature selection to develop the final model. It was developed with a view to optimize discrimination and indeed had not only a higher AUC and F1 score compared to EuroSCORE II and previous studies but also showed improved calibration and clinical utility [26]. It showed good calibration across all deciles of risk. Most patients had risks less than 10%, as would be expected of those undergoing surgery. However, in the higher-risk patients, XGBoost-23 continued to perform well. DCA showed that with probability thresholds under 60% there is a clear NB when using XGBoost-23 suggesting patients would be appropriately operated upon if using this model. Improved performance in the few high-risk patients becomes important when considering operations on patients with few alternatives (e.g. aortic dissections) or in units routinely operating on high-risk cases. It would be interesting to review model performance in those turned down for surgery but this data is not routinely collected.

Notably, we have not sought any information that is not already routinely gathered though many additional risk factors have been proposed previously [27, 28]. We also rationalized the final XGBoost model in the name of parsimony and to encourage its use by clinicians going forward.

No matter which modelling tool is utilized the relationship between patient, perioperative factors and death is inherently complex. Alternative ML models, such as XGBoost, offer the benefit of requiring little input from the model developer and having innate additive properties such that additional information can be used to adapt models without re-running the algorithm from the beginning. However, it is hard to depict as a standard formula and is less trusted by some. With the best performing model (XGBoost-23) we have described variable importance, which should also aid in greater understanding and acceptance in the wider surgical community. We are developing a website to host the model and allow prospective evaluation by others. LR has the advantage that it is well understood by the scientific community and publishing of coefficients allows external validation. However, modelling of interactions and non-linear relationships between the explanatory and response variables will be needed to optimize a LR model. The addition of new results would also require re-running the model development process.

Ensemble tree based, particularly gradient boosting, algorithms have been used with increasing frequency recently. Ours is the largest database studied, with only 3 other papers having more than 10 000 patients [30–32]. Most studies have shown marginal improvements in model performance with the use of alternative ML algorithms compared to commonly used models and retrained-LR models [31]. Interestingly Ong et al. [13] also showed improvements in discrimination when training models on local data. Although intuitive it reinforces the need to use either local variables or, as we have done, use hospital codes. Previous reviews have shown variable model performance on the basis of the hospital [4, 11–13] and operation undertaken [5, 14, 15]. Though the team behind the EuroSCORE advocated using hospital risk-adjusted mortality ratios, we felt that the hospital code could be incorporated into the final model from the outset and would increase ease of use. Zeng et al. [29] also performed a very thorough comparative study of different gradient boosting algorithms with a variety of feature selection methods but found only minor differences in performance between them and a retrained-LR model. Our study is unique in performing DCA in addition to discrimination and calibration. We feel this is crucial as improved clinical utility is the ultimate aim for prediction modelling.

Some variables are similar to those already taken into account in published scoring systems–age, creatinine clearance, New York Heart Association, urgency, critical pre-operative state, previous surgery, left ventricular function, peripheral vascular disease and pulmonary disease. Others have been replaced with similar variables of greater granularity, such as all diabetes management strategies, types of operations performed and distinguishing between mitral valve repair and replacement. New risk factors include the presence of a previous myocardial infarction (MI) not just recent MI, number of grafts inserted, pre-operative cardiac rhythm, first operator grade, basal metabolic index, pre-operative stroke, aortic arch procedure and need for mechanical support. Furthermore, differences in risk between those with previous valve surgery and previous coronary artery bypass surgery are also accounted for.

With the advent of more artificial intelligence-based search engines, there is increased public awareness and acceptance of these processes. We believe this should extend to the cardiac surgical community. With increasing server capacity and ultimately prospective data analysis we hope to be able to assess and fine tune this model further. The importance of risk prediction for morbidities (e.g. stroke, MI), resource utilization (e.g. length of stay, cost of intervention [33]) and interventional procedures [34] also has clear benefits for patient counselling and resource planning. Any future model utilized needs to be accurate, easy to use, avoid over-fitting and be recalibrated periodically.

Limitations and future work

The current model was created using a historic database. With the advent of more interventional strategies and the interruption of services secondary to the pandemic we should explore temporal differences in model performance. Prospective analysis is warranted before routine adoption of any model. Mortality following cardiac surgery in the UK is low [35] and strategies for managing class imbalance are needed. Future work will also include the development of an optimized LR-based model in order to perform a comparative analysis.

CONCLUSIONS

A feature-selected XGBoost-based model for 30 day or in-hospital mortality in adult cardiac surgery showed good discrimination, calibration and clinical benefit. Prospective assessment of a XGBoost-derived model in an external dataset is warranted.

SUPPLEMENTARY MATERIAL

Supplementary material is available at EJCTS online.

FUNDING

This study was supported by a British Heart Foundation-Turing research grant HF-Turing-19_2_1020.AJ was supported by the National Institute for Health and Care Research Bristol Biomedical Research Centre, University Hospitals Bristol and Weston NHS Foundation Trust and University of Bristol(NIHR203315).

Conflict of interest: none declared.

DATA AVAILABILITY

The data underlying this article were provided by NICOR/HQIP under licence/by permission. Data will be shared on request to the corresponding author with permission of NICOR/HQIP.

ETHICS STATEMENT

This register-based cohort study is part of research approved by the Health Research Authority (HRA) and Health and Care Research Wales and a need for patients consent was waived (HCRW)(IRAS ID: 257758, 23.7.2019).

Author contributions

Shubhra Sinha: Conceptualization; Data curation; Formal analysis; Methodology; Supervision; Validation; Writing—original draft; Writing—review & editing. Tim Dong: Conceptualization; Formal analysis; Methodology; Software; Visualization; Writing—review & editing. Arnaldo Dimagli: Conceptualization; Methodology; Writing—review & editing. Andrew Judge: Supervision; Writing—review & editing. Gianni D. Angelini: Funding acquisition; Methodology; Project administration; Supervision; Writing—review & editing.

Reviewer information

Reviewer information European Journal of Cardio-Thoracic Surgery thanks the anonymous reviewers for their contribution to the peer review process of this article.

REFERENCES

1

Baumgartner
H
,
Falk
V
,
Bax
JJ
,
De Bonis
M
,
Hamm
C
,
Holm
PJ
et al. ;
ESC Scientific Document Group
.
2017 ESC/EACTS guidelines for the management of valvular heart disease
.
Eur Heart J
2017
;
38
:
2739
91
.

2

Neumann
FJ
,
Sousa-Uva
M
,
Ahlsson
A
,
Alfonso
F
,
Banning
AP
,
Benedetto
U
et al.
2018 ESC/EACTS Guidelines on myocardial revascularization. The Task Force on myocardial revascularization of the European Society of Cardiology (ESC) and European Association for Cardio-Thoracic Surgery (EACTS)
.
Eur Heart J
2019
;
40
:
87
165
.

3

Nashef
SAM
,
Roques
F
,
Michel
P
,
Gauducheau
E
,
Lemeshow
S
,
Salamon
R.
European system for cardiac operative risk evaluation (EuroSCORE)
.
Eur J Cardiothorac Surg
1999
;
16
:
9
13
.

4

Nashef
SAM
,
Roques
F
,
Sharples
LD
,
Nilsson
J
,
Smith
C
,
Goldstone
AR
et al
EuroSCORE II
.
Eur J Cardiothorac Surg
2012
;
41
:
734
45
.

5

Michel
P
,
Roques
F
,
Nashef
SAM
;
EuroSCORE Project Group
.
Logistic or additive EuroSCORE for high-risk patients?
Eur J Cardiothorac Surg
2003
;
23
:
684
7
; discussion 687.

6

Karabulut
H
,
Toraman
F
,
Alhan
C
,
Camur
G
,
Evrenkaya
S
,
Dağdelen
S
et al
EuroSCORE overestimates the cardiac operative risk
.
Cardiovasc Surg
2003
;
11
:
295
8
.

7

Barmettler
H
,
Immer
FF
,
Berdat
PA
,
Eckstein
FS
,
Kipfer
B
,
Carrel
TP.
Risk-stratification in thoracic aortic surgery: should the EuroSCORE be modified?
Eur J Cardiothorac Surg
2004
;
25
:
691
4
.

8

van Straten
AHM
,
Tan
E
,
Hamad
MAS
,
Martens
EJ
,
van Zundert
AAJ.
Evaluation of the EuroSCORE risk scoring model for patients undergoing coronary artery bypass graft surgery: a word of caution
.
NHJL
2010
;
18
:
355
9
.

9

Sergeant
P
,
De Worm
E
,
Meyns
B.
Single centre, single domain validation of the EuroSCORE on a consecutive sample of primary and repeat CABG
.
Eur J Cardiothorac Surg
2001
;
20
:
1176
82
.

10

Gummert
JF
,
Funkat
A
,
Osswald
B
,
Beckmann
A
,
Schiller
W
,
Krian
A
et al
EuroSCORE overestimates the risk of cardiac surgery: results from the national registry of the German Society of Thoracic and Cardiovascular Surgery
.
Clin Res Cardiol
2009
;
98
:
363
9
.

11

Sinha
S
,
Dong
T
,
Dimagli
A
,
Vohra
HA
,
Holmes
C
,
Benedetto
U
et al.
Comparison of machine learning techniques in prediction of mortality following cardiac surgery: analysis of over 220 000 patients from a large national database
.
Eur J Cardiothorac Surg
2023
;
63
:ezad183.

12

Grant
SW
,
Grayson
AD
,
Jackson
M
,
Au
J
,
Fabri
BM
,
Grotte
G
et al.
Does the choice of risk-adjustment model influence the outcome of surgeon-specific mortality analysis? A retrospective analysis of 14 637 patients under 31 surgeons
.
Heart
2008
;
94
:
1044
9
.

13

Ong
CS
,
Reinertsen
E
,
Sun
H
,
Moonsamy
P
,
Mohan
N
,
Funamoto
M
et al
Prediction of operative mortality for patients undergoing cardiac surgical procedures without established risk scores
.
J Thorac Cardiovasc Surg
2023
;
165
:
1449
59.e15
.

14

Barili
F
,
Pacini
D
,
Capo
A
,
Ardemagni
E
,
Pellicciari
G
,
Zanobini
M
et al.
Reliability of new scores in predicting perioperative mortality after isolated aortic valve surgery: a comparison with the Society of Thoracic Surgeons score and logistic EuroSCORE
.
Ann Thorac Surg
2013
;
95
:
1539
44
.

15

Grant
SW
,
Hickey
GL
,
Dimarakis
I
,
Cooper
G
,
Jenkins
DP
,
Uppal
R
et al.
Performance of the EuroSCORE models in emergency cardiac surgery
.
Circ Cardiovasc Qual Outcomes
2013
;
6
:
178
85
.

16

Team RC
. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing,
2020
. https://www.r-project.org/ (4 April 2024, date last accessed).

17

Benedetto
U
,
Sinha
S
,
Dimagli
A
,
Cooper
G
,
Mariscalco
G
,
Uppal
R
et al.
Decade-long trends in surgery for acute type A aortic dissection in England: a retrospective cohort study
.
Lancet Reg Health Eur
2021
;
7
:
100131
.

18

Berriman
R
,
Hickey
G
, National Adult Cardiac Surgery Audit. National Institute for Cardiovascular Outcomes Research (NICOR). National Adult Cardiac Surgery Audit Registry Data Pre-processing.
2015
. www.ucl.ac.uk/nicor/audits/Adultcardiacsurgery%0Ahttps://www.ucl.ac.uk/nicor/audits/adultcardiac/documents/reports/annualreport2010-11 (4 April 2024, date last accessed).

19

NICOR
. NICOR Datasets.
2015
. https://www.nicor.org.uk/datasets/ (4 April 2024, date last accessed).

20

van Rossum
G.
Python Tutorial, Technical Report CS-R9526. Amsterdam: Centrum Wiskunde & Informatica,
1995
.

21

Raschka
S.
MLxtend : providing machine learning and data science utilities and extensions to Python ’ s scientific computing stack
.
J Open Source Softw
2018
;
3
:
24
5
.

22

Pudil
P
,
Novovičová
J
,
Kittler
J.
Floating search methods in feature selection
.
Pattern Recogn Lett
1994
;
15
:
1119
25
.

23

Lundberg
SM
,
Lee
SI.
A unified approach to interpreting model predictions
. In Guyon I, Luxburg UV, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R (eds.), 'Advances in Neural Information Processing Systems 30', Curran Associates, Inc.,
2017
:
4765
74
.

24

Shapley
L.
Chapter: A Value for n-Person Games
.
In Kuhn H, Tucker A, (eds.), Contributions to the Theory of Games II
.
Princeton University Press
,
1953
,
307
17
.

25

Dong
T
,
Sinha
S
,
Zhai
B
,
Fudulu
DP
,
Chan
J
,
Narayan
P
et al.
Cardiac surgery risk prediction using ensemble machine learning to incorporate legacy risk scores: a benchmarking study
.
Digit Health
2023
;
9
:20552076231187605. doi: .

26

Dong
T
,
Sinha
S
,
Zhai
B
,
Fudulu
D
,
Chan
J
,
Narayan
P
et al.
Performance drift in machine learning models for cardiac surgery risk prediction: retrospective analysis
.
JMIRx Med
2024
;
5
:
e45973
.

27

Roques
F
,
Nashef
SAM
,
Michel
P
,
Gauducheau
E
,
De Vincentiis
C
,
Baudet
E
et al.
Risk factors and outcome in European cardiac surgery: analysis of the EuroSCORE multinational database of 19030 patients
.
Eur J Cardiothorac Surg
1999
;
15
:
816
23
.

28

Head
SJ
,
Osnabrugge
RLJ
,
Howell
NJ
,
Freemantle
N
,
Bridgewater
B
,
Pagano
D
et al.
A systematic review of risk prediction in adult cardiac surgery: considerations for future model development
.
Eur J Cardiothorac Surg
2013
;
43
:
e121
9
.

29

Zeng
J
,
Zhang
D
,
Lin
S
,
Su
X
,
Wang
P
,
Zhao
Y
et al.
Comparative analysis of machine learning vs. traditional modeling approaches for predicting in-hospital mortality after cardiac surgery: temporal and spatial external validation based on a nationwide cardiac surgery registry
.
Eur Heart J Qual Care Clin Outcomes
2024
;
10
:
121
31
.

ABBREVATIONS

     
  • AUC

    Area under the receiver operator curve

  •  
  • CI

    Confidence interval

  •  
  • DCA

    Decision curve analysis

  •  
  • ECE

    Expected calibration error

  •  
  • EuroSCORE

    European System for Cardiac Operative Risk Evaluation

  •  
  • LR

    Logistic regression

  •  
  • MI

    Myocardial infarction

  •  
  • ML

    Machine learning

  •  
  • NACSA

    National Adult Cardiac Surgery Audit

  •  
  • NB

    Net benefit

  •  
  • SHAP

    SHapley Adaptive exPlanations

  •  
  • XGBoost

    Extreme gradient boosting machine

Author notes

Shubhra Sinha and Tim Dong authors contributed equally to this study.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Supplementary data