Machine Learning Based Prediction of Postoperative Acute Kidney Injury Risk in Coronary Artery Bypass Grafting Patients

Yang Zhang; Dabei Cai; Ye Deng; Zhu Wang; Zhihan Zhang; Hu Zhang; Qingjie Wang; Shoujie Feng; Ling Sun; Jun Wei

doi:10.2147/CIA.S543303

Back to Journals » Clinical Interventions in Aging » Volume 20

Original Research

Machine Learning Based Prediction of Postoperative Acute Kidney Injury Risk in Coronary Artery Bypass Grafting Patients

Authors Zhang Y, Cai D , Deng Y, Wang Z, Zhang Z, Zhang H, Wang Q , Feng S, Sun L , Wei J

Received 11 June 2025

Accepted for publication 6 November 2025

Published 15 November 2025 Volume 2025:20 Pages 2033—2048

DOI https://doi.org/10.2147/CIA.S543303

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 2

Editor who approved publication: Dr Maddalena Illario

Download Article [PDF]

Yang Zhang,^1,^* Dabei Cai,^2,^* Ye Deng,^2,^* Zhu Wang,¹ Zhihan Zhang,¹ Hu Zhang,¹ Qingjie Wang,² Shoujie Feng,¹ Ling Sun,³ Jun Wei¹

¹Department of Cardiovascular Surgery, The Affiliated Hospital of Xuzhou Medical University, Xuzhou, 221000, People’s Republic of China; ²Department of Cardiology, The Third Affiliated Hospital of Nanjing Medical University, Changzhou, Jiangsu, 213000, People’s Republic of China; ³Department of Cardiology, The Affiliated Wuxi People’s Hospital of Nanjing Medical University, Wuxi People’s Hospital, Wuxi Medical Center, Nanjing Medical University, Wuxi, 214023, People’s Republic of China

*These authors contributed equally to this work

Correspondence: Ling Sun, Department of Cardiology, The Affiliated Wuxi People’s Hospital of Nanjing Medical University, Wuxi People’s Hospital, Wuxi Medical Center, Nanjing Medical University, Wuxi, 214023, People’s Republic of China, Email [email protected] Jun Wei, Department of Cardiovascular Surgery, The Affiliated Hospital of Xuzhou Medical University, Xuzhou, 221000, People’s Republic of China, Email [email protected]

Background: Coronary artery bypass grafting (CABG) is key for severe coronary artery disease, but postoperative acute kidney injury (AKI) may increase mortality and prolong hospital stays. Reliable models for early prediction of post-CABG AKI remain lacking.
Methods: Data of 520 CABG patients (September 2021–December 2024) from the Affiliated Hospital of Xuzhou Medical University were collected, and the patients were divided into a training group (70%, for model building) and a validation group (30%). Key variables were screened through Least Absolute Shrinkage and Selection Operator (LASSO) regression, followed by the construction of six machine learning models: Random Forest (RF), eXtreme Gradient Boosting (XGBoost), Logistic Regression (LR), Light Gradient Boosting Machine (LightGBM), Softmax Regression, and Support Vector Machine (SVM). The SHapley Additive exPlanations (SHAP) was used to quantify feature importance.
Results: The incidence of post-CABG AKI was 25.96%, and the median age of patients in the AKI group was significantly higher than that in the non-AKI group (66.09 ± 8.15 vs 64.32 ± 7.76, p = 0.025). In the training group, the XGBoost model using the top 5 important variables outperformed other models (Area Under the Curve [AUC] = 0.89, 95% Confidence Interval [CI]: 0.86– 0.91), followed by the LightGBM model using the top 5 important variables and the RF model using the top 5 important variables (both had an AUC of 0.88; 95% CI: 0.85– 0.90 and 0.85– 0.91, respectively). In the validation group, the LR model using the top 15 important variables and the Softmax Regression model using the top 15 important variables maintained the highest stability (both had an AUC of 0.86, 95% CI: 0.79– 0.92). SHAP analysis confirmed that estimated glomerular filtration rate (eGFR), intraoperative epinephrine use and calcium levels were the top three predictive factors.
Conclusion: The machine learning models constructed in this study can effectively predict post-CABG AKI, facilitating early identification of high-risk patients.

Keywords: coronary artery bypass grafting, acute kidney injury, machine learning, prediction model, area under the receiver operating characteristic curve

Introduction

Cardiovascular diseases, especially coronary artery disease (CAD), are the leading cause of death worldwide.¹ Coronary artery bypass grafting (CABG) is the main means of treatment for severe coronary heart disease (CHD) and is the preferred treatment for revascularization in patients with severe multivessel disease and complex coronary artery disease.^2,3 However, a variety of complications can occur after CABG, with acute kidney injury (AKI) being particularly common, occurring in 15–30%.^4–6 AKI not only affects the prognosis of patients but also increases the economic burden, which is mainly manifested by increased serum creatinine and decreased urine output,⁷ and is closely related to increased mortality, prolonged hospital stay and increased medical costs.^8,9 Although serum creatinine (SCr) levels are often used as a diagnostic marker and grading basis for AKI, early studies have found a lag.¹⁰ Given the rapid progression of AKI and the high mortality rate, early identification and intervention of such patients are essential to reduce morbidity and reduce the healthcare burden.^11–14 Unfortunately, there is a lack of effective clinical prediction models for early detection of AKI. Failure to identify and treat patients at high risk of AKI after CABG may lead to the progression of chronic renal failure or even end-stage renal disease, further increasing the risk of death. The high morbidity and mortality of AKI after cardiac surgery underscores its urgency as an important issue. However, the pathogenesis of AKI after CABG is complex and not fully understood.¹⁵ Therefore, clarifying the risk factors of AKI after CABG and exploring effective prediction models have become key issues to be solved urgently.

Machine learning, an important branch of artificial intelligence, is a scientific technology that learns patterns from complex data through algorithms to predict behavioral outcomes and trends.¹⁶ It possesses powerful capabilities in feature extraction, matching, and information integration, enabling it to learn completely driven by data without pre-assuming the relationship between input variables and output results, thus forming an efficient approach distinct from traditional rule-based programming. This characteristic has made it widely applied in medical fields such as outcome prediction, diagnosis, medical image analysis, and treatment.^17,18 However, Machine learning-based predictive models for post-CABG AKI remain scarce. This study addresses this gap by integrating preoperative baseline data (eg, renal function, diabetes history) and intraoperative time-series physiological metrics (eg, urine output, hemodynamics) to develop a robust Machine learning model for cardiac surgery-associated acute kidney injury (CSAAKI), offering a novel paradigm for early postoperative AKI warning (early warning).

Materials and Methods

Patients’ Enrollment

The preoperative and intraoperative data of 520 patients who underwent CABG in the Department of Cardiac and Vascular Surgery of the Affiliated Hospital of Xuzhou Medical University from September 1, 2021 to December 31, 2024 were collected. The inclusion criteria were: 1. Coronary artery bypass surgery in our hospital; 2. Age ≥ 18 years; The exclusion criteria are: 1. Repeat CABG surgery; 2. No perioperative medical records. The study was conducted in accordance with the Declaration of Helsinki (revised in 2013). This study was approved by the Medical Ethics Committee of the Affiliated Hospital of Xuzhou Medical University (No. XYFY2025-KL154-01) and written informed consent was obtained from all participants.

Data Collection

Demographic characteristics: age, gender, body mass index, underlying diseases (hypertension, diabetes, neurological lesions, etc.), smoking history, cardiac function grade (New York Heart Association grade (NYHA grade)), Canadian Cardiovascular Society grade (CCS grade), blood type, etc. Preoperative medication: nitrates, statins, β blockers, diuretics, antibiotics, etc. Preoperative auxiliary examinations: electrocardiogram, cardiac color ultrasound (ejection fraction, etc.) Preoperative laboratory parameters: chloride, sodium, potassium, urea nitrogen, creatinine clearance, serum creatinine, direct bilirubin, total bilirubin, albumin, total protein, Alanine Aminotransferase (ALT), Aspartate Aminotransferase (AST), platelets, hematocrit, lymphocyte count, white blood cell count, neutrophil count, red blood cell count, hemoglobin, procalcitonin (PCT), Creatine kinase isoenzyme (CK-MB), troponin T (TnT), B-type natriuretic peptide (BNP), glycosylated hemoglobin (HbA1c). Preoperative coagulation function: activated partial thromboplastin time (APTT), prothrombin time (PT), international normalized ratio (INR). Intraoperative parameters: surgery-related: duration of surgery, extracorporeal bypass management (including temperature regulation); Intraoperative management: intraoperative medication, blood loss, intraoperative urine output; Number of intraoperative grafts. Immediate postoperative indicator: postoperative creatinine value.

Model Construction and Evaluation

Six machine learning models were constructed based on the features selected by the training cohort. The models used are: random forest (RF) model, extreme gradient boosting (XGBoost) model, Logistic regression (LR) model, Light Gradient Boosting Machine (LightGBM) model, Softmax Regression model and Support Vector Machine (SVM) model. Due to the small amount of data in this study, it is avoided.

RF is a model that can be used for both regression and classification.¹⁹ It is one of the most popular integration methods and falls under the specific category of bagging methods. This approach can be described as a technique of using a group of weak learners together to create a stronger aggregate learner. XGBoost is an optimized distributed gradient boosting library designed to be efficient, flexible, and portable. It implements machine learning algorithms under the gradient boosting framework.¹⁹ Traditional LR models are also used for model building.¹⁹ The LightGBM model is an efficient gradient boosting framework, which is designed for large-scale data and high-dimensional features, and is commonly used for machine learning tasks such as classification and regression.²⁰ Softmax regression has become a core method for multi-class classification tasks by leveraging probabilistic outputs and cross-entropy optimization.²¹ SVM aims to find a hyperplane in the feature space that optimally separates data into distinct classes. For non-linear datasets, the “kernel trick” enables SVMs to map the input data into a higher-dimensional space, where a linear separation becomes feasible.²² Grid Search Cross-Validation is a method of systematically traversing parameter combinations and combining cross-validation to evaluate model performance to find the optimal hyperparameters. In view of the limited scale of the dataset in this study, in order to effectively improve the generalization ability of the model and reduce the risk of overfitting, we deeply embed the grid search cross-validation method into the above machine learning algorithm framework. By systematically traversing the hyperparameter combination space and using cross-validation to evaluate the performance of each combination for multiple rounds, a more robust model evaluation system is constructed while fully mining the data feature information, so as to obtain the optimal model configuration with high fitting degree and strong stability.

Study Endpoint

The endpoint of the study was AKI during hospitalization, and the change in renal function at admission and serum creatinine level after admission was comprehensively evaluated according to the glomerular filtration rate, and the estimated Glomerular Filtration Rate (eGFR) was estimated according to the Modification of Diet in Renal Disease (MDRD) study equation at the time of admission calculated according to the first serum creatinine level and age, and the calculation formula was as follows:²³

eGFR [mL/(min*1.73m2)] = 186* (SCr) ^−1.154* (age) ^−0.23

The definition of AKI is based on serum creatinine (SCr) criteria for the overall outcome of kidney disease improvement (KDIGO).^24,25 Due to difficulties in urine output statistics in clinical work. Therefore, the diagnostic criteria for this article are: SCr elevation ≥0.3 mg/dL or SCr ≥26.5 mmol/L within 48 hours; 2.7 days of known or presumed SCr elevation ≥1.5 times baseline. where the baseline SCr value is defined as the last SCr value detected closest to the day of surgery within one week prior to undergoing extracorporeal cardiac surgery.

Statistical Analysis

The collected data was standardized, normalized, outliers were eliminated, and the KNN algorithm (n = 5) was used to fill in the gaps. Encode and transform different types of data to make them suitable for machine learning algorithm input requirements. The study cohort consisted of 520 patients, including a heterogeneous sample of AKI and non-AKI patients, with AKI patients accounting for only 25.96% of the entire cohort and non-AKI patients accounting for 74.04% of the entire cohort. However, the proportion of these two types is quite different, which may lead to the low prediction accuracy of the prediction model, so to solve the problem of classification imbalance, we use the Synthetic Minority Oversampling Technique (SMOTE).^26,27 The SMOTE method is an effective tool to solve the problem of uneven data distribution, which preprocesses the data in the training queue before constructing the model.

In the research related to heart bypass surgery conducted at our hospital, there are two main types of surgery, namely stop-skip bypass and non-stop skip bypass. Stop-skip surgery requires the help of cardiopulmonary bypass equipment to temporarily stop the heart beating during the operation, creating a relatively static and bloodless clear vision for the surgical operation, which is conducive to the fine vascular anastomosis operation. However, during this process, cardiopulmonary bypass may trigger a systemic inflammatory response that affects hemodynamic stability, which in turn has a potential adverse effect on renal perfusion.

Non-stop bypass surgery avoids the use of cardiopulmonary bypass and completes the vascular bypass in the state of continuous beating of the heart. This procedure minimizes the risk of complications associated with cardiopulmonary bypass, but requires high skill and experience from the surgeon, who needs to accurately complete the vascular anastomosis with limited operating space and heart beating interference.

In terms of anesthesia, our hospital uses mask oxygen induction anesthesia for heart bypass surgery. Before the surgery begins, the patient is inhaled with a high concentration of oxygen through a mask to maximize the body’s oxygen reserves. Subsequently, the appropriate dose of anesthetic drug is slowly administered to gradually bring the patient into anesthesia. This induction anesthesia method can ensure that the patient can receive the operation smoothly in a painless and unconscious state, while maintaining the stability of important physiological functions such as breathing and circulation. Good anesthesia effect is essential for maintaining intraoperative hemodynamic stability, which is directly related to the blood perfusion of the kidneys, and is closely related to the occurrence and development of acute kidney injury after cardiac bypass surgery. The subsequent construction of the machine learning model for acute kidney injury after cardiac bypass surgery will fully consider the potential impact of these two surgical methods and the unified anesthesia method, and dig deep into the data characteristics and rules to improve the accuracy and reliability of the model prediction.

Importance rankings for all variables were obtained by the SHapley Additive exPlanation (SHAP) method. SHAP can interpret the output of any machine learning model. Taking its name from SHapley additive interpretation and inspired by cooperative game theory, SHAP constructs an additive explanation model in which all features are considered contributors. SHAP provides a solid theoretical foundation for achieving local and global interpretability. The advantage of the SHAP value is that it not only gives us a SHAP value to assess the importance of the feature but it also shows us the positive or negative impact of the impact.^27,28

SPSS 26.0 was used for data analysis in this study; Use Python (version 3.9.11, Python Software Foundation, Chicago, USA). Normally distributed measurement data are expressed as mean (standard deviation). Frequency and percentage were used for categorical variables, chi-square test or Fisher’s exact test was used for differences between groups, median and interquartile range (IQR) was used for continuous variables, and differences between groups were determined using the Mann–Whitney U-test.

Results

Baseline Characteristics

After applying the inclusion and exclusion criteria, 520 patients with heart bypass were extracted from the database, and entered the final analysis. The incidence of AKI after heart bypass surgery was 25.96%, there was no difference in the proportion of men and women in the AKI group and non-acute kidney injury (Non-AKI) group (p = 0.90), and the median age of the AKI group was (66.09 ± 8.15) There was a statistically significant difference in median age between the non-AKI group and the non-AKI group (64.32 ± 7.76) (p = 0.025). Other baseline characteristics of the patient are shown in Table 1.

Table 1 Baseline Characteristics

Feature Selection for Models

In the feature screening phase, we employed the Least Absolute Shrinkage and Selection Operator (LASSO) binary logistic regression model—an approach that not only filters out non-significant features but also mitigates overfitting. To determine the optimal model parameters, we conducted 10-fold cross-validation, which ultimately enabled the selection of 31 key variables (Figure 1). The 31 variables including: Age, Alcohol, Hypertension, Pulmonary Hypertension, Atrial Fibrillation, Gamma-glutamyl Transferase (GGT), Albumin, Albumin/Globulin Ratio, eGFR, Sodium, Glycated Hemoglobin A1c (HbA1c), Mean Corpus Mean Corpuscular Hemoglobin Concentration (MCHC), Percentage of Eosinophils, Calcium, Hypoglycemic Drug, Insulin, Beta-Blocker, Combined Surgery, Duration of Surgery (DOS), Intraoperative Urine Output (IOUO), Duration of Anesthesia (DOA), Epinephrine, Cell Saver, Isoprenaline, Dopamine, Potassium Chloride, Atropine, Palonosetron, Mannitol, Amiodarone, and Duration of Minimum Temperature (DMT). Subsequently, based on these 31 key variables (ranked in descending order of importance, with the full list provided for enhanced model interpretability and practical application), we developed a series of machine learning models. These models were constructed using variable subsets of different sizes, specifically incorporating the top 5, 10, 15, 20, 25, and all 31 key variables, to evaluate the impact of variable quantity on model performance.

Figure 1 The result of LASSO regression. (A) LASSO coefficient curves for 194 risk factors. (B) LASSO coefficient path plot. Selected 31 risk factors with the help of LASSO regression analysis.

Machine Learning Models in the Training Cohort

We developed six machine learning algorithms: LR, Softmax Regression, RF, LightGBM, XGBoost, and SVM. Based on the importance ranking of the 31 key variables screened by LASSO regression, we sequentially adopted the top 5, top 10, top 15, top 20, top 25 variables, and all 31 variables. By gradually increasing the number of included variables, we divided the variables of different quantities into 6 groups and input them into the 6 machine learning algorithms, respectively, to construct different machine learning models. The amount of data in this paper is small, and the generalization ability and accuracy of the model are often difficult to guarantee. In order to improve the calculation accuracy of the model, you can add grid search cross-validation to the selected algorithm. This approach, which combines the advantages of grid search and cross-validation, can effectively optimize the hyperparameters of the model, thereby improving the performance of the model under limited data. The machine learning models using the first 5 important features in the training cohort were as follows: the AUC of the LR model was 0.79 (95% CI: 0.75, 0.83), and the AUC of the Softmax model was 0.79 (95% CI: 0.75, 0.83), the AUC of the RF model was 0.88 (95% CI: 0.85, 0.91), and the AUC of the LightGBM model was 0.88 (95% CI: 0.85, 0.90). The AUC of XGBoost model was 0.89 (95% CI: 0.86, 0.91), and that of SVM model was 0.82 (95% CI: 0.78, 0.85). All machine learning models performed the best in the training cohort with XGBoost, while the LR model and Softmax model performed the worst. We gradually increase the number of included variables to develop different machine learning models. The ROC curve shows the ROC curve for each model in the training cohort (Figure 2). The performance of all models is shown in the training cohort as the number of variables increases.

Figure 2 The ROC curves for machine learning models and the performances of all models in test cohort. The X-axis in 2G-2L represents the AUC values of each model. Top 5 variables (A and G); top 10 variables (B and H); top 15 variables (C and I); top 20 variables (D and J); top 25 variables (E and K); top 31 variables (F and L).

Machine Learning Models in the Validation Cohort

Validation was carried out in a validation cohort of 156 cases, and in all developed machine learning models, the AUC performance of each dataset was synthesized, and the model performance was closely related to the characteristics of the dataset. LR and Softmax performed best on most datasets due to their stable and high AUC values, and the AUC of the LR model for the first five variables (LR-5) was 0.79 (95% CI: 0.70–0.87). The LR model for the first 10 variables (LR-10) had an AUC of 0.81, (95% CI: 0.74–0.88); For the LR model of the first 15 variables (LR-15), the AUC was 0.86, (95% CI: 0.79–0.92); The LR model for the first 20 variables (LR-20) had an AUC of 0.84, (95% CI: 0.77–0.90); The LR model for the first 25 variables (LR-25) had an AUC of 0.82, (95% CI: 0.74–0.89); The LR model for the first 31 variables (LR-31) had an AUC of 0.82, (95% CI: 0.74–0.88). The Softmax model for the first five variables (Softmax-5) had an AUC of 0.79 (95% CI: 0.70–0.87); The Softmax model for the first 10 variables (Softmax-10) had an AUC of 0.81 (95% CI: 0.74–0.88); The Softmax model for the first 15 variables (Softmax-15) had an AUC of 0.86, (95% CI: 0.79–0.92); The Softmax model for the first 20 variables (Softmax-20) had an AUC of 0.84 (95% CI: 0.77–0.90); The Softmax model for the first 25 variables (Softmax-25) had an AUC of 0.82 (95% CI: 0.74–0.88); The Softmax model for the first 31 variables (Softmax-31) had an AUC of 0.82 (95% CI: 0.74–0.88). The SVM was the worst-performing algorithm due to its low and gradually decreasing AUC value, and the SVM model for the first five variables (SVM-5) with AUC of 0.79, (95% CI: 0.67–0.86), SVM model for the first 10 variables (SVM-10), AUC 0.77, (95% CI: 0.69–0.85), SVM model for the first 15 variables (SVM-15), AUC 0.69, (95% CI: 0.59–0.78), SVM model for the first 20 variables (SVM-20), AUC of 0.69, (95% CI: 0.58–0.78), AUC of 0.69 (95% CI: 0.0.59–0.79) for SVM model for the first 25 variables (SVM-25), AUC 0.65 (95% CI: 0.55–0.75) for SVM model for the first 31 variables (SVM-31). Random forest stood out in the first 25 and 31 variables of the dataset, with AUC of 0.80 (95% CI: 0.72–0.87) and 0.81 (95% CI: 0.72–0.88), showing good adaptability to complex feature interactions; LightGBM and XGBoost have potential, but their performance fluctuates greatly. ROC curves for each model are shown (Figure 3). The performance of all models is shown in the validation queue. At the same time, as the number of variables increases, a dynamic plot of the area under the ROC curve for all machine learning models in the cohort is validated (Figure 4) and the dynamic plot of the area under the ROC curve of all machine learning models in the training cohort (Figure 4).

Figure 3 The ROC curves for machine learning models and the performances of all models in validation cohort. The X-axis in 3G-3L represents the AUC values of each model. Top 5 variables (A and G); top 10 variables (B and H); top 15 variables (C and I); top 20 variables (D and J); top 25 variables (E and K); top 31 variables (F and L).

Figure 4 Dynamic plot of the area under the ROC curve for all machine learning models; training cohort (A); validation cohort (B). The x-axis represents the number of variables included in the model, the y-axis represents different kinds of models, and the z-axis represents the AUC values of each model.

Feature Importance

To enhance the interpretability of the model’s results, we employed the SHAP algorithm. It explains the model’s output, identifies features with significant predictive impacts, and visually shows features’ positive/negative impacts. Further, we utilize the SHAP algorithm to plot variable correlation graphs (as shown in Figure 5).²⁹ These charts include single-sample feature influence plots, feature distribution heat maps in the context of sample clustering, feature importance histograms, and feature density scatter plots. The SHAP value is used to measure the importance of a feature, and its algorithm relies on the XGBoost classification algorithm and provides an intrinsic measure of importance for each feature.²⁹

Figure 5 Single-sample feature impact map (A) heat map of feature distribution under sample clustering (B) histogram of feature importance (C) scatter plot of feature density (D).

Discussion

In this study, a prediction model for AKI after CABG was successfully developed based on machine learning algorithms. Through comprehensive analysis of preoperative and intraoperative data from 520 patients, LASSO regression was used for feature selection, combined with SHAP algorithm to interpret model output, and finally a variety of machine learning models including LR, Softmax, RF, LightGBM, XGBoost and SVM were constructed and validated. These models have demonstrated good feasibility and utility in predicting AKI after CABG.

AKI is a syndrome of sudden loss of renal excretion, usually accompanied by oliguria, that lasts from hours to days.³⁰ The pathogenesis is complex, involving multiple factors such as ischemia-reperfusion injury, surgical trauma, inflammation, and oxidation.^31,32 CHD is one of the manifestations of coronary systemic atherosclerosis, many patients with CHD also have renovascular disease, and hemodynamic instability and hypoperfusion syndrome reduce renal perfusion, increasing the risk of prerenal AKI, which may progress to nephrogenic AKI if left untreated.³³ Despite advances in renal replacement therapy, AKI is associated with poor outcomes and significantly affects surgical mortality, intensive care unit resources, and length of hospital stay.³⁴ Currently, there is a lack of widely accepted prediction models for AKI after cardiac surgery in China.

Traditional risk scoring models (such as Cardiac postoperative Argi score,³⁵ Cleveland Clinic score,³⁶ Mehta score,³⁷ simplified renal index score,³⁸ etc). It has been tried in clinical practice to predict the risk of AKI after cardiac surgery, but these models do not perform well in terms of discriminant power and correction effect, and lack sufficient convincing and broad applicability. Therefore, it is urgent to develop a risk prediction model for postoperative cardiac AKI that is suitable for clinical practice and has efficient predictive performance.

With the continuous progress of medical informatics, machine learning, as an important branch of artificial intelligence, has gradually become a promising tool in the field of clinical predictive models.^39,40 Although prediction models based on traditional statistics have been widely reported, machine learning models specifically for AKI after CABG have yet to be developed. Traditional statistical methods usually first establish the key relationships between variables and specific outcomes, and then link these variables by generating equations or functions, which makes prediction models based on traditional statistics highly interpretable. In contrast, machine learning methods assume that there is some meaningful relationship between the independent and dependent variables, and directly look for the path that best connects the two variables. Due to the natural advantage of machine learning algorithms in capturing nonlinear relationships, some cardiac surgeons are advocating the use of new machine learning-based models to predict AKI associated with cardiac surgery as an alternative to traditional clinical scoring tools.⁴¹ However, the algorithms produced by machine learning methods often have varying degrees of opacity and are often referred to as “black boxes”, which makes the model difficult to interpret and hinders the widespread application of machine learning in clinical practice to a certain extent.

In this study, six machine learning algorithms that excel in structured data classification tasks are selected, including XGBoost, LR, LightGBM, RF, Softmax, and SVM. As an ensemble learning technology, XGBoost is known for its high accuracy and fast processing capabilities, especially in high-dimensional data and missing values. The LR model facilitates the understanding of the model decision-making process due to its excellent interpretability. As an efficient linear model, LightGBM performs strongly in binary classification scenarios, and its storage efficiency is better than XGBoost, making it more advantageous in processing large datasets. RF is also an important approach in the field of ensemble learning, which is highly adaptable to high-dimensional data and missing values. Softmax is a generalized linear regression model for multi-categorical problems, which can convert the predictions of multiple categories into a form of probability distribution, which is well interpretable and logical for easy understanding and analysis. As a supervised learning algorithm, SVM has strong classification ability by finding the optimal hyperplane to separate data, especially when processing high-dimensional data, and can also effectively process nonlinear data through kernel techniques. These algorithms have been extensively validated in many practical applications, and although there are other algorithms to choose from, these six algorithms are ideal for building predictive models in this study due to their excellent performance and reliability.

In this study, we first applied the LASSO regression model to simplify 194 sets of clinical data. Subsequently, we determined the optimal parameters of the model through 10-fold cross-validation, and based on this, further screened out 31 key features with non-zero coefficients. This feature selection process not only helps machine learning models effectively eliminate invalid variables but also accurately captures the correlation between features and clinical outcomes, thereby significantly improving the predictive performance of the model. Moreover, the value of this method is not limited to model optimization, it also provides a new perspective for research on disease mechanisms, assisting clinicians in gaining a deeper understanding of the dynamic changes in patients’ conditions. LASSO regression is a penalty term-based variable selection method that reduces model complexity and improves prediction accuracy by compressing the coefficients of non-significant variables to filter features.³⁵ It performs both variable selection and complexity regularization when fitting generalized linear models, and is suitable for modeling and prediction of continuous, binary or multivariate discrete variables.³⁹ In addition, LASSO regression can effectively avoid overfitting by adjusting parameters to control model complexity.⁴⁰

Although LASSO regression can screen for characteristic variables, it cannot directly calculate the effect of characteristic variables on predicting clinical outcomes.³⁹ The SHAP algorithm, on the other hand, explains its contribution to the model prediction by calculating the SHapley value of each eigenvariable, thereby revealing how the eigenvariable affects the prediction outcome.⁴¹ In this study, the SHAP algorithm was used to analyze the key factors influencing the prediction of CAS-AKI by the prediction model, and it was found that eGFR, intraoperative epinephrine application, calcium ion, history of hypertension, and preoperative glucose-lowering drug use were the top five important variables. Several studies have confirmed that preoperative eGFR is an independent risk factor for AKI after CABG surgery.^6,42,43 Charat et al listed eGFR as an important risk factor for predicting AKI after cardiac surgery in their Machine learning model.⁴⁴ In this study, the eGFR value of the AKI group was 98.89 ± 23.14, while the eGFR of the non-AKI group was 108.59 ± 17.11, which was significantly different (p < 0.001). This highlights the importance of close preoperative monitoring of eGFR for the prevention of postoperative AKI in patients undergoing cardiopulmonary bypass cardiac surgery.^45–47 In addition, the SHAP algorithm also showed that the longer the duration of anesthesia, the more common postoperative AKI became. This may be because prolonged duration of general anesthesia can lead to poor blood supply to the kidneys, which increases the risk of impaired kidney function. Zhang et al also found that the risk of postoperative AKI increased significantly with prolonged anesthesia and decreased intraoperative urine output during liver transplantation.⁴⁸ This suggests that prolonged anesthesia and increased anesthetic medication may prolong renal artery constriction, which in turn exacerbates renal injury leading to AKI. However, the specific effect of anesthesia timing and intraoperative urine output on other procedure-related AKI requires further study.

One of the great strengths of this study is that the use of SHAP values unravels the “black box” of machine learning models. Although traditional risk scoring models have identified risk factors such as preoperative HGB, preoperative renal function, age, duration of surgery, left ventricular ejection fraction, body mass index, and hypertension,^{36–38,49–52} insufficient attention has been paid to important intraoperative factors such as intraoperative urine output, intravenous fluid volume, blood product transfusion, and dynamic changes in hemodynamic features. Notably, pulmonary hypertension (PAH) is listed as a risk factor, PAH is an important risk factor for AKI after transcatheter aortic valve implantation,⁵³ and is strongly associated with right heart function,⁵⁴ and right heart failure has been reported to be associated with severe AKI after cardiac surgery,⁵⁵ with PAH being one of the top five risk factors, providing a new perspective for clinical decision-making. And in our study, in addition to some traditional risk factors (such as diabetes, etc.), we also screened a number of intraoperative related risk factors (such as intraoperative urine output, intraoperative epinephrine application, intraoperative mechanical blood transfusion, etc). Hemodynamic changes, blood product transfusion, intraoperative fluid replacement, and intraoperative urine output all reflect acute responses to renal hypoperfusion and corresponding management measures. These factors, combined with the patient’s preoperative health condition susceptibility to acute stress and the large intraoperative dynamic physiological response, determine the patient’s overall response to surgery. Based on these findings, software can be developed in the future that can identify high-risk patients and thus optimize treatment strategies after cardiac surgery.

The model constructed in this study focuses on the early risk assessment of postoperative AKI in patients undergoing CABG. It can quantitatively predict the risk of AKI in patients based on key preoperative and intraoperative variables before typical clinical manifestations of renal dysfunction appear. The early risk warning capability of the model can provide a reference for the clinical formulation of individualized prevention strategies. It should also be noted that this model still has certain limitations: limited by the scope of variable selection and the model’s prediction logic, the current model can only assess the risk of AKI occurrence and cannot further distinguish the specific etiological types of AKI (eg, renal hypoperfusion caused by prerenal factors, renal parenchymal injury induced by intrinsic renal factors, etc). In clinical practice, there are significant differences in treatment protocols for AKI of different etiologies (for example, prerenal AKI requires prioritized improvement of circulatory perfusion, while intrinsic renal AKI may require adjustment of nephrotoxic drug use or targeted intervention for renal parenchymal injury). The lack of this etiology-distinguishing capability may to a certain extent affect the application value of the model in guiding clinical precise treatment. Future studies may consider incorporating more specific variables related to AKI etiology (such as biomarkers like kidney injury molecule-1 and neutrophil gelatinase-associated lipocalin) and optimizing the model structure by combining multi-modal data (eg, imaging examinations, pathological indicators) to improve the model’s ability to identify AKI etiologies and further expand its clinical application scenarios.

In the field of risk stratification for patients undergoing coronary artery surgery, several risk models have been developed for identifying high-risk individuals and guiding individualized treatment in patients after CABG, with the PRECISE-HBR score being a typical example.⁵⁶ Originally created to address the need for bleeding risk assessment after percutaneous coronary intervention (PCI), this score quantifies the risk of major bleeding events in PCI patients within 1 year using 7 easily accessible clinical indicators, providing a reference for formulating postoperative antiplatelet therapy regimens and monitoring bleeding.⁵⁷ Its advantages lie in being developed based on multicenter real-world data and clinical trials (with a derivation cohort of nearly 30,000 patients) and validated in multiple external cohorts, resulting in a large sample size, broad population coverage, higher generalizability and reliability. Additionally, its simplified set of variables eliminates the need for complex testing, making it convenient for clinical use. By contrast, the model constructed in this study for predicting AKI after CABG adopts machine learning algorithms combined with LASSO variable selection. Compared with the linear regression relied on by traditional scoring systems, it can more accurately capture non-linear relationships and interactions between variables, leading to superior predictive performance (eg, in terms of AUC value). However, the current model is based on single-center retrospective data, with limited sample size and population diversity, which may affect its external validity. In the future, we should draw on the development experience of the PRECISE-HBR score to expand the sample size, incorporate multicenter prospective data, supplement AKI-related specific indicators, and improve the model’s interpretability. This will allow us to balance predictive accuracy with clinical convenience, ultimately achieving precise assessment of AKI after CABG and facilitating individualized treatment in the future.

The current study has several limitations. Firstly, the sample size is relatively small. This may weaken the statistical power of results and limit the model’s generalizability to broader CABG populations. As a retrospective study, it also carries potential selection bias (eg, excluding patients with incomplete records or lost to follow-up, leading to a sample that may not fully represent the overall cohort). Secondly, the machine learning model has weak interpretability and a typical “black box” characteristic: while it can provide accurate risk predictions, its internal logic (eg, how variables interact to affect outcomes or the weight of key factors) remains unclear, which may reduce clinicians’ trust in its recommendations and hinder exploration of AKI-related mechanisms. Thirdly, this model is currently unable to distinguish between different etiologies of AKI. Future studies will optimize the model by incorporating etiology-specific variables and multimodal data to enhance its ability to identify AKI etiologies.

Conclusion

The machine learning models constructed in this study can effectively predict post-CABG AKI, facilitating early identification of high-risk patients, which may further support clinicians in formulating personalized perioperative management strategies to reduce the risk of AKI.

Funding

This study was supported by Key Project of Jiangsu Provincial Health Commission (K2024014), Construction Project of High-Level Hospital of Jiangsu Province (GSPJS202404, GSPJS202518), Changzhou High Level Medical Talents Training Project (2022CZBJ054), Postgraduate Practice Innovation Program of Jiangsu Province (SJCX24_0748), Young Talents Program of Wuxi Medical Center to Nanjing Medical University (WMCQ202403), General Projects of Wuxi Medical Center to Nanjing Medical University (WMCG202408), Clinical Research Project of Wuxi Medical Center to Nanjing Medical University (WMCC202505) and Changzhou Talent Program (CQ20240067).

Disclosure

The authors report no conflicts of interest in this work.

References

1. Stefanos T, Li S, Abbafati C, et al. Global burden of 369 diseases and injuries in 204 countries and territories, 1990-2019: a systematic analysis for the global burden of disease study 2019. Lancet. 2020;396(10258):1204–1222. doi:10.1016/s0140-6736(20)30925-9

2. Rodriguez F, Mahaffey KW. Management of patients with NSTE-ACS: a comparison of the recent AHA/ACC and ESC guidelines. J Am College Cardiol. 2016;68(3):313–321. doi:10.1016/j.jacc.2016.03.599

3. Baumgartner H, Falk V, Bax JJ, et al. 2017 ESC/EACTS guidelines for the management of valvular heart disease. Eur Heart J. 2017;38(36):2739–2791. doi:10.1093/eurheartj/ehx391

4. Bell J, Sartipy U, Holzmann MJ, Hertzberg D. The association between acute kidney injury and mortality after coronary artery bypass grafting was similar in women and men. J Cardiothoracic Vascular Anesthesia. 2022;36(4):962–970. doi:10.1053/j.jvca.2021.11.036

5. Chan MJ, Lee CC, Chen SW, et al. Effect of different surgical type of coronary artery bypass grafting on kidney injury: a propensity score analysis. Medicine. 2017;96(45):e8395. doi:10.1097/md.0000000000008395

6. Yue Z, Yan-Meng G, Ji-Zhuang L. Prediction model for acute kidney injury after coronary artery bypass grafting: a retrospective study. Int Urol Nephrol. 2019;51(9):1605–1611. doi:10.1007/s11255-019-02173-7

7. Eckardt KU, Kasiske BL. Kidney disease: improving global outcomes. Nat Rev Nephrol. 2009;5(11):650–657. doi:10.1038/nrneph.2009.153

8. Hu J, Chen R, Liu S, Yu X, Zou J, Ding X. Global incidence and outcomes of adult patients with acute kidney injury after cardiac surgery: a systematic review and meta-analysis. J Cardiothorac Vasc Anesth. 2016;30(1):82–89. doi:10.1053/j.jvca.2015.06.017

9. Vandenberghe W, Gevaert S, Kellum JA, et al. Acute kidney injury in cardiorenal syndrome type 1 patients: a systematic review and meta-analysis. Cardiorenal Med. 2016;6(2):116–128. doi:10.1159/000442300

10. Wang Y, Bellomo R. Cardiac surgery-associated acute kidney injury: risk factors, pathophysiology and treatment. Nat Rev Nephrol. 2017;13(11):697–711. doi:10.1038/nrneph.2017.119

11. Cummings JJ, Shaw AD, Shi J, Lopez MG, O’Neal JB, Billings FTT. Intraoperative prediction of cardiac surgery-associated acute kidney injury using urinary biomarkers of cell cycle arrest. J Thoracic Cardiovasc Surg. 2019;157(4):1545–1553.e5. doi:10.1016/j.jtcvs.2018.08.090

12. Ortega-Loubon C, Fernández-Molina M, Carrascal-Hinojal Y, Fulquet-Carreras E. Cardiac surgery-associated acute kidney injury. Ann Card Anaesth. 2016;19(4):687–698. doi:10.4103/0971-9784.191578

13. Hobson CE, Yavas S, Segal MS, et al. Acute kidney injury is associated with increased long-term mortality after cardiothoracic surgery. Circulation. 2009;119(18):2444–2453. doi:10.1161/circulationaha.108.800011

14. Bove T, Monaco F, Covello RD, Zangrillo A. Acute renal failure and cardiac surgery. HSR Proc Intensive Care Cardiovasc Anesth. 2009;1(3):13–21.

15. Zhou B, Ao Q, Zhao H, Ye P. Rosuvastatin alleviates renal injury in cardiorenal syndrome model rats through anti-inflammatory and antioxidant pathways. Emergency Critical Care Med. 2022;2(4):203–208. doi:10.1097/ec9.0000000000000049

16. Jordan MI, Mitchell TM. Machine learning: trends, perspectives, and prospects. Science. 2015;349(6245):255–260. doi:10.1126/science.aaa8415

17. Jiang F, Jiang Y, Zhi H, et al. Artificial intelligence in healthcare: past, present and future. Stroke Vasc Neurol. 2017;2(4):230–243. doi:10.1136/svn-2017-000101

18. Rajkomar A, Dean J, Kohane I. Machine learning in medicine. N Engl J Med. 2019;380(14):1347–1358. doi:10.1056/NEJMra1814259

19. Kamińska JA. A random forest partition model for predicting NO(2) concentrations from traffic flow and meteorological conditions. Sci Total Environ. 2019;651(Pt 1):475–483. doi:10.1016/j.scitotenv.2018.09.196

20. Rufo DD, Debelee TG, Ibenthal A, Negera WG. Diagnosis of diabetes mellitus using gradient boosting machine (LightGBM). Diagnostics. 2021;11(9). doi:10.3390/diagnostics11091714

21. She X, Zhu Y. Text classification research based on improved softmax regression algorithm. 2018;273–276.

22. Bhosale H, Ramakrishnan V, Jayaraman VK. Support vector machine-based prediction of pore-forming toxins (PFT) using distributed representation of reduced alphabets. J Bioinform Comput Biol. 2021;19(5):2150028. doi:10.1142/s0219720021500281

23. Delanaye P, Mariat C. The applicability of eGFR equations to different populations. Nat Rev Nephrol. 2013;9(9):513–522. doi:10.1038/nrneph.2013.143

24. Khwaja A. KDIGO clinical practice guidelines for acute kidney injury. Nephron Clin Pract. 2012;120(4):c179–84. doi:10.1159/000339789

25. Thomas ME, Blaine C, Dawnay A, et al. The definition of acute kidney injury and its use in practice. Kidney Int. 2015;87(1):62–73. doi:10.1038/ki.2014.328

26. Sun L, Zhu W, Chen X, et al. Machine learning to predict contrast-induced acute kidney injury in patients with acute myocardial infarction. Front Med Lausanne. 2020;7:592007. doi:10.3389/fmed.2020.592007

27. Kim D, Antariksa G, Handayani MP, Lee S, Lee J. Explainable anomaly detection framework for maritime main engine sensor data. Sensors. 2021;21(15). doi:10.3390/s21155200

28. Wojtuch A, Jankowski R, Podlewska S. How can SHAP values help to shape metabolic stability of chemical compounds? J Cheminform. 2021;13(1):74. doi:10.1186/s13321-021-00542-y

29. Yang M, Liu C, Wang X, et al. An explainable artificial intelligence predictor for early detection of sepsis. Crit Care Med. 2020;48(11):e1091–e1096. doi:10.1097/ccm.0000000000004550

30. Goyal A, Daneshpajouhnejad P, Hashmi MF, Bashir K. Acute Kidney Injury. In: StatPearls. StatPearls Publishing LLC; 2025.

31. Kramer RS, Herron CR, Groom RC, Brown JR. Acute kidney injury subsequent to cardiac surgery. J Extra Corpor Technol. 2015;47(1):16–28.

32. Takaki S, Shehabi Y, Pickering JW, Endre Z, Miyashita T, Goto T. Perioperative change in creatinine following cardiac surgery with cardiopulmonary bypass is useful in predicting acute kidney injury: a single-centre retrospective cohort study. Interact Cardiovasc Thorac Surg. 2015;21(4):465–469. doi:10.1093/icvts/ivv184

33. Warren J, Mehran R, Baber U, et al. Incidence and impact of acute kidney injury in patients with acute coronary syndromes treated with coronary artery bypass grafting: insights from the harmonizing outcomes with revascularization and stents in acute myocardial infarction (HORIZONS-AMI) and acute catheterization and urgent intervention triage strategy (ACUITY) trials. Am Heart J. 2016;171(1):40–47. doi:10.1016/j.ahj.2015.07.001

34. Oh TK, Song IA. Postoperative acute kidney injury requiring continuous renal replacement therapy and outcomes after coronary artery bypass grafting: a nationwide cohort study. J Cardiothorac Surg. 2021;16(1):315. doi:10.1186/s13019-021-01704-7

35. Jiang W, Xu J, Shen B, Wang C, Teng J, Ding X. Validation of four prediction scores for cardiac surgery-associated acute kidney injury in chinese patients. Braz J Cardiovasc Surg. 2017;32(6):481–486. doi:10.21470/1678-9741-2017-0116

36. Thakar CV, Arrigain S, Worley S, Yared JP, Paganini EP. A clinical score to predict acute renal failure after cardiac surgery. J Am Soc Nephrol. 2005;16(1):162–168. doi:10.1681/asn.2004040331

37. Mehta RH, Grab JD, O’Brien SM, et al. Bedside tool for predicting the risk of postoperative dialysis in patients undergoing cardiac surgery. Circulation. 2006;114(21):2208–16;quiz2208. doi:10.1161/circulationaha.106.635573

38. Wijeysundera DN, Karkouti K, Dupuis JY, et al. Derivation and validation of a simplified predictive index for renal replacement therapy after cardiac surgery. JAMA. 2007;297(16):1801–1809. doi:10.1001/jama.297.16.1801

39. Kresoja KP, Unterhuber M, Wachter R, Thiele H, Lurz P. A cardiologist’s guide to machine learning in cardiovascular disease prognosis prediction. Basic Res Cardiol. 2023;118(1):10. doi:10.1007/s00395-023-00982-7

40. Douglas MJ, Callcut R, Celi LA, Merchant N. Interpretation and use of applied/operational machine learning and artificial intelligence in surgery. Surg Clin North Am. 2023;103(2):317–333. doi:10.1016/j.suc.2022.11.004

41. Penny-Dimri JC, Bergmeir C, Reid CM, Williams-Spence J, Cochrane AD, Smith JA. Machine learning algorithms for predicting and risk profiling of cardiac surgery-associated acute kidney injury. Semin Thorac Cardiovasc Surg. 2021;33(3):735–745. doi:10.1053/j.semtcvs.2020.09.028

42. Seelhammer TG, Maile MD, Heung M, Haft JW, Jewell ES, Engoren M. Kinetic estimated glomerular filtration rate and acute kidney injury in cardiac surgery patients. J Crit Care. 2016;31(1):249–254. doi:10.1016/j.jcrc.2015.11.006

43. Ortega-Loubon C, Fernández-Molina M, Pañeda-Delgado L, Jorge-Monjas P, Carrascal Y. Predictors of postoperative acute kidney injury after coronary artery bypass graft surgery. Braz J Cardiovasc Surg. 2018;33(4):323–329. doi:10.21470/1678-9741-2017-0251

44. Thongprayoon C, Pattharanitima P, Kattah AG, et al. Explainable preoperative automated machine learning prediction model for cardiac surgery-associated acute kidney injury. J Clin Med. 2022;11(21). doi:10.3390/jcm11216264

45. Bai L, Jin Y, Zhang P, et al. Risk factors and outcomes associated with acute kidney injury following extracardiac total cavopulmonary connection: a retrospective observational study. Transl Pediatr. 2022;11(6):848–858. doi:10.21037/tp-21-474

46. Wang M, Xu X, Wu S, et al. Risk factors for ventilator-associated pneumonia due to multi-drug resistant organisms after cardiac surgery in adults. BMC Cardiovasc Disord. 2022;22(1):465. doi:10.1186/s12872-022-02890-5

47. Meersch M, Zarbock A. Prevention of cardiac surgery-associated acute kidney injury. Curr Opinion Anaesthesiol. 2017;30(1):76–83. doi:10.1097/aco.0000000000000392

48. Zhang Y, Yang D, Liu Z, et al. An explainable supervised machine learning predictor of acute kidney injury after adult deceased donor liver transplantation. J Transl Med. 2021;19(1):321. doi:10.1186/s12967-021-02990-4

49. Chertow GM, Lazarus JM, Christiansen CL, et al. Preoperative renal risk stratification. Circulation. 1997;95(4):878–884. doi:10.1161/01.cir.95.4.878

50. Aronson S, Fontes ML, Miao Y, Mangano DT. Risk index for perioperative renal dysfunction/failure: critical dependence on pulse pressure hypertension. Circulation. 2007;115(6):733–742. doi:10.1161/circulationaha.106.623538

51. Palomba H, de Castro I, Neto AL, Lage S, Yu L. Acute kidney injury prediction following elective cardiac surgery: AKICS score. Kidney Int. 2007;72(5):624–631. doi:10.1038/sj.ki.5002419

52. Brown JR, Cochran RP, Leavitt BJ, et al. Multivariable prediction of renal insufficiency developing after cardiac surgery. Circulation. 2007;116(11 Suppl):I139–43. doi:10.1161/circulationaha.106.677070

53. Tang M, Liu X, Lin C, et al. Meta-analysis of outcomes and evolution of pulmonary hypertension before and after transcatheter aortic valve implantation. Am J Cardiol. 2017;119(1):91–99. doi:10.1016/j.amjcard.2016.09.015

54. Gudsoorkar PS, Thakar CV. Acute kidney injury, heart failure, and health outcomes. Cardiol Clin. 2019;37(3):297–305. doi:10.1016/j.ccl.2019.04.005

55. Bianco JC, Stang MV, Denault AY, Marenchino RG, Belziti CA, Musso CG. acute kidney injury after heart transplant: the importance of pulmonary hypertension. J Cardiothorac Vasc Anesth. 2021;35(7):2052–2062. doi:10.1053/j.jvca.2020.12.010

56. Gragnano F, van Klaveren D, Heg D, et al. Derivation and validation of the PRECISE-HBR score to predict bleeding after percutaneous coronary intervention. Circulation. 2025;151(6):343–355. doi:10.1161/circulationaha.124.072009

57. Chuliang Hong, Sicheng Chen and Tianyu Hu, et al. Prognostic Value of Optical Flow Ratio among Patients with Coronary Artery Disease after Percutaneous Coronary Treatment: A Hospital-Based Retrospective Cohort Investigation. CVIA. 2024. Vol. 9(1). DOI: 10.15212/CVIA.2024.0012

Creative Commons License © 2025 The Author(s). This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms and incorporate the Creative Commons Attribution - Non Commercial (unported, 4.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.