Development and Validation of a Machine Learning-Based Individualized Model to Predict TKI Benefit in Postoperative Recurrent HCC

Meng Li; Yumin Jiang; Lin Gong; Ruitao Sun; Aoyun Hao; Kun Liu; Kunye Luan; Zheng Jia; Shimei Zhang; Kui Liu; Bin Tan; Weiyu Hu; Chao Qu; Jingyu Cao

doi:10.2147/JHC.S599142

Back to Journals » Journal of Hepatocellular Carcinoma » Volume 13

Original Research

Development and Validation of a Machine Learning-Based Individualized Model to Predict TKI Benefit in Postoperative Recurrent HCC

Authors Li M , Jiang Y, Gong L, Sun R, Hao A, Liu K, Luan K, Jia Z, Zhang S, Liu K, Tan B, Hu W, Qu C, Cao J

Received 20 February 2026

Accepted for publication 3 June 2026

Published 16 June 2026 Volume 2026:13 599142

DOI https://doi.org/10.2147/JHC.S599142

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 3

Editor who approved publication: Prof. Dr. Imam Waked

Download Article [PDF]

Meng Li,^1,² Yumin Jiang,¹ Lin Gong,² Ruitao Sun,¹ Aoyun Hao,¹ Kun Liu,² Kunye Luan,² Zheng Jia,¹ Shimei Zhang,¹ Kui Liu,¹ Bin Tan,¹ Weiyu Hu,¹ Chao Qu,¹ Jingyu Cao¹

¹Department of Hepatobiliary and Pancreatic Surgery, Affiliated Hospital of Qingdao University, Qingdao, Shandong, People’s Republic of China; ²Department of Hepatobiliary and Pancreatic Surgery, No. 971 Hospital of the People’s Liberation Army Navy, Qingdao, Shandong, People’s Republic of China

Correspondence: Chao Qu; Jingyu Cao, Department of Hepatobiliary and Pancreatic Surgery, Affiliated Hospital of Qingdao University, Qingdao, Shandong, People’s Republic of China, Email [email protected]; [email protected]

Background: Postoperative recurrence is a major contributor to poor prognosis in hepatocellular carcinoma (HCC) after curative resection. Tyrosine kinase inhibitors (TKIs) are commonly used for recurrent HCC, yet substantial interpatient heterogeneity limits their universal benefit, and reliable tools to individualize post-recurrence therapy are lacking.
Methods: We retrospectively analyzed 454 patients with recurrent HCC following curative resection. Overlap weighting based on propensity scores was applied to balance baseline characteristics between TKI-treated and non-TKI-treated groups and to assess the survival benefit of TKI therapy. Machine learning-based survival models were independently developed for each treatment cohort using multiple algorithms and internally validated. A counterfactual inference framework was implemented to estimate individualized survival outcomes under alternative treatment strategies and to identify the optimal therapy for each patient.
Results: After overlap weighting, TKI therapy was associated with improved survival compared with non-TKI treatment, including longer recurrence-free survival (median, 24 vs. 12 months; HR = 0.355, P < 0.001) and overall survival (median, 36 vs. 18 months; HR = 0.486, P = 0.001). In the non-TKI cohort, an eight-feature survival support vector machine (Surv-SVM) model achieved strong predictive performance (C-index: 0.796 in training, 0.766 in validation). In the TKI cohort, a four-feature random survival forest (RSF) model showed the highest discrimination (C-index: 0.856 and 0.838). Both models demonstrated good calibration and stable time-dependent performance. Counterfactual analysis revealed potential treatment mismatch in 23.4% of patients, including 20.7% who received non-TKI therapy but were predicted to benefit more from TKI treatment.
Conclusion: TKI therapy provides significant survival benefit in recurrent HCC after curative resection. Our machine learning-based, counterfactual framework enables individualized estimation of treatment benefit, supporting personalized post-recurrence therapy and facilitating precision management in this high-risk population.

Keywords: hepatocellular carcinoma, postoperative recurrence, tyrosine kinase inhibitor, machine learning, counterfactual reasoning

Introduction

HCC remains a leading cause of cancer-related mortality worldwide.¹ Curative resection is the primary treatment for early-stage HCC; however, postoperative recurrence is common and represents a major determinant of long-term survival.² Prognosis after recurrence is generally poor, making the selection of optimal post-recurrence therapy a critical and complex clinical decision.

TKIs, including sorafenib and lenvatinib, have demonstrated significant survival benefits in advanced HCC and are established as standard systemic therapies.^3–5By targeting key pathways involved in angiogenesis and tumor proliferation, these agents delay disease progression and improve overall survival. Their efficacy has prompted increasing use in the management of recurrent HCC following curative resection. Nevertheless, evidence regarding the survival benefit of TKIs specifically in the post-recurrence setting remains inconclusive, with retrospective studies reporting inconsistent outcomes and no definitive consensus.^6,7

Importantly, real-world data indicate that not all patients with recurrent HCC derive substantial benefit from TKI therapy.^8,9 Heterogeneity in tumor biology, liver function, and treatment tolerance contributes to variable responses. Recent studies have identified multiple molecular markers and gene signatures associated with HCC prognosis, reflecting the complex biological heterogeneity of this disease.^10,11 Consequently, indiscriminate use of TKIs may expose some patients to unnecessary toxicity and financial burden without improving survival. These observations highlight the urgent need for strategies to reliably identify individuals most likely to benefit from TKI therapy after recurrence.

A central challenge is the absence of robust predictive frameworks to guide individualized therapy selection. In current practice, post-recurrence treatment decisions are often empirical, based on physician experience or resource availability rather than evidence-based estimates of personalized benefit, which may lead to suboptimal outcomes. To address this gap, our study was designed with sequential aims: first, to evaluate the survival benefit of TKI therapy in recurrent HCC after curative resection using propensity score-based methods to control for confounding; second, to develop treatment-specific prognostic models for TKI and non-TKI cohorts employing multiple machine learning algorithms; and third, to integrate counterfactual inference to quantify individualized survival outcomes under alternative treatment strategies, thereby identifying the optimal therapeutic approach for each patient. This framework aims to provide objective, data-driven support for post-recurrence decision-making and to advance precision medicine for this high-risk population.

Material and Methods

Patients

This retrospective cohort study included patients with HCC who underwent curative resection and subsequently developed postoperative recurrence at Qingdao University Affiliated Hospital between January 2015 and December 2023.

Inclusion criteria were as follows: (1) age ≥ 18 years with no severe comorbidities or major organ dysfunction; (2) initial diagnosis of primary HCC confirmed by histopathology or non-invasive imaging criteria (multiphase CT or MRI); (3) imaging-confirmed postoperative recurrence, including local recurrence or distant metastasis; (4) availability of complete follow-up records, including detailed clinical and laboratory data; (5) receipt of TKI therapy following recurrence, with documented medication regimen, treatment duration, and response assessment.

Exclusion criteria included: (1) history of other concurrent malignancies; (2) receipt of palliative or non-radical surgery; (3) severe coagulopathy (International Normalized Ratio > 2.0) or active bleeding tendency; (4) Child-Pugh grade C liver failure or dysfunction of other vital organs; (5) loss to follow-up or interruption of planned therapy; (6) incomplete medical records precluding comprehensive analysis.

This study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board of Qingdao University Affiliated Hospital. Written informed consent was obtained from all participants. Based on these criteria, a total of 454 eligible patients were included in the final analysis.

Data Collection

Baseline demographic, clinical, pathological, and laboratory data were collected. Demographic and clinical variables included age, sex, and hepatitis B virus (HBV) infection status. Primary tumor characteristics encompassed tumor size, number, histological differentiation, and the presence of microvascular or macrovascular invasion. Recurrent tumor characteristics, including the size and number of recurrent lesions, were also recorded.

Laboratory parameters included preoperative complete blood counts and liver function tests. Systemic inflammation-nutrition indices—such as neutrophil-to-lymphocyte ratio (NLR), platelet-to-lymphocyte ratio (PLR), and lymphocyte-to-monocyte ratio (LMR)—were derived from these data.

Continuous clinical variables were handled according to their distribution and clinical interpretability. Specifically, several continuous variables were categorized based on clinically meaningful thresholds and are presented as categorical variables in Tables 1 and 2. This approach follows standard practices in clinical survival modeling and ensures consistency between model inputs and the tabulated clinical data. The primary endpoint was overall survival (OS) after recurrence, defined as the interval from the date of first confirmed recurrence to death from any cause or last follow-up. The final follow-up date was August 31, 2025.

Table 1 Baseline Characteristics of the Targeted Therapy and Non-Targeted Therapy Groups

Table 2 Baseline Characteristics Before and After Propensity Score Overlap Weighting

Statistical Analysis

All statistical analyses were conducted using R software (version 4.5.0; R Foundation for Statistical Computing). Continuous variables are presented as mean ± standard deviation or median (interquartile range), and categorical variables as counts (percentages). Group comparisons were performed using Student’s t-test, Mann–Whitney U-test, chi-square test, or Fisher’s exact test, as appropriate. Missing data were minimal in our dataset. For variables with occasional missing values, we applied multiple imputation based on chained equations, following established protocols in previous clinical studies. This approach ensured data completeness and maintained the statistical integrity of the analyses. Variables with a P-value < 0.05 were considered statistically significant.

Overlap Weighting Based on Propensity Scores Matching

To mitigate potential confounding bias, overlap weighting based on propensity scores was applied. Propensity scores, representing the probability of receiving TKI therapy, were estimated for each patient using a logistic regression model incorporating all baseline covariates. Overlap weights were then assigned to create a balanced pseudo-population.

Balance between weighted groups was assessed using standardized mean differences (SMDs), with SMD < 0.1 indicating adequate balance. Survival outcomes in the weighted cohort were compared using the weighted Kaplan-Meier method and weighted Cox proportional hazards models.

Development and Validation of Machine Learning Prognostic Models

Prognostic models were developed separately for TKI-treated and non-TKI cohorts. Although overlap weighting was applied to assess the causal effect of TKI therapy on post-recurrence outcomes, the prognostic models for individualized survival prediction were trained on the original, unweighted cohort. This approach was chosen to capture the full variability of baseline characteristics and treatment heterogeneity, and to ensure that the models can be applied to real-world patients receiving diverse post-recurrence treatments.

Feature Selection

Within a 5-fold cross-validation framework, the concordance index (C-index) was used to evaluate and compare the performance of three feature selection methods: stepwise Cox regression, RSF, and LASSO-Cox regression. The optimal feature set for each cohort was then selected.

Model Construction and Training

Using the selected features, multiple survival prediction models were constructed, including RSF, elastic net Cox regression, gradient boosting machine (GBM), and Surv-SVM. Predictive performance and stability were evaluated and compared based on the C-index distribution across cross-validation folds.

RSF model was constructed using 300 survival trees (n_estimators=300). The minimum number of samples required in terminal nodes was set to 5 (min_samples_leaf=5). At each split, the square root of the total number of variables was randomly selected as candidate features (max_features=“sqrt”). Parallel computation was enabled using all available CPU cores (n_jobs=−1). Surv-SVM was performed using a regularization parameter of alpha=1×10⁻⁴ and a hybrid ranking–regression setting with rank_ratio=0.5. The relatively small regularization strength was selected to preserve predictive information from high-dimensional features, while the hybrid optimization strategy allowed simultaneous consideration of survival risk ranking and survival time prediction under censored survival conditions. This parameter setting was considered suitable for the characteristics of the present study, including relatively small sample size, high-dimensional feature space, and survival outcomes with censoring. Random seed was fixed at 42 to ensure reproducibility.

Model Evaluation

Validated models were assessed using multiple approaches: (1) risk stratification ability via Kaplan-Meier analysis with Log rank test; (2) predictive discrimination over time using time-dependent area under the receiver operating characteristic curve (AUC); (3) prediction accuracy measured by time-dependent Brier scores and calibration curves; (4) feature importance interpreted at the individual level using SHapley Additive exPlanations (SHAP).

Counterfactual Framework for Treatment Strategy Matching Analysis

A counterfactual prediction framework was implemented to analyze personalized treatment matching. For each patient, the two validated cohort-specific models were used to predict 24-month recurrence risk under two counterfactual scenarios: receiving TKI therapy or not. The absolute difference in predicted risk between scenarios was calculated, with a risk difference > 0.05 predefined as clinically significant for treatment recommendation.¹²

Patients were subsequently classified into four decision quadrants based on actual treatment received versus model-recommended treatment (lower predicted risk). This analysis quantified alignment between real-world treatment and model-based recommendations, identifying potential mismatches and providing evidence for individualized clinical decision-making.

Results

Baseline Characteristics of the Study Cohort

A total of 454 patients with recurrent HCC treated at Qingdao University Affiliated Hospital between January 2015 and December 2023 were included (Figure 1). The cohort comprised 387 males (85.2%), and 253 patients (55.7%) were younger than 60 years at the time of surgery. Based on post-recurrence treatment, patients were categorized into a non- TKI group (n = 307) and a TKI therapy group (n = 147). Detailed baseline characteristics are summarized in Table 1. Intergroup comparisons revealed significant differences in lymphocyte count (P = 0.048), LMR (P = 0.015), history of preoperative interventional therapy, HBV infection status, maximum preoperative tumor diameter, tumor differentiation, and site of first recurrence (all P < 0.05). Other variables showed no significant differences (all P > 0.05).

Figure 1 Flowchart of patient selection, matching, and modeling procedures.

Survival Outcomes After Overlap Weighting Adjustment

After applying overlap weighting, propensity score distributions were well-balanced, resulting in a weighted cohort of 54 patients in the TKI therapy group and 105 in the non- TKI group (approximately 1:2 ratio). Covariate balance assessment (Table 2) confirmed that previously imbalanced variables were adequately balanced after weighting (all P > 0.05). Overlapping probability density curves (Figure 2A) and standardized mean differences below 0.1 for all covariates (Figure 2B) demonstrated successful covariate adjustment.

Figure 2 Evaluation of cohort balance and survival outcomes after propensity score overlap weighting. (A) Density distribution of propensity scores in the TKI-TKI therapy and non-TKI therapy groups after weighting. (B) Standardized mean differences (SMD) for all baseline covariates between the TKI-TKI therapy and non-TKI therapy groups after weighting. (C) Kaplan–Meier curves for recurrence-free survival (RFS) in the TKI-TKI therapy and non-TKI therapy groups. (D) Kaplan–Meier curves for overall survival (OS) in the TKI-TKI therapy and non-TKI therapy groups.

Survival analysis showed significantly improved outcomes in the TKI therapy group. Median recurrence-free survival (RFS) was 24 months versus 12 months in the non-targeted group (HR = 0.355; 95% CI: 0.238–0.528; P < 0.001; Figure 2C). Median OS was 36 months versus 18 months (HR = 0.486; 95% CI: 0.311–0.759; P = 0.001; Figure 2D), indicating that, after adjustment for confounding, TKI therapy significantly prolongs both RFS and OS.

Prognostic Model for the Non- TKI Subgroup

Feature Selection

Patients in the non-TKI group were randomly split into training (n = 214) and validation (n = 93) sets in a 7:3 ratio. The results of univariate and multivariate Cox regression analyses for the non-TKI subgroups are summarized in Table 3. Among stepwise Cox regression, RSF, and LASSO-Cox methods, stepwise Cox regression yielded the optimal eight-feature set: surgery/ablation after recurrence, number of recurrent tumors, interventional therapy after recurrence, maximum preoperative tumor diameter, AFP level at recurrence, satellite nodules, HBV infection status, and nerve invasion.

Table 3 Univariate and Multivariate Cox Regression Analyses of Clinical Factors Associated with Recurrence in No Targeted Therapy Patients with HCC

Algorithm Comparison and Model Construction

Using this feature set, four algorithms—RSF, Elastic Net Cox, GBM, and Surv-SVM —were compared (Figure 3A). Five-fold cross-validation (Figure 3B) identified Surv-SVM as having the highest and most stable median C-index (0.79). Comprehensive evaluation (Figure 3C) confirmed its superior average C-index (0.783) and 24-month time-dependent AUC (0.875), establishing it as the final model. Feature importance ranking (Figure 3D) highlighted surgery/ablation after recurrence, number of recurrent tumors, and interventional therapy after recurrence as the top predictors.

Figure 3 Development and evaluation of the prognostic prediction model for the non-TKI therapy cohort. (A) Comparison of three feature selection methods: stepwise Cox regression, random survival forest (RSF), and LASSO-Cox regression. (B) Distribution of the concordance index (C-index) across five-fold cross-validation for the four candidate prediction algorithms. (C) Performance comparison of the four machine learning algorithms based on comprehensive evaluation metrics. (D) Ranking of the importance of the selected predictive features in the final model.

Model Validation and Evaluation

Kaplan-Meier survival analysis (Figure 4A and B) demonstrated significant OS separation between low- and high-risk groups in both training (C-index = 0.796, P < 0.001) and validation sets (C-index = 0.766, P < 0.001). Time-dependent AUC remained above 0.82 throughout the 70-month follow-up (Figure 4C), indicating stable discrimination. Time-dependent Brier scores (Figure 4D) suggested optimal calibration during mid-follow-up, with overlapping curves for training and validation sets. The 24-month calibration curve (Figure 4E and F) further validated predictive accuracy. SHAP analysis (Figure 4G) identified surgery/ablation or interventional therapy post-recurrence as the strongest protective factors, whereas an increased number of recurrent tumors was the primary risk factor.

Figure 4 Validation and interpretation of the prognostic model for the non-TKI therapy cohort. (A) Kaplan-Meier overall survival (OS) curves for the low- and high-risk groups in the training set. (B) Kaplan-Meier overall survival (OS) curves for the low- and high-risk groups in the validation set. (C) Time-dependent area under the curve (AUC) for the model over the 70-month follow-up period. (D) Time-dependent Brier score for assessing the calibration of the model. (E-F) Calibration plots comparing predicted and observed survival probability at 24 months. (G) SHapley Additive exPlanations (SHAP) summary plot showing the impact of key features on model output.

Prognostic Model for the TKI Therapy Subgroup

Feature Selection

The TKI therapy cohort was split into training (n = 102) and validation (n = 45) sets in a 7:3 ratio. The results of univariate and multivariate Cox regression analyses for the TKI therapy subgroups are summarized in Table 4. Stepwise Cox regression, RSF, and LASSO-Cox methods were used for feature selection. RSF identified four key predictors: maximum tumor diameter at recurrence, surgery/ablation after recurrence, platelet count, and number of recurrent tumors (Figure 5A).

Table 4 Univariate and Multivariate Cox Regression Analyses of Clinical Factors Associated with Recurrence in Targeted Therapy Patients with HCC

Figure 5 Development and evaluation of the prognostic prediction model for the TKI therapy cohort. (A) Comparison of three feature selection methods: stepwise Cox regression, random survival forest (RSF), and LASSO-Cox regression. (B) Distribution of the concordance index (C-index) across five-fold cross-validation for the four candidate prediction algorithms. (C) Performance comparison of the four machine learning algorithms based on comprehensive evaluation metrics. (D) Ranking of the importance of the selected predictive features in the final model.

Algorithm Comparison and Model Construction

Model comparison using these variables in five-fold cross-validation (Figure 5B and C) identified RSF as the optimal model, demonstrating superior predictive discrimination (average C-index: 0.778), stability, and 24-month time-dependent AUC (0.851). Feature importance ranking (Figure 5D) indicated maximum tumor diameter at recurrence as the most critical predictor, followed by platelet count and surgery/ablation after recurrence.

Model Validation and Evaluation

The RSF model effectively stratified patients into high- and low-risk groups (training C-index = 0.856, validation C-index = 0.838, both P < 0.001; Figure 6A and B). Predictive performance remained stable within 48 months (Figure 6C). Time-dependent Brier scores stabilized between 0.48–0.49 (Figure 6D), and the 24-month calibration curve demonstrated good accuracy (Figure 6E and F). SHAP analysis (Figure 6G) confirmed maximum tumor diameter at recurrence as the primary risk factor and surgery/ablation after recurrence as a key protective factor.

Figure 6 Validation and interpretation of the prognostic model for the TKI therapy cohort. (A) Kaplan-Meier overall survival (OS) curves for the low- and high-risk groups in the training set. (B) Kaplan-Meier overall survival (OS) curves for the low- and high-risk groups in the validation set. (C) Time-dependent area under the curve (AUC) for the model over the 48-month follow-up period. (D) Time-dependent Brier score for assessing the calibration of the model. (E and F) Calibration plots comparing predicted and observed survival probability at 24 months. (G) SHapley Additive exPlanations (SHAP) summary plot showing the impact of key features on model output.

Treatment Strategy Matching Analysis Based on Counterfactual Predictions

Using the two validated models, counterfactual analysis of the entire cohort (n = 454) yielded the following strategy matching: 213 patients (46.9%) were aligned with the model’s recommendation for non-TKI therapy; 135 patients (29.7%) were aligned for TKI therapy; 94 patients (20.7%) receiving non-TKI therapy were recommended to switch to TKI therapy; and 12 patients (2.6%) receiving TKI therapy were recommended to switch to non-TKI therapy. Overall, 348 patients (76.6%) were in the “treatment-concordant” group, whereas 106 patients (23.4%) comprised the “potential treatment-mismatch” group (Figure 7).

Figure 7 Distribution of treatment strategy alignment following counterfactual analysis.The chart categorizes patients into four groups based on the comparison of their actual treatment and the optimal treatment predicted by the validated models.

Discussion

The optimal management of HCC recurrence after curative resection remains insufficiently defined, largely due to the lack of high-level evidence guiding post-recurrence treatment selection. Although TKIs are widely used in advanced HCC, their survival benefit in the post-resection recurrent setting remains controversial.¹³ In this study, by applying overlap weighting based on propensity scores to minimize confounding, we demonstrated that TKI therapy was associated with significantly improved recurrence-free survival (HR = 0.355) and overall survival (HR = 0.486) compared with non-TKI therapy. These findings provide robust real-world evidence supporting the effectiveness of TKI therapy in patients with recurrent HCC after curative resection.⁶

Beyond confirming population-level treatment benefit, an important strength of this study lies in its focus on individualized therapeutic decision-making. Recognizing the marked heterogeneity in recurrent HCC, we developed separate machine learning–based prognostic models for TKI-treated and non-TKI-treated patients. By integrating these treatment-specific models within a counterfactual inference framework, we were able to quantify individualized survival outcomes under alternative treatment scenarios. This dual-scenario modeling approach moves beyond traditional risk stratification and enables personalized estimation of treatment benefit, thereby facilitating data-driven optimization of post-recurrence treatment strategies for patients with HCC.

The treatment-specific models revealed distinct prognostic architectures. In the TKI-treated cohort, key predictors emphasized the dominant roles of tumor burden and the immune–inflammatory milieu. Maximum tumor diameter at recurrence emerged as the most influential risk factor, consistent with prior evidence indicating that baseline tumor size critically affects responsiveness to TKIs such as sorafenib and lenvatinib.¹⁴ Larger recurrent tumors may reflect greater intratumoral heterogeneity and a more complex tumor microenvironment, thereby attenuating the therapeutic efficacy of angiogenesis- and signaling pathway–targeted agents. The inclusion of platelet count further underscores its dual significance as an indicator of hepatic functional reserve and an active mediator of systemic inflammation and tumor progression. Platelets have been shown to facilitate angiogenesis, immune evasion, and metastatic niche formation through the release of pro-angiogenic and immunomodulatory factors, mechanisms that may directly or indirectly modulate sensitivity to TKI therapy.^15,16 Notably, the TKI-specific model incorporated fewer prognostic features overall. This observation may reflect a relative “simplification” of disease progression pathways once key oncogenic signaling axes (eg., VEGFR and FGFR) are pharmacologically inhibited. Under such conditions, patient prognosis may become more strongly determined by the agent’s capacity to control dominant tumor burden characteristics and by the host’s ability to sustain an immune microenvironment conducive to effective treatment response.¹⁷

Conversely, the prognostic model for the non-TKI cohort incorporated a broader array of variables, reflecting the multifactorial nature of disease progression in the absence of targeted intervention. In addition to number of recurrent tumors, a well-established surrogate for intrahepatic dissemination and multicentric recurrence,¹⁸ the model retained primary tumor pathological features, underscoring the persistent influence of intrinsic tumor biology on post-recurrence outcomes. The inclusion of alpha-fetoprotein level at recurrence further highlights its role as a canonical biomarker of tumor aggressiveness and proliferative activity, as well as a predictor of therapeutic responsiveness in advanced HCC.¹⁹ Importantly, HBV infection status emerged as a clinically relevant determinant, consistent with evidence that effective antiviral therapy confers a survival advantage in virus-related HCC following curative resection.²⁰ Ongoing viral replication and the associated chronic inflammatory milieu may accelerate hepatocarcinogenesis and promote tumor progression when systemic anticancer therapy is absent. Taken together, the differential feature composition between the two models suggests that, without TKI treatment, patient prognosis is more heavily governed by the tumor’s inherent invasive potential and the host’s sustained inflammatory state, whereas TKI therapy may partially override these drivers by constraining key oncogenic signaling pathways.

In summary, although the independent prognostic factors selected by the two models demonstrated both overlap and divergence, one shared finding carries particularly important clinical implications. The therapeutic paradigm of systemic therapy combined with local treatment constitutes the cornerstone of management for advanced hepatocellular carcinoma and is consistently recommended by major international guidelines.^21–23Recent evidence further indicates that integrated locoregional and systemic strategies can improve survival outcomes in patients with recurrent hepatocellular carcinoma by synergistically controlling macroscopic disease and modulating the tumor microenvironment.²⁴ In the present study, receipt of radical local treatment (surgical resection or ablation) after recurrence emerged as the most influential protective factor in both subgroup-specific models. This consistent finding strongly suggests that the principle of combining systemic therapy with aggressive local control remains applicable—and prognostically advantageous—even in patients who experience recurrence following curative resection. These results are in line with contemporary retrospective analyses demonstrating that surgical or ablative control of recurrent lesions is independently associated with prolonged overall survival and delayed disease progression in recurrent HCC.²⁵ Importantly, this observation underscores that, irrespective of the subsequent systemic treatment strategy, effective eradication of macroscopic recurrent lesions represents a universal foundation for improving post-recurrence outcomes.

The principal distinction between the two models lies in the composition and complexity of their feature sets. The TKI-treated model was comparatively parsimonious, potentially reflecting a “simplification” of dominant prognostic drivers under TKI therapy, with outcomes being governed primarily by residual tumor burden and the host immune microenvironment. In contrast, the non-TKI model captured a broader spectrum of variables, more comprehensively characterizing the tumor’s intrinsic invasive behavior and the patient’s chronic inflammatory state in the absence of targeted oncogenic pathway inhibition. This divergence is consistent with emerging evidence suggesting that targeted systemic therapies may partially override tumor-intrinsic heterogeneity by suppressing key oncogenic signaling pathways, thereby reshaping the prognostic landscape.²⁶ This critical divergence highlights the methodological and clinical value of our treatment-stratified modeling strategy, suggesting that a single unified prognostic model may be insufficient to accurately reflect the heterogeneous biological mechanisms influencing prognosis across different therapeutic pathways.²⁷

The central advancement of our framework lies in its transition from conventional prognostic estimation to clinically actionable decision support. This innovation is supported by two key strengths. First, model development and reporting rigorously adhered to international standards for clinical prediction research, ensuring methodological transparency, robustness, and reproducibility.²⁸ Moreover, all predictive variables were derived from routinely available clinical data, obviating the need for specialized or costly testing and thereby enhancing the feasibility of real-world implementation. Second, and most notably, this study pioneers a dual-scenario modeling strategy integrated with counterfactual reasoning.²⁹ Consistent with prior applications of counterfactual frameworks in clinical research,^12,30 this approach enables estimation of individualized survival outcomes under alternative treatment strategies, allowing objective identification of potential treatment mismatch at the patient level. Consequently, our treatment-matching simulation revealed that approximately 23.4% of patients within the cohort may derive substantial benefit from optimization of their current therapeutic strategy.

Several recent studies have developed radiomics and deep learning models to predict HCC recurrence or treatment response. Ren et al applied CT-based machine learning to predict outcomes in HCC patients receiving TACE plus TKI therapy;³¹ Wang et al integrated MR radiomics with deep learning to predict post-ablation recurrence;³² Liu et al constructed a pathology-based deep learning model for postoperative recurrence risk and sorafenib response;³³ Chen et al combined conventional radiomics and deep learning features to predict early recurrence after curative ablation.³⁴ While bioinformatics approaches have been employed to identify hub genes and predictive signatures for HCC treatment response.³⁵ These studies collectively demonstrate the potential of artificial intelligence to personalize HCC management. Compared with these approaches, our model offers distinct advantages: it directly compares survival outcomes between TKI and non-TKI therapies in the specific setting of postoperative recurrence, and provides individualized treatment benefit estimates through a counterfactual framework. Moreover, our model relies solely on routinely available clinical and laboratory variables, making it more readily generalizable across institutions without requiring specialized imaging or pathology pipelines. However, a trade-off exists: our model may not capture tumor morphological heterogeneity as effectively as image-based deep learning models. Therefore, the two approaches are complementary. Future work could integrate clinical variables with radiomics and pathological features to further improve predictive accuracy for post-recurrence TKI benefit.

This study has several limitations. First, as a single-center retrospective analysis, residual or unmeasured confounding cannot be fully excluded, despite the use of advanced overlap weighting based on propensity scores to mitigate measured confounders. Second, the TKI-treated group included both sorafenib and lenvatinib without agent-specific subgroup analysis due to limited sample size, precluding assessment of differential survival benefits between individual TKIs. Moreover, because the TKI and non-TKI models were trained on separate subgroups with potentially different feature distributions, applying a patient’s baseline characteristics to both models may introduce prediction errors for underrepresented profiles; thus, individualized treatment benefit estimates should be interpreted with caution. Third, our models focus solely on survival outcomes and do not incorporate treatment toxicity, cost, patient preferences, or quality of life, which are critical for clinical decision-making. Finally, although the models demonstrated robust internal validation and favorable performance, their generalizability remains to be established through external validation. Future research should include prospective, multi-center data collection to assess the real-world clinical utility and further refine the treatment-matching framework.

Conclusion

This study demonstrates that tyrosine kinase inhibitor therapy provides a significant survival benefit in patients with postoperative recurrent hepatocellular carcinoma, as shown through rigorous propensity score-based adjustment. By integrating machine learning-based prognostic models with counterfactual inference, we enable individualized estimation of treatment benefit and predictive survival outcomes. This framework represents a step toward data-driven, precision management of recurrent HCC and may inform personalized post-recurrence therapeutic decisions. Prospective validation and evaluation of clinical implementation are warranted before routine application.

Abbreviations

HCC, hepatocellular carcinoma; TKI, tyrosine kinase inhibitor; HBV, hepatitis B virus; NLR, neutrophil-to-lymphocyte ratio; PLR, platelet-to-lymphocyte ratio; LMR, lymphocyte-to-monocyte ratio; OS, overall survival; RFS, recurrence-free survival; SMDs, standardized mean differences; C-index, concordance index; RSF, random survival forest; GBM, gradient boosting machine; Surv-SVM, Survival support vector machine; AUC, area under the curve; SHAP, the Shapley additive explanations.

Data Sharing Statement

The data supporting the findings of this study are available from the corresponding author upon reasonable request (Jingyu Cao, E-mail [email protected]).

Ethics Approval and Consent to Participate

The protocol of this study was approved by the ethical review board of the Affiliated Hospital of Qingdao University (No. QYFY-WZLL-42256).

Acknowledgments

Thank all the staff authors for their contributions to this study.

Author Contributions

All authors made a significant contribution to the work reported, whether that is in the conception, study design, execution, acquisition of data, analysis and interpretation, or in all these areas; took part in drafting, revising or critically reviewing the article; gave final approval of the version to be published; have agreed on the journal to which the article has been submitted; and agree to be accountable for all aspects of the work.

Funding

This study was supported by the Taishan Scholar Foundation of Shandong Province, the Shandong Provincial Natural Science Foundation (ZR2024QH265 and ZR2025MS1290), the Shandong Provincial Medical and Health Science and Technology Development Plan (202425020784), the Shandong Higher Education Young Science and Technology Support Program (2024KJJ041), the Qingdao Natural Science Foundation (24-4-4-zrjj-103-jch), and the Medical and Health Scientific Research Project of Qingdao (2024-WJKY173).

Disclosure

All authors declare no competing interests.

References

1. Siegel RL, Giaquinto AN, Jemal A. Cancer statistics, 2024. CA Cancer J Clin. 2024;74(1):12–21. doi:10.3322/caac.21820

2. Llovet JM, Kelley RK, Villanueva A, et al. Hepatocellular carcinoma. Nat Rev Dis Primer. 2021;7(1):6. doi:10.1038/s41572-020-00240-3

3. Sensi B, Angelico R, Toti L, et al. Mechanism, potential, and concerns of immunotherapy for hepatocellular carcinoma and liver transplantation. Curr Mol Pharmacol. 2024;17:e1012161775. doi:10.2174/0118761429310703240823045808

4. Finn RS, Qin S, Ikeda M, et al. Atezolizumab plus bevacizumab in unresectable hepatocellular carcinoma. New Engl J Med. 2020;382(20):1894–1905. doi:10.1056/NEJMoa1915745

5. Ren Z, Xu J, Bai Y, et al. Sintilimab plus a bevacizumab biosimilar (ibi305) versus sorafenib in unresectable hepatocellular carcinoma (orient-32): a randomised, open-label, Phase 2–3 study. Lancet Oncol. 2021;22(7):977–990. doi:10.1016/S1470-2045(21)00252-7

6. Li J, Wang W, Zhu R, et al. Postoperative adjuvant tyrosine kinase inhibitors combined with anti-pd-1 antibodies improves surgical outcomes for hepatocellular carcinoma with high-risk recurrent factors. Front Immunol. 2023;14:1202039. doi:10.3389/fimmu.2023.1202039

7. Payo-Serafín T, Méndez-Blanco C, Fernández-Palanca P, et al. Risk versus benefit of tyrosine kinase inhibitors for hepatocellular carcinoma: a systematic review and meta-analysis of randomized controlled trials. Clin Pharmacol Ther. 2024;116(2):328–345. doi:10.1002/cpt.3312

8. Haak F, Di Tommaso L, Viganò L, et al. A transcriptomic signature of intrahepatic cholangiocellular carcinoma with very early recurrence: identifying patients at risk and finding a personalized adjuvant therapy. Eur J Surg Oncol. 2023;49(2):e214. doi:10.1016/j.ejso.2022.11.586

9. Sun B, Chen L, Lei Y, et al. Sorafenib plus transcatheter arterial chemoembolization with or without camrelizumab for the treatment of intermediate and advanced hepatocellular carcinoma. Br J Radiol. 2024;97(1159):1320–1327. doi:10.1093/bjr/tqae087

10. Xu W, Liao S, Hu Y, Huang Y, Zhou J. Upregulation of mir-3130-5p enhances hepatocellular carcinoma growth by suppressing ferredoxin 1: mir-3130-5p enhances hcc growth via inhibiting fdx1. Curr Mol Pharmacol. 2024;17:e1012209080. doi:10.2174/0118761429358008250305070518

11. Ye W, Wang J, Zheng J, Jiang M, Zhou Y, Wu Z. Association between higher expression of vav1 in hepatocellular carcinoma and unfavourable clinicopathological features and prognosis. Protein Pept Lett. 2024;31(9):706–713. doi:10.2174/0109298665330781240830042601

12. Famularo S, Donadon M, Cipriani F, et al. Machine learning predictive model to guide treatment allocation for recurrent hepatocellular carcinoma after surgery. JAMA Surg. 2023;158(2):192–202. doi:10.1001/jamasurg.2022.6697

13. Wang Z, Fan J, Zhou S, et al. Perioperative camrelizumab plus rivoceranib versus surgery alone in patients with resectable hepatocellular carcinoma at intermediate or high risk of recurrence (cares-009): a randomised phase 2/3 trial. Lancet. 2025;406(10515):2089–2099. doi:10.1016/S0140-6736(25)01720-9

14. Llovet JM, Montal R, Sia D, Finn RS. Molecular therapies and precision medicine for hepatocellular carcinoma. Nat Rev Clin Oncol. 2018;15(10):599–616. doi:10.1038/s41571-018-0073-4

15. Tian X, Liu X, Zeng F, Chen Z, Wu D. Platelet-to-lymphocyte ratio acts as an independent risk factor for patients with hepatitis b virus-related hepatocellular carcinoma who received transarterial chemoembolization. Eur Rev Med Pharmacol. 2016;20(11):2302.

16. Sierko E, Wojtukiewicz MZ. Platelets and angiogenesis in malignancy. Semin Thromb Hemost. 2004;30(1):95–108. doi:10.1055/s-2004-822974

17. Sia D, Villanueva A, Friedman SL, Llovet JM. Liver cancer cell of origin, molecular class, and effects on?patient prognosis. Gastroenterology. 2017;152(4):745–761. doi:10.1053/j.gastro.2016.11.048

18. Zeng J, Zeng J, Lin K, et al. Development of a machine learning model to predict early recurrence for hepatocellular carcinoma after curative resection. Hepatobil Surg Nutr. 2022;11(2):176–187. doi:10.21037/hbsn-20-466

19. Personeni N, Bozzarelli S, Pressiani T, et al. Usefulness of alpha-fetoprotein response in patients treated with sorafenib for advanced hepatocellular carcinoma. J Hepatol. 2012;57(1):101–107. doi:10.1016/j.jhep.2012.02.016

20. Huang DQ, Hoang JK, Kamal R, et al. Antiviral therapy utilization and 10-year outcomes in resected hepatitis b virus- and hepatitis c virus-related hepatocellular carcinoma. J Clin Oncol. 2024;42(7):790–799. doi:10.1200/JCO.23.00757

21. Reig M, Forner A, Rimola J, et al. BCLC strategy for prognosis prediction and treatment recommendation: the 2022 update. J Hepatol. 2022;76(3):681–693. doi:10.1016/j.jhep.2021.11.018

22. European AFTS. EASL clinical practice guidelines on the management of hepatocellular carcinoma. J Hepatol. 2025;82(2):315–374. doi:10.1016/j.jhep.2024.08.028

23. Heimbach JK, Kulik LM, Finn RS, et al. Aasld guidelines for the treatment of hepatocellular carcinoma. Hepatology. 2018;67(1):358–380. doi:10.1002/hep.29086

24. Liang J, Bai Y, Ha F, Luo Y, Deng H, Gao Y. Combining local regional therapy and systemic therapy: expected changes in the treatment landscape of recurrent hepatocellular carcinoma. World J Gastro Oncol. 2023;15(1):1–18. doi:10.4251/wjgo.v15.i1.1

25. Ohama H, Hiraoka A, Tada F, et al. Clinical usefulness of surgical resection including the complementary use of radiofrequency ablation for intermediate-stage hepatocellular carcinoma. Cancers. 2022;15(1):236. doi:10.3390/cancers15010236

26. Li L, Li Z, Pan L, et al. Adjuvant therapy for hepatocellular carcinoma after curative treatment: several unanswered questions. J Clin Transl Hepatol. 2024;12(5):525–533. doi:10.14218/JCTH.2024.00030

27. Kent DM, van Klaveren D, Paulus JK, et al. The predictive approaches to treatment effect heterogeneity (path) statement: explanation and elaboration. Ann Intern Med. 2020;172(1):W1–W25. doi:10.7326/M18-3668

28. Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (tripod). Ann Intern Med. 2015;162(10):735–736. doi:10.7326/L15-5093-2

29. Wei X, Wang F, Liu Y, et al. A machine learning model based on counterfactual theory for treatment decision of hepatocellular carcinoma patients. J Hepatocell Carcinoma. 2024;11:1675–1687. doi:10.2147/JHC.S470550

30. Li X, Liang X, Chen S, et al. Development and validation of a ct-based online calculator for prognosis and postoperative adjuvant transcatheter arterial chemoembolization benefit prediction in resected hepatocellular carcinoma: a multicenter retrospective cohort study. Int J Surg. 2025;2025:10–1097. doi:10.1097/JS9.0000000000004456.

31. Ren Q, Zhu P, Li C, et al. Pretreatment computed tomography-based machine learning models to predict outcomes in hepatocellular carcinoma patients who received combined treatment of trans-arterial chemoembolization and tyrosine kinase inhibitor. Front Bioeng Biotechnol. 2022;10:872044. doi:10.3389/fbioe.2022.872044

32. Wang Y, Zhang Y, Xiao J, Geng X, Han L, Luo J. Multicenter integration of mr radiomics, deep learning, and clinical indicators for predicting hepatocellular carcinoma recurrence after thermal ablation. J Hepatocell Carcinoma. 2024;11:1861–1874. doi:10.2147/JHC.S482760

33. Li Y, Xiong J, Hu Z, et al. Denoised recurrence label-based deep learning for prediction of postoperative recurrence risk and sorafenib response in hcc. Bmc Med. 2025;23(1):162. doi:10.1186/s12916-025-03977-4

34. Li Y, Qian G, Zhu Y, et al. An integrated model combined conventional radiomics and deep learning features to predict early recurrence of hepatocellular carcinoma eligible for curative ablation: a multicenter cohort study. J Comput Assist Tomo. 2025;49(6):860–871. doi:10.1097/RCT.0000000000001764

35. Gudivada IP, Amajala KC. Integrative bioinformatics analysis for targeting hub genes in hepatocellular carcinoma treatment. Curr Genomics. 2025;26(1):48–80. doi:10.2174/0113892029308243240709073945

Creative Commons License © 2026 The Author(s). This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms and incorporate the Creative Commons Attribution - Non Commercial (unported, 4.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.