Back to Journals » International Journal of General Medicine » Volume 18

Development of a Predictive Risk Model for Recurrence of Chronic Pulmonary Aspergillosis in Post-Tuberculosis Patients: A Retrospective Observational Study

Authors Wu M ORCID logo, Yang YN, Wang F ORCID logo, Yan JR, Yang R, Yang C, Ren Y

Received 2 June 2025

Accepted for publication 21 November 2025

Published 3 December 2025 Volume 2025:18 Pages 7243—7254

DOI https://doi.org/10.2147/IJGM.S544265

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 5

Editor who approved publication: Prof. Dr. Héctor Mora-Montes



Ming Wu,1 Yan Na Yang,1 Fei Wang,1 Ju Rong Yan,1 Rui Yang,2 ChengQing Yang,3 Yi Ren1

1Department of Clinical Laboratory, Wuhan Pulmonary Hospital, Wuhan, Hubei Province, 430030, People’s Republic of China; 2Department of Orthopedics, People’s Hospital of Dongxihu District, Wuhan, Hubei Province, 430040, People’s Republic of China; 3Respiratory Ward Two, Wuhan Pulmonary Hospital, Wuhan, Hubei Province, 430030, People’s Republic of China

Correspondence: ChengQing Yang, Respiratory Ward Two, Wuhan Pulmonary Hospital, Wuhan, Hubei Province, 430030, People’s Republic of China, Email [email protected] Yi Ren, Department of Clinical Laboratory, Wuhan Pulmonary Hospital, Wuhan, Hubei Province, 430030, People’s Republic of China, Tel +86-13807196283, Email [email protected]

Objective: The recurrence rate of post-tuberculosis chronic pulmonary aspergillosis (post-TB CPA) is alarmingly high. This study aims to establish a risk prediction model utilizing machine learning algorithms to forecast the one-year recurrence risk of post-TB CPA.
Methods: This retrospective study included all patients diagnosed with pulmonary tuberculosis complicated by chronic pulmonary aspergillosis at Wuhan Pulmonary Hospital in 2022. Ultimately, 220 patients were included for the significance analysis.The Least Absolute Shrinkage and Selection Operator LASSO regression analysis was utilized to select 8 variables associated with the recurrence of tuberculosis complicated by chronic pulmonary aspergillosis. Four machine learning algorithms were compared to predict the recurrence risk in patients with this complication, with their performance evaluated using the receiver operating characteristic curve, area under the curve (AUC), calibration curve analysis, and decision curve analysis.
Results: LASSO regression analysis identified chronic obstructive pulmonary disease (COPD), chronic fibrotic pulmonary aspergillosis (CFPA), progressive pleural hypertrophy, fungal culture results, age, disease duration, emphysema and treatment duration as factors related to the recurrence risk of tuberculosis complicated by chronic pulmonary aspergillosis. The logistic regression model demonstrated the best performance, it outperformed the other three models by achieving the highest AUC of 0.779 on the internal validation set and 0.819 in the test cohort. The calibration curve indicated a strong correlation between the actual and predicted probabilities, while the decision curve analysis revealed significant clinical benefits.
Discussion: In this study, we developed a disease recurrence prediction model using machine learning techniques. This model aims to assist clinicians in identifying the most relevant risk factors associated with the recurrence of tuberculosis complicated by chronic pulmonary aspergillus. It facilitates the formulation of targeted and effective re-examination plans for discharged patients, ultimately reducing the recurrence rate after discharge and enhancing the quality of life for these patients.

Keywords: post-TB CPA, machine learning, predictive model, risk factors, recurrence risk, AUC

Introduction

Tuberculosis (TB) is a significant public health concern. It is estimated that in 2025, the number of new TB cases in China will reach Nearly 800,000, with a death toll of 30,000.1 Pulmonary tuberculosis (PTB) is caused by the invasion of Mycobacterium tuberculosis into the lungs. It is the most prevalent form of TB, accounting for over 80% of all cases.2,3 More than two-thirds of PTB patients continue to experience extensive lung structural damage following treatment.4,5 Post-tuberculosis lung disease (PTLD) refers to a group of lung diseases characterized by chronic respiratory system abnormalities, either partially or entirely resulting from PTB, with or without clinical symptoms.6 The lung structural damage caused by tuberculosis can be further complicated by infections such as chronic pulmonary aspergillosis (CPA).7 As a complication of PTB, CPA affects approximately 3 million individuals worldwide, with a global prevalence rate of 42 per 100,000.8,9 Research indicates that the 5-year mortality rate for CPA ranges from 50% to 85%,10 contributing to its status as a global burden. This complication is referred to as post-tuberculosis chronic pulmonary aspergillosis (post-TB CPA). Notably, China continues to bear a high burden of tuberculosis on a global scale. Post-TB CPA patients are not uncommon; however, there is a notable lack of comprehensive literature on this topic. The coexistence of these two conditions complicates both diagnosis and treatment. Currently, there are no established diagnostic and treatment guidelines for post-TB CPA in China. In recent years, as the survival rate of tuberculosis patients has improved, the incidence of CPA among those with pulmonary tuberculosis complications has also significantly increased.11,12

Factors influencing the development of chronic pulmonary aspergillosis in patients with pulmonary tuberculosis may include the patient’s immune status, history of pulmonary diseases, environmental exposures, and genetic susceptibility.13,14 These factors are complex and severely impact the quality of life of affected individuals. Preventing recurrence is a key objective in the management of chronic pulmonary aspergillosis. Recurrence can lead to significant distress for both patients and their families.15 This study aims to analyze the clinical characteristics of CPA patients following tuberculosis, identify the most relevant factors associated with recurrence, and ultimately enhance the quality of life for discharged patients.

Traditional research methods exhibit certain limitations in identifying the interactions among complex factors; however, the application of machine learning technology has introduced new opportunities in this field. Machine learning is adept at handling large-scale and multi-dimensional data.16,17 By establishing intricate models to identify potential influencing factors and their interrelationships, it can enhance the accuracy of disease prediction and early diagnosis and also identify potential recurrence factors of the disease.17–19 In recent years, the use of machine learning in medical research has gained significant traction. Through the construction and optimization of machine learning models, researchers can effectively identify recurrence factors related to pulmonary tuberculosis complicated by chronic pulmonary aspergillosis, thereby supporting clinical decision-making.20 This study retrospectively analyzed the clinical characteristics and treatment data of 220 patients with pulmonary tuberculosis complicated by chronic pulmonary aspergillosis who were admitted to Wuhan Pulmonary Hospital in 2022. Machine learning was employed to analyze the factors influencing the recurrence of post-TB CPA patients within one year, aiming to enhance clinical management and post-discharge follow-up for these patients. This review provides valuable insights.

Materials and Methods

Study Population

A retrospective study method was employed in this research. From January to December 2022, the inpatient electronic medical record system of Wuhan Pulmonary Hospital was utilized to identify patients diagnosed with pulmonary fungal infections, specifically Aspergillus pneumonia, invasive pulmonary aspergillosis, allergic bronchopulmonary aspergillosis, and disseminated aspergillosis. This selection encompassed patients with potential diagnoses of pulmonary aspergillosis. After applying the weight removal process, the diagnoses of all patients were re-evaluated based on predefined inclusion and exclusion criteria. The inclusion criteria were as follows:1.Chest CT scans must exhibit imaging characteristics consistent with chronic pulmonary aspergillosis (CPA); 2. There must be direct or immunological evidence of Aspergillus infection, meeting at least one of the following criteria: ① Positive Aspergillus culture from sputum or bronchoalveolar lavage fluid (BALF); ② Detection of Aspergillus nucleic acid in sputum or BALF; ③Positive galactomannan antigen test (GM) in BALF; ④ Presence of Aspergillus-specific IgG antibodies in serum; ⑤ Pathological confirmation of Aspergillus infection in lung tissue; 3. The disease duration must be at least three months. The exclusion criteria included: 1. Invasive pulmonary aspergillosis;2.Allergic bronchopulmonary aspergillosis;3. Absence of Aspergillus infection; 4.Suspected Aspergillus infections that remain undiagnosed.21,22

Data Collection

The clinical data of 220 patients diagnosed with pulmonary tuberculosis (PTB) combined with CPA were collected. This dataset includes various parameters such as gender, age, previous medical history, presenting symptoms, disease duration, imaging findings, laboratory test classifications, and follow-up outcomes. The follow-up assessments focused on patient stability and recurrence of the disease. Informed consent was obtained from all participants, and the study adhered to the ethical guidelines established by Wuhan Pulmonary Hospital.

Model Construction

The Beckman Coulter Dx AI platform (https://www.xsmartanalysis.com/beckman/login/) was utilized for statistical analysis. Least Absolute Shrinkage and Selection Operator (LASSO) regression analysis was employed to identify factors associated with the recurrence of post-TB CPA. This study implemented four machine learning classification algorithms, namely XGBoost, Random Forest, logistic regression, and Support Vector Machine (SVM), to construct the machine learning model. The training dataset was used for optimal model selection, while the test dataset was utilized to validate the best-performing model. Evaluation metrics, including accuracy, sensitivity, specificity, positive predictive value, negative predictive value, F1 score, and area under the receiver operating characteristic curve (AUC), were employed to compare the efficacy of the models.

Statistical Analysis

Continuous variables were standardized, while categorical variables underwent exclusive encoding. Continuous variables were represented using the median (M) and interquartile range (IQR), whereas categorical variables were expressed as counts and percentages. The differences between groups for continuous variables were analyzed using either Student’s t-test or the Mann–Whitney test, while categorical variables were examined using Fisher’s exact test or the chi-squared test.

Key variables for constructing the prediction model were selected using Lasso regression. The optimal algorithm was then identified by comparing multiple modeling algorithms and was used to build the prediction model individually. Discriminative ability was quantified by the area under the receiver operating characteristic curve (AUC).

All statistical analyses were conducted using Python version 3.11.4, R version 4.2.3 and glmnet version 4.1.8. Results were deemed statistically significant at a threshold of P < 0.05.

Result

Basic Characteristics of the Patients

A total of 220 patients were enrolled in this study, comprising 141 males (64%) and 79 females (36%). The median age of the patients was 60 years (Table S1). Continuous variables were presented as mean±standard deviation (SD) for normally distributed variables or median with interquartile range (OR) for non-normally distributed variables, and categorical variables were presented as percentage frequencies. Demographic and laboratory tests were compared using a t-testor Mann–Whitney U-test for continuous and chi-square test for categorical variables. We applied Shapiro–Wilk Nornality test to test for normality, and Levene’s-test to test homogeneity of variances.Then, nomally distributed variables were compared by the Student’s t-test, and non-normally distributed variables were compared by MannWhitney U-test. In the significant analysis and comparison, it was observed that the combination of bronchiectasis, Non-tuberculous mycobacteria (NTM), and chronic obstructive pulmonary disease (COPD), along with chronic fibrotic pulmonary aspergillosis (CFPA), showed P values for CFPA, number of cavities, progressive pleural hypertrophy, emphysema, fungal culture, age, disease duration, Erythrocyte Sedimentation Rate(ESR), Serum Fungal Galactomannan Test (Serum GM), and treatment course all less than 0.05 in Table 1, indicating statistical significance.

Table 1 The Results of the Significance Analysis

Feature Screening

To identify the most relevant predictors for recurrence and to avoid overfitting, feature selection was performed using the Least Absolute Shrinkage and Selection Operator (Lasso) regression with a binomial deviance penalty. The optimal regularization parameter (λ) was determined via 10-fold cross-validation.The cross-validation results are presented in Figure 1A, which plots the binomial deviance against Log(λ). The left vertical dashed line indicates the value of λ (λ_min = 0.035) that achieved the minimum mean cross-validated error. The right vertical dashed line corresponds to the value of λ (λ_1se = 0.067), which represents the most parsimonious model within one standard error of the minimum error.The coefficient profiles for all candidate variables across the range of λ values are illustrated in Figure 1B. Each path represents the trajectory of a variable’s coefficient as the penalty increases (moving rightward on the Log(λ) axis). The two vertical lines from the cross-validation plot are superimposed, showing the exact set of variables retained at each critical λ value. Variables whose coefficients shrink to zero are effectively excluded from the model.At λ_min = 0.035, the Lasso model selected 13 variables with non-zero coefficients. To enhance model simplicity and generalizability, the model based on λ_1se = 0.067 was adopted for further analysis. At this penalty level, the coefficients of eight variables remained non-zero, identifying them as key predictors. As detailed in Table 2 (under the “SEM of minimal distance” column), these factors and their coefficients are: chronic obstructive pulmonary disease (COPD, 0.610), chronic pulmonary aspergillosis (CFPA, 0.458), progressive pleural thickening (0.778), emphysema (0.045), fungal culture (0.089), age (0.010), disease duration (0.001), and treatment course (−0.082). The magnitude of these coefficients reflects the strength of association with recurrence, while the sign indicates the direction (positive for risk factors, negative for protective factors). The negative coefficient for “treatment course” suggests a protective effect against recurrence.

Table 2 The Coefficients of Lasso Regression

Figure 1 Least absolute shrinkage and selection operator (LASSO) regression analysis and 5-fold cross-validation for selecting factors associated with the recurrent patients of post-TB CPA patients.(A) A joint plot was created based on the log-likelihood.The minimum standard is on the left line and the 1-SE standard is on the right line. In the current study, we selected 7 non-zero predictors according to the 1-SE standard. SE, the standard error.(B) Bias selection of the tuning parameter (lambda) in LASSO regression based on the minimum standard (left dashed line) and 1-SE (standard error) standard (right dashed line).

Identification of the Optimal Model

Four machine learning models—namely XGBoost, Random Forest, Logistic Regression, and Support Vector Machine (SVM)—were employed to analyze and compare the evaluation indicators among the models in order to identify the optimal predictive model. To mitigate the risk of overfitting and to select the best model, a 5-fold cross-validation was performed using the training dataset. Figure 2 illustrates the performance of the four machine learning models in both the training and validation cohorts. Subsequently, the average values of accuracy, sensitivity, specificity, positive predictive value, negative predictive value, F1 score, and area under the curve (AUC) for the four machine learning models were calculated. The performance metrics of the logistic model in the validation set, including accuracy, sensitivity, specificity, positive predictive value, negative predictive value, F1 score, and AUC (95% CI), are presented in Table 3. Among the four comparative models, logistic regression ranked third in AUC on the training set, yet its training performance remained notably high with an AUC of 0.819. More importantly, logistic regression outperformed the other three models by achieving the highest AUC of 0.779 on the internal validation set. As the internal validation set is widely recognized as a far more critical indicator for evaluating a model’s generalization potential,since it simulates the model’s performance on unseen data more accurately than the training set and its stronger robustness and adaptability to non-training data. Thus, we rigorously confirm that logistic regression is the optimal model for this study.

Table 3 Comparison of Multiple Machine Learning Evaluation Indexes Between Training Set and Validation Set

Figure 2 Multiple model comparison results. (A). ROC curve comparison of training set in multiple machine algorithms. (B). ROC curve comparison of validation set in multiple machine algorithms.

Analysis and Assessment of the Logistic Model

We employ the logistic model as the final classification method for binary classification.To ensure the objectivity and reliability of model performance evaluation, this study adopted a dual validation strategy combining “independent test set + cross-validation. Specifically, 15% of the total data was randomly extracted as the Independent Test Set; this part of the data was not involved in model training throughout the process and was only used for External Validation under simulated real-world scenarios to verify the model’s generalization ability.The remaining 85% of the dataset was used for model construction and internal evaluation, implemented through 5-Fold Cross-Validation: this part of the data was evenly divided into 5 subsets. In each iteration, 4 subsets served as the training set and 1 subset as the internal validation set. This process was repeated 5 times to achieve full sample coverage. Finally, the mean and standard deviation of the 5 validation results were used to comprehensively evaluate the model’s stability and internal consistency. Calibration curvesd for each set in Figure 3A–C. The accuracy, sensitivity, specificity, positive predictive value, negative predictive value, F1 score, and AUC (95% CI) of the model in the test, training, and validation sets are presented in Table 4. The AUC of the test set (0.842), which is very close to the training set’s 0.844. Notably, the test set (0.842) was significantly higher than the validation set (0.817). These results collectively demonstrate that our model exhibits excellent performance—it not only avoids overfitting but also achieves superior generalization ability in real-world-like scenarios compared to internal validation, fully confirming the model’s reliability and effectiveness.

Table 4 Diagnostic Efficacy of the Logistic Model in the Testing and Validation Cohorts

Figure 3 Performance of the prediction model (A) Receiver operating characteristic (ROC) curve of the training cohort; (B) ROC curve of the validation cohort; (C) ROC curve of the testing cohort.

The SHAP Summary Graph

As illustrated in Figure 4A and B, the SHAP algorithm was used to interpret feature importance in the logistiregression model. COPD, CFPA, progressive pleural thickening, emphysema, fungal culture, age, disease duration, and treatment course were identified as the most influential features fopredicting recurrence of chronic pulmonary aspergillosis in post-tuberculosis patients.A larger mean absolute Shapley value indicated a greater impact on the model’predictions.To clarify, these figures decode the risk factors for recurrent post-tuberculosis CPA via SHAP: Figure 4A (SHAP summary scatter plot):Displays how features influence recurrence risk—red/blue colors denote high/low feature values, and SHAP values show their push/pull on risk. Figure 4B (SHAP feature importance bar plot): Ranks features by their average impact on risk, with Treatment course being the most critical and Emphysema the least. Figure 4C (SHAP force plot): Illustrates how features collectively determine an individual’s recurrence risk (f(x)=0.39). Together, they comprehensively unravel the model’s reasoning for predicting CPA recurrence risk.

Figure 4 Summary plots of visualize SHAP values. In Figure (A) the position of the point along the X-axis represents the actual SHAP value, indicating the influence of a specific feature on the model output of that specific patient. Mathematically speaking, this corresponds to the logarithm of the recurrence risk among patients, which means that a higher SHAP value indicates a higher recurrence risk compared to patients with a lower SHAP value. Features are organized along the Y-axis according to their importance and are determined by the average of their absolute Shapley values. The higher the position of the feature in the graph is, the more significant the influence on the model will be. Figure (B) shows the SHAP importance graph of the logistic algorithm. (C) Contribution level of features in logistic model.

Discussion

Chronic Pulmonary Aspergillosis (CPA), as a complication of pulmonary tuberculosis (PTB), imposes a considerable burden worldwide.20 Epidemiological data indicate that 5% to 35% of patients develop CPA as a consequence of tuberculosis, with half of the global CPA burden attributable to this disease.13,23 Although there have been several studies on CPA following tuberculosis in international contexts, relevant research reports from China are scarce. Patients suffering from post-TB CPA frequently exhibit symptoms such as chronic cough, expectoration, and hemoptysis. Additionally, some patients may present with tuberculosis-related systemic symptoms, including fever, night sweats, and weight loss,24 though these clinical manifestations tend to be more severe. The clinical presentation of tuberculosis combined with CPA poses a complex clinical challenge.24,25 In addition to diagnosis and treatment, preventing recurrence is also of paramount importance.

In recent years, the widespread application of machine learning technology in the medical field has resulted in a growing number of studies aimed at utilizing machine learning models to predict the likelihood of diseases and clinical outcomes in patients with TB and CPA.13,26 Research indicates that machine learning models, which incorporate clinical features and biomarkers, hold significant promise for predicting disease risk and prognosis in these patient populations.27,28 For instance, Yan et al employed logistic regression to estimate the probability of tuberculosis in conjunction with CPA, taking into account the patient’s disease history, lung cavitation, and surgical background. Their model achieved an AUC of 0.86.28 Additionally, Ren et al utilized serum cytokine biomarkers, including IL-6 and IL-8, as diagnostic criteria for pulmonary tuberculosis and chronic pulmonary aspergillosis.29

This study retrospectively analyzed the clinical data of 72 patients with tuberculosis complicated by CPA at Wuhan Pulmonary Hospital in 2022, including various laboratory examination indicators. A one-year recurrence risk prediction model for the disease was constructed using machine learning algorithms. Four different machine learning models were compared to identify the most relevant risk factors for recurrence in patients with tuberculosis complicated by CPA. Seven significant influencing factors were identified: COPD, CPA, progressive pleural hypertrophy, fungal culture, age, disease duration, and treatment duration. By analyzing SHAP values, we quantified the contribution of each feature to the model output, thus determining the most relevant predictor.30 This analytical method provides a deeper understanding of how each factor influences the predictive results of the model. A comprehensive evaluation of these factors is crucial for clinical decision-making and patient management.

The results of the SHAP analysis indicate that the treatment course is the most significant factor. A full treatment course can reduce the risk of recurrence, which aligns with previous studies.22 The duration of the treatment course directly impacts the treatment outcome and the risk of recurrence. An appropriate treatment duration is essential for controlling the condition and preventing recurrence. Progressive pleural hypertrophy is the second most relevant factor. Studies have demonstrated that the absence of cavitation on chest X-rays and the presence of 100% pleural thickening serve as negative predictive values for CPA.31 Therefore, this study recommends that discharged patients undergo chest X-ray examinations every three months, with particular attention to pleural hypertrophy, as this can help prevent recurrence.COPD and chronic fibrotic pulmonary aspergillus disease are the third risk factors for recurrence and warrant significant attention. Pneumothorax is a common complication of post-tuberculosis CPA.Tuberculosis-induced lung damage (cavities, fibrosis) plus Aspergillus-induced inflammation weakens lung tissue, easily causing rupture and pneumothorax.22 Conversely, pneumothorax worsens PT-CPA patients’ respiratory dysfunction and indicates poor disease control, with high recurrence risk.It is advised that such patients complete the full course of treatment during their hospital stay, maintain good compliance, and increase the frequency of follow-ups and regular reexaminations after discharge. Patients with a longer disease duration and those with positive fungal cultures face an elevated risk of recurrence and should use medications judiciously. Age is also a significant factor; it is one of the critical determinants influencing the development of chronic pulmonary aspergillosis, as confirmed by numerous studies.32 As individuals age, the immune system’s functionality declines, rendering elderly patients more susceptible to infections.

While this research has yielded significant results, it is not without limitations. The study was conducted at a single center, which may limit the generalizability of the model to other regions or populations. Additionally, the absence of long-term follow-up precludes an evaluation of the model’s predictive capabilities regarding long-term prognosis. However, this study also presents notable strengths: the application of multiple machine learning algorithms for model construction and comparison enhances the stability and reliability of the models. By employing 5-fold cross-validation and a comprehensive evaluation using multiple indices, the optimal logistic regression model was identified, effectively mitigating issues of overfitting and model selection bias.33 The model demonstrates commendable predictive performance on both the training and validation sets, suggesting a degree of generalization ability.34 The contribution of each feature to the model was analyzed using SHAP values, providing valuable insights for clinical decision-making. Despite the identified limitations, through rigorous methodological design and multi-faceted model evaluation, a relatively stable and reliable one-year recurrence risk prediction model for pulmonary tuberculosis combined with chronic pulmonary aspergillosis has been established. Future research should aim to expand the sample size, conduct multi-center validations, and investigate the predictive effects and long-term prognoses of this model across diverse populations.

Conclusion

This study constructed a machine learning-based one-year recurrence prediction model for post-TB CPA using 220 retrospective cases from Wuhan Pulmonary Hospital; LASSO regression identified eight key risk factors (COPD, CFPA, progressive pleural hypertrophy, etc)., and among four algorithms, the logistic regression model performed best (AUC: 0.779 internal validation, 0.819 test cohort) with good calibration and clinical benefits. In conclusion, this model effectively identifies high-risk patients and quantifies recurrence risk, providing a practical tool for personalized follow-up to reduce recurrence and improve prognosis, while future multi-center prospective studies are needed to validate its generalizability.

Data Sharing Statement

All data supporting the results of this study are available upon request from the corresponding author.

Ethics Statement

This study was approved by the Ethics Committee of [Wuhan Pulmonary Hospital] (Approval No.[Wuhan Ethics (2022) No. 7]). The Ethics Committee waived the requirement for informed consent because of the retrospective nature of the study and the anonymization of all participant data.

Funding

This work was financially supported by the Wuhan Science and Technology Innovation Bureau Project (No. 2024040801020381). Wuhan Municipal Health and Family Planning Commission Scientific research project (No:WX20D75).

Disclosure

The authors declare that they have no competing interests.

References

1. Trajman A, Campbell JR, Kunor T, et al. Tuberculosis. Lancet. 2025;405(10481):850–866. doi:10.1016/S0140-6736(24)02479-6

2. Chihota V, Gombe M, Gupta A, et al. Tuberculosis preventive treatment in high TB-burden settings: a state-of-the-art review. Drugs. 2025;85(2):127–147. doi:10.1007/s40265-024-02131-3

3. Tayal A, Kabra SK. Tuberculosis preventive treatment. Indian J Pediatr. 2023;91(8):823–829. doi:10.1007/s12098-023-04969-z

4. Dheda K, Mirzayev F, Cirillo DM, et al. Multidrug-resistant tuberculosis. Nat Rev Dis Primers. 2024;10(1):22. doi:10.1038/s41572-024-00504-2

5. Janssen S, Murphy M, Upton C, Allwood B, Diacon AH. Tuberculosis: an update for the clinician. Respirology. 2025;30(3):196–205. doi:10.1111/resp.14887

6. Bongomin F, Denning DW. Chronic pulmonary aspergillosis: a neglected post-TB lung disease. Int J Tuberc Lung Dis. 2024;28(6):314–315. doi:10.5588/ijtld.24.0165

7. Denning DW. Global incidence and mortality of severe fungal disease. Lancet Infect Dis. 2024;24(7):e428–e438. doi:10.1016/S1473-3099(23)00692-8

8. Wichmann D, Hoenigl M, Koehler P, et al. Diagnosis and treatment of invasive pulmonary aspergillosis in critically ill intensive care patients: executive summary of the German national guideline (AWMF 113-005). Infection. 2025;53(4):1299–1310. doi:10.1007/s15010-025-02572-2

9. Ledford DK, Kim T-B, Ortega VE, Cardet JC. Asthma and respiratory comorbidities. J Allergy Clin Immunol. 2024;155(2):316–326. doi:10.1016/j.jaci.2024.11.006

10. Sengupta A, Ray A, Upadhyay AD, et al. Mortality in chronic pulmonary aspergillosis: a systematic review and individual patient data meta-analysis. Lancet Infect Dis. 2025;25(3):312–324. doi:10.1016/S1473-3099(24)00567-X

11. Rozaliyani A, Setianingrum F, Isbaniah F, et al. A silent threat in post-tuberculosis patients: chronic pulmonary aspergillosis survey in multiple regions of Indonesia (I-CHROME study). J Fungi. 2025;11(5). doi:10.3390/jof11050329

12. Jaggi TK, Agarwal R, Tiew PY, et al. Fungal lung disease. Eur Respir J. 2024;64(5):2400803. doi:10.1183/13993003.00803-2024

13. Madden A, Ofori SK, Budu M, Sisay E, Dooley B, Murray MB. A systematic review of chronic pulmonary aspergillosis among patients treated for pulmonary tuberculosis. Clin Infect Dis. 2025;81(4):e163–e171. doi:10.1093/cid/ciaf150

14. Li Y, Ren X, Wang Q, et al. A predictive model for pulmonary aspergillosis in ICU Patients: a multicenter retrospective cohort study. Infect Drug Resist. 2025;18:441–454. doi:10.2147/IDR.S493019

15. Bongomin F. Post-tuberculosis chronic pulmonary aspergillosis: an emerging public health concern. PLoS Pathog. 2020;16(8):e1008742. doi:10.1371/journal.ppat.1008742

16. Greener JG, Kandathil SM, Moffat L, Jones DT. A guide to machine learning for biologists. Nat Rev Mol Cell Biol. 2022;23(1):40–55. doi:10.1038/s41580-021-00407-0

17. Theodosiou AA, Read RC. Artificial intelligence, machine learning and deep learning: potential resources for the infection clinician. J Infect. 2023;87(4):287–294. doi:10.1016/j.jinf.2023.07.006

18. Guo Q-H, Xie F-C, Zhong F-M, et al. Application of interpretable machine learning algorithms to predict distant metastasis in ovarian clear cell carcinoma. Cancer Med. 2024;13(7):e7161. doi:10.1002/cam4.7161

19. Gu W, Chen Y, Zhu H, et al. Development and validation of CT-based radiomics deep learning signatures to predict lymph node metastasis in non-functional pancreatic neuroendocrine tumors: a multicohort study. EClinicalMedicine. 2023;65:102269. doi:10.1016/j.eclinm.2023.102269

20. Neuböck MJ, Günther G, Barac A, et al. Chronic pulmonary aspergillosis as a considerable complication in post-tuberculosis lung disease. Semin Respir Crit Care Med. 2024;45(1):102–113. doi:10.1055/s-0043-1776913

21. Denning DW, Cadranel J, Beigelman-Aubry C, et al. Chronic pulmonary aspergillosis: rationale and clinical guidelines for diagnosis and management. Eur Respir J. 2016;47(1):45–68. doi:10.1183/13993003.00583-2015

22. Zhong H, Wang Y, Gu Y, et al. Clinical features, diagnostic test performance, and prognosis in different subtypes of chronic pulmonary aspergillosis. Front Med Lausanne. 2022;9:811807. doi:10.3389/fmed.2022.811807

23. Rayens E. Estimating mortality in chronic pulmonary aspergillosis. Lancet Infect Dis. 2025;25(3):250–251. doi:10.1016/S1473-3099(24)00659-5

24. Nguyen NTB, Le Ngoc H, Nguyen NV, et al. Chronic pulmonary aspergillosis situation among post tuberculosis patients in vietnam: an observational study. J Fungi. 2021;7(7). doi:10.3390/jof7070532

25. Tobin EH, Gilotra TS, Baradhi KM. Aspergilloma. Treasure Island (FL): StatPearls Publishing; 2024.

26. Soeroso NN, Siahaan L, Khairunnisa S, et al. The association of chronic pulmonary aspergillosis and chronic pulmonary histoplasmosis with MDR-TB patients in indonesia. J Fungi. 2024;10(8). doi:10.3390/jof10080529

27. Vithayathil M, Koku D, Campani C, et al. Machine learning based radiomic models outperform clinical biomarkers in predicting outcomes after immunotherapy for hepatocellular carcinoma. J Hepatol. 2025;83(4):959–970. doi:10.1016/j.jhep.2025.04.017

28. Karlsson L, Vogel J, Arvidsson I, et al. Machine learning prediction of tau-PET in Alzheimer’s disease using plasma, MRI, and clinical data. Alzheimers Dement. 2025;21(2):e14600. doi:10.1002/alz.14600

29. Ren W, Li H, Guo C, et al. Serum cytokine biomarkers for use in diagnosing pulmonary tuberculosis versus chronic pulmonary aspergillosis. Infect Drug Resist. 2023;16:2217–2226. doi:10.2147/IDR.S403401

30. Bai Q, Chen H, Gao Z, et al. Advanced prediction of heart failure risk in elderly diabetic and hypertensive patients using nine machine learning models and novel composite indices: insights from NHANES 2003–2016. Eur J Prev Cardiol. 2025. doi:10.1093/eurjpc/zwaf081

31. Garg M, Bhatia H, Sehgal I, et al. The conundrum of computed tomography findings in chronic pulmonary aspergillosis: insights from 103 cases. J Thorac Imaging. 2025;40(5). doi:10.1097/RTI.0000000000000828

32. Chirumamilla NK, Arora K, Kaur M, et al. Innate and adaptive immune responses in subjects with CPA secondary to post-pulmonary tuberculosis lung abnormalities. Mycoses. 2024;67(5):e13746. doi:10.1111/myc.13746

33. Zhang X, Xu Z, Yao Y, et al. Validation status of electronic sphygmomanometers in China: a national survey. Hypertension. 2025;82(3):532–541. doi:10.1161/HYPERTENSIONAHA.124.24203

34. Zou X, Gomez ZW, Reddy TE, Allen AS, Majoros WH. Bayesian estimation of allele-specific expression in the presence of phasing uncertainty. Bioinformatics. 2025;41(6). doi:10.1093/bioinformatics/btaf283

Creative Commons License © 2025 The Author(s). This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms and incorporate the Creative Commons Attribution - Non Commercial (unported, 4.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.