Modeling risk assessment for breast cancer in symptomatic women: a Saudi Arabian study
Received 5 October 2018
Accepted for publication 1 January 2019
Published 4 February 2019 Volume 2019:11 Pages 1125—1132
Checked for plagiarism Yes
Review by Single-blind
Peer reviewer comments 3
Editor who approved publication: Dr Kenan Onel
Anwar E Ahmed,1,2 Donna K McClish,3 Thamer Alghamdi,4 Abdulmajeed Alshehri,4 Yasser Aljahdali,4 Khalid Aburayah,4 Abdulrahman Almaymoni,4 Monirah Albaijan,1 Hamdan Al-Jahdali,1,4–6 Abdul Rahman Jazieh4–6
1King Abdullah International Medical Research Center (KAIMRC), Riyadh, Saudi Arabia; 2College of Public Health and Health Informatics, King Saud Bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia; 3Department of Biostatistics, School of Medicine, Virginia Commonwealth University, Richmond, VA, USA; 4College of Medicine, King Saud Bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia; 5King Abdulaziz Medical City, Riyadh, Saudi Arabia; 6Ministry of the National Guard - Health Affairs, Riyadh, Saudi Arabia
Background: Despite the continuing increase in the breast cancer incidence rate among Saudi Arabian women, no breast cancer risk-prediction model is available in this population. The aim of this research was to develop a risk-assessment tool to distinguish between high risk and low risk of breast cancer in a sample of Saudi women who were screened for breast cancer.
Methods: A retrospective chart review was conducted on symptomatic women who underwent breast mass biopsies between September 8, 2015 and November 8, 2017 at King Abdulaziz Medical City, Riyadh, Saudi Arabia.
Results: A total of 404 (63.8%) malignant breast biopsies and 229 (36.2%) benign breast biopsies were analyzed. Women ≥40 years old (aOR: 6.202, CI 3.497–11.001, P=0.001), hormone-replacement therapy (aOR 24.365, 95% CI 8.606–68.987, P=0.001), postmenopausal (aOR 3.058, 95% CI 1.861–5.024, P=0.001), and with a family history of breast cancer (aOR 2.307, 95% CI 1.142–4.658, P=0.020) were independently associated with an increased risk of breast cancer. This model showed an acceptable fit and had area under the receiver-operating characteristic curve of 0.877 (95% CI 0.851–0.903), with optimism-corrected area under the curve of 0.865.
Conclusion: The prediction model developed in this study has a high ability in predicting increased breast cancer risk in our facility. Combining information on age, use of hormone therapy, postmenopausal status, and family history of breast cancer improved the degree of discriminatory accuracy of breast cancer prediction. Our risk model may assist in initiating population-screening programs and prompt clinical decision making to manage cases and prevent unfavorable outcomes.
Keywords: breast cancer management, risk assessment, modeling, patient stratification, predictive tool
The incidence of breast cancer is increasing in Saudi Arabia and is currently the most common cancer among women,1 accounting for 28.7% of all cancers reported among females in the country.2 According to the Saudi Cancer Registry, the age-standardized incidence in 2001 was 11.8 per 100,000 females,3 which increased to 22.7 in 2014.2 A Saudi Arabian study in 2008 reported that the incidence of breast cancer was expected to increase by 350% in 2025.4 The apparent increase in breast cancer incidence is possibly attributable to genetic, environmental, and dietary factors.1
Breast cancer remains the major cause of death among women in Saudi Arabia.1 This poor prognosis could be due to diagnosis at an advanced stage.5 Early detection of breast cancer is known to improve patients’ survival and clinical outcomes.6 Also, early detection can be aided by applying risk-assessment tools to stratify patients with greater risk of breast cancer.7 This may aid the screening process to improve individual clinical decision making. Numerous breast cancer risk-prediction models have been developed and validated in various female populations to aid diagnosis decision making and improve accuracy.8–10
Gene expressions for breast cancer in Saudi Arabia are different from those in the North American population.11 As such, the generalizability of the current existing models may be suitable for specific populations where they have been validated, but may not be suitable to project the risk of breast cancer in women in Saudi Arabia. To our knowledge, there is a great need to develop and validate a risk-prediction model in Saudi Arabia to stratify patients at high risk effectively and achieve better understanding of existing risk-prediction models.
The aim of this study was to develop a risk-assessment tool to distinguish between high risk and low risk of breast cancer in a sample of women screened for breast cancer at King Abdulaziz Medical City, Riyadh (KAMC-RD), Ministry of National Guard – Health Affairs (MNG-HA), Saudi Arabia. The model assessed the contribution of demographic, clinical, and reproductive factors and breast-imaging findings in predicting breast cancer. We hypothesized that a number of factors would be associated with increased risk of breast cancer in women who had undergone a biopsy of breast mass at KAMC-RD, MNG-HA.
A single center retrospective chart review was conducted on 637 symptomatic women who underwent a biopsy of breast mass between September 8, 2015, and November 8, 2017, at KAMC-RD, MNG-HA. The study received ethical approval from the institutional review board at the MNG-HA (SP17/027/R). Due to the study design, patient consent to review their medical records was not required as per the institutional review board.
Data were retrieved on women’s age at biopsy of breast mass, body-mass index (kg/m2), diabetes, hypertension, asthma, lung disease, hyperlipidemia, use of oral contraceptives, use of hormone-replacement therapy (HRT) to treat signs and symptoms, change in size or skin of breast, family history of breast cancer, and history of genital disease. These data were recorded as yes/no. Breast cancer in Saudi Arabia predominantly impacts women who are over 40 years of age.12 Women’s age at biopsy was classified into two groups: <40 years and 40 years or more. This cutoff value was defined by the Youden index, as it produced sensitivity of 93% and specificity of 57.2%. In accordance with the World Health Organization, a body-mass index ≥30 kg/m2 was used to define obesity.13
Data were extracted on postmenopausal status (yes/no) from medical charts and age ≥50 years was used as a cutoff to define postmenopausal status in cases of unavailable data on postmenopausal status. This cutoff was used because two previous reports estimated mean age at menopause to be around 50 years among women in Saudi Arabia.14,15 Data were retrieved on imaging tests used to screen for breast cancer, including breast ultrasound, mammography, and magnetic resonance imaging (MRI). A positive screening for breast cancer on mammography, ultrasound, or MRI was classified as abnormal diagnostic imaging. A total of 637 symptomatic women underwent breast biopsy during the study period. Data were excluded for four women, because they had no available breast-biopsy findings on their records.
All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards. Due to the study design, patient consent to review their medical records was not required as per the institutional review board at the Ministry of National Guard – Health Affairs.
Data were analyzed using STATA version 12 (StataCorp, College Station, TX, USA). An overall summary of sample characteristics and across-group analyses is presented in Table 1.
Table 1 Women’s characteristics
P-values of χ2/independent-sample t-tests and unadjusted OR were reported to test whether specific sample characteristics were associated with breast cancer. Accuracy was evaluated for each predictor alone in identifying breast cancer by area under the receiver-operating characteristic (ROC) curve (AUC) analysis and 95% CI (Table 2).
Table 2 Bivariate analysis: factors associated with increased breast cancer risk
Abbreviation: HRT, hormone-replacement therapy.
Development of a risk-prediction model to classify breast cancer risk was undertaken using the stepwise logistic regression model (Table 3). The model evaluated 14 potential factors that could be associated with increased risk of breast cancer, and were evaluated at α≤0.05. The Hosmer–Lemeshow test was used to assess the goodness of fit of the final model, with P>0.05, suggesting that the final model fit the data well.
Table 3 ROC analysis: accuracy of individual factors
Abbreviations: HRT, hormone-replacement therapy; ROC, receiver-operating characteristic; SE, standard error.
The overall performance of the final model was evaluated by AUC and compared with the individual predictors in the final model (Figure 1). The AUC of the final model was internally corrected using a bootstrap resampling approach by generating 100 random samples drawn with replacements from the original sample (n=633). This was performed to validate the AUC result on 100 replicates. Average optimism and SD were calculated. A risk-probability calculator was developed to predict the risk of developing breast cancer, which is a function of the important selected factors.
Figure 1 ROC curves of individual factors and breast cancer risk prediction model.
A total of 633 women who underwent breast biopsy were analyzed, which included 404 (63.8%) who had had a malignant breast biopsy and 229 (36.2%) who had had a benign breast biopsy. The mean age was 49.3±15.6 years, ranging 16–97 years. The majority (94.5%) of our sample had at least one abnormal imaging examination. Table 1 presents the sample characteristics.
The unadjusted risk (Table 2) of being diagnosed with breast cancer tended to increase with age ≥40 years, obesity, diabetes, hypertension, asthma, lung disease, hyperlipidemia, postmenopause, use of HRT, changes in size or skin of breast, family history of breast cancer, and history of genital disease (P≤0.05). ROC-curve analyses were used to quantify discriminatory accuracy of individual predictors for classifying the risk of breast cancer (Table 3). AUC ranged between 0.509 (abnormal diagnostic imaging) to 0.751 (age ≥40 years). Age ≥40 years, postmenopause, and use of menopausal hormone therapy showed higher discriminatory accuracy (AUC >0.7) compared with other individual predictors (Table 3). A prediction model that combines various patients’ information to increase the discriminatory accuracy for classifying the risk of breast cancer was used.
From a total of 14 predictive factors, four significant independent risk factors (Table 4) were identified to predict high risk of breast cancer. These were age ≥40 years, HRT, postmenopause, and family history of breast cancer. We noted that women >40 years had higher odds of developing breast cancer (aOR 6.202, CI 3.497–11.001; P=0.001). Women who used menopausal hormone therapy had 24.365 times the odds of breast cancer than women who did not use menopausal hormone therapy (aOR 24.365, 95% CI 8.606–68.987; P=0.001). Women who had a family history of breast cancer were twice as likely to develop breast cancer as those who had no had family history of breast cancer (aOR 2.307, 95% CI 1.142–4.658, P=0.020). Postmenopausal women were associated with higher odds of developing breast cancer (aOR 3.058, 95% CI 1.861–5.024, P=0.001). According to the goodness-of-fit test, the model showed an acceptable fit (Hosmer–Lemeshow, P=0.053). The model showed substantial discriminatory accuracy for classifying the risk of breast cancer (AUC 0.877, 95% CI 0.851–0.903). In 100 bootstrap resamples, the overall accuracy of these samples was also high, with an AUC of 0.870 with mean optimism of 0.002 and SD 0.012. The corrected estimate of discriminatory accuracy was 0.877–0.012=0.865. From these results, a breast cancer risk calculator was developed (Supplementary material).
Table 4 Multivariate analysis: factors associated with increased breast cancer risk
Abbreviation: HRT, hormone-replacement therapy; SE, standard error.
A probability of 0.72 was defined as the optimal operating point to discriminate between high risk and low risk of breast cancer, with sensitivity, specificity, and Youden index of 80.7%, 78.5%, and 0.592, respectively (Table S1 and S2).
This study proposes a breast cancer prediction model based on a retrospective cohort of 633 symptomatic women who underwent breast biopsy at KAMC-RD, MNG-HA, Saudi Arabia. The predictive accuracy of breast cancer risk classification into high risk and low risk has not been previously reported in the Saudi population. The developed model integrates four risk factors: age ≥40 years, postmenopausal status, HRT, and family history of breast cancer. The discriminatory accuracy of each factor alone was: 0.751, 0.744, 0.707, and 0.541 (AUC), respectively. However, combining information on age, HRT, postmenopausal status, and family history of breast cancer increased the degree of discriminatory accuracy of breast cancer prediction (0.877) with optimism-corrected AUC of 0.865.
Age ≥40 years was associated with increased risk of breast cancer. This is comparable to several previous existing risk-prediction models, where older women are much more likely than younger women to be diagnosed with breast cancer.7,16–20 Our findings are also in agreement with annual data reported by the Saudi Health Council, where the incidence of breast cancer was higher in women aged >40 years.2,3 Targeting and monitoring women aged >40 years may improve early detection,7,21 and improve the survival rate from breast cancer.22
HRT use was the strongest predictor and associated with the highest risk of breast cancer. This association was in agreement with a number of earlier studies.23–26 Barlow et al reported that the model could be improved by adding the use of HRT.23 The use and duration of hormone therapy must be justified and monitored, as it can be used as part of preventive health strategies or as a modifiable factor to reduce the risk of developing breast cancer.
In agreement with several earlier studies,25–29 postmenopausal women have a higher risk of breast cancer. The results also show agreement with previous models,23,27,29 wherein family history of breast cancer was significantly associated with breast cancer. Breast cancer-screening strategies should target postmenopausal women and women with a family history of breast cancer to reduce risk. These study findings might become a prerequisite for a large prospective study, and then for the development of preventive health programs to reduce the high risk of breast cancer in these groups.
Strength and limitations
We identified several limitations in the present study. Association does not necessarily indicate causation. Due to the nature of the study design, some potential risk factors for breast cancer were not available to be included in the model (eg, breast density, age at menarche, age at first pregnancy, number of children). Although it was a large retrospective cohort, validation was performed internally, rather than in a separate prospective cohort. The results of this prediction model may be suitable only for symptomatic women who were treated at KAMC-RD, MNG-HA and underwent breast-mass biopsy.
Despite these limitations, it appears promising that combining information on demographic, clinical, and reproductive factors, ie, age, hormone use, postmenopausal status, and family history of breast cancer, produced substantial improvement in the discriminative ability of the model compared to individual factors. The model confirmed findings in other published risk-prediction models and may contribute to clinical decision making on the diagnostic pathway and prompt a screening program.
The prediction model developed in this study had a high ability to predict increased breast cancer risk in our facility. Risk assessment was improved by combining information on age, menopausal hormone therapy, being postmenopausal, and family history of breast cancer. This model can be used in risk assessment for identifying high-risk groups and improving the early-diagnosis process. Our risk model may assist in initiating population screening programs and prompt clinical decision-making to manage cases and prevent unfavorable outcomes. External validation is required to assess the effectiveness of the model in a prospective cohort and then develop primary preventive strategies for breast cancer in Saudi Arabia.
MNG-HA, Ministry of National Guard-Health Affairs; KAMC-RD, King Abdulaziz Medical City, Riyadh; CI, confidence interval; ROC, receiver-operating characteristic; AUC, area under the ROC curve; MRI, magnetic resonance imaging; BMI, body-mass index.
The authors would like to thank King Abdullah International Medical Research Center, Riyadh, Saudi Arabia, for approving this study.
All authors contributed toward data analysis, drafting and revising the paper, gave final approval of the version to be published and agree to be accountable for all aspects of the work.
The authors report no conflicts of interest in this work.
Al-Rikabi A, Husain S. Increasing prevalence of breast cancer among Saudi patients attending a tertiary referral hospital: a retrospective epidemiologic study. Croat Med J. 2012;53(3):239–243.
Cancer Incidence Report Saudi Arabia 2014. Saudi Cancer Registry. Available from: https://nhic.gov.sa/eServices/Documents/2014.pdf. Accessed 18 February, 2018.
National Cancer Registry (Saudi Arabia). Saudi Arabia Cancer Incidence Report 2001. Riyadh, Saudi Arabia: National Cancer Registry (Saudi Arabia), 2005. Available from: http://ghdx.healthdata.org/record/saudi-arabia-cancer-incidence-report-2001. Accessed 18 February, 2018.
Ibrahim EM, Zeeneldin AA, Sadiq BB, Ezzat AA. The present and the future of breast cancer burden in the Kingdom of Saudi Arabia. Med Oncol. 2008;25(4):387–393.
Al Tamimi DM, Shawarby MA, Ahmed A, Hassan AK, Alodaini AA. Protein expression profile and prevalence pattern of the molecular classes of breast cancer - a Saudi population based study. BMC Cancer. 2010;10(1):223.
Balabram D, Turra CM, Gobbi H. Survival of patients with operable breast cancer (Stages I-III) at a Brazilian public hospital - a closer look into cause-specific mortality. BMC Cancer. 2013;13(1):1.
Gail MH, Brinton LA, Byar DP, et al. Projecting individualized probabilities of developing breast cancer for WHITE females who are being examined annually. J Natl Cancer Inst. 1989;81(24):1879–1886.
Lee SM, Park JH, Park HJ. Implications of systematic review for breast cancer prediction. Cancer Nurs. 2008;31(5):E40–E46.
Anothaisintawee T, Teerawattananon Y, Wiratkapun C, Kasamesup V, Thakkinstian A. Risk prediction models of breast cancer: a systematic review of model performances. Breast Cancer Res Treat. 2012;133(1):1–10.
Meads C, Ahmed I, Riley RD. A systematic review of breast cancer incidence risk prediction models with meta-analysis of their performance. Breast Cancer Res Treat. 2012;132(2):365–377.
Amemiya Y, Bacopulos S, Al-Shawarby M, et al. A comparative analysis of breast and ovarian cancer-related gene mutations in Canadian and Saudi Arabian patients with breast cancer. Anticancer Res. 2015;35(5):2601–2610.
Ezzat AA, Ibrahim EM, Raja MA, Al-Sobhi S, Rostom A, Stuart RK. Locally advanced breast cancer in Saudi Arabia: high frequency of stage III in a young population. Med Oncol. 1999;16(2):95–103.
World Health Organization. Obesity: preventing and managing a global epidemic. WHO Technical Report Series No. 894. Geneva: World Health Organization; 2000.
Greer W, Sandridge AL, Chehabeddine RS. The frequency distribution of age at natural menopause among Saudi Arabian women. Maturitas. 2003;46(4):263–272.
Aldughaither A, Almutairy H, Alateeq M. Menopausal symptoms and quality of life among Saudi women visiting primary care clinics in Riyadh, Saudi Arabia. Int J Womens Health. 2015;7:645.
Costantino JP, Gail MH, Pee D, et al. Validation studies for models projecting the risk of invasive and total breast cancer incidence. J Natl Cancer Inst. 1999;91(18):1541–1548.
Anderson SJ, Ahnn S, Duff K. NSABP Breast Cancer Prevention Trial risk assessment program, version 2. NSABP Biostatistical Center Technical Report. August 14, 1992;12.
Dite GS, Mahmoodi M, Bickerstaffe A, et al. Using SNP genotypes to improve the discrimination of a simple breast cancer risk prediction model. Breast Cancer Res Treat. 2013;139(3):887–896.
Chay WY, Ong WS, Tan PH, et al. Validation of the Gail model for predicting individual breast cancer risk in a prospective nationwide study of 28,104 Singapore women. Breast Cancer Res. 2012;14(1): R19.
Breyer JZ, Wendland EM, Kops NL, Caleffi M, Hammes LS. Assessment of potential risk factors for breast cancer in a population in southern Brazil. Breast Cancer Res Treat. 2018;169(1):125–131.
Pitman JA, Mcginty GB, Soman RR, Drotman MB, Reichman MB, Arleo EK. Screening mammography for women in their 40S: the potential impact of the American cancer Society and U.S. Preventive Services Task Force breast cancer screening recommendations. AJR Am J Roentgenol. 2017;209(3):697–702.
Fontein DB, de Glas NA, Duijm M, et al. Age and the effect of physical activity on breast cancer survival: a systematic review. Cancer Treat Rev. 2013;39(8):958–965.
Barlow WE, White E, Ballard-Barbash R, et al. Prospective breast cancer risk prediction model for women undergoing screening mammography. J Natl Cancer Inst. 2006;98(17):1204–1214.
Green LE, Dinh TA, Smith RA. An estrogen model: the relationship between body mass index, menopausal status, estrogen replacement therapy, and breast cancer risk. Comput Math Methods Med. 2012;2012:1–8.
Wang F, Dai J, Li M, et al. Risk assessment model for invasive breast cancer in Hong Kong women. Medicine. 2016;95(32):e4515.
Leon Guerrero RT, Novotny R, Wilkens LR, et al. Risk factors for breast cancer in the breast cancer risk model study of Guam and Saipan. Cancer Epidemiol. 2017;50(Pt B):221–233.
Eriksson M, Czene K, Pawitan Y, Leifland K, Darabi H, Hall P. A clinical model for identifying the short-term risk of breast cancer. Breast Cancer Res. 2017;19(1):29.
Crooke PS, Justenhoven C, Brauch H, et al. Estrogen metabolism and exposure in a genotypic-phenotypic model for breast cancer risk prediction. Cancer Epidemiol Biomarkers Prev. 2011;20(7):1502–1515.
Park B, Ma SH, Shin A, et al. Korean risk assessment model for breast cancer risk prediction. PLoS One. 2013;8(10):e76736.
Table S1 Breast cancer risk calculator
Note: *Randomly selected symptomatic women.
Table S2 Optimal operating point to discriminate between high risk and low risk of breast cancer
This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution - Non Commercial (unported, v3.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.Download Article [PDF]