Validation of a clinical risk-scoring algorithm for severe scrub typhus
Authors Sriwongpan P, Patumanond J, Krittigamas P, Tantipong H, Tawichasri C, Namwongprom S
Received 1 November 2013
Accepted for publication 23 December 2013
Published 18 February 2014 Volume 2014:7 Pages 29—34
Checked for plagiarism Yes
Review by Single anonymous peer review
Peer reviewer comments 3
Pamornsri Sriwongpan,1,2 Jayanton Patumanond,3 Pornsuda Krittigamas,4 Hutsaya Tantipong,5 Chamaiporn Tawichasri,6 Sirianong Namwongprom1,7
1Clinical Epidemiology Program, Faculty of Medicine, Chiang Mai University, Chiang Mai, 2Department of Social Medicine, Chiangrai Prachanukroh Hospital, Chiang Rai, 3Clinical Epidemiology Program, Faculty of Medicine, Thammasat University, Bangkok, 4Department of General Pediatrics, Nakornping Hospital, Chiang Mai, 5Department of Medicine, Chonburi Hospital, Chonburi, 6Clinical Epidemiology Society at Chiang Mai, Chiang Mai, 7Department of Radiology, Faculty of Medicine, Chiang Mai University, Chiang Mai, Thailand
Objective: The aim of the study reported here was to validate the risk-scoring algorithm for prognostication of scrub typhus severity.
Methods: The risk-scoring algorithm for prognostication of scrub typhus severity developed earlier from two general hospitals in Thailand was validated using an independent dataset of scrub typhus patients in one of the hospitals from a few years later. The predictive performances of the two datasets were compared by analysis of the area under the receiver-operating characteristic curve (AuROC). Classification of patients into non-severe, severe, and fatal cases was also compared.
Results: The proportions of non-severe, severe, and fatal patients by operational definition were similar between the development and validation datasets. Patient, clinical, and laboratory profiles were also similar. Scores were similar in both datasets, both in terms of discriminating non-severe from severe and fatal patients (AuROC =88.74% versus 91.48%, P=0.324), and in discriminating fatal from severe and non-severe patients (AuROC =88.66% versus 91.22%, P=0.407). Over- and under-estimations were similar and were clinically acceptable.
Conclusion: The previously developed risk-scoring algorithm for prognostication of scrub typhus severity performed similarly with the validation data and the first dataset. The scoring algorithm may help in the prognostication of patients according to their severity in routine clinical practice. Clinicians may use this scoring system to help make decisions about more intensive investigations and appropriate treatments.
Keywords: severity, clinical prediction rule, algorithm, prognosis, Thailand
“Scrub typhus,” one of several potentially fatal tropical rickettsial infections, is caused by Orientia tsutsugamushi and presents as a systemic, vasculitis-like infection. Clinical manifestations are typically fever, eschar, generalized or regional lymphadenopathy, maculopapular rash, severe headache, and myalgia. Nausea and vomiting, diarrhea, and conjunctival injection can also occur.1,2 Without appropriate treatment, infected patients usually experience complications in the second week. Those especially at risk of infection are the elderly, those without previous immunity,2–4 and those with risk factors such as hypoalbuminemia, hyperbilirubinemia, high levels of aspartate aminotransferase (AST) and/or creatinine, and those with lung crepitations.3,5–10 Patients with complications who are admitted to intensive care units with a high Acute Physiology and Chronic Health Evaluation II (APACHE II) score are at high risk of mortality.5 In rural areas where effective treatment is unavailable or delayed, mortality rates have been reported as high as 35%.2
The cost of treatment for scrub typhus patients with complications is much higher due to hospital admissions and more intensive treatments.11 Prognostication of disease severity may help with the identification of disease severity and prompt initiation of treatments.
Prognostication of disease severity has been considered by researchers of various conditions. Examples include a clinical prediction rule for severe community-acquired pneumonia,12 Mortality in Emergency Department Sepsis score,13 a prediction rule for risk of mortality in Clostridium difficile infection,14 and decision-tree algorithms for dengue fever15 and dengue hemorrhagic fever.16 Some of these tools were found to have both advantages and disadvantages when validated.
Previously, we developed a clinical risk-scoring algorithm for scrub typhus severity comprising six predictors from routine practice.17 The objective of the study reported here was to validate the risk-scoring algorithm developed earlier with an independent dataset obtained from similar patients from a few years later.
Materials and methods
Data were retrieved retrospectively from the medical files of scrub typhus patients. The criteria for inclusion were similar to those used in the previous study17 and were based on history of disease exposure and presentation with acute fever and at least one of the following signs and symptoms: myalgia, headache, conjunctival injection, cough, profuse sweating, maculopapular rash, and/or lymphadenopathy accompanied by the presence of eschar, and/or a positive immuno-chromatographic test for scrub typhus.18
Patients were classified into three groups:
- non-severe – patients without any complications
- severe – patients with complications involving at least one of the following organ systems:
- cardiovascular system – presence of any of any of the following: systolic blood pressure <90 mmHg, myocarditis (defined as elevated creatine kinase-myocardial band isoenzyme above baseline), or abnormal cardiac arrhythmia with no previous history of atrial fibrillation, supra ventricular tachycardia, or frequent premature ventricular tachycardia
- respiratory system – presence of acute respiratory distress syndrome, defined as the ratio of partial pressure arterial oxygen and fraction of inspired oxygen (PaO2/FiO2) <200 mmHg, with bilateral interstitial infiltration on chest film with normal cardio/thoracic ratio, or no volume overload of central venous pressure from central venous catheter
- central nervous system – presence of any of the following: Glasgow Coma Scale ≤12 without other causes, seizure without other causes, or meningoencephalitis
- hematological system – platelet count ≤20,000/mm3
- urinary system – presence of acute renal failure, defined as creatinine ≥2mg/dL or creatinine change of >0.5 mg/dL/day
- gastrointestinal and hepatobiliary system – presence of hepatitis, defined as elevated AST or alanine aminotransferase (ALT) more than fivefold normal levels.
Patients in any intervention trials during the same period were excluded from the data analysis.
The development data were for patients admitted to two university-affiliated referral hospitals – Nakornping Hospital, Chiang Mai and Chiangrai Prachanukroh Hospital – in the north of Thailand from 2004 to 2010 (n=526).
The validation data were for patients admitted to Chiangrai Prachanukroh Hospital from 2011 to 2012 (n=257).
The characteristics of the development data and the validation data were compared using an exact probability test, Student’s t-test, or Wilcoxon rank-sum test as appropriate. The patient risk scores were assigned following the scoring scheme previously developed (Table 1).17 Patients were categorized into three severity groups: 1) non-severe, 2) severe, and 3) fatal, as described. The predictive performance of the scores was compared by analysis of the area under the receiver-operating characteristic curve (AuROC). Over- and underestimated proportions were calculated and compared.
Table 1 Significant predictors and assigned item scores
The study was approved by the Ethics Committee for Research in Patients, Faculty of Medicine, Chiang Mai University, and the research ethical committees of the two hospitals.
Data for 257 patients comprised the validation dataset. Of those, 160 cases were defined as non-severe, 62 as severe, and 35 as fatal. The distribution of patients into the three severity groups of the development and the validation datasets was similar. The clinical characteristics of the patients were also similar except in terms of 1) sex, 2) symptoms or signs of conjunctival injection, maculopapular rash, lymphadenopathy, eschar, abdominal pain, stiff neck, and crepitation, 3) vital signs of systolic blood pressure, and diastolic blood pressure (Table 2). The laboratory investigation data also showed similar profiles for both datasets, except in terms of platelet count and AST, alanine aminotransferase, alkaline phosphatase, albumin, and potassium levels (Table 3).
Table 2 Demographic and clinical characteristics of scrub typhus patients in the development and validation datasets
The risk score was assigned to patients with six complete clinical parameters (n=167, 65%). The mean risk scores of patients in the validation data were different to those of patients in the development dataset (mean ± sd 7.1±3.9 and 6.0±3.9, respectively). The proportion of patients classified into the three severity levels was also different (P=0.004; Table 4).
Table 4 Score-classified scrub typhus severity levels in the development and validation datasets
In the validation dataset, the score correctly predicted 31.7% (53 out of 78) of patients into the non-severe group (score ≤5) with one-level underestimation of 6.6% (eleven patients) and no two-level underestimation, while 14.4% (24 out of 55) of patients were correctly classified into the severe group (score 6–9), with an underestimation of 5.4% (nine patients) and an overestimation of 13.8% (23 patients). Finally, for the fatal severity group (score ≥10), the score was predicted correctly for 14.9% (25 of 34) of patients, with one-level overestimation of 12.0% (20 patients) and two-level overestimation of 1.2% (2 patients), giving a total overestimation of 13.2% (Table 5).
Table 5 Score-classified severity levels, criterion-classified severity levels, and risk estimation validity
The overall predictive performance (as assessed by the AuROCs) of the validation dataset was insignificantly lower than that of the development dataset, both in the discrimination between non-severe versus (vs) severe and fatal (AuROC =88.74% vs 91.48%, P=0.324) and between fatal versus non-severe and severe (AuROC =88.66% vs 91.22%, P=0.407; Table 6 and Figure 1).
Table 6 Discriminative performance of scrub typhus severity score in the development and validation datasets
Validation is needed for any clinical prediction rules before they can be adopted in clinical practice, either internally or externally. The Mortality in Emergency Department Sepsis score, for example, was developed to evaluate the risk of death in patients in emergency departments, in which validation in one-third of the sample yielded similar discrimination (AuROC =0.82 vs 0.78).13 Further, a clinical prediction rule for severe community-acquired pneumonia was developed to screen patients for initial treatments in emergency departments. The rule performed similarly to the development data when prospectively validated in patients from the same hospitals (AuROC =0.92). However, the performance was poorer when externally validated in patients from other hospitals (AuROC =0.80).12 External validation of a prediction rule for risk of mortality in Clostridium difficile infection also yielded poorer performance (AuROC =0.704 vs 0.653).14 It is very common for the performance of the validation data to be poorer than that of the development data.
Our clinical risk-scoring algorithm for severe scrub typhus was validated against patients admitted in only one hospital some years after the patients used in the development dataset. When only patients with complete information on the six predictors (n=167) were selected from the validation dataset, the performance of the algorithm was somewhat poorer than with the development data, both in terms of differentiating non-severe from severe and fatal (AuROC =88.74% vs 91.48%), and fatal from non-severe and severe (AuROC =88.66% vs 91.22%). However, these differences were not statistically or clinically significant. The overall proportion of correct classifications was somewhat lower (61.0% vs 68.3%), but overestimation was higher (27.0% vs 25.8%), which may be considered more beneficial to the patients.17
There were incomplete data on important predictors for 90 of 257 patients (35%). The missing data were AST (n=42, 24.1%), albumin (n=68, 26.5%), and creatinine (n=48, 18.7%). These patients were in the non-severe group, which did not require organ-specific investigations. We did the sensitivity analysis by replacing the missing data by the mean values calculated from existing nonmissing data for each of the corresponding variables. When this was done, the discrimination between non-severe and severe and fatal was reduced (AuROC =84.41%) but was still considered excellent,19 while the discrimination between fatal and non-severe and severe was unchanged. However, sensitivity analysis by multiple imputation of the missing data yielded almost identical performances (data not shown).
These sensitivity analyses imply that the derived risk-scoring algorithm might be successfully and safely applied to classify future patients, especially those with severe and fatal risk of scrub typhus. The score should help physicians in charge determine the need for further investigations and start the necessary treatments for those at these risk levels.
The main advantage of the risk-scoring algorithm is that it requires routinely available patient information, comprising demographic, physical examination, and laboratory test data.17 In endemic areas with restricted health resources,20–22 the score may help inform clinical decision making with regard to both investigations and treatments.
In the present study, data on the validation patients were collected retrospectively and were based on provisional diagnosis made as part of routine clinical practice, in which scrub typhus cases may reflect “suspected” rather than “confirmed” cases, by definition. Therefore, prospective validation to patients with a theoretically defined diagnosis of scrub typhus18 would be helpful to confirm the algorithm’s feasibility for future use in routine practice.
A risk-scoring algorithm developed previously to classify scrub typhus patients into the three severity levels was validated using a different set of patients. The algorithm demonstrated good performance in this subsequent set of patients. The algorithm was clinically acceptable and could be applied in routine clinical practice to help identify patients with severe prognosis in order to judge the need for more intensive investigation and treatments.
The authors wish to thank the authorities of Chiangrai Prachanukroh Hospital and Nakornping Hospital, Chiang Mai for their permission to collect data and wish to thank Chiang Mai University for its financial support.
The study was partially granted by the Faculty of Graduate, Chiang Mai University. The authors declare no other conflicts of interest in this work.
Lee N, Ip M, Wong B, et al. Risk factors associated with life-threatening rickettsial infections. Am J Trop Med Hyg. 2008;78(6):973–978.
Day NP, Newton P, Parola P, Raoult D. Scrub typhus and other tropical rickettsioses. In: Cohen J, Powderly WG, Opal SM, editors. Infectious Diseases. 3rd ed. United States. Elsevier; 2010;2:1231–1237.
Kim DM, Kim SW, Choi SH, Yun NR. Clinical and laboratory findings associated with severe scrub typhus. BMC Infect Dis. 2010;10:108.
Wu KM, Wu ZW, Peng GQ, Wu JL, Lee SY. Radiologic pulmonary findings, clinical manifestations and serious complications in scrub typhus: experiences from a teaching hospital in eastern Taiwan. Int J Gerontol. 2009;3(4):223–232.
Lee CS, Hwang JH, Lee HB, Kwon KS. Risk factors leading to fatal outcome in scrub typhus patients. Am J Trop Med Hyg. 2009;81(3):484–488.
Sriwongpan P, Krittigamas P, Kantipong P, Kunyanone N, Patumanond J, Namwongprom S. Clinical indicators for severe prognosis of scrub typhus. Risk Manag Healthc Policy. 2013;6:43–49.
Varghese GM, Abraham OC, Mathai D, et al. Scrub typhus among hospitalised patients with febrile illness in South India: magnitude and clinical predictors. J Infect. 2006;52(1):56–60.
Chanta C, Triratanapa K, Ratanasirichup P, Mahaprom W. Hepatic dysfunction in pediatric scrub typhus: role of liver function test in diagnosis and marker of disease severity. J Med Assoc Thai. 2007;90(11):2366–2369.
Thap LC, Supanaranond W, Treeprasertsuk S, Kitvatanachai S, Chinprasatsak S, Phonrat B. Septic shock secondary to scrub typhus: characteristics and complications. Southeast Asian J Trop Med Public Health. 2002;33(4):780–786.
Hu ML, Liu JW, Wu KL, et al. Short report: Abnormal liver function in scrub typhus. Am J Trop Med Hyg. 2005;73(4):667–668.
Lee CS, Min IS, Hwang JH, Kwon KS, Lee HB. Clinical significance of hypoalbuminemia in outcome of patients with scrub typhus. BMC Infect Dis. 2010;10(1):216.
España PP, Capelastegui A, Gorordo I, et al. Development and validation of a clinical prediction rule for severe community-acquired pneumonia. Am J Respir Crit Care Med. 2006;174(11):1249–1256.
Shapiro NI, Wolfe RE, Moore RB, Smith E, Burdick E, Bates DW. Mortality in Emergency Department Sepsis (MEDS) score: a prospectively derived and validated clinical prediction rule. Crit Care Med. 2003;31(3):670–675.
Butt E, Foster JA, Keedwell E, et al. Derivation and validation of a simple, accurate and robust prediction rule for risk of mortality in patients with Clostridium difficile infection. BMC Infect Dis. 2013; 13(1):316.
Tanner L, Schreiber M, Low JG, et al. Decision tree algorithms predict the diagnosis and outcome of dengue fever in the early phase of illness. PLoS Negl Trop Dis. 2008;2(3):e196.
Lee VJ, Lye DC, Sun Y, Leo YS. Decision tree algorithm in deciding hospitalization for adult patients with dengue haemorrhagic fever in Singapore. Trop Med Int Health. 2009;14(9):1154–1159.
Sriwongpan P, Krittigamas P, Tantipong H, Patumanond J, Tawichasri C, Namwongprom S. Clinical risk scoring algorithm to forecast scrub typhus severity. Risk Manag Healthc Policy. 2014;7:11–17.
World Health Organization (WHO). WHO Recommended Surveillance Standards. 2nd ed. WHO/CDS/CSR/ISR/99.2. Geneva: WHO; nd. Available from: http://www.who.int/csr/resources/publications/surveillance/whocdscsrisr992.pdf. Accessed October 5, 2011.
Hosmer DW Jr, Lemeshow S, Sturdivant RX. Applied Logistic Regression. 3rd ed. New York, NY: Willey; 2013.
Abrahamsen SK, Haugen CN, Rupali P, et al. Fever in the tropics: aetiology and case-fatality – a prospective observational study in a tertiary care hospital in South India. BMC Infect Dis. 2013;30(13):355.
Lee JH, Chung KM, Kim ES, et al. Dynamics of clinical symptoms in patients with scrub typhus. Jpn J Infect Dis. 2013;66(2):155–157.
Vikrant S, Dheer SK, Parashar A, et al. Scrub typhus associated acute kidney injury – a study from a tertiary care hospital from western Himalayan state of India. Ren Fail. 2013;35(10):1338–1343.
This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution - Non Commercial (unported, v3.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.Download Article [PDF]