Back to Journals » Clinical Epidemiology » Volume 11

Sensitivity, specificity, positive and negative predictive values of identifying atrial fibrillation using administrative data: a systematic review and meta-analysis

Authors Yao RJR, Andrade JG , Deyell MW , Jackson H, McAlister FA , Hawkins NM 

Received 21 February 2019

Accepted for publication 29 April 2019

Published 23 August 2019 Volume 2019:11 Pages 753—767

DOI https://doi.org/10.2147/CLEP.S206267

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 3

Editor who approved publication: Professor Henrik Sørensen



Ren Jie Robert Yao,1 Jason G Andrade,1 Marc W Deyell,1 Heather Jackson,2 Finlay A McAlister,3 Nathaniel M Hawkins1

1Division of Cardiology, University of British Columbia, Vancouver, Canada; 2Cardiac Services British Columbia, Vancouver, Canada; 3Division of General Internal Medicine, University of Alberta, Edmonton, Canada

Correspondence: Nathaniel M Hawkins
University of British Columbia, BC Centre for Improved Cardiovascular Health, St. Paul’s Hospital, 1081 Burrard Street, Vancouver, BC V6Z 1Y6, Canada
Tel +1 604 875 4111
Fax +1 604 875 5504
Email [email protected]

Introduction: Atrial fibrillation (AF) is the commonest arrhythmia and a major cause of stroke and health care utilization. Researchers and administrators use electronic health data to assess disease burden, quality and variance in care, value of interventions and prognosis. We performed a systematic review and meta-analysis to assess the validity of AF case definitions in administrative databases.
Methods: Medline was searched from 2000 to 2018. Extracted information included sensitivity, specificity, positive and negative predictive values (PPV and NPV) for various AF case definitions. Estimates were pooled using random-effects models due to significant heterogeneity between studies.
Results: We identified 24 studies, including 21 from North America or Scandinavia. Hospital, ambulatory and mixed data sources were assessed in 10, 4 and 10 studies, respectively. Nine different AF case definitions were evaluated, most based on ICD-9 or 10 codes. Twenty-two studies assessed case definitions in patients diagnosed with AF and thus could generate PPV alone. Half the studies sampled unrestricted populations including a mix of those with and without AF to assess sensitivity. Only 13 studies included ECG confirmation as a gold standard. The pooled random effects estimates were: sensitivity 80% (95% CI 72–86%); specificity 98% (96–99%); PPV 88% (82–94%); NPV 97% (94–99%). Only 3 studies reported all accuracy parameters and included rhythm monitoring in the gold standard definition.
Conclusion: Relatively few studies examined sensitivity, and fewer still included rhythm monitoring in the gold standard comparison. Administrative data may fail to identify a significant proportion of patients with AF. This, in turn, may bias estimates of quality of care and prognosis.

Keywords: atrial fibrillation, registries, validation studies, accuracy, sensitivity, specificity

Introduction

Atrial fibrillation (AF) increases risk of stroke, heart failure and death, and is one of the few cardiac conditions whose prevalence continues to rise.1,2 Most developed health systems collect reasons for hospital and ambulatory encounters for administration, service planning, quality improvement and reimbursement. Health services researchers use these administrative electronic databases to monitor the burden of disease, quality of care, and ascertain exposures or outcomes. The accuracy of AF identification is central to these applications. Sensitivity and specificity, though theoretically independent, typically trade-off and are inversely related.3 The “optimal” approach to identifying AF depends on the purpose. High sensitivity more completely captures a population, improves generalizability and is important when defining AF as an exposure. By contrast, high specificity ensures persons identified truly have AF and is central to adjudicating treatment uptake, which appears inappropriately low if patients with sinus rhythm are misclassified as having AF.4

Conceptually, the AF patient journey involves ambulatory and acute contacts dissociated in time and space, between which information flows by varying amounts and rates. Interrogating data sources over short time intervals or single environments may miss infrequent encounters. A previous systematic review examined the accuracy of AF detection, but was limited to ICD-9 codes only, non-contemporary electronic sources, North American cohorts and narrative synthesis without consideration of the impact of different health care settings (indeed the focus was largely on hospitalization data).5 We, therefore, undertook a systematic review to address these evidence gaps.

Methods

Participants, outcomes and study designs

Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) guidelines were followed (Table S1). We examined the accuracy of AF case definitions in electronic administrative health data, namely sensitivity (SN), specificity (SP), positive and negative predictive values (PPV and NPV). Inpatient, outpatient and mixed populations were included. All study designs were accepted. The study protocol was not published.

Search strategy and data collection

MEDLINE was searched from January 2000 to February 2018, limited to adult humans and English language, excluding case studies, reviews and conference abstracts. Search terms were determined by literature review and database query. The search strategy combined Medical Subject Headings (MeSH) terms and keywords in title and abstract to define three groups: atrial fibrillation (including atrial flutter (AFL) if not differentiated); administrative and electronic medical databases; and studies examining accuracy of AF identification within these records (Table S2). The search returned 1007 unique records. Manual bibliography searches identified an additional 31 publications (Figure S1). Titles and abstracts were screened for inclusion, and 302 full-text articles reviewed. Studies fulfilling the participant, outcomes and study design criteria were included. Variables of interest were decided a priori, expanded iteratively after pilot and collected in Microsoft Excel. The following information was extracted: bibliographic details, sample size, population characteristics, inclusion and exclusion criteria, codes and algorithms, AF confirmation gold standard and accuracy parameter outcomes.

Data synthesis

Weighted averages of sensitivity, specificity, positive and negative predictive values were calculated using the DerSimonian-Laird random effects model.6 Forest plots of estimates with 95% confidence intervals (CI) were generated. Publication bias was assessed through visual inspection of funnel plots and the Begg-Mazumdar rank correlation test for asymmetry.7,8 Heterogeneity was tested with visual forest plot inspection, Cochrane Q, I2 and Tau2 statistics.9 Estimates with significant heterogeneity (I2>90%) were examined manually and formally for moderating effects including country, publication year and reference standard, none of which were significant. The leave-one-out method was used to determine if the results were sensitive to the inclusion of extreme values from specific studies.10

Results

Study characteristics

Twenty-four studies were identified (Table 1). Most originated from countries with established administrative databases that are often interrogated by health services researchers, including 10 from the United States, 3 from Canada and 8 from Sweden or Denmark. The populations were heterogeneous, including general unselected, stroke and post-operative cohorts. Hospital, ambulatory and mixed data sources were assessed in 10, 4 and 10 studies, respectively. Only 3 studies outside Scandinavia examined mixed populations.4,11,12 One Canadian study included administrative data from emergency departments separate from hospitalizations.11

Table 1 Study characteristics

Coding and case definition algorithms

Most reports investigated International Classification of Diseases codes: ICD8 (427.93 AF, 427.94 AFL), ICD9 (4 digit code 427.3 and more explicit 427.31 AF and 427.32 AFL), ICD10 (I48). Overall, 9 different combinations of codes were studied (Table 2). The impact of coding position (primary versus secondary diagnosis in hospitalization data) was never examined. Four studies compared the accuracy of two versus one encounter coded as AF in ambulatory data sources within a single year.4,1113 This consistently increased specificity but decreased sensitivity. A single study compared 2 versus 1 year for case ascertainment in Veterans Affairs outpatient records, finding greater sensitivity with only slightly reduced specificity.13 Overall in that study, 2 diagnoses over 2 years were optimal for detecting AF.13 Only one study from Canada examined more complex algorithms including cardioversion codes and pharmacy dispensations for antiarrhythmic drugs.11

Table 2 Characteristic and validation of atrial fibrillation in studies

Characteristics of AF

Prevalent and incident AF were assessed in two-thirds and one-thirds of studies, respectively (Table 1). Incident cases were typically defined by exclusion of prior AF diagnoses since the records began, or methods were not specified. The incidence and prevalence of AF varied markedly depending on the population studied, from 0.3% to 55%.14,15 The incidence and prevalence was highest in studies following cardiac surgery (32–36%) and stroke (10–28%), lower in general hospitalizations (7–9%) and lowest in unselected outpatients (0.3–1%). No study distinguished between persistent and paroxysmal AF. Three studies reported from 7.0% to 20.4% of the coded AF to be transient.4,16,17 Two defined transient as a single episode without recurrence,4,16 while two added precipitants including cardiac surgery and/or hyperthyroidism.16,17

Gold standard for diagnosis of AF

With the exception of two studies, medical chart review was considered the gold standard by which history of AF was classified (Table 2). Of these, 13 studies specifically included ECG review, of which 2 employed ECG alone for confirmation of AF.18,19 No prospective protocols or frequencies for ECG were reported. A median of 11 ECGs per patient with AF was noted in a Swedish outpatient setting.15 Only 4 studies mentioned use of longer term rhythm monitoring such as Holter, although these results may also have been available in medical record review.

Sensitivity and specificity

Half the studies (n=12) sampled an unrestricted population including those without AF to assess sensitivity of case definitions and these ranged from 57% to 93%, median 81% (Table 3). The pooled random effects estimate for sensitivity was 80% (95% CI 72–86%) with significant heterogeneity (Q 439, I2 97.7%, Tau2 0.08). One-third of the studies (n=8) reported specificity. Estimates were consistently high in ambulatory, hospitalized and mixed populations, ranging from 91% to 99%, median 99% (Table 3). The pooled random effects estimate for specificity was 98% (95% CI 96–99%).

Table 3 Codes and accuracy for identifying atrial fibrillation

Positive and negative predictive value

Positive predictive value was reported in nearly all studies (n=22), and was the only parameter reported in half the studies (n=12). The PPV ranged from 71% to 99%, median 93% (Table 3). The pooled random effects estimate was 88% (95% CI 82–94%) with significant heterogeneity (Q 4997, I2 99.6%, Tau2 0.02) (Figure 1). The pooled estimate was similar in ambulatory, hospitalized and mixed populations (respectively: 87% (78–96%); 87% (79–95%); 90% (85–94%)). Negative predictive value was reported in 8 studies. Estimates were consistently high, ranging from 86% to 99%, median 98% (Table 3). The pooled random effects estimate was 97% (95% CI 94–99%).

Figure 1 Positive predictive values of atrial fibrillation (AF) algorithms stratified by population type.

Discussion

This analysis reports several key findings. The overall specificity and NPV of an AF diagnosis using the ICD case definitions was high, 98% and 97%, respectively. The sensitivity and PPV were lower though reasonable, 80% and 88%, respectively. Only half the studies sampled patients with and without an assigned diagnosis of AF to determine the sensitivity of the case definitions and thus the proportion potentially missed by using administrative data. Half the studies confirmed AF using electrocardiography as the gold standard, while the remainder employed medical record review, alternative databases (like primary care EMRs) and/or patient questionnaires. Only 3 studies reported all accuracy parameters and included rhythm monitoring in the gold standard definition.15,17,20

Sensitivity

High sensitivity improves case finding as it more completely captures a population, increases the estimated incidence and prevalence and enhances generalizability. This is particularly relevant when estimating the burden of disease and to reduce bias when studying health inequalities. Sensitivity is also important when defining AF as an exposure. Misclassification of exposure (eg, AF) as non-exposure (eg, no AF) attenuates the association with outcomes such as stroke.21 By contrast, sensitivity is less of a concern when defining AF as an outcome, for example in pharmacovigilance studies. In these circumstances, estimates of relative risk are not biased providing misclassification occurs to the same degree in exposed and non-exposed patients.

Sensitivity is reduced when cases are missed and AF is misclassified as normal (ie, false negatives). This occurs in two circumstances. First, when recording or coding is incorrect. Second, when correctly recorded and coded diagnoses are missed in time or space. Examining shorter time frames may miss infrequent encounters, as evidenced in the Veterans Affairs study where sensitivity increased using a 2 versus 1 year period for case ascertainment.13 Information also flows by varying amounts and rates through health systems. Although the median time from AF on ECG to diagnosis in the Swedish Patient Register was 16 days, this time lapse exceeded 6 months in one-third of patients.15

Sensitivity may be viewed from different perspectives: local, horizontal level of care (eg, primary care), vertical (eg, health maintenance organization) or global (entire health care system). Examining a single health care setting may miss encounters meeting the AF case definition in another. For example, hospitalization data alone misses patients managed entirely in the community, causing under-estimates of prevalence rates and over-estimates of adverse outcome rates. There were insufficient studies to accurately compare sensitivity between ambulatory, hospital and mixed populations. However, one of the mixed population studies did compare the accuracy of coding between primary care, secondary care or both together. In that study from Ontario, the sensitivity was 45%, 39% and 75% for hospitalization, emergency department or outpatient data sources alone, respectively, and 83% combining the three sources.11

The true population incidence and prevalence may also be influenced by access to rhythm monitoring (ECG, Holter, event or loop recorder), reporting standards (eg, training, quality assurance) and information transfer (eg, interface to electronic medical record). These factors are potentially more challenging in community than in hospital settings, particularly relevant to measuring inequalities, and difficult to quantify. None of the included studies described these aspects of access.

Positive predictive value

Since sensitivity and specificity are typically inversely related, higher sensitivity reduces specificity, which increases false positives and lowers PPV. The impact on PPV is magnified for diseases with a relatively low prevalence such as AF. A high PPV ensures persons identified truly have AF (fewer false positives). This is central to adjudicating treatment uptake, which will appear inappropriately withheld if patients with sinus rhythm are misclassified as having AF, unless OAC is prescribed for an alternate reason.4

A PPV value exceeding 85–90% suggested adequate for research purposes.19,20 The reasons for false positives were rarely explored.22 Potential scenarios include: 1) miscoding eg, allergic rhinitis was written as “AR” and coded as AF;22 2) rhythm misinterpretation such as atrial tachycardia; 3) misreporting if based on medical history alone; and 4) AF defined by an intervention shared with other conditions eg, cardioversion. PPV is also highly dependent on disease prevalence: as many studies focused on older or high-risk individuals they may overestimate the true PPV for that case definition if applied in a younger population.

Oral anticoagulation is the only treatment to improve survival in patients with AF, and thus a key quality indicator. Although the overall PPV was high (88%), the specificity and PPV to identify AF requiring anticoagulation (as opposed to any AF) could be lower for several reasons. First, up to 10% of incident AF is isolated with a defined precipitant, low recurrence, and may not require anticoagulation.4,16,23 Only three studies reported or excluded such patients.4,16,17 Second, anticoagulation adjudication requires accurate coding of embolic and bleeding risk factors, which like AF exhibit high specificity but are under-reported.12,22 More subjective bleeding risks such as frailty and falls are particularly difficult to quantify, although a recently described frailty score based on administrative data (the Hospital Frailty Risk Score) has been described.24 Finally, patient preferences are major drivers of anticoagulation decisions but are never captured in administrative databases.

Atrial fibrillation phenotype and coding considerations

The disease spectrum (permanent, persistent, paroxysmal, isolated unprovoked or provoked episodes) was rarely reported yet also impacts accuracy of AF detection. Permanent or persistent AF is associated with greater comorbidity and hence health care encounters during which arrhythmia is continuously present. By contrast, isolated or paroxysmal AF may be under-represented by health care encounters. Treatment including rate versus rhythm control and anticoagulation also varies based on symptoms, AF duration, risk of recurrence and thromboembolism.25 The accuracy of administrative data to identify AF requiring anticoagulation is thus further lessened by limited phenotypic characterization.4 The AF phenotype may also impact the “gold standard” for diagnosing AF, whereby paroxysmal AF is missed by ECG alone but detected by chart review. In the only study examining this issue, ECG review did not improve sensitivity of AF detection over diagnosis codes alone.4

Most developed health systems collect reasons for hospital and ambulatory encounters for administration, service planning, quality improvement and reimbursement. A single primary or most responsible diagnosis is typically assigned, while conditions complicating or prolonging stay are coded in multiple secondary positions, sometimes further categorized as pre-existing or de novo disease. Differences in coding accuracy, treatment and prognosis are reported between primary and secondary positions for conditions such as heart failure.26,27 To our knowledge, such differences have not been explored in patients with AF, and no study identified by our search compared coding positions.

Strengths and limitations

Several strengths and limitations merit consideration. Our analysis is contemporary, included varied health systems, ICD-8 to ICD-10 codes, and both ambulatory and hospital populations. However, most studies originated from North America or Scandinavia, and examined ICD codes in administrative data sources. This potentially limits generalizability to other health care systems. There was significant heterogeneity in terms of population, prevalence of AF and reported accuracy parameters. Most studies assessed accuracy in restricted cohorts as opposed to the broader population.

Directions for future research

Health service researchers and administrators may interpret administrative data using either our pooled estimates or locally relevant studies from among those identified. Jurisdictions would ideally conduct nationally representative validation studies to provide estimates specific to their populations and data sources. These should examine existing codes and test new case definition algorithms in all data sources with differences in coding practices and diagnostic accuracy (eg, hospitalization, emergency department, ambulatory primary and secondary care), and in scenarios with varying disease prevalence. Though challenging and costly, random sampling of representative populations is essential to define sensitivity, enhance generalizability and reduce bias when studying inequality. To understand the true disease burden, algorithms should combine primary and secondary care data sources.

More complex algorithms utilizing advanced analytics such as natural language processing and machine learning to mine free-text medical records merit investigation. Potential avenues include integrating corroboratory data such as medications and procedures, and temporal and spatial coding patterns. Future work should investigate the optimal gold standard including rhythm monitoring, electronic data sources and chart review. The reasons for false positives and negatives need to be explored in detail, as does the impact of AF phenotype and coding position. Finally, the accuracy of embolic and bleeding risk factor case definitions requires further validation in order to adjudicate appropriateness of anticoagulation management choices.

Conclusion

The overall accuracy of AF identification was reasonable for system planning and surveillance of prevalence, quality and outcomes. However, there is a marked disconnect between the volume of publications in these domains, and those examining the underpinning data. Sensitivity and PPV were the least accurate parameters with greatest uncertainty in terms of evidence and interpretation. This potentially underestimates the burden of disease and may bias estimates of outcomes and treatment quality. The optimal AF case definition should consider the purpose of the study and the data sources available. Health service administrators, researchers and clinicians should be mindful of these factors, and work together to refine our use of electronic data.

Abbreviations

AF, atrial fibrillation; AFL, atrial flutter; ICD, International Classification of Diseases; NPV, negative predictive value; PPV, positive predictive value; Sn, sensitivity; Sp, specificity.

Author contributions

All authors contributed toward data analysis, drafting and revising the paper, gave final approval of the version to be published and agree to be accountable for all aspects of the work.

Disclosure

Drs Deyell and Andrade are recipients of Career Scholar awards from the Michael Smith Foundation for Health Research. Dr Hawkins is supported by a Vancouver Coastal Health Clinician Scientist Award, and is the UBC Dr Charles Kerr Distinguished Scholar in Heart Rhythm Management. Dr McAlister is supported by the Alberta Health Services Chair in Cardiovascular Outcomes Research. The authors report no other conflicts of interest in this work.

References

1. Andrade JG, Verma A, Mitchell LB, et al. 2018 focused update of the Canadian Cardiovascular Society guidelines for the management of atrial fibrillation. Can J Cardiol. 2018;34:1371–1392. doi:10.1016/j.cjca.2018.08.026

2. Andrade J, Khairy P, Dobrev D, Nattel S. The clinical profile and pathophysiology of atrial fibrillation: relationships among clinical features, epidemiology, and mechanisms. Circ Res. 2014;114:1453–1468. doi:10.1161/CIRCRESAHA.114.303211

3. Chubak J, Pocobelli G, Weiss NS. Tradeoffs between accuracy measures for electronic health care data algorithms. J Clin Epidemiol. 2012;65:343–349 e342. doi:10.1016/j.jclinepi.2011.09.002

4. Navar-Boggan AM, Rymer JA, Piccini JP, et al. Accuracy and validation of an automated electronic algorithm to identify patients with atrial fibrillation at risk for stroke. Am Heart J. 2015;169:39–44 e32. doi:10.1016/j.ahj.2014.09.014

5. Jensen PN, Johnson K, Floyd J, et al. A systematic review of validated methods for identifying atrial fibrillation using administrative data. Pharmacoepidemiol Drug Saf. 2012;21 Suppl 1:141–147. doi:10.1002/pds.2317

6. DerSimonian R, Laird N. Meta-analysis in clinical trials. Control Clin Trials. 1986;7:177–188.

7. Egger M, Davey Smith G, Schneider M, Minder C. Bias in meta-analysis detected by a simple, graphical test. BMJ. 1997;315:629–634. doi:10.1136/bmj.315.7109.629

8. Begg CB, Mazumdar M. Operating characteristics of a rank correlation test for publication bias. Biometrics. 1994;50:1088–1101.

9. Higgins JP, Thompson SG, Deeks JJ, Altman DG. Measuring inconsistency in meta-analyses. BMJ. 2003;327:557–560. doi:10.1136/bmj.327.7414.557

10. Viechtbauer W. Conducting meta-analyses in R with the metafor package. J Stat Softw. 2010;36:1–48. doi:10.18637/jss.v036.i03

11. Tu K, Nieuwlaat R, Cheng SY, et al. Identifying patients with atrial fibrillation in administrative data. Can J Cardiol. 2016;32:1561–1565. doi:10.1016/j.cjca.2016.06.006

12. Sung SF, Hsieh CY, Lin HJ, et al. Validation of algorithms to identify stroke risk factors in patients with acute ischemic stroke, transient ischemic attack, or intracerebral hemorrhage in an administrative claims database. Int J Cardiol. 2016;215:277–282. doi:10.1016/j.ijcard.2016.04.069

13. Borzecki AM, Wong AT, Hickey EC, Ash AS, Berlowitz DR. Identifying hypertension-related comorbidities from administrative data: what’s the optimal approach? Am J Med Qual. 2004;19:201–206. doi:10.1177/106286060401900504

14. Ruigomez A, Johansson S, Wallander MA, Rodriguez LA. Incidence of chronic atrial fibrillation in general practice and its treatment pattern. J Clin Epidemiol. 2002;55:358–363.

15. Baturova MA, Lindgren A, Carlson J, Shubik YV, Bertil Olsson S, Platonov PG. Atrial fibrillation in patients with ischaemic stroke in the Swedish national patient registers: how much do we miss? Europace. 2014;16:1714–1719. doi:10.1093/europace/euu165

16. Go AS, Hylek EM, Phillips KA, et al. Prevalence of diagnosed atrial fibrillation in adults: national implications for rhythm management and stroke prevention: the AnTicoagulation and Risk Factors in Atrial Fibrillation (ATRIA) study. JAMA. 2001;285:2370–2375. doi:10.1001/jama.285.18.2370

17. Alonso A, Agarwal SK, Soliman EZ, et al. Incidence of atrial fibrillation in whites and African-Americans: the Atherosclerosis Risk in Communities (ARIC) study. Am Heart J. 2009;158:111–117. doi:10.1016/j.ahj.2009.05.010

18. Go AS, Hylek EM, Phillips KA, et al. Implications of stroke risk criteria on the anticoagulation decision in nonvalvular atrial fibrillation: the Anticoagulation and Risk Factors in Atrial Fibrillation (ATRIA) study. Circulation. 2000;102:11–13. doi:10.1161/01.cir.102.1.11

19. Thigpen JL, Dillon C, Forster KB, et al. Validity of international classification of disease codes to identify ischemic stroke and intracranial hemorrhage among individuals with associated diagnosis of atrial fibrillation. Circ Cardiovasc Qual Outcomes. 2015;8:8–14. doi:10.1161/CIRCOUTCOMES.113.000371

20. Kokotailo RA, Hill MD. Coding of stroke and stroke risk factors using international classification of diseases, revisions 9 and 10. Stroke. 2005;36:1776–1781. doi:10.1161/01.STR.0000174293.17959.a1

21. White E. The effect of misclassification of disease status in follow-up studies: implications for selecting disease classification criteria. Am J Epidemiol. 1986;124:816–825. doi:10.1093/oxfordjournals.aje.a114458

22. Chotchaisuwatana S, Jedsadayanmata A, Chaiyakunapruk N, Jampachaisri K. Validation of electronic medical database in patients with atrial fibrillation in community hospitals. J Med Assoc Thai. 2011;94:686–692.

23. Glazer NL, Dublin S, Smith NL, et al. Newly detected atrial fibrillation and compliance with antithrombotic guidelines. Arch Intern Med. 2007;167:246–252. doi:10.1001/archinte.167.3.246

24. Gilbert T, Neuburger J, Kraindler J, et al. Development and validation of a Hospital Frailty Risk Score focusing on older people in acute care settings using electronic hospital records: an observational study. Lancet. 2018;391:1775–1782. doi:10.1016/S0140-6736(18)30668-8

25. Macle L, Cairns J, Leblanc K, et al. 2016 focused update of the Canadian Cardiovascular Society guidelines for the management of atrial fibrillation. Can J Cardiol. 2016;32:1170–1185. doi:10.1016/j.cjca.2016.07.591

26. Kucharska-Newton AM, Heiss G, Ni H, et al. Identification of heart failure events in medicare claims: the Atherosclerosis Risk in Communities (ARIC) study. J Card Fail. 2016;22:48–55. doi:10.1016/j.cardfail.2015.07.013

27. Shoaib A, Farag M, Nasir M, et al. Is the diagnostic coding position of acute heart failure related to mortality? A report from the Euro Heart Failure Survey-1. Eur J Heart Fail. 2016;18:556–563. doi:10.1002/ejhf.505

28. Frost L, Andersen LV, Vestergaard P, Husted S, Mortensen LS. Trend in mortality after stroke with atrial fibrillation. Am J Med. 2007;120:47–53. doi:10.1016/j.amjmed.2005.12.027

29. Norberg J, Backstrom S, Jansson JH, Johansson L. Estimating the prevalence of atrial fibrillation in a general population using validated electronic health data. Clin Epidemiol. 2013;5:475–481. doi:10.2147/CLEP.S53420

30. Sundboll J, Adelborg K, Munch T, et al. Positive predictive value of cardiovascular diagnoses in the Danish National Patient Registry: a validation study. BMJ Open. 2016;6:e012832. doi:10.1136/bmjopen-2016-012832

31. Rix TA, Riahi S, Overvad K, et al. Validity of the diagnoses atrial fibrillation and atrial flutter in a Danish patient registry. Scand Cardiovasc J. 2012;46:149–153. doi:10.3109/14017431.2012.673728

32. Frost L, Vestergaard P. Caffeine and risk of atrial fibrillation or flutter: the Danish diet, cancer, and health study. Am J Clin Nutr. 2005;81:578–582. doi:10.1093/ajcn/81.3.578

33. Smith JG, Platonov PG, Hedblad B, Engstrom G, Melander O. Atrial fibrillation in the Malmo Diet and Cancer study: a study of occurrence, risk factors and diagnostic validity. Eur J Epidemiol. 2010;25:95–102. doi:10.1007/s10654-009-9404-1

34. Brophy MT, Snyder KE, Gaehde S, et al. Anticoagulant use for atrial fibrillation in the elderly. J Am Geriatr Soc. 2004;52:1151–1156. doi:10.1111/j.1532-5415.2004.52314.x

35. Ruigomez A, Johansson S, Wallander MA, Garcia Rodriguez LA. Predictors and prognosis of paroxysmal atrial fibrillation in general practice in the UK. BMC Cardiovasc Disord. 2005;5:20. doi:10.1186/1471-2261-5-17

36. Quan H, Parsons GA, Ghali WA. Assessing accuracy of diagnosis-type indicators for flagging complications in administrative data. J Clin Epidemiol. 2004;57:366–372. doi:10.1016/j.jclinepi.2003.01.002

37. Shen AY, Yao JF, Brar SS, Jorgensen MB, Wang X, Chen W. Racial/Ethnic differences in ischemic stroke rates and the efficacy of warfarin among patients with atrial fibrillation. Stroke. 2008;39:2736–2743. doi:10.1161/STROKEAHA.107.508580

38. Shireman TI, Howard PA, Kresowik TF, Ellerbeck EF. Combined anticoagulant-antiplatelet use and major bleeding events in elderly atrial fibrillation patients. Stroke. 2004;35:2362–2367. doi:10.1161/01.STR.0000141933.75462.c2

39. Munkholm SB, Jakobsen CJ, Mortensen PE, Lundbye-Christensen S, Andreasen JJ. Validation of post-operative atrial fibrillation in the Western Denmark Heart Registry. Dan Med J. 2015;62:A5162.

40. Walkey AJ, Wiener RS, Ghobrial JM, Curtis LH, Benjamin EJ. Incident stroke and mortality associated with new-onset atrial fibrillation in patients hospitalized with severe sepsis. JAMA. 2011;306:2248–2254. doi:10.1001/jama.2011.1615

41. Hravnak M, Hoffman LA, Saul MI, et al. Atrial fibrillation: prevalence after minimally invasive direct and standard coronary artery bypass. Ann Thorac Surg. 2001;71:1491–1495. doi:10.1016/s0003-4975(01)02477-8

Supplementary materials

Table S1 PRISMA checklist

Table S2 Search strategy

Figure S1 Flow diagram of study selection.

Creative Commons License © 2019 The Author(s). This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution - Non Commercial (unported, v3.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.