Back to Journals » Clinical Epidemiology » Volume 14

Presence of Codes for Indication for Use in Clinical Practice Research Datalink Aurum: An Assessment of Benign Prostatic Hyperplasia Treatments

Authors Persson R , Hagberg KW , Vasilakis-Scaramozza C , Yelland E, Williams T , Myles P , Jick SS 

Received 2 February 2022

Accepted for publication 25 April 2022

Published 3 May 2022 Volume 2022:14 Pages 641—652

DOI https://doi.org/10.2147/CLEP.S360843

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 2

Editor who approved publication: Dr Eyal Cohen



Rebecca Persson,1 Katrina Wilcox Hagberg,1 Catherine Vasilakis-Scaramozza,1 Eleanor Yelland,2 Tim Williams,2 Puja Myles,2 Susan S Jick1,3

1Boston Collaborative Drug Surveillance Program, Lexington, MA, USA; 2Clinical Practice Research Datalink, Medicines and Healthcare products Regulatory Agency, London, UK; 3Boston University School of Public Health, Boston, MA, USA

Correspondence: Rebecca Persson, Boston Collaborative Drug Surveillance Program, 11 Muzzey Street, Lexington, MA, 02421, USA, Email [email protected]

Background: Assessments of strengths and limitations of new data sources are critical for making decisions about suitability for specific research questions. For some studies, it is necessary to capture a drug’s indication for use.
Objective: To assess the presence of indications for prescription use in Clinical Practice Research Datalink (CPRD) Aurum (January 1988–June 2021) by describing the proportion of men in CPRD Aurum who had a recorded indication for use of prescriptions for 5-alpha reductase inhibitors (5-ARI), alpha blockers (AB), or tadalafil, which have multiple indications.
Methods: From a random sample of 154 practices of CPRD Aurum data, we selected 85,597 male patients with a prescription for a 5-ARI, an AB, or tadalafil. Among these patients, we described presence of codes indicating whether the patient had benign prostatic hyperplasia, hypertension, erectile dysfunction, or alopecia using three indication definitions: narrow (specific diagnoses recorded within one year before and up to 90 days after the prescription), broad (specific diagnoses or supporting clinical codes in the time period described above), and widest (diagnoses or supporting codes recorded at any time before the prescription and up to 90 days after the prescription).
Results: Using the narrow indication definition limited to diagnoses only, 39,861 (46.6%) patients’ records contained an indication for use. The broad definitions, which additionally included supporting codes, captured indications for 62,912 (73.5%) patients and the widest definition, which additionally included supporting codes and all available data before the first prescription date, captured indications for 71,478 (83.5%) patients. Indications were present more often for prescriptions in 2005 and later (85.9%).
Conclusion: The findings of this assessment suggest that CPRD Aurum can be used for studies that require information on treatment indications for BPH and potentially for treatments of other chronic diseases managed in the primary care setting.

Keywords: Clinical Practice Research Datalink, CPRD Aurum, data element presence, pharmacoepidemiology, indication for use, prostatic hyperplasia

Introduction

The Clinical Practice Research Datalink (CPRD) Aurum, a database of general practice electronic health records from the United Kingdom (UK), has been available to researchers since 2018.1 Prior studies of CPRD Aurum have described the quality and completeness of recording of diagnoses (pulmonary embolism, myocardial infarction) compared with an external reference and of the internal consistency of recorded diagnoses with other clinical codes within CPRD Aurum (type 2 diabetes, hypercholesterolemia, and anemia). In each of these assessments, the correctness of diagnoses recorded in CPRD Aurum was high while completeness varied by condition.2–4

For some research questions, particularly in pharmacoepidemiology, it is necessary to capture a drug’s indication for use to properly define a study population or exposure cohort. For example, a study of patients treated for benign prostatic hyperplasia (BPH) may include patients with prescriptions for 5-alpha reductase inhibitors (5-ARIs), alpha blockers (ABs), and tadalafil (phosphodiesterase-5 (PDE-5) inhibitor). In addition to BPH, many of these drugs are also indicated for other conditions: hypertension (ABs), erectile dysfunction (ED) (tadalafil), and alopecia (1 mg finasteride, a 5-ARI). While it may be preferable to address confounding by indication by limiting such a hypothetical study to patients with a BPH diagnosis as the indication for use, it is not known if this is feasible in CPRD Aurum. The objective of this study was to assess how often information on indication for use is captured in CPRD Aurum, an assessment of “element presence”,5 by describing the proportion of men in CPRD Aurum who had a recorded indication for use of their prescriptions for a 5-ARI, AB, or tadalafil.

Methods

Data Resources

CPRD Aurum is provided by CPRD, a research service jointly supported by the Medicines and Healthcare products Regulatory Agency and the National Institute for Health Research (NIHR), as part of the UK Department of Health and Social Care. CPRD Aurum is a large, prospectively collected, population-based, anonymized medical record database which, at the time of data extraction for this study (June 2021), encompassed data on 1491 National Health Service (NHS) practices and 40 million patients.6 Medical records include demographic information, prescription details, clinical events, referrals, hospital admissions, laboratory results, and other patient details, such as smoking, alcohol consumption, and height and weight as captured by general practitioners (GPs). CPRD Aurum includes electronic patient data starting in 1988 and contains data captured using the EMIS® patient management software as well as data that has been migrated into EMIS from other platforms (eg, Vision®, SystmOne®).7 GPs in the UK function as the gatekeepers for all NHS care. Thus, providers of  hospital and secondary care are required to send information on patient encounters to the GP. Thus, primary care records are expected to contain diagnoses made by specialists and consultants,8 however some diagnoses may be captured in free-text fields which are unavailable to researchers. Furthermore, except for some speciality treatments, drugs are typically prescribed by GPs regardless of whether the GP or a consultant initiated treatment. In CPRD Aurum, diagnoses and other non-prescription data are coded using a combination of SNOMED CT (UK edition), Read Version 2, and local EMIS Web® software-specific codes that have been cross-mapped by CPRD to a single diagnostic code (“MedCode”). Prescriptions are coded using the Dictionary of Medicines and Devices (dm+d) codes which are a subset of the SNOMED CT terminology and are assigned a “ProdCode” by CPRD.9–11

Study Population

From CPRD Aurum release June 2021, we randomly selected a subset of data containing 154 practices. (These 154 practices were similar to all practices in the full database with respect to mean age, proportion of male patients, mean length of record and geographic region. There was a smaller proportion [3%] of practices in the sample of practices that had migrated from Vision [ie, CPRD GOLD] than in CPRD Aurum overall [14%].) From this subset, we selected male patients with a first prescription for any treatment for BPH (“study drugs”): 5-ARIs (dutasteride and finasteride), ABs (alfuzosin, doxazosin, indoramin, prazosin, terazosin, tamsulosin, tamsulosin + solifenacin), 5-ARI+AB combination treatment (dutasteride + tamsulosin) or tadalafil recorded on 1 January 1988 or later.

We assigned the index date to the date of the first study drug in the patient record. We also estimated a start date and end date of each patient’s active CPRD Aurum electronic record using the available registration, prescription, and clinical data (Supplement A). From this population, we excluded patients who were less than age 20 or older than age 100 on the index date, whose index date was after 31 December 2020 (end of study period) or after the patient’s estimated record end date, or whose records contained less than one year of data between the start of their electronic medical record and the index date (potential prevalent prescription). We also excluded patients with more than one class of study drug on the index date (with the exception of 5-ARI+AB combinations), those with more than one study drug in the same class on the index date and those with a study drug recorded on an invalid date (ie, dates that are missing).

Assignment of Indications for Use

We assigned indications for use by drug type using three progressively broad definitions according to recorded diagnoses and condition-specific symptoms, referrals and other clinical codes recorded around the time of the index date (Table 1). The “narrow” definition included condition-specific diagnoses recorded within 365 days before the index date (first study drug prescription) through 90 days after the index date. This time period was used to capture information recorded by the GP early on in the patient’s clinical course as well as after the GP had received notice of a diagnosis from a specialist. The “broad” definition included condition-specific diagnoses or presence of symptoms and other clinical codes in the same time period defined above, or specialist visits within 30 days before or after the index date. The “widest” definition included all condition-specific diagnoses, symptoms and other clinical codes recorded at any time before the index date through 90 days after, or any specialist visits within 30 days before or after the index date. Note that although the structure of CPRD Aurum allows for prescriptions to be linked to diagnoses and symptoms through “problem” records,1 these linkages were not present for any drug prescriptions assessed in this study. All indications for use were limited to those approved for each drug type. BPH as an indication for tadalafil use was limited to prescriptions recorded in 2012 or later (when the indication for tadalafil was expanded by the European Medicines Agency to include BPH). See Table 1. Code lists are available in Supplement B.

Table 1 Narrow, Broad and Widest Definitions for Assigning Indications for Use by Study Drug Class

Assessment of the Presence of Indication for Use

We described the proportion of patients with a recorded indication for use among all patients and by study drug class (5-ARIs+/- ABs, AB monotherapy, or tadalafil) using progressively broad definitions of indication: narrow, broad and widest for each condition. We then described coding patterns by age and calendar time. We repeated each assessment within two sensitivity analysis populations: 1) among patients with no history of prostate cancer, prostatectomy or other procedures of the prostate, urethra, or bladder neck, as some study drugs are used off-label in these patients and 2) among patients with their first study drug prescription in 2005 or later after the implementation of Quality and Outcomes Framework (QOF) incentives.12

Lastly, we conducted a subgroup analysis among patients with 5-ARI prescriptions who were likely to have BPH as the indication for use. We selected patients with no alopecia codes who had a prescription for dutasteride or finasteride (5mg dose) monotherapy recorded in 2005 or later (after the implementation of data quality QOFs). Among this subpopulation of likely BPH patients, we compared whether patient characteristics and BPH disease severity varied by the definition used to capture the BPH indication. In this analysis, we categorized patients according to mutually exclusive categories using the most stringent definition met by their record: narrow, broad not including narrow, and widest exclusive of narrow or broad, or none.

Results

The final study population included 85,597 patients with a first prescription for a study drug: 9095 (10.6%) received 5-ARIs (with or without ABs), 60,457 (70.6%) received AB monotherapy, and 16,045 (18.7%) received tadalafil (Table 2, Supplement C). The median record length before the index date was approximately 10 years for all study drugs. There were differences in patient characteristics by study drug class. Men with prescriptions for 5-ARIs and ABs were older than those with tadalafil. 5-ARI use was similarly distributed across calendar time after 2000, AB use increased over time, and tadalafil use was highest in the years 2005 through 2014. “Unknown” BMI, smoking and race/ethnicity data were highest in 5-ARI±AB users. History of prostate cancer and diabetes was highest among users of tadalafil, while cardiovascular disease was highest among users of 5-ARIs (Table 2).

Table 2 Patient and CPRD Aurum Record Characteristics at Index Date by Study Drug Class on Index Date

Among users of all study drugs, 39,861 (46.6%) had a recorded indication for use using the narrow definitions (condition-specific diagnoses recorded within one year before and up to 90 days after the index date). The proportion of patients with a recorded indication for use increased with broader definitions. The broad definition (diagnoses, symptoms and other clinical codes within one year before and up to 90 days after the index date) captured indications for use for 62,912 (73.5%) patients and the widest definition (diagnoses, symptoms and other clinical codes at any time before or up to 90 days after the index date) captured indications for use for 71,478 (83.5%) patients (Figure 1). Using the narrow definition, the proportion of patients with an indication for use was highest among tadalafil users. However, using the widest definition, the proportion was highest among AB only users (Table 3). Overall, the results were similar when patients with a history of prostate cancer, prostatectomy or other procedures of the prostate, urethra or bladder neck were excluded. (Supplement D) When the analysis was limited to patients with an index date in 2005 and later, 53,833 of 62,665 (85.9%) users of any study drug had a recorded indication for use using the widest definition (Supplement E).

Table 3 Recorded Indications for Use with Increasingly Broad Definitions, by Study Drug

Figure 1 Numbers of patients with recorded indications captured by various definitions (N = 85,597).

The proportion of patients with a recorded indication for use was similar across most study drug categories stratified by age but was somewhat lower among 5-ARI±AB users younger than age 40. Capture of indication for use was higher in 2005 and later than in earlier years (Table 4). Results were similar in both sensitivity analyses (Supplements F and G).

Table 4 Presence of Any Indication (Widest Definitions) by Study Drug Class, Age and Calendar Year

In the subgroup of patients with 5-ARI prescriptions and likely BPH (ie, a dutasteride or 5 mg finasteride prescription and no alopecia-related codes), patients with no recorded indication for use (ie, “none” in Table 5) were more likely to be less than 60 years old than patients with a recorded indication for use. These patients were also slightly more likely to be missing BMI, smoking, and race/ethnicity information. Patients without a coded indication were also less likely to have BPH-related complications coded in their CPRD Aurum records before or after the index date than patients with a recorded indication for use (Table 5).

Table 5 Patient and CPRD Aurum Record Characteristics for Users of Dutasteride or 5mg Finasteride Monotherapy with and without Recorded BPH Indication with First Study Drug Prescription in 2005 or Later

In this same subgroup analysis, patients who met the narrow BPH definition were somewhat younger and were less likely to have a history of prostate cancer, transurethral resection of the prostate (TURP) or other procedures of the urethra, prostate or bladder neck before the index date than patients who met either the broad (not including narrow definition) or widest (exclusive of broad or narrow) definitions. However, patients who met the narrow criteria were more likely to have had a prostate-specific antigen (PSA) test within the year before the index date. There were fewer differences in the period after the index date between patients who met various BPH definitions. The proportions of patients with urinary tract and kidney infections, prostate cancer and prostatectomy after the index date were similar across groups of patients whose records met the narrow, broad (not including narrow definition) and widest (exclusive of broad or narrow) definitions for BPH. TURP and other procedures after the index date were somewhat less likely in patients who met the widest BPH criteria (Table 5).

Discussion

Capturing the indication for use of prescribed treatments is important for many pharmacoepidemiological studies. In this assessment of CPRD Aurum data, the indications for use were present in the coded electronic record in 83.5% of male patients with prescriptions for BPH treatments. Capture was lower (46.6%) if the indication definition was limited to diagnoses only (excluding symptoms and other clinical codes) or was restricted to the time period around the first prescription. Coded indications for use were more likely to be present for prescriptions recorded in 2005 and later (ie, after the implementation of data quality QOFs).12 These findings support the use of CPRD Aurum for studies that require information on treatment indications for BPH as well as hypertension, erectile dysfunction, alopecia and potentially other chronic diseases managed in the primary care setting.

We chose to assess BPH treatments for this evaluation as BPH is a chronic condition generally managed by a GP and because the drugs prescribed for BPH have multiple indications for use. In their role as the gatekeeper of clinical care in the NHS, it is expected that a patient’s GP would record diagnoses and prescribe continuing treatments regardless of where the initial diagnosis of BPH, hypertension, ED or alopecia was made. As CPRD Aurum is a relatively new data source, it was not known whether GPs reliably record these details in a way that is accessible to researchers. In this assessment, an indication for use was captured for most (>80%) patients with prescriptions for the study drugs of interest. Capture was lower (<50%) when only those diagnoses recorded close to the index date were considered. This is consistent with the structure of primary care electronic health records which, unlike in claims data, may, depending on individual GP recording patterns as well as incentives such as QOFs, only code the presence of a chronic condition at first diagnosis. Researchers using CPRD data would be advised to use all available data before an index or cohort entry date to improve capture of chronic conditions. Issues related to differences in record length may be addressed through matching or other study design choices.

Among patients with a dutasteride or 5mg finasteride prescription and no alopecia (likely BPH), patients without a coded indication for use were more likely to be younger than age 60 and to be missing race/ethnicity information. They were also less likely to have coded diagnoses for BPH-related complications. We cannot know from this assessment whether these patients had less severe disease than patients with a coded BPH indication or whether their GPs were less likely to code any diagnoses in a way that is available to researchers. Among patients whose records did include a coded indication for use, patients whose record met the narrow BPH definition (a diagnosis code close to the index date) were somewhat less likely to have prostate cancer, TURP or related procedures before index date than patients whose records met the broad definition not including narrow, or the widest exclusive of narrow or broad definitions. However, after the index date the proportion of patients with BPH-related complications was similar regardless of which BPH definition they met. Thus, it is likely that severity of disease is not directly associated with the BPH definition used to capture the patient.

It is possible that we misclassified the indication for use for some prescriptions. We reviewed all codes present in the year before and after the index date in a sample of patient records so it is unlikely that we missed important diagnosis or supporting codes, but it is possible that we misinterpreted some codes for symptoms, or referrals, that were associated with other conditions. It is also possible that for some patients, symptoms recorded well before prescription date may have resolved independently and were unrelated to the study prescription. Furthermore, we did not fully assess how these definitions performed among patients with records consistent with more than one indication for use or among patients with concomitant use of more than one class of study drugs. However, the goal of this study was to describe the presence of coded indications for use of BPH treatments, not to assess whether the coded diagnoses were “correct” compared to other internal or external data. Formal validation studies of each condition are still necessary. Prior studies have found correctness of diagnoses recorded in CPRD Aurum to be high for a range of conditions, though completeness/missingness of diagnosis recording in CPRD Aurum varied from ~50% to >90% depending on the condition, QOF reporting requirements, and the method of assessment.2–4

The findings of this study demonstrate that indications for use of BPH treatments are present in CPRD Aurum in a large proportion of patients. It is likely that GP recording practices are similar for treatments of other chronic conditions managed in the primary care setting in the UK. Thus, these results suggest that CPRD Aurum may be useful for research in other clinical areas where capture of indication information is important. Researchers are advised to use symptoms and other supporting clinical codes in addition to diagnosis codes to identify the indication for use including all available data before the first prescription date but recognizing the potential for misclassification of treatment indication, especially where a drug has multiple indications for use. Researchers should also consider methods, such as matching by GP, to reduce biases related to differential recording of drug indication between GPs. Depending on the study objectives, validation of treatment indication through GP questionnaires may be necessary to achieve sufficiently accurate information.13

Ethical Review and Copyright Statement

This study is based in part on data from the Clinical Practice Research Datalink obtained under license from the UK Medicines and Healthcare products Regulatory Agency. The data is provided by patients and collected by the NHS as part of their care and support. The interpretation and conclusions contained in this study are those of the authors alone. This study was approved by the Independent Scientific Advisory Committee (ISAC) for Medicines and Healthcare products Regulatory Agency (protocol no:18_191A), and the protocol was made available to the journal reviewers upon request. Researchers can apply for a limited licence to access CPRD data for public health research, subject to individual research protocols meeting CPRD data governance requirements. More details including data specification, licence fees and applications process are available on the CPRD website (https://www.cprd.com).

Funding

This study was unfunded.

Disclosure

CPRD is jointly sponsored by the UK government’s Medicines and Healthcare products Regulatory Agency and the National Institute for Health Research (NIHR). As a not-for-profit UK government body, CPRD seeks to recoup the cost of delivering its research services to academic, industry and government researchers through research user license fees. PM, EY and TW are employees of CPRD, the data custodians for CPRD Aurum. No funding was received by the Boston Collaborative Drug Surveillance Program (BCDSP) for the conduct of this study. The BCDSP receives industry funding to conduct research using CPRD data. The authors report no other conflicts of interest in this work.

References

1. Wolf A, Dedman D, Campbell J, et al. Data resource profile: Clinical Practice Research Datalink (CPRD) Aurum. Int J Epidemiol. 2019;48:1740-1740g. doi:10.1093/ije/dyz034

2. Jick S, Hagberg KW, Persson R, et al. Quality and completeness of diagnoses recorded in the new CPRD Aurum Database: evaluation of pulmonary embolism. Pharmacoepidemiol Drug Saf. 2020;29(9):1134–1140. doi:10.1002/pds.4996

3. Persson R, Vasilakis-Scaramozza C, Hagberg KW, et al. CPRD Aurum database: assessment of data quality and completeness of three important comorbidities. Pharmacoepidemiol Drug Saf. 2020;29:1456–1464. doi:10.1002/pds.5135

4. Persson R, Sponholtz T, Vasilakis-Scaramozza C, et al. Quality and completeness of myocardial infarction recording in Clinical Practice Research Datalink Aurum. Clin Epidemiol. 2021;13:745–775. doi:10.2147/CLEP.S319245

5. Weiskopf NG, Weng C. Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research. J Am Med Informatics Assoc. 2013;20(1):144–151. doi:10.1136/amiajnl-2011-000681

6. CPRD. CPRD Aurum 2021 dataset. medRxiv. 2021. doi:10.48329/pyc2-we97

7. Kontopantelis E, Stevens RJ, Helms PJ, Edwards D, Doran T, Ashcroft DM. Spatial distribution of clinical computer systems in primary care in England in 2016 and implications for primary care electronic medical record databases: a cross-sectional population study. BMJ Open. 2018;8:e020738. doi:10.1136/bmjopen-2017-020738

8. Herrett E, Gallagher AM, Bhaskaran K, et al. Data resource profile: Clinical Practice Research Datalink (CPRD). Int J Epidemiol. 2015;44:827–836. doi:10.1093/ije/dyv098

9. NHS Digital. SNOMED CT. Available from: https://digital.nhs.uk/services/terminology-and-classifications/snomed-ct. Accessed April 28, 2022.

10. NHS Digital. Read codes. Available from: https://digital.nhs.uk/services/terminology-and-classifications/read-codes. Accessed April 28, 2022.

11. NHS Business Services Authority. Dictionary of medicines and devices (dm+d). Available from: https://www.nhsbsa.nhs.uk/pharmacies-gp-practices-and-appliance-contractors/dictionary-medicines-and-devices-dmd. Accessed April 28, 2022.

12. Taggar JS, Coleman T, Lewis S, Szatkowski L. The impact of the Quality and Outcomes Framework (QOF) on the recording of smoking targets in primary care medical records: cross-sectional analyses from The Health Improvement Network (THIN) database. BMC Public Health. 2012;12:329. doi:10.1186/1471-2458-12-329

13. CPRD. CRPD PROVE (PRoviding Online Verification of EHR). Available from: https://cprd.com/cprd-prove. Accessed April 28, 2022.

Creative Commons License © 2022 The Author(s). This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution - Non Commercial (unported, v3.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.