Hong Kong Med J 2024 Oct;30(5):380–5 | Epub 29 Aug 2024
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Validation of diagnostic coding for chronic obstructive pulmonary disease in an electronic health record system in Hong Kong
WC Kwok, MB, BS, FHKAM (Medicine)1; Terence CC Tam, MB, BS, FHKAM (Medicine)1; CW Sing, PhD2; Esther WY Chan, PhD2; CL Cheung, PhD2
1 Department of Medicine, Queen Mary Hospital, The University of Hong Kong, Hong Kong SAR, China
2 Department of Pharmacology and Pharmacy, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
 
Corresponding author: Dr WC Kwok (kwokwch@hku.hk)
 
 Full paper in PDF
 
Abstract
Introduction: Electronic health record databases can facilitate epidemiology research regarding diseases such as chronic obstructive pulmonary disease (COPD), a common medical condition worldwide. We aimed to assess the validity of International Classification of Diseases, 9th Revision (ICD-9) code algorithms for identifying COPD in Hong Kong’s territory-wide electronic health record system, the Clinical Data Analysis and Reporting System (CDARS).
 
Methods: Adult patients diagnosed with COPD at all public hospitals in Hong Kong and specifically at Queen Mary Hospital from 2011 to 2020 were identified using the ICD-9 code 496 (Chronic airway obstruction, not elsewhere classified) within the CDARS. Two respiratory specialists reviewed clinical records and spirometry results to confirm the presence of COPD in a randomly selected group of cases.
 
Results: During the study period, 93 971 and 2479 patients had the diagnostic code for COPD at all public hospitals in Hong Kong and specifically at Queen Mary Hospital, respectively. Two hundred cases were randomly selected from Queen Mary Hospital for validation using medical records and spirometry results. The overall positive predictive value was 81.5% (95% confidence interval=76.1%-86.9%). We also developed an algorithm to identify COPD cases in our cohort.
 
Conclusion: This study represents the first validation of ICD-9 coding for COPD in the CDARS. Our findings demonstrated that the ICD-9 code 496 is a reliable indicator for identifying COPD cases, supporting the use of the CDARS database for further clinical research concerning COPD.
 
 
New knowledge added by this study
  • This is the first validation study of International Classification of Diseases, 9th Revision (ICD-9) coding for chronic obstructive pulmonary disease (COPD) in the Hong Kong Clinical Data Analysis and Reporting System (CDARS).
  • The ICD-9 code 496 demonstrated a high positive predictive value for identifying COPD cases in the CDARS.
Implications for clinical practice or policy
  • This study established an algorithm for identifying COPD cases in the CDARS.
  • The findings provide a basis for territory-wide analysis of COPD in Hong Kong.
 
 
Introduction
Chronic obstructive pulmonary disease (COPD) is a chronic inflammatory lung disease characterised by airflow limitation, which causes symptoms such as difficulty breathing, productive cough, and wheezing. Smoking is the primary risk factor for COPD development.1 Patients with COPD experience gradual deterioration of lung function, with potential intermittent exacerbations.
 
Although COPD is preventable and manageable, it was ranked as the fourth leading cause of death worldwide in the 2019 Global Initiative for Chronic Obstructive Lung Disease guidelines.2 The Global Burden of Disease Study estimated that there were 3.2 million COPD-related deaths in 2015, an increase of 11.6% compared with 1990.3 The prevalence of COPD also increased by 44.2% during the same period, reaching 174.5 million cases in 2015.3 In Hong Kong, the Population Health Survey 2014/15 revealed that 0.5% (0.6% in male individuals; 0.4% in female individuals) of non-institutionalised persons aged ≥15 years had physician-diagnosed COPD.4
 
The prevalence of COPD in Hong Kong among adults aged ≥60 years is 25.9% or 12.4%, depending on the spirometric criteria used (post-bronchodilator ratio of forced expiratory volume in 1 second to forced vital capacity [ie, FEV1/FVC ratio] <70% or lower limit of normal).5 In 2005, the crude mortality rate for COPD was 29.1 per 100 000 population, whereas the crude hospitalisation rate was 193 per 100 000 population.6 From January 2017 to December 2020, there were 78 693 admissions for COPD across all public hospitals in Hong Kong.7 8
 
Population-based or large database studies are valuable for understanding the epidemiology, clinical characteristics, and burden of COPD.9 10 11 12 13 14 15 In countries/regions with electronic health record (EHR) systems, the EHR databases offer extensive information for clinical management, research, and big data analysis of various diseases, including COPD. Studies in the US and the United Kingdom have validated diagnostic codes for COPD and acute exacerbation of COPD. A study of the diagnostic code for COPD in the US showed a positive predictive value (PPV) of 91.7%, sensitivity of 71.7%, and specificity of 94.4%.16 In the United Kingdom, the diagnostic code for acute exacerbation of COPD had a PPV of 85.5% and sensitivity of 62.9%.17 Electronic health records typically contain diagnostic information, associated morbidity and mortality data, and possible longitudinal follow-up data, allowing the evaluation of COPD trends and associated health outcomes. Before research can be conducted using EHR data, the diagnostic coding must be validated. The Clinical Data Analysis and Reporting System (CDARS), an EHR database managed by the Hospital Authority (HA; a public healthcare service provider that manages 43 hospitals/institutions and 123 outpatient clinics18), has covered >90% of the Hong Kong population since 1993. The CDARS captures medical information including diagnoses, drug prescriptions, demographics, admissions, medical procedures, and laboratory results. Although the accuracy of diagnostic coding has been demonstrated for some conditions in Hong Kong,19 20 21 it has not been validated for COPD. In this study, we aimed to assess the validity of International Classification of Diseases, 9th Revision (ICD-9) code algorithms for identifying COPD in the CDARS.
 
Methods
This study was conducted at Queen Mary Hospital (QMH), a territory-wide tertiary and quaternary referral centre under HA for advanced medical services and respiratory diseases. All medical information regarding its patients is captured within the CDARS.
 
Firstly, all adult patients aged ≥40 years with a principal diagnosis of COPD in HA from 1 January 2011 to 31 December 2020 were identified through the CDARS. Then, in the ICD-9 coding validation session, it included adult patients aged ≥40 years with a principal diagnosis of COPD recorded at QMH from 1 January 2011 to 31 December 2020. Potential COPD cases in the CDARS were initially identified using the ICD-9 code 496 (Chronic airway obstruction, not elsewhere classified). Cases with a secondary diagnosis of ICD-9 code 493 (Asthma; indicating potential asthma-COPD overlap [ACO] or asthma) were excluded. The clinical information and spirometry results for all potential COPD cases during the study period were retrieved for validation from the CDARS. The algorithm used for case identification is depicted in Figure 1.
 

Figure 1. Algorithm for identifying chronic obstructive pulmonary disease cases in the Clinical Data Analysis and Reporting System
 
Among potential cases identified in the QMH cohort, 200 were randomly selected for validation. Case validation was performed by two respiratory specialists, based on the clinical information, spirometry results, physician notes, and clinical examination reports. A potential COPD case was regarded as true positive if the specialist concluded that the patient had definite COPD according to the Global Initiative for Chronic Obstructive Lung Disease guidelines.22 A valid case was defined as the presence of symptoms compatible with COPD, along with spirometry results demonstrating airflow limitation (ie, FEV1/FVC ratio <0.7) that could not be fully reversed by the administration of an inhaled bronchodilator. Potential cases not meeting these criteria were regarded as false positive. Patients without spirometry data were excluded from the case validation process. The flow of patient selection is illustrated in Figure 1.
 
The PPV was computed to assess the validity of COPD diagnostic codes in the CDARS, using the definition of the number of true positives (ie, cases identified by ICD-9 codes which met the above criteria) divided by the total number of true positives plus false positives (ie, cases identified by ICD-9 codes which did not meet the above criteria).
 
 
Cohen’s kappa was used to estimate inter-rater reliability and the 95% confidence interval was estimated using a binomial distribution. All statistical analyses were performed using SPSS software (Windows 26.0; IBM Corp, Armonk [NY], US).
 
Results
In total, 2479 potential cases were identified in QMH between 2011 and 2020. During the same period, there were 93 971 cases with a principal diagnostic code of COPD across all public hospitals in Hong Kong. There were no significant differences in age or sex between QMH cases and overall cases throughout the HA (Table 1). Of the QMH cases, 200 were randomly selected for detailed validation. The validation process showed that 163 cases were true positives, resulting in an overall PPV of 81.5% (95% confidence interval=76.1%-86.9%). Major reasons for false positives included ACO, asthma, and bronchiectasis (Table 2). Cohen’s kappa was 0.77, suggesting substantial agreement. The proposed algorithm for identifying COPD cases in the CDARS is illustrated in Figure 2.
 

Table 1. Patient characteristics in all chronic obstructive pulmonary disease cases, 2011-2020
 

Table 2. Reasons for false-positive cases (n=200)
 

Figure 2. Chronic obstructive pulmonary disease validation
 
Discussion
In this validation study, the estimated overall PPV was 81.5% when ICD-9 coding was used to identify COPD cases within the CDARS, the territory-wide EHR system in Hong Kong.
 
A PubMed search using the terms ‘COPD’ AND ‘validation’ OR ‘international classification of disease codes’ did not identify any literature regarding validation of diagnostic codes for COPD in EHRs within Hong Kong. Validation of local diagnostic codes for COPD will facilitate large-scale studies in Hong Kong, which are needed considering the high local prevalence of this disease. Our study showed a PPV >70%, which is the typical validation criterion for case-finding algorithms in population-based cohort studies.23 24 The high PPV in our study may be attributable to the nature of the CDARS database, with high PPV also reported in other local validation studies involving other diseases.21 25 The CDARS database contains EHRs from all public hospitals, where diagnostic facilities and diagnostic protocols are well-established; in contrast, data from claims databases and general practitioners are expected to have lower accuracy. As such, in prior local validation studies with CDARS, they had high reported PPV of 79%25 and 100%21 for interstitial lung diseases and hip fracture, respectively. Also, COPD is a disease that is easier to be recognised by demonstrating airflow obstruction on spirometry, which contributed to the high PPV. Additionally, regular audits by the HA of diagnostic codes in patient discharge summaries to make sure the correct diagnosis were entered further enhance the accuracy of CDARS data.
 
Among the false-positive cases, ACO was the most frequent cause (Table 2). This relationship could be due to incorrect entry of COPD diagnostic codes or to patients with childhood asthma who developed COPD later in life. The lack of a separate ICD-9 diagnostic code for ACO and the absence of diagnostic criteria for this condition contribute to these challenges.26 27 28 29 30 31 32 33 34 Considering the current difficulties in accurate diagnosis of ACO, the actual PPV for COPD could be higher. Thus, our proposed algorithm excludes cases with a secondary diagnosis of asthma in the CDARS to avoid including patients with ACO. Proper education to address this miscoding issue is essential. Asthma was the second most common incorrectly coded diagnosis. This result could be related to initial misdiagnosis at presentation, such as attributing shortness of breath in a smoker to COPD, rather than asthma. Heart failure, which also presents with dyspnoea and wheezing, could be misclassified as COPD in rare instances. Bronchiectasis, pneumonia, silicosis, and interstitial lung disease can also present with chronic productive cough and dyspnoea, similar to COPD.
 
Strengths and limitations
The strengths of this study include its use of territorywide database with >11 million records, which allowed the identification of a sufficient number of cases. The methodology utilised to confirm true-positive COPD cases was both feasible and practical: the medical records and spirometry results for all cases with the COPD diagnostic code were reviewed by respiratory specialists.
 
However, this study had some limitations. First, the patient population mostly comprised adult Chinese patients, consistent with the demographics of patients with COPD in Hong Kong. This ethnicity component may limit generalisability to other populations. Second, only QMH cases were selected for validation. However, because all hospitals and clinics within the HA use a single diagnostic coding system, the diagnostic coding consistency is expected to be high. The high accuracy of ICD-9 coding within the Hong Kong CDARS has been demonstrated in other studies.20 21
 
Conclusion
This study represents the first validation of ICD-9 coding for COPD in Hong Kong. Our findings demonstrated that use of ICD-9 code 496, in conjunction with our algorithm to identify COPD, results in a PPV with sufficient reliability to support utilisation of the CDARS database for future COPD research.
 
Author contributions
Concept or design: WC Kwok, CL Cheung.
Acquisition of data: WC Kwok.
Analysis or interpretation of data: WC Kwok.
Drafting of the manuscript: WC Kwok, CL Cheung.
Critical revision of the manuscript for important intellectual content: TCC Tam, CW Sing, EWY Chan, CL Cheung.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
All authors have disclosed no conflicts of interest.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
The research was approved by the Institutional Review Board of The University of Hong Kong / Hospital Authority Hong Kong West Cluster, Hong Kong (Ref No.: UW22-716). The requirement for informed consent is waived by the Board due to the retrospective nature of the research.
 
References
1. Gershon AS, Warner L, Cascagnette P, Victor JC, To T. Lifetime risk of developing chronic obstructive pulmonary disease: a longitudinal population study. Lancet 2011;378:991-6. Crossref
2. Singh D, Agusti A, Anzueto A, et al. Global strategy for the diagnosis, management, and prevention of chronic obstructive lung disease: the GOLD Science Committee Report 2019. Eur Respir J 2019;53:1900164. Crossref
3. Benziger CP, Roth GA, Moran AE. The Global Burden of Disease study and the preventable burden of NCD. Glob Heart 2016;11:393-7. Crossref
4. Centre for Health Protection, Department of Health, Hong Kong SAR Government. Non-Communicable Diseases Watch. Chronic obstructive pulmonary disease: an overview. November 2018. Available from: https://www.chp.gov.hk/files/pdf/ncd_watch_november_2018.pdf. Accessed 9 Aug 2024.
5. Chan HS, Ko FW, Chan JW, et al. Hospital burden of chronic obstructive pulmonary disease in Hong Kong—the trend from 2006 to 2014. Int J Chron Obstruct Pulmon Dis 2023;18:507-19. Crossref
6. Chan-Yeung M, Lai CK, Chan KS, et al. The burden of lung disease in Hong Kong: a report from the Hong Kong Thoracic Society. Respirology 2008;13 Suppl 4:S133-65. Crossref
7. Ko FW, Lau LH, Ng SS, et al. Respiratory admissions before and during the COVID-19 pandemic with mediation analysis of air pollutants, mask-wearing and influenza rates. Respirology 2023;28:47-55. Crossref
8. Chan KP, Ma TF, Kwok WC, et al. Significant reduction in hospital admissions for acute exacerbation of chronic obstructive pulmonary disease in Hong Kong during coronavirus disease 2019 pandemic. Respir Med 2020;171:106085. Crossref
9. Buist AS, McBurnie MA, Vollmer WM, et al. International variation in the prevalence of COPD (the BOLD study): a population-based prevalence study. Lancet 2007;370:741-50. Crossref
10. Grahn K, Gustavsson P, Andersson T, et al. Occupational exposure to particles and increased risk of developing chronic obstructive pulmonary disease (COPD): a population-based cohort study in Stockholm, Sweden. Environ Res 2021;200:111739. Crossref
11. Lindberg A, Lindberg L, Sawalha S, et al. Large underreporting of COPD as cause of death-results from a population-based cohort study. Respir Med 2021;186:106518. Crossref
12. Landt E, Çolak Y, Lange P, Laursen LC, Nordestgaard BG, Dahl M. Chronic cough in individuals with COPD: a population-based cohort study. Chest 2020;157:1446-54. Crossref
13. Lee SC, Son KJ, Han CH, Park SC, Jung JY. Impact of COPD on COVID-19 prognosis: a nationwide population-based study in South Korea. Sci Rep 2021;11:3735. Crossref
14. Du Y, Li Q, Sidorenkov G, et al. Computed tomography screening for early lung cancer, COPD and cardiovascular disease in Shanghai: rationale and design of a population-based comparative study. Acad Radiol 2021;28:36-45. Crossref
15. Bahremand T, Etminan M, Roshan-Moniri N, De Vera MA, Tavakoli H, Sadatsafavi M. Are COPD prescription patterns aligned with guidelines? Evidence from a Canadian population-based study. Int J Chron Obstruct Pulmon Dis 2021;16:751-9. Crossref
16. Chu SH, Wan ES, Cho MH, et al. An independently validated, portable algorithm for the rapid identification of COPD patients using electronic health records. Sci Rep 2021;11:19959. Crossref
17. Rothnie KJ, Müllerová H, Hurst JR, et al. Validation of the recording of acute exacerbations of COPD in UK primary care electronic healthcare records. PLoS One 2016;11:e0151357. Crossref
18. Hospital Authority, Hong Kong SAR Government. Introduction. Caring for our community's health. 2024. Available from: https://www.ha.org.hk/visitor/ha_visitor_index.asp?Content_ID=10008&Lang=ENG&Dimension=100&Parent_ID=10004. Accessed 9 Aug 2024.
19. Chan SM, Chung GK, Chan YH, et al. Resilience and coping strategies of older adults in Hong Kong during COVID-19 pandemic: a mixed methods study. BMC Geriatr 2022;22:299. Crossref
20. Cheung CL, Tan KC, Kung AW. Cohort profile: the Hong Kong Osteoporosis study and the follow-up study. Int J Epidemiol 2018;47:397-8f. Crossref
21. Sing CW, Woo YC, Lee AC, et al. Validity of major osteoporotic fracture diagnosis codes in the Clinical Data Analysis and Reporting System in Hong Kong. Pharmacoepidemiol Drug Saf 2017;26:973-6. Crossref
22. Global Initiative for Chronic Obstructive Lung Disease. 2022 Global Strategy for Prevention, Diagnosis and Management of COPD.
23. Cho SK, Doyle TJ, Lee H, et al. Validation of claims-based algorithms to identify interstitial lung disease in patients with rheumatoid arthritis. Semin Arthritis Rheum 2020;50:592-7. Crossref
24. Papani R, Sharma G, Agarwal A, et al. Validation of claims-based algorithms for pulmonary arterial hypertension. Pulm Circ 2018;8:2045894018759246. Crossref
25. Ye Y, Hubbard R, Li GH, et al. Validation of diagnostic coding for interstitial lung diseases in an electronic health record system in Hong Kong. Pharmacoepidemiol Drug Saf 2022;31:519-23. Crossref
26. Leung JM, Sin DD. Asthma-COPD overlap syndrome: pathogenesis, clinical features, and therapeutic targets. BMJ 2017;358:j3772. Crossref
27. Cosio BG, Soriano JB, López-Campos JL, et al. Defining the asthma-COPD overlap syndrome in a COPD cohort. Chest 2016;149:45-52. Crossref
28. Gibson PG, Simpson JL. The overlap syndrome of asthma and COPD: what are its features and how important is it? Thorax 2009;64:728-35. Crossref
29. Sin DD, Miravitlles M, Mannino DM, et al. What is asthma-COPD overlap syndrome? Towards a consensus definition from a round table discussion. Eur Respir J 2016;48:664-73. Crossref
30. Cataldo D, Corhay JL, Derom E, et al. A Belgian survey on the diagnosis of asthma-COPD overlap syndrome. Int J Chron Obstruct Pulmon Dis 2017;12:601-13. Crossref
31. Soler-Cataluña JJ, Cosío B, Izquierdo JL, et al. Consensus document on the overlap phenotype COPD-asthma in COPD [in English, Spanish]. Arch Bronconeumol 2012;48:331-7. Crossref
32. Koblizek V, Chlumsky J, Zindr V, et al. Chronic obstructive pulmonary disease: official diagnosis and treatment guidelines of the Czech Pneumological and Phthisiological Society; a novel phenotypic approach to COPD with patient-oriented care. Biomed Pap Med Fac Univ Palacky Olomouc Czech Repub 2013;157:189-201. Crossref
33. Miravitlles M, Alvarez-Gutierrez FJ, Calle M, et al. Algorithm for identification of asthma-COPD overlap: consensus between the Spanish COPD and asthma guidelines. Eur Respir J 2017;49:1700068. Crossref
34. Global Initiative for Asthma; Global Initiative for Chronic Obstructive Lung Disease. Diagnosis of Diseases of Chronic Airflow Limitation: Asthma, COPD and Asthma-COPD Overlap Syndrome. Updated 2015. Available from: https://goldcopd.org/wp-content/uploads/2016/04/GOLD_ACOS_2015.pdf. Accessed 2 Aug 2024.