Effect of location on out-of-hospital cardiac arrests involving older adults in Hong Kong: secondary analysis of a territory-wide cohort

Hong Kong Med J 2023 Apr;29(2):142–9 | Epub 29 Mar 2023
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Effect of location on out-of-hospital cardiac arrests involving older adults in Hong Kong: secondary analysis of a territory-wide cohort
Ronald TM Wong, MB, BS, FHKAM (Emergency Medicine)
Department of Emergency Medicine, The University of Hong Kong, Hong Kong SAR, China
 
Corresponding author: Dr Ronald TM Wong (drwongtmr@gmail.com)
 
 Full paper in PDF
 
Abstract
Introduction: Most out-of-hospital cardiac arrests in Hong Kong involve older adults. The likelihood of survival varies among locations. This study investigated patient and bystander characteristics, as well as the timing of interventions, that affect the prevalences of shockable rhythm and survival outcomes among cardiac arrests involving older adults in homes, on streets, and in other public places.
 
Methods: This secondary analysis of a territory-wide historical cohort used data collected by the Fire Services Department of Hong Kong from 1 August 2012 to 31 July 2013.
 
Results: Bystander cardiopulmonary resuscitation was primarily performed by relatives in homes but not in non-residential locations. The intervals in terms of receipt of emergency medical services (EMS) call, initiation of bystander cardiopulmonary resuscitation, and receipt of defibrillation were longer for cardiac arrests that occurred in homes. The median interval for EMS to reach patients was 3 minutes longer in homes than on streets (P<0.001). Forty-seven percent of patients who developed cardiac arrest on streets had a shockable rhythm within the first 5 minutes after receipt of EMS call. Defibrillation within 15 minutes after receipt of EMS call was an independent predictor of 30-day survival (odds ratio=4.07; P=0.02). Fifty percent of patients who received defibrillation within 5 minutes in non-residential locations survived.
 
Conclusion: There were significant location-related differences in patient and bystander characteristics, interventions, and outcomes among cardiac arrests involving older adults. A large proportion of patients had a shockable rhythm in the early period after cardiac arrest. Good survival outcomes in out-of-hospital cardiac arrests involving older adults can be achieved through early bystander defibrillation and intervention.
 
 
New knowledge added by this study
  • Among out-of-hospital cardiac arrests involving older adults that occurred at different locations, there were significant differences in patient and bystander characteristics, as well as prehospital interventions, which influenced survival outcomes.
  • Many older adults who experienced cardiac arrest in non-residential locations had a shockable rhythm in the early period after receipt of emergency medical services (EMS) call, and early defibrillation was associated with favourable survival outcomes.
  • Low rates of shockable rhythm and significant delays in bystander and EMS processes were observed within homes.
Implications for clinical practice or policy
  • Additional measures are needed to overcome bystander inertia.
  • Interventions to mitigate the adverse factors related to cardiac arrests occurring in older adult households, such as volunteer dispatch via mobile applications, should be considered.
 
 
Introduction
The proportion of older adults in Hong Kong is expected to increase from 18% in 2019 to 26% by the year 2029.1 Overcrowding is a serious problem, such that population densities of 57 530 people/km2 are present in ageing districts.2 3 Most residents of Hong Kong live in high-rise apartments that require elevators for access, but most elevators cannot accommodate an ambulance stretcher with a patient in a supine position.4 More than 50% of out-of-hospital cardiac arrest (OHCA) events occur in private homes, a location that is associated with poor survival outcomes.5 The proportion of domestic households consisting solely of people aged ≥65 years has increased by approximately 24% between 2011 and 2016, from 8.4% to 10.4%.6 Considering these demographic changes, there is a need for improved overall understanding of the prehospital management of cardiac arrests that involve older adults in homes and other locations. This study investigated patient characteristics, types of bystanders involved, and prehospital interventions that were associated with differences in survival outcomes among cardiac arrests involving older adults in homes, compared with cardiac arrests on streets and in public areas excluding streets (PAES).
 
Methods
Study design and setting
This secondary analysis focused on a historical cohort from a previous study.5 The Emergency Ambulance Service of the Fire Services Department (FSD) provides most emergency medical services (EMS) in Hong Kong through a one-tiered system that serves the entire 1104 km2 region. At the time of data collection, the population was around 7.1 million.7 Ambulance personnel are required to perform cardiopulmonary resuscitation (CPR) on and transfer most cases of OHCA to hospitals. A small number of patients with obvious post-mortem changes (eg, rigor mortis) may be directly transferred to the public mortuary; such patients were not included in this study. Fire Services Department ambulances will only transfer patients to emergency departments under the Hospital Authority. At the time of data collection, callers requesting for EMS for OHCA patients were not provided with post-dispatch instructions to perform CPR.
 
Participants
This secondary analysis included all patients with OHCA who were transferred to the Emergency Departments (EDs) by FSD ground ambulances from 1 August 2012 to 31 July 2013. Exclusion criteria were cardiac arrests caused by trauma, patients not transferred by ground ambulance, and patients directly transferred to the public mortuary. After patient selection from the primary dataset, the following additional exclusions were made: cardiac arrests that involved patients aged <65 years, occurred within residential care homes for the elderly, or occurred in the ambulance en route to hospital.
 
Data sources
Data regarding patient characteristics and prehospital management were prospectively collected by EMS personnel who were directly involved in prehospital care for patients who experienced OHCA. The collected data included patient age and sex, location of cardiac arrest, whether the cardiac arrest was witnessed and the identity of the witness, whether bystander CPR was performed and who performed it, whether defibrillation with an automated external defibrillator (AED) was performed, what electrocardiogram rhythm was first detected, the timings of prehospital events (recognition of cardiac arrest, receipt of EMS call, initiation of bystander CPR, initiation of first defibrillation, EMS arrival at patient’s side, initiation of CPR by EMS personnel, and arrival at the ED), and return of spontaneous circulation (ROSC) before ED arrival.
 
Electronic medical records at the relevant ED (Accident and Emergency Information System, Hong Kong Hospital Authority) were reviewed to determine the time of defibrillation and time of ROSC at the ED, as well as whether a patient survived until admission. A patient was assumed to have received no resuscitative intervention unless specific documentation was present in the ED record. Neurological status upon discharge and survival at 30 days after cardiac arrest were determined from a territory-wide electronic medical record database (Clinical Management System, Hong Kong Hospital Authority).
 
Variables
Streets were defined as paved thoroughfares for pedestrians, including sidewalks. Public areas excluding streets were other areas that were accessible by the public throughout the day; these included outdoors (eg, parks and markets) and indoor facilities (eg, eateries, places of recreation, and day care facilities for older adults). Bystanders were defined in accordance with the guidelines of the Utstein Resuscitation Registry Templates for Out-of-Hospital Cardiac Arrest.8 Fire Services Department first responders dispatched to the scene were classified as EMS personnel. Older adult care workers (OACWs) are individuals who care for residents in various private and public housing arrangements for older adults. Older adult care workers accompanying patients were not dispatched as part of the organised emergency rescue team; thus, they were classified as bystanders. Public access defibrillation (PAD) was defined as a defibrillation shock delivered from an AED when a bystander performed CPR. Shocks delivered when FSD first responders performed CPR were excluded.
 
Time intervals were rounded to the nearest minute. The decision interval was the interval between recognition of cardiac arrest and receipt of EMS call. Call-to-bystander CPR was the interval between receipt of EMS call and initiation of bystander CPR. Call-to-EMS arrival was the interval between receipt of EMS call and EMS arrival at the patient’s side. Time of first defibrillation was defined as the time of the earliest of the following three events: PAD, defibrillation by EMS, or defibrillation in the ED. Call-to-bystander CPR intervals were grouped as 0-2, 3-5, 6-8, 9-11, and 12-31 minutes, as well as no bystander CPR. Call-to-first defibrillation intervals were grouped as 0-5, 6-10, 11-15, 16-20, and 21-55 minutes, as well as no defibrillation (>55 minutes/not applicable).
 
Post-cardiac arrest neurological status was classified using the 5-point Glasgow-Pittsburgh Cerebral Performance Categories (CPC) scale. In the scale, CPC 1 represents patients with good cerebral performance; CPC 2 includes patients who can manage activities of daily living independently or participate in part-time work in a sheltered environment; CPC 3 to CPC 5 ranges from patients who are unable to live independently because of cerebral disability to patients who have experienced brain death. Patients with CPC 1 or CPC 2 were presumed to have a favourable neurological outcome.
 
Statistical methods
Patient characteristics, interventions, and outcomes were analysed using descriptive statistics. Pearson’s χ2 test was used to compare categorical variables; Fisher’s exact test was used if >20% of expected counts were <5. The Kruskal–Wallis rank sum test was used to compare non-parametric time intervals. A P value of <0.05 was considered statistically significant. Predictors of 30-day survival were analysed using univariate and multivariate logistic regression; findings were reported as odds ratios (ORs) with 95% confidence intervals. Adjusted variables included age; sex; arrest location; person witnessing the arrest (relative, OACW or other bystanders, EMS personnel, or unwitnessed); person performing bystander CPR (no bystander CPR, OACW, relative, or other); PAD (yes or no); first monitored rhythm (asystole, pulseless electrical activity, ventricular fibrillation/ventricular tachycardia); and call-to-EMS arrival, call-to-bystander CPR, and call-to-first defibrillation intervals.
 
Statistical analysis was performed using R software, version 3.6.1 (R Foundation for Statistical Computing, Austria). The original study was approved by the Institutional Review Board of The University of Hong Kong/Hospital Authority Hong Kong West Cluster (Ref No.: UW 15-599). No new data were collected for secondary analysis. This manuscript was prepared in accordance with the STROBE (Strengthening the Reporting of Observational Studies in Epidemiology) reporting guidelines.
 
Results
Participant selection and characteristics
Figure 1 describes patient selection from the primary dataset. The original cohort comprised 5154 patients with OHCA who were transferred to the ED by FSD ground ambulances. After the application of exclusion criteria, 2255 patients were included in the analysis. Table 1 compares the patient and bystander characteristics, interventions, and outcomes of OHCA occurring in homes, in PAES, and on streets. Patients who experienced cardiac arrest in homes were significantly older (approximately 5 years; P<0.001) than patients who experienced cardiac arrest on streets or in PAES. In all groups, there were more male patients; the sex disparity was the greatest in the streets group, followed by the PAES group.
 

Figure 1. Patient selection from primary dataset
 

Table 1. Patient and bystander characteristics, interventions, and survival outcomes of out-of-hospital cardiac arrests involving older adults in homes, on streets, and in other public areas
 
Furthermore, most cardiac arrests (66.4% among all patients; P<0.001; Table 1) were unwitnessed. Relatives were the most common type of bystander present in witnessed arrests, whereas there were significant differences in the involvement of OACWs, EMS personnel, and other individuals at the three locations. Compared with EMS personnel, there were more OACWs as bystanders in PAES and more bystanders, represented by the ‘other’ group, in PAES and on streets. There was a significant difference in the proportion of bystanders performing CPR among the three locations (P<0.001), as illustrated in Figure 2. The bystander CPR rate was the highest in PAES and the lowest in homes. Among the nine patients who received PAD, six received it when OACWs provided CPR in PAES.
 

Figure 2. Identity of bystanders performing cardiopulmonary resuscitation in homes, in public areas excluding streets (PAES), and on streets
 
 
Initial monitored rhythm
Notably, asystole was the most common initial monitored rhythm (81.2% among all patients; P<0.001; Table 1). However, cardiac arrests on streets and in PAES had significantly higher rates of shockable rhythm and PEA, compared with cardiac arrests in homes. The prevalences of shockable rhythm relative to the time from receipt of EMS call and the location of cardiac arrest are shown in Figure 3. The highest rates of shockable initial rhythm (SIR) were observed within the first 5 and 10 minutes after receipt of EMS call for cardiac arrests on streets, which were 47% (8/17) and 41% (17/42), respectively.
 

Figure 3. Location, time from receipt of emergency medical services (EMS) call, and proportion of shockable rhythm of cardiac arrests
 
Timing of interventions
Patients with cardiac arrest in homes had significantly longer intervals in terms of receipt of EMS call, initiation of bystander CPR, and receipt of defibrillation (all P<0.001; Table 1). The median interval for EMS to reach patients was 3 minutes longer in homes than on streets. The interval between recognition of cardiac arrest and receipt of EMS call was 0 minutes in 57.5% of patients (1297/2255).
 
Survival and neurological outcomes
Additionally, patients with cardiac arrest in homes had significantly lower rates of ROSC, 30-day survival, and favourable neurological outcomes (all P<0.001; Table 1).
 
Independent predictors of 30-day survival are shown in Table 2. Older age and longer call-to-EMS arrival interval both decreased the overall likelihood of survival (ORs of 0.92 and 0.87, respectively). Pulseless electrical activity and ventricular fibrillation/ventricular tachycardia increased the likelihood of survival compared with asystole (ORs of 6.4 and 15.6, respectively). Cardiac arrest witnessed by EMS personnel and defibrillation within 15 minutes after receipt of EMS call increased the overall likelihood of survival (ORs of 6.23 and 4.07, respectively).
 

Table 2. Independent predictors of 30-day survival of cardiac arrests
 
The relationship among the location, timing of defibrillation, and 30-day survival of cardiac arrest is shown in Figure 4. Overall, patients who received defibrillation within 5 minutes and at 6 to 10 minutes after receipt of EMS call had survival rates of 33% (2/6) and 17% (15/86), respectively. For patients who received defibrillation on streets/in PAES within 5 minutes and at 6 to 10 minutes after receipt of EMS call, the survival rates were 50% (2/4) and 22% (10/45), respectively. Two patients in the homes group received defibrillation within 5 minutes; the survival rate was 0% (0/2).
 

Figure 4. Relationship among location, timing of defibrillation, and survival of cardiac arrests. Only two patients in the home group received defibrillation within 5 minutes. Streets and public areas excluding streets are combined because of the small number of patients in some subgroups
 
Cardiac arrest at home was a predictor of survival in univariate analysis (OR=0.076, 95% confidence interval [CI]=0.038-0.15) but not in multivariable analysis (OR=0.65, 95% CI=0.22-1.90). The effect of location on survival was mediated by the first monitored rhythm, and the call-to-EMS arrival interval.
 
Discussion
This study investigated factors that affect the prevalences of shockable rhythm and survival outcomes among cardiac arrests involving older adults in Hong Kong. The patient characteristics, proportion of witnessed arrests, and rates of SIR and PAD for cardiac arrests involving older adults in homes were similar between the present study and a previous analysis in Japan.9 Unlike many western countries, EMS personnel in Hong Kong and Japan generally do not terminate resuscitation in the field; this similarity facilitates comparison of data between the two studies. A notable difference was that in Japanese homes, 45% of older patients received bystander CPR; this receipt of CPR was associated with rate of ROSC, 30-day survival, and favourable neurological outcomes that were threefold higher than the corresponding rates in Hong Kong.
 
Bystander cardiopulmonary resuscitation
The bystander CPR rate in Hong Kong homes was low (3.8%) [Table 1], and there was a substantial delay in its initiation. Although the type of relatives involved as bystanders was not recorded in the present study, considering the proportion of older adult households in Hong Kong,6 many of the relatives presumably were cohabiting older adults. Such individuals may not be able to follow telephone instructions to perform CPR because of physical limitations or emotional distress10; thus, the provision of post-dispatch instructions and enhancement of community-wide CPR training will not improve survival among these patients.11 Although high-rise apartments create barriers to EMS personnel, they also increase the likelihood that trained volunteers will be present in the vicinity, where they may be dispatched using mobile applications.12 13 14
 
In non-residential locations, most bystanders performing CPR were not relatives of the patients. Fear of legal consequences is reportedly a major cause for intervention inertia in this situation.15 A previous survey in Hong Kong, in which one-third of respondents had prior first aid training, revealed that nearly all respondents were willing to call for help but only one-fifth were willing to perform bystander CPR.16 These findings suggest that knowledge transfer is insufficient to overcome bystander inertia in Hong Kong. Training programmes should ensure that factors inhibiting intervention (eg, legal concerns, fear of disease transmission, and bystander effect) are addressed.17 18
 
Shockable initial rhythm
Previous studies in Hong Kong revealed low rates of SIR in patients with OHCA, ranging from 5% to 14%, along with dismal survival rates of 0.6% to 3%.1 19 20 These low rates imply that aggressive bystander interventions (eg, defibrillation for older adults) are futile. However, the findings of the present study indicate that older adults in non-residential locations have much higher SIR rates in the initial 10 minutes after receipt of EMS call; moreover, early defibrillation is an independent predictor of survival among such patients, and high survival rates can be achieved with early defibrillation.
 
The present study revealed a 2% per-minute decrease in the rate of SIR. This is similar to the findings in a large multinational study from northern Europe.21 Differences in SIR rates between residential and non-residential locations may be partly related to patient factors (eg, age and presence of co-morbidities); they could also be related to differences in the decision interval (ie, time elapsed between recognition of cardiac arrest [as reported by a bystander] and receipt of EMS call). A previous study in Hong Kong showed that efforts to seek advice from relatives often contributed to delayed receipt of EMS call.4 Longer decision intervals and consequential delays in EMS arrival lead to interactions with later parts of the shockable rhythm downslope and lower SIR rates. In practice, the recall of decision intervals by bystanders is unreliable. This is consistent with the decision interval of 0 minutes reported by most bystanders in the present study. Despite this confounding factor, the findings in this study indicate that bystanders should not hesitate to provide aggressive resuscitation and early defibrillation for older patients.
 
Public access defibrillation
Notably, very few patients received PAD in this study, and most instances of PAD administration were performed by OACWs in PAES. According to a nationwide study in Japan, 16.5% of patients received PAD during witnessed ventricular fibrillation cardiac arrest.22 Estimation of the AED coverage rate in Hong Kong using a horizontal level walking route distance model revealed that only 11% of patients with OHCA would have an AED within 100 m.23 Considering the large number of OHCA events occurring within high-rise buildings, the actual coverage rate is presumably lower. Furthermore, there is evidence that most people in Hong Kong do not know the location of the AED nearest to their home or workplace.16 Unless AEDs are easy to locate and readily accessible at all times, PAD rates will remain low.24
 
Barriers to rescue in high-rise buildings
In a previous study in Hong Kong, the proportions of patients with OHCA who could be accessed by elevator or stairs and by stairs alone were 74% and 14%, respectively.4 In the present study, the median interval for EMS to reach patients was 3 minutes longer in homes than on streets. This represents the ‘vertical response time’ component of the call-to-EMS arrival interval.25 In a previous study, survival was lower among patients who experienced cardiac arrest at higher levels within buildings.26 Nearly 70% of lifts in Hong Kong do not have sufficient area to accommodate the ambulance stretcher.4 Therefore, the vertical response time leads to a delay in EMS interventions and deterioration in CPR quality, both of which may contribute to the poor outcomes of cardiac arrests that occur in homes. The use of circulatory adjuncts to enhance cerebral perfusion during head-up position CPR within lifts should be considered.27
 
Limitations
Importantly, only patients transported to hospital by FSD ground ambulances were included in this study; a small number of patients with OHCA may have been transported to hospital by other means.
 
Furthermore, data regarding the timings of recognition of cardiac arrest, bystander CPR, and PAD obtained from bystanders may have been subject to response bias. The lack of blinding of emergency department personnel towards patient factors (eg, absence of shockable rhythm and prehospital defibrillation, longer time to ROSC, co-morbidities, and advanced age) may have led to selection bias regarding treatment decisions, including the termination of resuscitation, arrangement of intensive care unit resources, and coronary angiography; such bias has been reported to negatively influence the survival rate.28 Data regarding pre-arrest co-morbidity and functional status were not available, which may have resulted in a confounding effect on survival outcomes. Additionally, a small number of patients received defibrillation within 5 minutes. All of the factors listed here may have affected the accuracy of conclusions drawn from this subset.
 
This study was based on territory-wide data collected in 2012 to 2013. Thus, it may not reflect the current situation because of changes in patient demography, prevalence of shockable rhythm, and survival enhancement interventions introduced in the past several years. A large multinational study in northern Europe investigated the rate of SIR among OHCA events occurring in residential and public locations from 2006 to 2015. The rate of SIR in public locations remained stable during that period. A decrease was observed in residential locations between 2006 and 2010, but the proportion has remained stable since 2011.21 Therefore, despite these limitations, the findings of the present study add to the broader understanding of OHCA involving older adults.
 
Conclusion
This study revealed significant differences in the patient and bystander characteristics and prehospital interventions among cardiac arrests involving older adults that occurred in homes, on streets, and in other public locations. Many older adults who experienced cardiac arrest in non-residential locations had a shockable rhythm in the early period after receipt of EMS call. Early defibrillation, an independent predictor of survival, was associated with favourable survival outcomes in older adults. These findings suggest that bystanders should provide aggressive resuscitation, including early defibrillation. Additionally, low rates of shockable rhythm and significant delays in bystander and EMS processes were observed within homes. New interventions (eg, volunteer dispatch via mobile applications) are needed to overcome unfavourable factors that affect cardiac arrests occurring within older adult households. Finally, the overall bystander CPR rate was low, indicating that additional measures are needed to overcome bystander inertia. The insights from this study will help to improve survival outcomes in OHCAs involving older adults.
 
Author contributions
The author contributed to the concept or design, analysis or interpretation of data, drafting of the manuscript, and critical revision of the manuscript for important intellectual content. The author had full access to the data, contributed to the study, approved the final version for publication, and takes responsibility for its accuracy and integrity.
 
Conflicts of interest
The author has no conflicts of interest to disclose.
 
Acknowledgement
The author thanks Dr Ling-pong Leung, Emergency Medicine Unit of The University of Hong Kong, for providing the original dataset and permitting its use for secondary analysis in the study. The author also thanks Mr Min Fan and Ms Lujie Chen, both from Emergency Medicine Unit of The University of Hong Kong, for their technical support in this study.
 
Funding/support
This research received no specific grant from any funding in the public, commercial, or not-for-profit sectors.
 
Ethics approval
This study is a secondary analysis of a historical cohort study which was approved by the Institutional Review Board of The University of Hong Kong/Hospital Authority Hong Kong West Cluster (Ref No.: UW 15-599). The requirement for informed patient consent was waived because of the retrospective study design. All patient data in the dataset were anonymous.
 
References
1. Census and Statistics Department, Hong Kong SAR Government. Hong Kong population projections 2020-2069. Available from: https://www.censtatd.gov.hk/hkstat/sub/sp190.jsp?productCode=B1120015. Accessed 7 Nov 2020.
2. Census and Statistics Department, Hong Kong SAR Government. The profile of Hong Kong population analysed by District Council district, 2018. Table 2. Proportion of land-based non-institutional population by District Council district and age group, 2017. Available from: https://www.censtatd.gov.hk/en/data/stat_report/ product/FA100096/att/B71807FB2018XXXXB0100.pdf. Accessed 24 Mar 2020.
3. Census and Statistics Department, Hong Kong SAR Government. 2016 Population By-census Office. Main tables (demographic). Population density by District Council district and year. 2017. Available from: https://www.bycensus2016.gov.hk/en/bc-mt.html?search=A202. Accessed 24 Mar 2020.
4. Leung LP, Wong TW, Tong HK, Lo CB, Kan PG. Out-of-hospital cardiac arrest in Hong Kong. Prehosp Emerg Care 2001;5:308-11. Crossref
5. Fan KL, Leung LP, Siu YC. Out-of-hospital cardiac arrest in Hong Kong: a territory-wide study. Hong Kong Med J 2017;23:48-53. Crossref
6. Census and Statistics Department, Hong Kong SAR Government. 2016 Population By-census. Domestic households in Hong Kong. Available from: https://www.bycensus2016.gov.hk/en/Snapshot-04.html. Accessed 24 Mar 2020.
7. Census and Statistics Department, Hong Kong SAR Government. 2011 Hong Kong Population Census. Census results. Available from: https://www.censtatd.gov.hk/en/scode170.html. Accessed 17 Mar 2023.
8. Perkins GD, Jacobs IG, Nadkarni VM, et al. Cardiac arrest and cardiopulmonary resuscitation outcome reports: update of the Utstein Resuscitation Registry Templates for Out-of-Hospital Cardiac Arrest: a statement for healthcare professionals from a task force of the International Liaison Committee on Resuscitation (American Heart Association, European Resuscitation Council, Australian and New Zealand Council on Resuscitation, Heart and Stroke Foundation of Canada, InterAmerican Heart Foundation, Resuscitation Council of Southern Africa, Resuscitation Council of Asia); and the American Heart Association Emergency Cardiovascular Care Committee and the Council on Cardiopulmonary, Critical Care, Perioperative and Resuscitation. Circulation 2015;132:1286-300. Crossref
9. Okabayashi S, Matsuyama T, Kitamura T, et al. Outcomes of patients 65 years or older after out-of-hospital cardiac arrest based on location of cardiac arrest in Japan. JAMA Netw Open 2019;2:e191011. Crossref
10. Dami F, Carron PN, Praz L, Fuchs V, Yersin B. Why bystanders decline telephone cardiac resuscitation advice. Acad Emerg Med 2010;17:1012-5. Crossref
11. Kiyohara K, Nishiyama C, Matsuyama T, et al. Out-of-hospital cardiac arrest at home in Japan. Am J Cardiol 2019;123:1060-8. Crossref
12. Ringh M, Rosenqvist M, Hollenberg J, et al. Mobile-phone dispatch of laypersons for CPR in out-of-hospital cardiac arrest. N Engl J Med 2015;372:2316-25. Crossref
13. Smith CM, Wilson MH, Ghorbangholi A, et al. The use of trained volunteers in the response to out-of-hospital cardiac arrest—the GoodSAM experience. Resuscitation 2017;121:123-6. Crossref
14. Mao DR, Ong ME. High-rise residential resuscitation: scaling the challenge. CMAJ 2016;188:399-400. Crossref
15. Coons SJ, Guy MC. Performing bystander CPR for sudden cardiac arrest: behavioral intentions among the general adult population in Arizona. Resuscitation 2009;80:334-40. Crossref
16. Fan KL, Leung LP, Poon HT, Chiu HY, Liu HL, Tang WY. Public knowledge of how to use an automatic external defibrillator in out-of-hospital cardiac arrest in Hong Kong. Hong Kong Med J 2016;22:582-8. Crossref
17. Resuscitation Council UK. Cardiopulmonary resuscitation, automated defibrillators and the law. 2018. Available from: https://www.resus.org.uk/sites/default/files/2020-05/CPR%20AEDs%20and%20the%20law%20%285%29.pdf. Accessed 24 Mar 2020. Crossref
18. Sayre MR, Barnard LM, Counts CR, et al. Prevalence of COVID-19 in out-of-hospital cardiac arrest: implications for bystander cardiopulmonary resuscitation. Circulation 2020;142:507-9. Crossref
19. Wong TW, Yeung KC. Out-of-hospital cardiac arrest: two and a half years experience of an accident and emergency department in Hong Kong. J Accid Emerg Med 1995;12:34-9. Crossref
20. Lau CL, Lai JC, Hung CY, Kam CW. Outcome of out-of-hospital cardiac arrest in a regional hospital in Hong Kong. Hong Kong J Emerg Med 2005;12:224-7. Crossref
21. Oving I, de Graaf C, Karlsson L, et al. Occurrence of shockable rhythm in out-of-hospital cardiac arrest over time: a report from the COSTA group. Resuscitation 2020;151:67-74. Crossref
22. Kitamura T, Kiyohara K, Sakai T, et al. Public-access defibrillation and out-of-hospital cardiac arrest in Japan. N Engl J Med 2016;375:1649-59. Crossref
23. Fan M, Fan KL, Leung LP. Walking route–based calculation is recommended for optimizing deployment of publicly accessible defibrillators in urban cities. J Am Heart Assoc 2020;9:e014398 Crossref
24. Agerskov M, Nielsen AM, Hansen CM, et al. Public access defibrillation: great benefit and potential but infrequently used. Resuscitation 2015;96:53-8. Crossref
25. Silverman RA, Galea S, Blaney S, et al. The “vertical response time”: barriers to ambulance response in an urban area. Acad Emerg Med 2007;14:772-8. Crossref
26. Drennan IR, Strum RP, Byers A, et al. Out-of-hospital cardiac arrest in high-rise buildings: delays to patient care and effect on survival. CMAJ 2016;188:413-9. Crossref
27. Moore JC, Segal N, Debaty G, Lurie KG. The “do’s and don’ts” of head up CPR: lessons learned from the animal laboratory. Resuscitation 2018;129:e6-e7. Crossref
28. Winther-Jensen M, Kjaergaard J, Hassager C, et al. Resuscitation and post resuscitation care of the very old after out-of-hospital cardiac arrest is worthwhile. Int J Cardiol 2015;201:616-23. Crossref

The real-world impact of the COVID-19 pandemic on patients with cancer: a multidisciplinary cross-sectional survey

Hong Kong Med J 2023 Apr;29(2):132–41 | Epub 14 Apr 2023
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
The real-world impact of the COVID-19 pandemic on patients with cancer: a multidisciplinary cross-sectional survey
Kelvin KH Bao, MB BChir, FRCR; Ka-man Cheung, MB, ChB, FRCR; James CH Chow, MB, ChB, FRCR; Carmen WL Leung, MB, ChB, FRCR; Kam-hung Wong, MB, ChB, FRCR
Department of Clinical Oncology, Queen Elizabeth Hospital, Hong Kong SAR, China
 
Corresponding author: Dr Kelvin KH Bao (bkh641@ha.org.hk)
 
 Full paper in PDF
 
Abstract
Introduction: The coronavirus disease 2019 (COVID-19) pandemic has caused unprecedented disruptions to cancer care worldwide. We conducted a multidisciplinary survey of the real-world impact of the pandemic, as perceived by patients with cancer.
 
Methods: A total of 424 patients with cancer were surveyed using a 64-item questionnaire constructed by a multidisciplinary panel. The questionnaire examined patient perspectives regarding COVID-19–related effects (eg, social distancing measures) on cancer care delivery, resources, and healthcare-seeking behaviour, along with the physical and psychosocial aspects of patient well-being and pandemic-related psychological repercussions.
 
Results: Overall, 82.8% of respondents believed that patients with cancer are more susceptible to COVID-19; 65.6% expected that COVID-19 would delay anti-cancer drug development. Although only 30.9% of respondents felt that hospital attendance was safe, 73.1% expressed unaltered willingness to attend scheduled appointments; 70.3% of respondents preferred to receive chemotherapy as planned, and 46.5% were willing to accept changes in efficacy or side-effect profile to allow an outpatient regimen. A survey of oncologists revealed significant underestimation of patient motivation to avoid treatment interruptions. Most surveyed patients felt that there was an insufficient amount of information available concerning the impact of COVID-19 on cancer care, and most patients reported social distancing–related declines in physical, psychological, and dietary wellness. Sex, age, education level, socio-economic status, and psychological risk were significantly associated with patient perceptions and preferences.
 
Conclusion: This multidisciplinary survey concerning the effects of the COVID-19 pandemic revealed key patient care priorities and unmet needs. These findings should be considered when delivering cancer care during and after the pandemic.
 
 
New knowledge added by this study
  • Most patients with cancer (73.1%) reported that their willingness to attend scheduled oncology appointments was not affected by the pandemic. All surveyed oncologists underestimated patient motivation to avoid treatment interruptions.
  • Patient acceptance of telerehabilitation varied according to age and socio-economic status, whereas the negative impact of social distancing on patients with cancer were substantial and multidimensional.
  • Psychometric analyses can stratify patients with cancer into psychological risk groups, based on their distinct perceptions of the pandemic.
Implications for clinical practice or policy
  • These findings will help to increase awareness of the effects of the coronavirus disease 2019 pandemic on patients with cancer, revealing their priorities and unmet needs.
  • This work aligns the expectations of oncologists and patients with cancer with respect to modifications of cancer services during the pandemic.
  • These results will promote better resource allocation and earlier multidisciplinary interventions to reduce pandemic-related impact on at-risk populations.
 
 
Introduction
The coronavirus disease 2019 (COVID-19) pandemic poses an unprecedented threat to health systems worldwide. During the first year of the pandemic, there were more than 110 million confirmed COVID-19 cases globally and more than 2.6 million deaths.1 In terms of scale, the number of COVID-19 cases during that period was at least sixfold more than the global number of new cancer cases in 2018; the mortality during that period exceeded the combined mortalities of lung cancer and breast cancer in 2018.2 The many consequences of COVID-19 have included unprecedented disruptions to cancer care services,3 4 5 such as cancellations of outpatient appointments to delays in scheduled systemic treatments and radiotherapy; during periods of increased transmission, such disruptions have forced oncologists to make difficult decisions in attempts to balance patient protection and disease control. There have been similar impact to the delivery of oncology-related allied health services, including physical therapy and occupational therapy,6 dietetics,7 diagnostic imaging,8 and psychological services for patients with cancer9; there has been a particularly large shift in the use of telemedicine. Throughout the COVID-19 pandemic, and particularly during periods of increased transmission, good multidisciplinary coordination has been a crucial aspect of cancer care. By acquiring comprehensive knowledge regarding perceptions of the pandemic, changes in healthcare-seeking behaviour, impact on daily life, and the newly emerged unmet needs (physical, socio-economic, and psychological) of patients with cancer, multidisciplinary cancer caregivers can customise and adjust their services accordingly, thus enabling appropriate resource allocation. Considering these challenges, we designed and conducted a prospective study to comprehensively examine the real-world impact of COVID-19 on patients with cancer; we sought to identify actionable solutions from the perspective of experienced multidisciplinary cancer caregivers.
 
Methods
A prospective survey regarding the perspectives of patients with cancer on the impact of the COVID-19 pandemic was jointly developed by a multidisciplinary team at Queen Elizabeth Hospital, Hong Kong that consisted of clinical oncologists, clinical psychologists, physiotherapists, occupational therapists, and dieticians specialising in cancer care. A pilot survey was administered to 88 patients, followed by interviews to further assess patient understanding of the questions and to develop additional items via thematic analysis. The multidisciplinary team then refined the questionnaire. The final version consisted of 64 items with a combination of Likert scales and polar questions; it covered topics such as patient perceptions of cancer care resources, treatment delivery and quality, changes in healthcare-seeking behaviour, adequacy of available pandemic-related information, social distancing–related adverse impact, and psychological repercussions of the pandemic. We also invited patients who were newly diagnosed with cancer to complete an extended questionnaire which focused on psychometric measurements of post-traumatic stress disorder (PTSD) [the PTSD Checklist for DSM-5 (PCL-5)],10 anxiety and depression (the Emotion Thermometers tool),11 and intolerance of uncertainty (the Intolerance of Uncertainty Scale-12 [IUS-12]).12 Patients were then stratified into risk groups. High-risk individuals had scores of ≥5 on the abbreviated PCL-5 scale,13 ≥3 on the Emotion Thermometers for depression or anxiety,14 and ≥25 on the IUS-12 scale.11 Associations between patient perceptions and psychological risk were then explored. Full details of the patient questionnaire are shown in online supplementary Table 1.
 
Furthermore, we surveyed clinical oncologists in Hong Kong (practising in Queen Elizabeth Hospital, United Christian Hospital, and Buddhist Hospital) regarding their perceptions of the pandemic; the oncologists were also asked to predict the responses of patients with cancer in various domains of interest.
 
The patient survey was conducted between 12 and 22 May 2020 at Queen Elizabeth Hospital. Patients with cancer and survivors aged ≥18 years who attended their outpatient oncology appointments were invited to participate. Patients who could not read English or Chinese were excluded, and participation was voluntary. Detailed survey information was provided on the questionnaire cover sheet, and a patient’s decision to participate in the survey was regarded as informed consent. Hardcopies of the questionnaire were anonymously completed by participants on site, then collected by dedicated nursing staff.
 
Data analysis
Descriptive analysis was used to describe various impact of the pandemic on patients. Patient demographics, disease characteristics, treatment details, and socio-economic information were summarised. Qualitative data are presented as the percentage of respondents who selected a particular response. Chi squared tests were used to determine associations between responses and categorical patient factors. P values <0.05 were considered indicative of statistical significance. Analyses were performed using SPSS software (Windows version 25.0; IBM Corp, Armonk [NY], United States).
 
Results
Demographic characteristics
Between 12 and 22 May 2020, 650 patients with cancer were invited to participate in the survey; 424 responses were received, yielding a response rate of 65.2%. Demographic and clinical characteristics of the participants are presented in the Table. Most survey respondents were female (70.0%), and more than half were aged 46 to 75 years. Nearly half (46.0%) of the respondents were receiving active cancer treatment. The cancer stage was III or below in half of the respondents and 20.3% of respondents were at stage IV; 29.2% of respondents were uncertain of their staging. Almost half (43.9%) of the respondents had a monthly family income of <HK$16 000 (around US$2000), which is Hong Kong’s 2018 poverty line for a family of three.15
 

Table. Demographic and clinical characteristics of patients with cancer (n=424)
 
Impact of coronavirus disease 2019 on cancer resources
As shown in online supplementary Table 1, most respondents (82.8%) believed that patients with cancer are more susceptible to COVID-19, while more than half (52.1%) believed that cancer-related resources will be depleted and 59.3% were concerned that healthcare workforce shortages during the pandemic would harm their treatment. Overall, 65.6% of respondents were concerned that COVID-19 would lead to delays in anti-cancer drug development. These concerns were significantly associated with education level of patients, in which the more educated respondents demonstrated less concern (tertiary level 57.6% vs secondary 64.4% vs primary 71.4%, P=0.01) [Fig 1a].
 

Figure 1. (a) Level of patient concern about the impact of coronavirus disease 2019 (COVID-19) on anti-cancer drug development by education level (n=424). (b) Effect of COVID-19 pandemic on patient willingness to attend hospital appointments by patient demographics (n=424). (c) Perceived availability of information about the impact of COVID-19 on patients with cancer by education level (n=424)
 
Impact of coronavirus disease 2019 on healthcare-seeking behaviour
As shown in online supplementary Table 1, fewer than one-third of respondents (30.9%) felt that it was safe to attend hospital appointments during the pandemic (the proportion was greater among men than among women: 37.8% vs 27.8%). Furthermore, most respondents reported that their willingness to attend oncology clinic appointments (73.1%) or undergo clinical tests (80.2%) was unaffected. Age (P=0.021), sex (P=0.029), and education level (P=0.003) were factors significantly associated with patient willingness; respondents aged 18-45 years or >75 years, female, and more educated individuals were more hesitant to attend their scheduled appointments (Fig 1b).
 
Compared with the pre-pandemic period, most respondents (79.0%) stated that they were equally willing (65.1%) or more willing (13.9%) to seek medical attention now if they felt unwell. Overall, 62.2% of respondents were equally willing or more willing to be hospitalised if requested by their oncologists. Male respondents were more willing to be hospitalised, compared with female respondents (68.3% vs 59.2%); additionally, respondents receiving radical treatment were more willing to be hospitalised, compared with respondents receiving palliative treatment (71.3% vs 59.0%).
 
Effects of social distancing on medical consultation and cancer treatment
Nearly all respondents (98.4%) felt that it was acceptable for medical staff to maintain an increased physical distance from patients during consultations, and most (83.3%) felt that it did not negatively impact the quality of their clinic experience. During the pandemic, some clinically stable patients were exempt from the requirement for oncologist examination prior to medication refills. Overall, 59.9% of respondents felt that such an arrangement should continue beyond the pandemic period (male respondents vs female respondents: 69.3% vs 55.9%; respondents receiving radical treatment vs respondents receiving palliative treatment: 54.4% vs 69.2%).
 
Concerning the effects of the pandemic on plans for cancer treatment, most respondents stated that their decisions to receive chemotherapy (70.3%) or radiotherapy (67.9%) were unaffected. However, 46.5% of the respondents were willing to accept changes in treatment efficacy or side-effect profile to allow an outpatient regimen; this preference was particularly strong among respondents receiving palliative treatment (61.5%), compared with respondents receiving radical treatment (33.7%).
 
Acquisition and adequacy of pandemic-related information
As shown in online supplementary Table 1, half of the respondents (49.1%) spent an average of 10 to 30 minutes daily interacting with news and information sources focused on COVID-19. Their most common news sources were television (41.7%), the internet (28.0%), and newspapers (12.9%). Only 3.5% of respondents received pandemic-related news or information from their hospitals. Young respondents (<45 years) were significantly more likely to receive news primarily from the internet, compared with older respondents (>75 years) (37.6% vs 13.4%; P=0.001); such a difference was also present between respondents with different education levels (tertiary education vs primary education: 35.5% vs 17.4%; P=0.001) and between respondents with different levels of monthly family income (>HK$40 000 vs <HK$16 000: 32% vs 25%; P=0.006).
 
Concerning the adequacy of information received regarding COVID-19 and its impact on patients with cancer, more than half of the respondents (54.1%) felt that it was inadequate; this sentiment was more prevalent among respondents with a higher education level, compared with those who were less educated (tertiary vs primary level: 60.2% vs 40.3%; P=0.017) [Fig 1c].
 
Effects of social distancing on allied health professional services
During the pandemic, social distancing became the new daily norm. Nearly half of the respondents (49.5%) reported exercising less, whereas 10.4% reported exercising more. In general, 55.4% of respondents noticed an overall deterioration in their physical well-being (Fig 2a); about one-third of respondents (32.1%) reporting reduced walking tolerance, and 25.9% of respondents noticed some reduction in limb power (online supplementary Table 1).
 

Figure 2. (a) Patient perception that being homebound during lockdown results in physical and functional deterioration by age-group (n=424). (b) Preferred method of physiotherapy delivery during the coronavirus disease 2019 (COVID-19) pandemic by patient demographics (n=424). (c) Level of patient concern about the waiting time at outpatient department and associated risk of infection, compared with the pre–COVID-19 pandemic period (n=424). (d) Perceived availability of information about the impact of COVID-19 on patients with cancer (n=424)
 
With respect to patient preferences regarding physiotherapy delivery during the pandemic, 42.9% of young respondents (18-45 years) preferred online sessions, whereas 50.0% of older respondents (>75 years) preferred home visits by a therapist [Fig 2b]. Education level (P=0.029) and income (P=0.034) were significantly associated with patient preference. Respondents with a higher education level (tertiary vs primary level: 33.3% vs 16.3%) and respondents with a higher income (monthly income >HK$40 000 vs <HK$16 000: 50.3% vs 26.0%) preferred online sessions, rather than in-person sessions.
 
During the period of social distancing, 64.5% of older respondents (>75 years) felt that their lives had become monotonous and lonely; significantly fewer (39.3%) younger respondents (<45 years) expressed this sentiment. Most respondents (58.0%) agreed that their home care support had improved because family members spent more time together; this sentiment was more prevalent among older respondents (>75 years) [70.9%].
 
As shown in online supplementary Table 1, the pandemic caused dietary habit alterations in 38.9% of respondents. Approximately one-fifth of respondents reported reduced appetite (22.2%) and increased consumption of junk food (processed or ready-made meals) (19.3%). Significantly more respondents in the low-income subgroup reported reduced appetite, compared with respondents in the high-income subgroup (30.6% vs 8.2%, P=0.001). Notably, the use of face masks led to a reduction in meal frequency among 30.7% of respondents; this reduction was more prevalent among respondents with lower income, compared with respondents who had higher income (32.6% vs 23.2%).
 
Psychological impact of coronavirus disease 2019
Overall, 41.0% and 23.1% of respondents had recently experienced anxiety and/or depressed mood. In total, 103 consecutive newly diagnosed patients responded to the extended psychometric questionnaire. The results revealed greater levels of concern regarding the impact of COVID-19 on cancer care manpower and the risk of infection during outpatient clinic waiting time in patients with higher risks of PTSD (P=0.011 and P=0.015, respectively), anxiety (P=0.013 and P=0.034, respectively), depression (P=0.017 and P=0.043, respectively), and uncertainty intolerance (P=0.004 and P=0.044, respectively) [Fig 2c]. A high IUS-12 score (uncertainty intolerance) was associated with the presence of greater concern regarding the effects of the pandemic on cancer research and drug development (P=0.03). As shown in online supplementary Table 2, respondents with a high risk of anxiety were less likely to agree with the ‘no visiting’ policy of hospitals (P=0.013). More respondents with high risks of anxiety (P=0.024) and depression (P=0.044) felt that there was an insufficient amount of information available in the media regarding the impact of COVID-19 on patients with cancer (Fig 2d). Moreover, respondents with a high risk of PTSD demonstrated significantly greater concern when asked about their fear of being infected by their caregiver or family members, compared with respondents who had a low risk of PTSD (P=0.005). Detailed results of the psychometric questionnaire are shown in online supplementary Table 2.
 
Comparison of oncologist and patient perspectives
We invited 30 practising clinical oncologists to predict patient healthcare-seeking behaviour during the pandemic. All 21 responding oncologists predicted significant reductions in patient willingness to attend appointments and patient willingness to be hospitalised, but most patients reported no change in either type of willingness (73.3% and 54.7%, respectively). A greater proportion of oncologists (50.0%) than patients (16.7%) reported a negative impact on their clinic experience because of doctor-patient distancing measures (Fig 3a). Furthermore, when asked about their confidence in identifying the cause of a new fever (COVID-19–related vs other causes), most oncologists reported little (73.3%) or no confidence (13.3%), whereas almost half of the patients (47.4%) reported that they were quite or very confident in their ability to identify the cause of a new fever (Fig 3b).
 

Figure 3. (a) Effect of physical distancing by medical staff during consultations on perception of clinic experience of patients (n=424) and oncologists (n=30) during coronavirus disease 2019. (b) Level of confidence among patients (n=424) and oncologists (n=30) in identifying the cause of a fever in the coming week
 
Discussion
This study investigated the perceptions of patients with cancer regarding the real-world impact of COVID-19 (during the early days of the pandemic) through the perspective of a multidisciplinary team that included clinical oncologists, clinical psychologists, physiotherapists, dieticians, and occupational therapists. Using a comprehensive set of questions, we identified key concerns, unmet needs, perceptions, and expectations of patients with cancer at different stages in their cancer care journeys. Cancer care16 and pandemic management17 are both resource-intensive endeavours. Because the COVID-19 pandemic has become the focus of healthcare worldwide, it is understandable that patients with cancer are concerned about pandemic-related negative impact on cancer care resources. Our results suggest that patients with cancer remained committed to attending scheduled appointments, despite the perceived risk of COVID-19 during the early days of the pandemic. This sustained clinical demand—along with general acceptance among patients regarding COVID-19 adaptive measures (staff-patient distancing), streamlined services (prescription-only clinics), and outcome trade-offs (efficacy and side-effect profile)—allowed our oncology services to continue with minimal disruptions despite the reduced availability of healthcare resources. Nevertheless, only 30.9% of surveyed patients felt that it was safe to attend the hospital; this observation highlights the need to ensure patients are informed about hospital safety measures for COVID-19 management, with details regarding rationale and efficacy. This study also revealed a discrepancy between male and female patients in terms of healthcare-seeking behaviour; moreover, patients receiving radical treatment demonstrated different perceptions and needs, compared with patients receiving palliative care. Oncology healthcare providers should consider the unique needs of various patient groups when implementing pandemic management strategies.
 
Between the initial outbreak and the time of this survey, the general public’s understanding of COVID-19 heavily relied on mainstream media coverage,18 which often did not focus on the needs of specific patient groups. International guidelines regarding cancer management during the pandemic began to emerge later in 2020,19 but they mainly targeted medical professionals. Accordingly, patients with cancer felt that the COVID-19–related information provided to patients was inadequate. This perception was particularly prevalent among patients with a higher education level, who tended to obtain news and information more frequently from multiple sources (eg, the internet and social media). Notably, this situation highlights the phenomenon of ‘the more you know, the more you realise you don’t know’, thereby emphasising the presence of an additional information barrier for underprivileged patient groups.20 The situation is further complicated by the presence of COVID-19–related misinformation, which has been widespread on social media since 2020.21 The findings in this study provide insights concerning the distinct pandemic-related information preferences and needs among patients according to age, education level, and income. Cancer services should focus on addressing these preferences and needs by providing patients with current COVID-19–related information from official sources, ensuring that the hospital remains a source of verified and practical pandemic-related information accessible to all patients.
 
The consequences of social distancing (eg, reduced exercise, poor diet, increased financial burden, and loss of social interactions) are detrimental to the physical and psychological well-being of patients with cancer,22 23 24 potentially reducing cancer treatment tolerance and compromising outcomes. Although the impact of pandemic-related lockdowns on dietary patterns of diabetic patients25 and older population26 have been studied, there are limited prospective data regarding the nutritional status of patients with cancer and their needs during the pandemic. This study has revealed some real-world patient needs, particularly among socio-economically disadvantaged patients; it also highlights the importance of individualised dietetic and occupational health assessments and early interventions (inpatient or outpatient) by dieticians and occupational therapists who specialise in cancer care. Dedicated self-help materials prepared by allied health professionals to address the adverse effects of social distancing may also serve as effective resources.
 
When the pandemic began, telerehabilitation emerged as a promising alternative method for patient–clinician interactions, with effective use in a physiotherapy context.27 28 However, our findings indicate that telerehabilitation may not be universally welcomed, particularly among older patients. Turolla et al29 described the challenges of implementing telerehabilitation; our findings highlight the need to carefully examine telemedicine accessibility and ‘telehealth literacy’30 among socio-economically underprivileged populations.31 32 When possible, conventional physiotherapy and rehabilitation should remain available, particularly for older adults, less-educated individuals, and low-income patients.33 Our findings offer a rationale for triaging appropriate patients towards telemedicine; they also highlight the need for improving telemedicine quality and access, as well as the importance of ensuring that alternatives are available.
 
This study demonstrated that psychometric analysis is a meaningful tool for identifying at-risk populations of patients with cancer during the pandemic. Patients in psychological high-risk groups clearly demonstrated distinct perceptions, expectations, and needs when simultaneously confronted with a cancer diagnosis and the COVID-19 pandemic. Without effective management, such patients could experience long-lasting psychiatric morbidities, as revealed during the severe acute respiratory syndrome epidemic in 2003.34 35 Wang et al36 emphasised the importance of mental healthcare attention and resources for patients with cancer during the COVID-19 pandemic. Along with routine cancer care, targeted psychotherapies and follow-up care for both the acute impact and long-term sequelae of COVID-19 are needed.
 
Oncologists and patients with cancer have different perceptions of cancer symptoms, treatment priorities, and psychosocial needs.37 38 39 We found that oncologists tended to underestimate patient motivation to avoid treatment interruptions, as well as patient risk acceptance, consistent with the observation by Catania et al40 that patients with cancer were more concerned about their cancers than about the pandemic. Moreover, compared with oncologists, a greater proportion of our surveyed patients expressed confidence in identifying COVID-19 symptoms. These results illustrate differences in priorities and perceptions of pandemic severity, along with the challenge of balancing disruptions to cancer treatment and maintaining COVID-19–related safety.
 
Proposed interventions to minimise the impact of coronavirus disease 2019 on cancer patients
The following are some proposed interventions to minimise the impact of COVID-19 on cancer patients:
1. Ensure that patients are informed about hospital safety measures for COVID-19 management, with details regarding rationale and efficacy.
2. Ensure that healthcare staff maintain appropriate physical distance from patients.
3. Operate prescription-only clinics and lengthen follow-up intervals for clinically stable patients.
4. Triage appropriate patients towards telemedicine; enhance general telehealth literacy by implementing user-friendly interfaces, step-by-step demonstrations, and support hotlines.
5. Establish a regularly updated COVID-19–related newsfeed that is customised for patients with cancer.
6. Work with dieticians, physiotherapists, and occupational therapists to create self-help pamphlets that can guide patients with cancer in coping with the effects of social distancing; facilitate the establishment of virtual support groups for patients with cancer.
7. Implement early allied health assessments and interventions for at-risk patients.
8. Ensure early psychological support, particularly for newly diagnosed patients.
9. Compassionately and flexibly enforce restrictive measures for newly diagnosed patients, individuals approaching the end of life, and selected at-risk patients.
10. Periodically review these measures as the pandemic progresses.
 
Study strengths and limitations
To our knowledge, this is one of the first studies to simultaneously explore perceptions of the real-world impact of the COVID-19 pandemic among patients with cancer and oncologists. Importantly, the efforts of the multidisciplinary team to construct the questions contributed to a multidimensional, holistic understanding of issues and unmet needs that affect patients with cancer at different stages of their cancer care journeys. Because of the in-person survey invitation and paper-and-pen methodology, our survey achieved a high response rate of 65%, ensuring that the results are representative of the surveyed population. However, sampling bias was present because survey respondents were patients who physically attended their clinic appointments; data were missing for around 10% of patients who declined to attend their clinic appointments. Furthermore, this survey was conducted within a short interval (2 weeks) towards the end of the first wave of the COVID-19 pandemic in Hong Kong, when there was a gradual easing of lockdown policies and personal protective equipment availability began to improve41; thus, this cross-sectional assessment may not adequately reflect the evolution of patient perceptions regarding the COVID-19 pandemic. Other key limitations of the study include its inclusion of patients from a single cancer centre, as well as the exclusion of patients who could not read Chinese or English and patients who underwent treatment in private clinics. There is a need to repeat the study at various time points throughout the pandemic; future analyses should focus on other affected countries and patient populations.
 
Conclusion
This multidisciplinary survey concerning the effects of the COVID-19 pandemic impact revealed key care priorities among patients with cancer, as well as their unmet needs; in particular, it highlighted the importance of distinct priorities and needs among socio-economically underprivileged groups and patients with diverse psychological phenotypes. Oncologists should be aware that their own perceptions of pandemic-related effects differ from their patients’ perceptions. These findings should be carefully considered as multidisciplinary teams modify their delivery of cancer care services during and after the pandemic.
 
Author contributions
Concept or design: KKH Bao, KM Cheung, JCH Chow.
Acquisition of data: All authors.
Analysis or interpretation of data: KKH Bao, KM Cheung, JCH Chow.
Drafting of the manuscript: KKH Bao, KM Cheung, JCH Chow.
Critical revision of the manuscript for important intellectual content: All authors.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
All authors have disclosed no conflicts of interest.
 
Acknowledgement
We thank all multidisciplinary team members and participating patients for their efforts and contributions.
 
Declaration
This research has been presented in part as poster presentations at the following conferences:
1. ESMO Congress 2020, virtual, 19-21 Sep 2020 (title: Cancer patients’ perspectives on the real-world impact of COVID-19 pandemic: a multidisciplinary survey)
2. ESMO Asia Congress 2020, virtual, 20-22 Nov 2020 (title: Psychometric interplay of the perception of the real-life impact of COVID-19 pandemic: a cross-sectional survey of patients with newly diagnosed malignancies)
 
Funding/support
This research was supported by the Hong Kong Hospital Authority Kowloon Central Cluster Research Grant 2020 (Ref No.: KCC/RC/G/2021-B01).
 
Ethics approval
The study was approved by the Kowloon Central/Kowloon East Cluster Clinical Research Ethics Committee of Hospital Authority, Hong Kong (Ref No.: KC/KE-20-0126/ER-1). All eligible respondents explicitly agreed to join the panel and provided informed consent to participate in the study.
 
References
1. World Health Organization. WHO Coronavirus Disease (COVID-19) Dashboard. Available from: https://covid19.who.int. Accessed 7 Nov 2020.
2. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 2018;68:394-424. Crossref
3. Jazieh AR, Akbulut H, Curigliano G, et al. Impact of the COVID-19 pandemic on cancer care: a global collaborative study. JCO Glob Oncol 2020;6:1428-38. Crossref
4. Chen-See S. Disruption of cancer care in Canada during COVID-19. Lancet Oncol 2020;21:e374. Crossref
5. Sud A, Torr B, Jones ME, et al. Effect of delays in the 2-week-wait cancer referral pathway during the COVID-19 pandemic on cancer survival in the UK: a modelling study. Lancet Oncol 2020;21:1035-44. Crossref
6. Prvu Bettger J, Thoumi A, Marquevich V, et al. COVID-19: maintaining essential rehabilitation services across the care continuum. BMJ Glob Health 2020;5:e002670. Crossref
7. Thibault R, Coëffier M, Joly F, Bohé J, Schneider SM, Déchelotte P. How the Covid-19 epidemic is challenging our practice in clinical nutrition—feedback from the field. Eur J Clin Nutr 2021;75:407-16. Crossref
8. Maringe C, Spicer J, Morris M, et al. The impact of the COVID-19 pandemic on cancer deaths due to delays in diagnosis in England, UK: a national, population-based, modelling study. Lancet Oncol 2020;21:1023-34. Crossref
9. Liu S, Yang L, Zhang C, et al. Online mental health services in China during the COVID-19 outbreak. Lancet Psychiatry 2020;7:e17-8. Crossref
10. Price M, Szafranski DD, van Stolk-Cooke K, Gros DF. Investigation of abbreviated 4 and 8 item versions of the PTSD Checklist 5. Psychiatry Res 2016;239:124-30. Crossref
11. Beck KR, Tan SM, Lum SS, Lim LE, Krishna LK. Validation of the emotion thermometers and hospital anxiety and depression scales in Singapore: screening cancer patients for distress, anxiety and depression. Asia Pac J Clin Oncol 2016;12:e241-9. Crossref
12. Carleton RN, Norton MA, Asmundson GJ. Fearing the unknown: a short version of the Intolerance of Uncertainty Scale. J Anxiety Disord 2007;21:105-17. Crossref
13. Geier TJ, Hunt JC, Hanson JL, et al. Validation of abbreviated four- and eight-item versions of the PTSD checklist for DSM-5 in a traumatically injured sample. J Trauma Stress 2020;33:218-26. Crossref
14. Schubart JR, Mitchell AJ, Dietrich L, Gusani NJ. Accuracy of the Emotion Thermometers (ET) screening tool in patients undergoing surgery for upper gastrointestinal malignancies. J Psychosoc Oncol 2015;33:1-14. Crossref
15. Office of the Government Economist Financial Secretary’s Office. Census and Statistics Department. Hong Kong SAR Government. Hong Kong Poverty Situation Report 2018. Available from: https://www.povertyrelief.gov.hk/eng/pdf/Hong_Kong_Poverty_Situation_Report_2018(2019.12.13).pdf. Accessed 17 Mar 2023.
16. Mariotto AB, Yabroff KR, Shao Y, Feuer EJ, Brown ML. Projections of the cost of cancer care in the United States: 2010-2020. J Natl Cancer Inst 2011;103:117-28. Crossref
17. Tan-Torres Edejer T, Hanssen O, Mirelman A, et al. Projected health-care resource needs for an effective response to COVID-19 in 73 low-income and middle-income countries: a modelling study. Lancet Glob Health 2020;8:e1372-9. Crossref
18. Casero-Ripollés A. Impact of Covid-19 on the media system. Communicative and democratic consequences of news consumption during the outbreak. El profesional de la información 2020;29:e290223. Crossref
19. Curigliano G, Banerjee S, Cervantes A, et al. Managing cancer patients during the COVID-19 pandemic: an ESMO multidisciplinary expert consensus. Ann Oncol 2020;31:1320-35. Crossref
20. Matsuyama RK, Wilson-Genderson M, Kuhn L, Moghanaki D, Vachhani H, Paasche-Orlow M. Education level, not health literacy, associated with information needs for patients with cancer. Patient Educ Couns 2011;85:e229-36. Crossref
21. Pennycook G, McPhetres J, Zhang Y, Lu JG, Rand DG. Fighting COVID-19 misinformation on social media: experimental evidence for a scalable accuracy-nudge intervention. Psychol Sci 2020;31:770-80. Crossref
22. Di Corrado D, Magnano P, Muzii B, et al. Effects of social distancing on psychological state and physical activity routines during the COVID-19 pandemic. Sport Sci Health 2020;16:619-24. Crossref
23. Ammar A, Brach M, Trabelsi K, et al. Effects of COVID-19 home confinement on eating behaviour and physical activity: results of the ECLB-COVID19 International Online Survey. Nutrients 2020;12:1583. Crossref
24. Galea S, Merchant RM, Lurie N. The mental health consequences of COVID-19 and physical distancing: the need for prevention and early intervention. JAMA Intern Med 2020;180:817-8. Crossref
25. Ruiz-Roso MB, Knott-Torcal C, Matilla-Escalante DC, et al. COVID-19 lockdown and changes of the dietary pattern and physical activity habits in a cohort of patients with type 2 diabetes mellitus. Nutrients 2020;12:2327. Crossref
26. Ceolin G, Moreira JD, Mendes BC, Schroeder J, Di Pietro PF, Rieger DK. Nutritional challenges in older adults during the COVID-19 pandemic [in Spanish]. Rev Nutrição 2020;33:e200174.Crossref
27. Smith AC, Thomas E, Snoswell CL, et al. Telehealth for global emergencies: implications for coronavirus disease 2019 (COVID-19). J Telemed Telecare 2020;26:309-13. Crossref
28. Scherrenberg M, Wilhelm M, Hansen D, et al. The future is now: a call for action for cardiac telerehabilitation in the COVID-19 pandemic from the secondary prevention and rehabilitation section of the European Association of Preventive Cardiology. Eur J Prev Cardiol 2021;28:524-40. Crossref
29. Turolla A, Rossettini G, Viceconti A, Palese A, Geri T. Musculoskeletal physical therapy during the COVID-19 pandemic: is telerehabilitation the answer? Phys Ther 2020;100:1260-4. Crossref
30. Norman CD, Skinner HA. eHealth literacy: essential skills for consumer health in a networked world. J Med Internet Res 2006;8:e9. Crossref
31. Jasemian Y. Elderly comfort and compliance to modern telemedicine system at home. 1st International ICST Workshop on Connectivity, Mobility and Patients’ Comfort. 2008. Available from: https://eudl.eu/doi/10.4108/icst.pervasivehealth2008.2516. Accessed 5 May 2020. Crossref
32. Bujnowska-Fedak MM, Grata-Borkowska U. Use of telemedicine-based care for the aging and elderly: promises and pitfalls. Smart Homecare Technol Telehealth 2015;3:91-105. Crossref
33. Chesser A, Burke A, Reyes J, Rohrberg T. Navigating the digital divide: a systematic review of eHealth literacy in underserved populations in the United States. Inform Health Soc Care 2016;41:1-19. Crossref
34. Chua SE, Cheung V, McAlonan GM, et al. Stress and psychological impact on SARS patients during the outbreak. Can J Psychiatry 2004;49:385-90. Crossref
35. Mak IW, Chu CM, Pan PC, Yiu MG, Chan VL. Long-term psychiatric morbidities among SARS survivors. Gen Hosp Psychiatry 2009;31:318-26.Crossref
36. Wang Y, Duan Z, Ma Z, et al. Epidemiology of mental health problems among patients with cancer during COVID-19 pandemic. Transl Psychiatry 2020;10:263. Crossref
37. Newell S, Sanson-Fisher RW, Girgis A, Bonaventura A. How well do medical oncologists’ perceptions reflect their patients’ reported physical and psychosocial problems? Data from a survey of five oncologists. Cancer 1998;83:1640-51. Crossref
38. Sakai H, Umeda M, Okuyama H, Nakamura S. Differences in perception of breast cancer treatment between patients, physicians, and nurses and unmet information needs in Japan. Support Care Cancer 2020;28:2331-8. Crossref
39. Söllner W, DeVries A, Steixner E, et al. How successful are oncologists in identifying patient distress, perceived social support, and need for psychosocial counselling? Br J Cancer 2001;84:179-85. Crossref
40. Catania C, Spitaleri G, Del Signore E, et al. Fears and perception of the impact of COVID-19 on patients with lung cancer: a mono-institutional survey. Front Oncol 2020;10:584612. Crossref
41. Coronavirus.gov. Hong Kong SAR Government. Together, We Fight the Virus. Available from: https://www.coronavirus.gov.hk/eng/index.html. Accessed 7 Nov 2020.

Implementation of ovarian tissue cryopreservation in Hong Kong

Hong Kong Med J 2023 Apr;29(2):121–31 | Epub 24 Feb 2023
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Implementation of ovarian tissue cryopreservation in Hong Kong
Jacqueline PW Chung, MB, ChB, FHKAM (Obstetrics and Gynaecology)1,2; David YL Chan, BSc (Ulster), DPhil (Oxon)1,2; Y Song, BSc (Peking University), PhD (CUHK)3; Elaine YL Ng, BSc (CUHK), MPhil (CUHK)1; Tracy SM Law, MB, ChB, FHKAM (Obstetrics and Gynaecology)1; Karen Ng, MB, ChB, FHKAM (Obstetrics and Gynaecology)1; Maran BW Leung, BSc (CUHK), PhD (CUHK)3; S Wang, MB, BS, MSc1; HM Wan, BEng (Jinan University), MSc (Jinan University)1; Joshua JX Li, MB, ChB, FHKAM (Pathology)5; CC Wang, MB, BS, PhD (Surgical Sciences in Obstetrics and Gynaecology)1,2,6
1 Assisted Reproductive Technology Unit, Department of Obstetrics and Gynaecology, The Chinese University of Hong Kong, Hong Kong
2 Fertility Preservation Research Centre, Department of Obstetrics and Gynaecology, The Chinese University of Hong Kong, Hong Kong
3 Department of Obstetrics and Gynaecology, The Chinese University of Hong Kong, Hong Kong
4 Department of Obstetrics and Gynaecology, Union Hospital, Hong Kong
5 Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Hong Kong
6 Li Ka Shing Institute of Health Science, School of Biomedical Sciences; and Chinese University of Hong Kong–Sichuan University Joint Laboratory in Reproductive Medicine, The Chinese University of Hong Kong, Hong Kong
 
Corresponding author: Prof Jacqueline PW Chung (jacquelinechung@cuhk.edu.hk)
 
 Full paper in PDF
 
Abstract
Introduction: Worldwide, >130 babies have been born from ovarian tissue cryopreservation (OTC) and ovarian tissue transplantation (OTT). Ovarian tissue cryopreservation can improve quality of life among young female cancer survivors. Here, we assessed the feasibility of OTC and subsequent OTT in Hong Kong via xenografts in nude mice.
 
Methods: This pilot study was conducted in a university-affiliated tertiary hospital. Fifty-two ovarian tissues were collected from 12 patients aged 29 to 41 years during ovarian surgery, then engrafted into 34 nude mice. The efficacies of slow freezing and vitrification were directly compared. In Phase I, non-ovariectomised nude mice underwent ovarian tissue engraftment. In Phase II, ovariectomised nude mice underwent ovarian tissue engraftment, followed by gonadotrophin administration to promote folliculogenesis. Ovarian tissue viability was assessed by gross anatomical, histological, and immunohistochemical examinations before and after OTC. Follicular density and morphological integrity were also assessed.
 
Results: After OTC and OTT, grafted ovarian tissues remained viable in nude mice. Primordial follicles were observed in thawed and grafted ovarian tissues, indicating that the cryopreservation and transplantation protocols were both effective. The results were unaffected by gonadotrophin stimulation.
 
Conclusion: This study demonstrated the feasibility of OTC in Hong Kong as well as primordial follicle viability after OTC and OTT in nude mice. Ovarian tissue cryopreservation is ideal for patients who cannot undergo the ovarian stimulation necessary for oocyte or embryo freezing as well as prepubertal girls (all ineligible for oocyte freezing). Our findings support the clinical implementation of OTC and subsequent OTT in Hong Kong.
 
 
New knowledge added by this study
  • This study assessed the viability of ovarian tissue cryopreservation and subsequent ovarian tissue transplantation in Hong Kong via xenografts in nude mice.
  • Grafted ovarian tissues remained viable after transplantation, regardless of protocol (slow freezing or vitrification).
Implications for clinical practice or policy
  • Ovarian tissue cryopreservation is ideal for patients who cannot undergo the ovarian stimulation necessary for oocyte or embryo freezing, as well as prepubertal girls (all ineligible for oocyte freezing).
  • These findings support the clinical implementation of ovarian tissue cryopreservation and subsequent ovarian tissue transplantation in Hong Kong.
  • Further studies are needed to clarify optimal cryopreservation and engraftment protocols.
 
 
Introduction
A diagnosis of cancer is disheartening news for every patient in terms of both disease and treatment. Anticancer treatments for common cancers (eg, chemotherapy and radiotherapy) are gonadotoxic and detrimental to future fertility.1 Advances in medical treatment have improved the 5-year survival rates of some cancers to >80% in children and adolescents.2 However, most surviving patients experience illness-related infertility.3 Infertility is also a concern for patients with severe endometriosis, patients with poor ovarian reserve, patients with benign medical conditions requiring chemotherapy, and transgender individuals undergoing gender-affirming surgery.4 Fortunately, advancements in fertility preservation (FP) technologies offer these patients the opportunity to have biological offspring in the future. In Hong Kong, only 45.6% of clinicians,5 22.2% of medical students,6 and 21.7% of the general public7 are familiar with FP. Therefore, FP technologies require greater attention in Hong Kong. Both clinicians and the public should be aware where and how to seek help when patients are diagnosed with cancer and need to use FP technologies to preserve their fertility.
 
For patients who cannot undergo the ovarian stimulation necessary for oocyte or embryo freezing as well as prepubertal girls (all ineligible for oocyte freezing), ovarian tissue cryopreservation (OTC) and subsequent orthotopic or heterotopic ovarian tissue transplantation (OTT) are ideal options for FP after recovery.8 Many European countries have provided OTC for patients with various medical reasons.4 Although OTC and OTT are widely available in Belgium, Denmark, Spain, France,9 Japan, Singapore,10 the United States, India, Australia, the Philippines, Korea,11 and some parts of China,12 these FP technologies remain unavailable in Hong Kong. Thus far, >130 babies worldwide have been born via OTC and OTT,13 and the American Society for Reproductive Medicine removed the ‘experimental’ designation for these technologies in 2019.14 Ovarian tissue cryopreservation can be performed via slow freezing or vitrification. Slow freezing has been the standard treatment for ovarian tissues,15 but there is increasing evidence to support the use of vitrification.16 17 Nevertheless, controversies remain concerning subsequent oocyte viability and the preservation of morphological integrity after ovarian tissues have been processed using these two cryopreservation techniques.18
 
The development of OTC, which can improve quality of life among young female cancer survivors,19 is urgently needed in Hong Kong. Here, we performed a pilot study to assess the feasibility of OTC and subsequent OTT in Hong Kong via xenografts in nude mice. In this study, we collected ovarian tissues, established both slow freezing and vitrification protocols, and evaluated tissue viability and follicle preservation after OTC and OTT in a nude mouse xenograft model. This mouse model provided important insights that will support the clinical implementation of OTC and OTT in Hong Kong. Our primary outcome was ovarian tissue viability after slow freezing or vitrification, as determined by histological analysis and immunohistochemistry. Our secondary outcomes were follicular density and the morphological integrity of grafted ovarian tissues.
 
Methods
This study was conducted between July 2019 and December 2021 at the Prince of Wales Hospital, a university-affiliated tertiary hospital in Hong Kong. All participants received a detailed explanation of the study, then provided written consent for inclusion. All researchers involved in the animal experiments were licensed by the Department of Health of the Hong Kong SAR Government.
 
Ovarian tissue collection
Ovarian tissues were collected from women or transgender individuals who underwent laparoscopic or open, unilateral or bilateral, ovarian cystectomy or salpingo-oophorectomy as treatment for benign ovarian cysts or tumours. During the operation, each patient underwent removal of a small section of ovarian tissue or the whole ovary; each specimen of donated ovarian tissue was retrieved from a routine surgical specimen or directly removed during surgery. To prevent thermal injury during tissue removal, cold scissors were used and diathermy was avoided. The amount of donated tissue varied among patients, depending on their age and clinical condition. For example, larger volumes of ovarian tissue were often collected from patients undergoing oophorectomy. Each patient was assigned a unique identification number linked to an encrypted file containing the patient’s data and demographic information; during analyses of tissue from each patient, the pathologist and research staff who conducted histology and immunohistochemistry analyses were blinded to the contents of the encrypted files.
 
Ovarian tissue cryopreservation
Tissues were transported to the laboratory in a standardised culture medium at 4°C and processed within 30 minutes after collection. After removing the medullary region, the ovarian tissue was frozen in accordance with a controlled-rate slow freezing machine protocol (Ovarian Tissue Cryopreservation Scientific Roundup; Planer, UK)20 or a vitrification manual (Ova Cryo Kit Type M, VT301S; Kitazato Corporation, Japan).21 The cortical region was cut into small fragments with a thickness of approximately 1 mm. Some collected ovarian tissues were very small and could not be sectioned for parallel slow freezing and vitrification fresh tissue controls; these small tissues were either frozen using the standard slow freezing method or vitrification.22 Larger ovarian tissues (typically collected from transgender individuals during oophorectomy) were cut into smaller fragments prior to slow freezing and vitrification, or prior to use as fresh tissue controls. Before the xenograft procedure, fragments were removed from fresh tissue, thawed slow-frozen tissue, and thawed vitrified tissue; these fragments were subsequently compared with grafted tissues to identify any differences related to engraftment.
 
A subset of fresh ovarian tissue fragments was fixed and subjected to histological analysis. When a large amount of ovarian tissue was available from a single patient, we compared cryopreservation methods using tissue from that patient; we also compared the cryopreserved tissue with fresh tissue.
 
Slow freezing
Slow freezing was performed in accordance with a validated protocol.23 24 25 Collected ovarian cortices were equilibrated at 4°C on a tilting shaker for 30 minutes in freezing solution (1.5 mol/L ethylene glycol and 0.1 mol/L sucrose in G-MOPS PLUS; Vitrolife, Sweden). After equilibration, the ovarian tissue pieces were placed into 1.8-mL cryogenic vials that had been pre-filled with 1 mL of freezing solution (two tissue pieces per vial). The cryogenic vials were then placed into an automated, computer-controlled freezing system (Kryo-360; Planer, UK).20 The slow freezing protocol was performed in accordance with the method described by Dolmans et al26 and the Planer Ovarian Tissue Cryopreservation Scientific Roundup.20
 
Vitrification
For vitrification of ovarian cortices, the Ovarian Tissue Vitrification Kit (Ova Cryo Kit Type M, VT301S; Kitazato Corporation, Japan)21 was used. The collected ovarian cortices were cut into 1 × 1 × 1 cm3 cubes using a surgical knife and a square measuring device provided in the kit. Vitrification was then performed in accordance with the kit manufacturer’s protocol, and the ovarian tissues were stored in liquid nitrogen.
 
Thawing of ovarian tissue for transplantation
Slow-frozen tissues were removed from liquid nitrogen and exposed to room temperature air for 5 seconds, then placed in 37°C water for 2 minutes. Subsequently, they were transferred to thawing solution 1 (0.75 mol/L ethylene glycol and 0.25 mol/L sucrose in G-MOPS PLUS) for 10 minutes, then to thawing solution 2 (0.25 mol/L sucrose in G-MOPS PLUS) for 10 minutes, and finally to a handling medium (G-MOPS) for 10 minutes. Vitrified ovarian tissue fragments were thawed using Ova Thawing Kit Type M (V302S; Kitazato Corporation, Japan), in accordance with the manufacturer’s instructions.21
 
Ovarian tissue transplantation into nude mice
The nude mouse xenograft model is ideal for assessment of OTC and OTT outcomes. Ovarian xenografts in immunodeficient nude mice can be used to test follicular viability and development. This approach can reveal whether freezing and thawing cause damage to ovarian tissue; it can also demonstrate the ability of cryopreserved tissue to support the development of large antral follicles.27 Considering the higher rate of immune leakiness in severe combined immunodeficient mice,28 we used BALB/c athymic nude mice to validate our OTC and OTT protocols before clinical implementation. Thirty-four female BALB/c athymic nude mice (age, 4-6 weeks; Laboratory Animal Services Centre, The Chinese University of Hong Kong) were used for this study. To prevent fighting between engrafted mice, only three mice were housed in individually ventilated cages at 28°C under controlled sterile conditions, with a 12-hour light/dark cycle and free access to an autoclaved pelleted diet and water. Mice were anaesthetised by intraperitoneal injection of ketamine (75 mg/kg)/xylazine (10 mg/kg) (AlfaMedic Limited, Hong Kong; manufactured in Holland). Ovarian tissues collected from patients were grafted onto nude mice. During ovarian tissue engraftment, the cortical surface was carefully oriented outward and tightly attached to the subcutaneous tissue or abdominal wall. There were two phases in our study, as described in the following sections (Fig 1).
 

Figure 1. Flowchart depicting Phase I and Phase II studies. In Phase I, tissues were collected from four patients and engrafted into nine mice. In Phase II, tissues were collected from six patients and engrafted into 25 mice
 
Phase I: Analysis of ovarian tissue xenograft viability in non-ovariectomised nude mice
To maintain endogenous hormone secretion and avoid the risk of ovariectomy-related death, mice in this phase were not subjected to ovariectomy. One fresh, slow-frozen, or vitrified tissue of approximately 4 × 6 × 1 mm3 was engrafted into the subcutaneous site on the neck of nine nude mice.29 The mice were then sacrificed by intraperitoneal injection of overdose of the anaesthetic. One mouse engrafted with vitrified tissue, two engrafted with slow-frozen tissues and one engrafted with fresh tissue were sacrificed after 2 weeks. Two mice engrafted with vitrified tissues, one engrafted with slow-frozen tissue and two engrafted with fresh tissues were sacrificed after 5 weeks.
 
Phase II: Analysis of ovarian tissue xenograft viability, folliculogenesis, and ovulation in ovariectomised nude mice
To promote graft survival and growth, mice in this phase were subjected to ovariectomy. Fresh, slow-frozen, or vitrified tissues of approximately 4 × 6 × 1 mm3 were either engrafted into the subcutaneous site on the neck of ovariectomised nude mice, or used for intraperitoneal engraftment in the left abdomen of those mice.29 Mice in this phase were divided into a saline group and a treatment group after 2 or 6 weeks of engraftment. The presence of gonadotrophins can optimise graft establishment and stimulate follicle growth.30 To promote folliculogenesis, mice in the treatment group underwent intraperitoneal injection (in the right abdomen) of 1 IU (100 μL) of follitropin alfa (GONAL-f; Merck Serono, Geneva, Switzerland) every other day for 5 to 8 weeks after 2 or 6 weeks’ engraftment.30 During the same period, mice in the saline group underwent intraperitoneal injection (in the right abdomen) of an equal volume of physiological saline every other day. Thirty-six hours before the mice were sacrificed, both groups of mice received a single dose of 10 international units of human chorionic gonadotrophin (Sigma-Aldrich, St Louis [MO], US) by injection to promote ovulation.
 
Grafted ovarian tissue viability
All grafted tissues were fixed in buffered formalin and embedded in paraffin wax, then sectioned and stained for analysis.
 
Histological analysis
Microscopic observations up to 400 times the original magnification (Leica DMIRB; Leica Microsystem, Wetzlar, Germany) of fresh and thawed ovarian tissues were performed after the tissues had been stained with haematoxylin and eosin (H&E). All follicles from the entire grafted tissue specimen on every slide were counted; section thickness and the presence/absence of a nucleolus were also considered.
 
Immunohistochemical assessment of stromal tissue viability
Stromal tissue viability was determined by assessing the morphologies of stromal cells on H&E-stained sections. Viability was defined as the presence of spindle cells with consistent cellularity; an intact nuclear membrane; the absence of pyknotic figures, apoptosis, or necrosis; and the absence of fibrosis or calcification. Viability was confirmed by immunohistochemical analyses using antibodies to cluster of differentiation 10 (CD10) and oestrogen receptors. Anti-CD10 antibody (clone NCL-CD10-270; Novocastra, Newcastle upon Tyne, UK) was used at a dilution of 1:50 with an incubation time of 30 minutes at a sustained temperature of 37°C. Anti–oestrogen receptor antibody (RM-9101; Thermo Fisher Scientific, Fremont [CA], US) was used at a dilution of 1:150 with an incubation time of 32 minutes at a sustained temperature of 37°C. Antigen retrieval was performed using ethylenediaminetetraacetic acid and microwave. Antibody detection was performed using Roche Diagnostics OptiView DAB IHC Detection Kit (Thermo Fisher Scientific, Waltham [MA], US). Immunohistochemical staining (original magnification × 400) was semi-quantitative and based on signal intensity (absent, weak, moderate, and strong). The presence of at least weak staining intensity in ovarian stromal tissue was regarded as a positive result.
 
Follicular density and quality after freezing, thawing, and transplantation
All follicles from the entire grafted tissue specimen on every H&E-stained slide were counted on multiple levels within thick sections (≥20 μm). The digital images were annotated on QuPath,31 obtaining the two-dimensional area of the slide and number of follicles. Follicular density was calculated by established methods described previously.32 33 Ovarian follicles were classified as primordial, primary, or secondary follicles according to morphological assessment of H&E-stained sections.33 Evaluation of grafted follicle quality was based on basement membrane integrity, cellular density, presence or absence of pyknotic bodies, and oocyte integrity. Only morphologically normal (ie, viable) follicles were counted. The results of gross anatomical examinations were confirmed by histological assessments. Gross tissue integrity was defined as the presence of a distinct vascularised tissue fragment that exhibited firmness and perfusion. Microscopic findings indicating viability were the presence of an intact nuclear membrane and the absence of necrosis, apoptosis, and pyknotic nuclei.
 
Results
In total, 52 ovarian tissues were collected from 12 patients aged 29 to 41 years. Ovarian tissues from different patients were cut into several pieces according to tissue size. These tissues were treated by vitrification or slow freezing, then engrafted into 34 mice as shown in Figure 1. Tables 1 and 2 only show data from patients with follicles to facilitate readability. Although there were nine tissues from four patients (Patients 1, 6, 7, and 10) in Phase I, Table 1 only shows data from the two patients with follicles (Patients 1 and 7). In Phase II, there were 25 tissues from six patients (Patients 1, 5, 8, 9, 11, and 12), but Table 2 only shows data from the four patients with follicles. In total, 18 control tissues were collected from 11 patients (Patients 1, 2, 4, 5, 6, 7, 8, 10, 11, 12, and 13). One patient (Patient 5) provided sufficient ovarian tissue for a comparison of cryopreservation methods using tissue from a single patient. Additionally, we compared fresh and slow-frozen tissues from Patient 5, and compared fresh and vitrified tissues from Patients 1, 5, and 12.
 

Table 1. Follicular density of viable grafted tissues in Phase I analysis: two of four patients with primordial follicles
 

Table 2. Follicular density of viable grafted tissues in Phase II analysis: five of six patients with primordial follicles
 
Graft recovery rate and macroscopic assessment
In Phase I, all xenografts were successfully retrieved from the experimental mice. Macroscopic observations of fresh and thawed tissues did not show substantial differences between cryopreservation methods in terms of tissue integrity or morphology (Fig 2a). Microscopic findings showed the presence of viable nuclei and the absence of necrosis, apoptosis, and pyknotic nuclei.
 

Figure 2. Macroscopic observations of fresh, thawed, and grafted tissues. (a) Fresh tissue (left) and thawed slow-frozen tissue (right). (b) Grafted slow-frozen ovarian tissues at subcutaneous sites on the neck in three BALB/c athymic nude mice. (c) Grafted fresh tissues (left and middle) and vitrified ovarian tissue (right) at intraperitoneal sites in three BALB/c athymic nude mice. Note that angiogenesis was observed around each xenograft
 
In Phase II, all xenografts were successfully retrieved from the experimental mice, with the exception of two calcified tissues. Most subcutaneous sites contained soft tissue fragments that were completely encased in membranes; the graft–murine tissue interface was vascularised (Fig 2b). Intraperitoneal sites contained soft tissue fragments with small vessels visible on the graft surface; the fragments were attached to surrounding tissue, and some grafts were encased in abdominal adipose tissue (Fig 2c).
 
Analysis of stromal tissue morphology
Immunohistochemical staining showed that all retrieved grafts had maintained viability, with the exception of two calcified tissues (Fig 3). Haematoxylin and eosin staining, CD10 staining, and oestrogen receptor staining showed no between-group differences (fresh vs slow-frozen, fresh vs vitrified, and slow-frozen vs vitrified). Moreover, stromal tissue viability did not differ between the treatment and saline groups in Phase II.
 

Figure 3. Haematoxylin and eosin (H&E) and immunohistochemical staining of ovarian stromal tissues in xenografts retrieved from nude mice (400 ×). (a) H&E staining; (b) cluster of differentiation 10 staining; (c) oestrogen receptor staining
 
Follicular histology and density
Retrieved grafts were embedded with paraffin and sectioned at a thickness of 4 or 30 μm. Microscopy analysis revealed primordial, primary, and secondary follicles (Fig 4). Tables 1 and 2 show the follicular densities of retrieved grafts from Phases I and II, respectively. Primordial follicles were observed in fresh and cryopreserved grafts from the same patient (Patient 5), regardless of cryopreservation method (slow freezing or vitrification) or gonadotrophin injection status.
 

Figure 4. Microscopic observation of different stages of follicles in the grafted tissues on nude mice (400 ×). (a) Haematoxylin and eosin (H&E) staining of primordial follicles in 4-μm sections; (b) H&E staining of primordial (left), primary (middle), and secondary (right) follicles in 30-μm sections
 
Discussion
Summary of main findings
Our pilot study demonstrated the feasibility of OTC with subsequent OTT in Hong Kong via xenografts of fresh and cryopreserved ovarian tissues in nude mice. Most grafted ovarian tissues remained viable after engraftment, as demonstrated by CD10 and oestrogen receptor staining results in stromal tissue, along with the presence of viable nuclei and the absence of necrosis, apoptosis, and pyknotic nuclei. Regardless of cryopreservation method, primordial follicles were observed in thawed ovarian tissues after engraftment; thus, both cryopreservation methods are feasible and effective. There were no differences in folliculogenesis after gonadotrophin injection. Overall, these findings validate our protocol for surgical collection of ovarian tissue, cryopreservation via slow freezing or vitrification, and subsequent tissue engraftment into mice; this protocol successfully generated primordial follicles in the xenografts. To our knowledge, this type of protocol was not previously validated in Hong Kong.
 
The benefits of OTC and OTT are not limited to gynaecology patients; they are also useful for patients in other specialties4 (eg, medicine, oncology, and paediatrics), including adolescents,34 as well as women who cannot undergo ovarian stimulation. To our knowledge, this is the first study to demonstrate the feasibility of OTC and OTT in Hong Kong; our findings support the clinical implementation of these technologies at medical centres in Hong Kong.
 
Current condition and success rate of ovarian tissue cryopreservation
Ovarian tissue cryopreservation has become an accepted FP technology in many fertility centres since the removal of its experimental designation.11 14 Notably, OTC allows the preservation of thousands of primordial follicles in a single procedure; compared with mature oocytes, preserved primordial follicles are more resistant to cryodamage.35 This technology is also appropriate for patients who cannot undergo ovulation stimulation because they require urgent chemotherapy or must avoid the enhancement of a hormone-sensitive malignancy36; it is also the only available FP technology for prepubertal girls.36 Furthermore, OTC allows natural conception; several spontaneous pregnancies have been reported after successful orthotopic autotransplantation.9 37 In some instances, both fertility and gonadal function are restored.34 According to a meta-analysis,38 endocrine function was restored in 63.9% of patients; the combined rate of pregnancies and live births was 28.4%. Dolmans et al9 also reported that 26% of women became pregnant and gave birth to one or two infants after the transplantation of frozen-thawed ovarian tissue; the live birth rate was 30.6%.
 
Barriers to clinical implementation of ovarian tissue cryopreservation
Despite its advantages, there are multiple barriers to the clinical implementation of OTC. Effective use of this technology involves two surgical procedures: the initial removal of ovarian tissue (prior to cryopreservation) and a future transplantation procedure, which may cause surgical and ethical problems (particularly in prepubertal patients).39 The technology also requires expertise that is not available in some parts of Asia. A Japanese group reported a live birth in 201540; another successful live birth was reported by a Chinese group in 2021,41 involving the cryopreserved ovarian tissue bank established by the Beijing Obstetrics and Gynecology Hospital.12 However, OTC is not widely available in Hong Kong. Reproductive health centres in Hong Kong may lack sufficient surgical expertise and/or an optimal cryopreservation environment.11 Thus, there is a need to reduce the obstacles to clinical implementation of OTC. From our experience, in terms of laboratory requirements, the protocol, equipment, and consumables can be incorporated into most assisted reproductive technology units. However, practical education is needed regarding OTC, including tissue management (eg, tissue thinning during removal of the medulla) and specific aspects of cryopreservation. Proper records of success measures (eg, freeze-thaw outcomes and graft survival rate) are essential; these data should be carefully documented in laboratory records. Additional equipment is also needed for the clinical implementation of OTC as a routine service because the harvesting surgery may be performed on an urgent basis that differs from the routine assisted reproductive technology laboratory programme.11 While planning for this study, we found that there have been inconsistencies in terms of selection criteria, cryopreservation methods, laboratory management of harvested tissue, and the transplantation technique itself. Although we found no differences in the morphological integrity of ovarian tissue after cryopreservation via slow freezing or vitrification, further studies with larger numbers of patients are needed to confirm the feasibility of follicular stimulation in vivo.
 
Current status of ovarian tissue transplantation
Human OTT remains unavailable in Hong Kong. Notably, our analysis of tissue engraftment was conducted in a mouse model. After the clinical implementation of OTC in Hong Kong, OTT involving autotransplantation could be established as a routine service. Ovarian autotransplantation is performed when a patient has fully recovered from disease, but this approach may carry a small risk of reintroducing malignant cells in patients with cancer.42 The results of some studies have suggested that the risk of reintroducing malignant cells could be minimised by meticulous examination of representative biopsy samples via histology, immunohistochemistry, and molecular biology techniques.43 Moreover, optical coherence tomography can be used to assess malignant cells in thawed ovarian tissue before transplantation.44
 
Limitations
There were several limitations in this study. First, tissue engraftment was not conducted in humans because of ethical concerns; thus, we analysed tissue engraftment in a nude mouse xenograft model, which was the best available model. Second, some ovarian tissues were collected from transgender individuals who had undergone testosterone replacement therapy, which might have affected the hormonal milieu of the ovarian tissue.45 Third, we only retrieved small fragments of ovarian tissue (~1 cm) from random locations in the ovaries of included patients; this may have led to sampling error if the sampled cortical layers did not contain primordial follicles. Fourth, the small number of included patients hindered our ability to compare the effects of cryopreservation methods on tissue from a single patient. Moreover, the small sample size might have reduced the strength of the findings. Finally, there are no standardised protocols for freezing, gonadotrophin stimulation, or transplantation in nude mouse xenograft models. However, our study demonstrated the feasibility of OTC in our centre. Further randomised controlled trials are needed to confirm our findings.
 
Future trends
We plan to conduct a randomised controlled trial of the two cryopreservation methods used in this study to determine which is best for clinical implementation. From the experience of the Danish group Rosendahl et al24 on OTC, they suggested that before applying the technique to humans, each laboratory should thoroughly test and validate the OTC method. In the future, implantation of artificial ovaries or the engraftment of human ovarian tissue into mice may enable fertility restoration without the potential reintroduction of malignant cells. These approaches may be particularly useful in women with a high risk of blood-borne leukaemia or cancers with a high risk of ovarian metastasis, as well as women who cannot undergo autotransplantation.46
 
Conclusion
Our study demonstrated the feasibility and viability of OTC with subsequent OTT in Hong Kong via xenografts in nude mice. These findings support the clinical implementation of OTC and subsequent OTT in Hong Kong, particularly for prepubertal young girls and for women who cannot undergo the ovarian stimulation necessary for oocyte or embryo freezing. Further studies are needed to clarify optimal cryopreservation and engraftment protocols.
 
Author contributions
Concept or design: JPW Chung, DYL Chan.
Acquisition of data: All authors.
Analysis or interpretation of data: JPW Chung, DYL Chan, Y Song, MBW Leung, JJX Li, CC Wang.
Drafting of the manuscript: All authors.
Critical revision of the manuscript for important intellectual content: JPW Chung, DYL Chan, CC Wang.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
As an editor of the journal, JPW Chung was not involved in the peer review process of the article. All other authors have no conflicts of interest to disclose.
 
Funding/support
This research was supported by Basecare Medical Device Co., Ltd. and the Theme-based Research Scheme funded by the Research Grants Council of the Hong Kong SAR Government (Ref No.: T13-602/21-N).
 
Ethics approval
The research was approved by the Institutional Review Board of the Joint Chinese University of Hong Kong–New Territories East Cluster Clinical Research Ethics Committee (Ref No.: 2019.356) and overseen by an independent data and safety monitoring committee. The trial was registered with the World Health Organization Primary Registry–Chinese Clinical Trials Registry (Trial No.: ChiCTR2100041611). The experimental animal protocol was approved by The Chinese University of Hong Kong Animal Experimentation Ethics Committee (Ref No.: 19-214-MIS). All participants received a detailed explanation of the study and provided written consent for inclusion. All researchers involved in animal experiments were licensed by the Department of Health of the Hong Kong SAR Government.
 
References
1. Maltaris T, Seufert R, Fischl F, et al. The effect of cancer treatment on female fertility and strategies for preserving fertility. Eur J Obstet Gynecol Reprod Biol 2007;130:148-55. Crossref
2. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2019. CA Cancer J Clin 2019;69:7-34. Crossref
3. Brydøy M, Fosså SD, Dahl O, Bjøro T. Gonadal dysfunction and fertility problems in cancer survivors. Acta Oncol 2007;46:480-9. Crossref
4. ESHRE Guideline Group on Female Fertility Preservation, Anderson RA, Amant F, et al. ESHRE guideline: female fertility preservation. Hum Reprod Open 2020;2020:hoaa052. Crossref
5. Chung JP, Lao TT, Li TC. Evaluation of the awareness of, attitude to, and knowledge about fertility preservation in cancer patients among clinical practitioners in Hong Kong. Hong Kong Med J 2017;23:556-61. Crossref
6. Ng EY, Ip JK, Mak DR, Chan AY, Chung JP. Awareness of fertility preservation among Chinese medical students. Hong Kong Med J 2020;26:184-91. Crossref
7. Yeung SY, Ng EY, Lao TT, Li TC, Chung JP. Fertility preservation in Hong Kong Chinese society: awareness, knowledge and acceptance. BMC Womens Health 2020;20:86. Crossref
8. Dolmans MM, Donnez J. Fertility preservation in women for medical and social reasons: oocytes vs ovarian tissue. Best Pract Res Clin Obstet Gynaecol 2021;70:63-80. Crossref
9. Dolmans MM, von Wolff M, Poirot C, et al. Transplantation of cryopreserved ovarian tissue in a series of 285 women: a review of five leading European centers. Fertil Steril 2021;115:1102-15. Crossref
10. Harzif AK, Santawi VP, Maidarti M, Wiweko B. Investigation of each society for fertility preservation in Asia. Front Endocrinol (Lausanne) 2019;10:151. Crossref
11. Takae S, Lee JR, Mahajan N, et al. Fertility preservation for child and adolescent cancer patients in Asian countries. Front Endocrinol (Lausanne) 2019;10:655. Crossref
12. Jin F, Ruan X, Du J, et al. Analysis on the characteristics of the patients and effects of ovarian tissue cryopreservation in the first ovarian tissues cryopreservation bank in China [in Chinese]. J Capital Univ Med Sci 2021;42:521-5.
13. Donnez J, Dolmans MM. Fertility preservation in women. N Engl J Med 2017;377:1657-65. Crossref
14. Practice Committee of the American Society for Reproductive Medicine. Fertility preservation in patients undergoing gonadotoxic therapy or gonadectomy: a committee opinion. Fertil Steril 2019;112:1022-33. Crossref
15. Rivas Leonel EC, Lucci CM, Amorim CA. Cryopreservation of human ovarian tissue: a review. Transfus Med Hemother 2019;46:173-81. Crossref
16. Marques LS, Fossati AA, Rodrigues RB, et al. Slow freezing versus vitrification for the cryopreservation of zebrafish (Danio rerio) ovarian tissue. Sci Rep 2019;9:15353. Crossref
17. Glujovsky D, Riestra B, Sueldo C, et al. Vitrification versus slow freezing for women undergoing oocyte cryopreservation. Cochrane Database Syst Rev 2014;2014:CD010047. Crossref
18. Bianchi V, Macchiarelli G, Borini A, et al. Fine morphological assessment of quality of human mature oocytes after slow freezing or vitrification with a closed device: a comparative analysis. Reprod Biol Endocrinol 2014;12:110. Crossref
19. Turan V, Oktay K. Sexual and fertility adverse effects associated with chemotherapy treatment in women. Expert Opin Drug Saf 2014;13:775-83. Crossref
20. Planer Ovarian Tissue Cryopreservation Scientific Roundup. Ovarian Tissue Cryopreservation Scientific Roundup. 2019. Available from: https://mail.planer.com/__80258426005B6A5E.nsf/0/C4C23666734672DF802584B1002F3777/$File/Ai091V1-Ovarian-Tissue-Cryopreservation-Scientific-Round-Up-28th-October-2019.pdf?OpenElement. Accessed 1 Mar 2022. Crossref
21. Kitazato Corporation. Ova Cryo Kit—Ovarian Tissue Vitrification Kit. 2021. Available from: https://www.kitazato.co.jp/en/products/vitrification/ova-cryo-kit-type-m. Accessed 31 January 2023.
22. Huang L, Mo Y, Wang W, Li Y, Zhang Q, Yang D. Cryopreservation of human ovarian tissue by solid–surface vitrification. Eur J Obstet Gynecol Reprod Biol 2008;139:193-8. Crossref
23. Jensen AK, Kristensen SG, Macklon KT, et al. Outcomes of transplantations of cryopreserved ovarian tissue to 41 women in Denmark. Hum Reprod 2015;30:2838-45.Crossref
24. Rosendahl M, Greve T, Andersen CY. The safety of transplanting cryopreserved ovarian tissue in cancer patients: a review of the literature. J Assist Reprod Genet 2013;30:11-24. Crossref
25. Macklon KT, Jensen AK, Loft A, Ernst E, Andersen CY. Treatment history and outcome of 24 deliveries worldwide after autotransplantation of cryopreserved ovarian tissue, including two new Danish deliveries years after autotransplantation. J Assist Reprod Genet 2014;31:1557-64. Crossref
26. Dolmans MM, Luyckx V, Donnez J, Andersen CY, Greve T. Risk of transferring malignant cells with transplanted frozen–thawed ovarian tissue. Fertil Steril 2013;99:1514-22. Crossref
27. Nisolle M, Casanas-Roux F, Qu J, Motta P, Donnez J. Histologic and ultrastructural evaluation of fresh and frozen–thawed human ovarian xenografts in nude mice. Fertil Steril 2000;74:122-9. Crossref
28. Bosma GC, Fried M, Custer RP, Carroll A, Gibson DM, Bosma MJ. Evidence of functional lymphocytes in some (leaky) scid mice. J Exp Med 1988;167:1016-33. Crossref
29. Dath C, Van Eyck AS, Dolmans MM, et al. Xenotransplantation of human ovarian tissue to nude mice: comparison between four grafting sites. Hum Reprod 2010;25:1734-43. Crossref
30. Dittrich R, Lotz L, Fehm T, et al. Xenotransplantation of cryopreserved human ovarian tissue—a systematic review of MII oocyte maturation and discussion of it as a realistic option for restoring fertility after cancer treatment. Fertil Steril 2015;103:1557-65. Crossref
31. Bankhead P, Loughrey MB, Fernández JA, et al. QuPath: open source software for digital pathology image analysis. Sci Rep 2017;7:16878. Crossref
32. Schmidt KL, Byskov AG, Nyboe Andersen A, Müller J, Yding Andersen C. Density and distribution of primordial follicles in single pieces of cortex from 21 patients and in individual pieces of cortex from three entire human ovaries. Hum Reprod 2003;18:1158-64. Crossref
33. Gougeon A, Chainy GB. Morphometric studies of small follicles in ovaries of women at different ages. J Reprod Fertil 1987;81:433-42. Crossref
34. Anderson RA, Mitchell RT, Kelsey TW, Spears N, Telfer EE, Wallace WH. Cancer treatment and gonadal function: experimental and established strategies for fertility preservation in children and young adults. Lancet Diabetes Endocrinol 2015;3:556-67. Crossref
35. Anderson RA, Wallace WH, Baird DT. Ovarian cryopreservation for fertility preservation: indications and outcomes. Reproduction 2008;136:681-9. Crossref
36. Jeruss JS, Woodruff TK. Preservation of fertility in patients with cancer. N Engl J Med 2009;360:902-11. Crossref
37. Anderson RA, Wallace WH, Telfer EE. Ovarian tissue cryopreservation for fertility preservation: clinical and research perspectives. Hum Reprod Open 2017;2017:hox001. Crossref
38. Pacheco F, Oktay K. Current success and efficiency of autologous ovarian transplantation: a meta-analysis. Reprod Sci 2017;24:1111-20. Crossref
39. Wallace WH, Kelsey TW, Anderson RA. Fertility preservation in pre-pubertal girls with cancer: the role of ovarian tissue cryopreservation. Fertil Steril 2016;105:6-12. Crossref
40. Suzuki N, Yoshioka N, Takae S, et al. Successful fertility preservation following ovarian tissue vitrification in patients with primary ovarian insufficiency. Hum Reprod 2015;30:608-15. Crossref
41. Sun N, Li Z, Pang W, Wang L, Li W. Live birth after transplantation of cryopreserved ovarian tissue with two-year follow-up: report of the first Chinese case [in Chinese]. Chin J Reprod Contraception 2021;41:1026-30.
42. Peek R, Eijkenboom LL, Braat DD, Beerendonk CC. Complete purging of Ewing sarcoma metastases from human ovarian cortex tissue fragments by inhibiting the mTORC1 signaling pathway. J Clin Med 2021;10:4362. Crossref
43. Lotz L, Dittrich R, Hoffmann I, Beckmann MW. Ovarian tissue transplantation: experience from Germany and worldwide efficacy. Clin Med Insights Reprod Health 2019;13:1179558119867357. Crossref
44. Peters IT, Stegehuis PL, Peek R, et al. Noninvasive detection of metastases and follicle density in ovarian tissue using full-field optical coherence tomography. Clin Cancer Res 2016;22:5506-13. Crossref
45. Irwig MS. Testosterone therapy for transgender men. Lancet Diabetes Endocrinol 2017;5:301-11. Crossref
46. Telfer EE, Andersen CY. In vitro growth and maturation of primordial follicles and immature oocytes. Fertil Steril 2021;115:1116-25. Crossref

Efficacy, toxicities, and prognostic factors of stereotactic body radiotherapy for unresectable liver metastases

Hong Kong Med J 2023 Apr;29(2):105–11 | Epub 30 Mar 2023
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE  CME
Efficacy, toxicities, and prognostic factors of stereotactic body radiotherapy for unresectable liver metastases
Calvin KK Choi, FHKCR, FHKAM (Radiology); Connie HM Ho, FHKCR, FHKAM (Radiology); Matthew YP Wong, MSc; Ronnie WK Leung, MSc; Frank CS Wong, FHKCR, FHKAM (Radiology); Stewart Y Tung, FHKCR, FHKAM (Radiology); Francis AS Lee, FHKCR, FHKAM (Radiology)
Department of Clinical Oncology, Tuen Mun Hospital, Hong Kong SAR, China
 
Corresponding author: Dr Calvin KK Choi (calvinkkchoi@hkbh.org.hk)
 
 Full paper in PDF
 
Abstract
Introduction: This study aims to determine the outcomes of stereotactic body radiotherapy (SBRT) for liver metastases in patients not eligible for surgery.
 
Methods: This study included 31 consecutive patients with unresectable liver metastases who received SBRT between January 2012 and December 2017; 22 patients had primary colorectal cancer and nine patients had primary non-colorectal cancer. Treatments ranged from 24 Gy to 48 Gy in 3 to 6 fractions over 1 to 2 weeks. Survival, response rates, toxicities, clinical characteristics, and dosimetric parameters were evaluated. Multivariate analysis was performed to identify significant prognostic factors for survival.
 
Results: Among these 31 patients, 65% had received at least one prior regimen of systemic therapy for metastatic disease, whereas 29% had received chemotherapy for disease progression or immediately after SBRT. The median follow-up interval was 18.9 months; actuarial in-field local control rates at 1, 2, and 3 years after SBRT were 94%, 55%, and 42%, respectively. The median survival duration was 32.9 months; 1-year, 2-year, and 3-year actuarial survival rates were 89.6%, 57.1%, and 46.2%, respectively. The median time to progression was 10.9 months. Stereotactic body radiotherapy was well-tolerated, with grade 1 toxicities of fatigue (19%) and nausea (10%). Patients who received post-SBRT chemotherapy had significant longer overall survival (P=0.039 for all patients and P=0.001 for patients with primary colorectal cancer).
 
Conclusion: Stereotactic body radiotherapy can be safely administered to patients with unresectable liver metastases, and it may delay the need for chemotherapy. This treatment should be considered for selected patients with unresectable liver metastases.
 
 
New knowledge added by this study
  • Stereotactic body radiotherapy (SBRT) for unresectable liver metastases was effective and well-tolerated. It may delay the need for chemotherapy while prolonging progression-free survival.
  • The receipt of post-SBRT chemotherapy is a significant prognostic factor for survival.
Implications for clinical practice or policy
  • Stereotactic body radiotherapy can be regarded as an alternative to surgery for patients with liver metastases, particularly patients with unresectable tumours.
  • We recommend offering SBRT to patients with unresectable liver metastases if they have good performance status (ie, Eastern Cooperative Oncology Group 0-1), liver tumours ≤6 cm in diameter, three or fewer liver tumours, normal liver volume >700 cm3, adequate organ function, and adequate liver function (Child-Pugh class A).
 
 
Introduction
The liver is a common site of metastases, which most frequently originate from primary colorectal cancer via portal circulation. Surgical resection is the standard treatment for medically and technically operable liver metastases, particularly from primary colorectal cancer. However, most patients are not eligible for surgery because of co-morbidities or unfavourable tumour factors. Most patients receive systemic therapy as initial treatment for liver metastases, but such treatment rarely leads to permanent elimination of the metastases; some form of local ablative intervention is required. For patients with unresectable limited liver metastases, numerous local therapeutic approaches are available, such as radiofrequency ablation, transcatheter arterial chemoembolisation, cryotherapy, and high-intensity focal ultrasound. However, all of these approaches exhibit a degree of invasiveness and are currently limited by tumour size (usually <3 cm), distance from critical structures, and distance from critical vasculature.1
 
In the past, radiotherapy has had a limited role in the management of liver metastases because of concerns regarding radiation-induced liver disease.2 3 Because the liver is subject to the parallel architecture principles of radiobiology, the risk of radiation-induced liver disease is generally proportional to the mean dose of radiation delivered to normal liver tissue. Therefore, small hepatic lesions can be safely treated with high doses of radiation via stereotactic body radiotherapy (SBRT). Advances in tumour imaging, radiotherapy planning and delivery, and motion management have facilitated the delivery of highly precise and four-dimensional SBRT. This non-invasive method can be used to deliver ablative treatments on an outpatient basis, thereby decreasing morbidity and cost.4
 
Ablative techniques offer a minimally invasive treatment option for selected patients with oligometastatic liver disease.5 There is increasing evidence to support the use of SBRT.6 To our knowledge, there is limited published information regarding the role of SBRT in the treatment of unresectable liver metastases in Hong Kong. In this study, we investigated the efficacy, toxicities, and prognostic factors of SBRT in patients with unresectable liver metastases.
 
Methods
Patient eligibility
Data regarding consecutive patients with unresectable liver metastases who received SBRT between January 2012 and December 2017 were retrospectively retrieved from the treatment database of the Department of Clinical Oncology at Tuen Mun Hospital. All patients with liver metastases were evaluated in multidisciplinary team meetings involving radiation oncologists and hepatobiliary surgeons. Eligibility was determined using the following criteria: (1) histologically confirmed malignancy (hepatic lesion biopsy not required); (2) biphasic computed tomography (CT) scan or positron emission tomography–CT of the liver within 4 weeks of radiation planning demonstrating liver tumours ≤6 cm in diameter, presence of three or fewer liver tumours, and normal liver volume >700 cm3; (3) discussion of the case in a multidisciplinary team meeting that included an opinion regarding the lack of qualification for radiofrequency ablation, along with a determination of non-resectability by a qualified hepatic surgeon; (4) patient refusal of surgical treatment; (5) Eastern Cooperative Oncology Group performance status 0 or 1; (6) adequate organ function (absolute neutrophil count ≥1.5×109/L; platelet count ≥75×109/L; creatinine level ≤1.5×upper limit of normal), liver function test results (aspartate aminotransferase and alanine aminotransferase levels ≤1.5×normal level), and Child-Pugh score of ≤6 (class A); (7) controlled extrahepatic disease and life expectancy >6 months; (8) no chemotherapy concurrent with radiotherapy (previous chemotherapy was not an exclusion criterion); and (9) previous treatment with radiofrequency ablation was not an exclusion criterion if recurrence had been confirmed.
 
Radiotherapy treatment
During four-dimensional CT scans, patients were positioned supine on an evacuated foam bag (Klarity Medical, China) with both arms abducted. The extent of tumour motion during respiration was used to determine whether treatment would be administered with free breathing plus abdominal compression or active breathing control. The gross tumour volume (GTV) was determined using contrast CT and co-registered with positron emission tomography–CT. For patients who required optimal abdominal compression to mitigate organ motion, planning was conducted using the mid-ventilation–based planning target volume (PTV) approach, and the GTV was determined using intravenous contrast CT. The clinical target volume was 0 mm outside of the GTV within the liver (ie, equal to GTV); it included the position of the tumour in all phases of respiration. The PTV was defined by adding an isotropic margin of 3 to 5 mm from the clinical target volume or 7 to 10 mm in the cranial-caudal axis and 4 to 6 mm in the anterior-posterior and lateral axes. Pretreatment four-dimensional cone-beam CT was performed prior to each treatment for all patients to adjust for setup uncertainties. Tumour localisation was conducted using the diaphragm or whole liver as a surrogate for the tumour. A two-step four-dimensional registration approach was used to align the diaphragm/liver surrogate to its time-weighted mean position. The SBRT dose, ranging from 8 to 16 Gy × 3 fractions to 5 to 7.5 Gy × 6 fractions, was individualised according to the following normal tissue constraints: (1) maximum spinal cord dose <15 Gy; (2) ≥700 cm3 of liver should receive <15 Gy, and D5% <30 Gy; (3) maximum stomach point dose of 25 Gy; and (4) maximum duodenum point dose of 25 Gy.
 
Evaluation
Patients were evaluated weekly during SBRT, immediately after completion of treatment, at 6 weeks after treatment, every 3 months for the first 2 years, and every 4 months thereafter. Physical examinations and blood tests were performed at each follow-up visit. Triphasic CT of the liver was conducted at 3 months after SBRT and then every 6 months until disease progression. Tumour response was assessed using modified response evaluation criteria for solid tumours.
 
The primary endpoint of the study was local control; secondary endpoints were overall survival and toxicity. Local control was defined as the absence of progressive disease within the PTV. The appearance of new lesions outside of the PTV was regarded as intrahepatic out-field failure. Overall survival was calculated from the start of SBRT until the end of follow-up or death.
 
Toxicity was graded using the National Cancer Institute Common Terminology Criteria for Adverse Events version 4.0. Toxicities were defined as adverse events that occurred <3 months after SBRT. Newly developed toxicities or toxicities that progressed to one grade above baseline were regarded as adverse events. Grade 5 liver failure related to SBRT was defined as death from liver failure in the presence of acute grade 3 liver toxicities during <6 months without intrahepatic progression.
 
Statistical analysis
Data were analysed using SPSS software (Windows version 23.0; IBM Corp, Armonk [NY], United States). Fisher’s exact test and independent t tests were used for univariate analysis of patient, disease, and treatment factors associated with liver toxicity. Binary logistic regression analysis was used for univariate analysis of dose-volumetric parameters associated with liver toxicity. Kaplan–Meier test was used for univariate analysis of overall survival, with a significance threshold of P<0.25; it was used for multivariate analysis of overall survival, with a significance threshold of P<0.05. Cox regression was used for further evaluation of variables which were significant in univariate analysis of overall survival.7 8
 
Results
Patients and treatment
During the study period, 31 consecutive patients with unresectable liver metastases underwent SBRT at our institution. Their characteristics are shown in Table 1. Colorectal cancer was the most common primary cancer. A total of 64.5% of patients received systemic treatment before SBRT; 71% of liver lesions were ≤ 30 mm. All patients received a fixed course of 3 or 6 fractions with total prescribed dose ranges of 24-48 Gy. The mean GTV was 26.9 cm3 (range, 1.5-137) and mean PTV was 91.8 cm3 (range, 21.7-269). The mean biological equivalent dose (BED10) to GTV was 79.8 Gy (range, 43.2-124.8). The median BED10 to GTV was 76.8 Gy. Surgical resection or radiofrequency ablation were performed in 32% of patients before SBRT. Targeted or non-targeted systemic chemotherapy was administered to 65% and 29% of patients before and after SBRT, respectively.
 

Table 1. Patient characteristics
 
Toxicities
Stereotactic body radiotherapy was well-tolerated. There were no grade 2-4 toxicities. Most patients were asymptomatic (grade 0) during radiotherapy; 19% of patients had grade 1 fatigue, 10% of patients had grade 1 nausea, and 3% of patients had skin reaction. No patients exhibited a change in Child-Pugh class after SBRT, and no significant prognostic factors for liver toxicities were identified.
 
Local control, survival, and prognostic factors
The median follow-up interval was 18.9 months. The 1-year, 2-year, and 3-year local control rates were 94% (29/31), 55% (17/31) and 42% (13/31), respectively. Only two patients (9% of all patients) with primary colorectal cancer had in-field recurrence at 1 year after SBRT. Sixteen patients in all treatment groups had out-field recurrence at 1 year after SBRT. The median time to progression was 10.9 months.
 
The median survival duration in all treatment groups was 32.9 months. The 1-year, 2-year, and 3-year survival rates were 89.6%, 57.1%, and 46.2%, respectively. The only significant prognostic factor for overall survival was receipt of post-SBRT chemotherapy for disease progression (P=0.039). Figures 1 and 2 show the survival curves and prognostic factors for all treatment groups. Previous local treatment, rat sarcoma virus status of colorectal cancer, number of liver metastases, extrahepatic metastases, BED to the liver, extrahepatic metastasis status, number of chemotherapy lines before or after SBRT, and carcinoembryonic antigen level after SBRT were not significant prognostic factors for overall survival. Table 2 summarises the factors that affected overall survival.
 

Figure 1. Overall survival of the whole group after stereotactic body radiotherapy (SBRT)
 

Figure 2. Overall survival of patients who received chemotherapy after stereotactic body radiotherapy (SBRT) for disease progression (PD) versus those who did not
 

Table 2. Prognostic factors affecting overall survival
 
The median survival duration in the colorectal cancer subgroup was 32.9 months. The only significant prognostic factor for overall survival was receipt of post-SBRT chemotherapy for disease progression (P=0.001). No other significant prognostic factors for overall survival were identified. Figures 3 and 4 show the survival curves and prognostic factors for the colorectal cancer subgroup.
 

Figure 3. Overall survival of colorectal cancer patients after stereotactic body radiotherapy (SBRT)
 

Figure 4. Overall survival of colorectal cancer patients who received chemotherapy after stereotactic body radiotherapy (SBRT) for disease progression (PD) versus those who did not
 
Discussion
Although surgical resection is the standard treatment for liver metastases, many patients are not eligible for such treatment. Multiple retrospective and prospective studies have demonstrated SBRT is a promising, safe, and non-invasive alternative to surgery for unresectable liver metastases.9 10 To our knowledge, there is limited published information regarding the use of SBRT to treat liver metastases in Hong Kong. In the present study, we retrospectively collected data regarding consecutive patients who received SBRT for unresectable liver metastases after multidisciplinary team evaluation; we assessed outcomes in terms of safety, local control, and survival. Among the 31 patients treated with SBRT, the 1-year and 2-year local control rates were 93% and 55%, respectively. The median survival duration was 32.9 months; the 1-year and 2-year survival rates were 89.6% and 57.1%, respectively. In the colorectal cancer subgroup, the 1-year and 2-year survival rates were 84.7% and 62.1%, respectively.
 
Multiple retrospective and prospective studies have been performed regarding SBRT for liver metastases from colorectal cancers (Table 3).11 12 13 14 In the present study, local control rates and survival rates were comparable with findings in previous reports. Notably, McPartlin et al11 conducted a prospective study using SBRT doses of 22-62 Gy in 6 fractions. The present study, with SBRT doses of 24-48 Gy in 3-6 fractions, demonstrated better 1-year local control (93% vs 50%) and 2-year survival (62.1% vs 26%) than the study by McPartlin et al.11
 

Table 3. Summary of literature regarding stereotactic body radiotherapy for liver metastases from colorectal cancers
 
Three other SBRT trials12 13 14 (45-75 Gy in 3 fractions) all demonstrated better local control rates than the findings in the present study (Table 3). These results indicate that a higher local control rate is associated with a higher radiation dose. Compared with the present study, Scorsetti et al12 and Joo et al14 showed higher 2-year survival rates (65% and 75%, respectively vs 62.1% in the present study), whereas Hoyer et al13 revealed a considerably lower 2-year survival rate (38%). These discrepant findings may be related to radiation dose—Scorsetti et al12 and Joo et al14 reported higher BED than that achieved by Hoyer et al13 and the present study. Among patients with primary colorectal tumours, the survival rate in the present study was comparable with rates in the previous studies.11 12 13 14 However, overall survival is dependent on many factors other than local control of irradiated liver metastases. Compared with earlier studies, overall survival is expected to be better in more recent studies because of stage migration, improvements in diagnostic techniques, and enhanced systemic treatment. Importantly, although the present study showed that post-SBRT chemotherapy was a prognostic factor for longer survival, selection bias may have been involved in the decision to administer chemotherapy to patients with better performance status.
 
In the present study, the incidence of toxicities was low, and there were no grade 2-4 toxicities. Among patients who received SBRT, only grade 1 toxicities were reported (fatigue, nausea, and skin reaction); these findings indicate that SBRT was well-tolerated.
 
Based on our results, we recommend that patients with unresectable liver metastases are evaluated in multidisciplinary team meetings; patients should be offered SBRT if they have good performance status (ie, Eastern Cooperative Oncology Group 0-1), liver tumours ≤6 cm in diameter, three or fewer liver tumours, normal liver volume >700 cm3, adequate organ function, and adequate liver function (Child-Pugh class A). Considering its minimal invasiveness and toxicity, as well as its potential for improving progression-free survival, SBRT should be regarded as an alternative to surgical resection of liver metastases to those patients who refuse surgical treatment.
 
There were some limitations in the present study. First, the BED to the tumour was low (median BED10 >100 Gy was administered to 35.5% of patients), and the mean GTV was high (26.9 cm3). The local control rate may have been influenced by the lower total radiation dose administered and larger tumour volume. Second, this was a retrospective study, and the sample size was small. Thus, a randomised controlled trial with a large number of patients is needed to determine whether SBRT can prolong overall survival in patients with liver metastases.
 
Conclusion
Stereotactic body radiotherapy can be safely administered to patients with unresectable liver metastases, and it may delay the need for chemotherapy. Considering its minimal invasiveness and toxicity, this treatment should be offered to selected patients with unresectable liver metastases; such an approach may improve progression-free survival. A phase III randomised study is needed to confirm these results.
 
Author contributions
All authors contributed to the concept or design of the study, acquisition of data, analysis or interpretation of data, drafting of the manuscript, and critical revision of the manuscript for important intellectual content. All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
The authors declare no conflict of interest.
 
Acknowledgement
The authors thank Mr Jia-jie Huang from Quality and Safety Division of New Territories West Cluster, Hospital Authority, Hong Kong for his statistical analysis support.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
Ethics approval document was issued by New Territories West Cluster Research Ethics Committee of Hospital Authority, Hong Kong (Ref No.: NTWC/REC/20035). Informed consent was obtained from patients for stereotactic body radiotherapy.
 
References
1. Aitken KL, Hawkins MA. Stereotactic body radiotherapy for liver metastases. Clin Oncol (R Coll Radiol) 2015;27:307-15. Crossref
2. Schefter TE, Kavanagh BD, Timmerman RD, Cardenes HR, Baron A, Gaspar LE. A phase I trial of stereotactic body radiation therapy (SBRT) for liver metastases. Int J Radiat Oncol Biol Phys 2005;62:1371-8. Crossref
3. Rusthoven KE, Kavanagh BD, Cardenes H, et al. Multi-institutional phase I/II trial of stereotactic body radiation therapy for liver metastases. J Clin Oncol 2009;27:1572-8. Crossref
4. Pan CC, Kavanagh BD, Dawson LA, et al. Radiation-associated liver injury. Int J Radiat Oncol Biol Phys 2010;76(3 Suppl):S94-100. Crossref
5. Aloia TA, Vauthey JN, Loyer EM, et al. Solitary colorectal liver metastasis: resection determines outcome. Arch Surg 2006;141:460-6. Crossref
6. Høyer M, Swaminath A, Bydder S, et al., Radiotherapy for liver metastases: a review of evidence. Int J Radiat Oncol Biol Phys 2012;82:1047-57. Crossref
7. Prentice RL, Zhao S. Regression models and multivariate life tables. J Am Stat Assoc 2021;116:1330-45. Crossref
8. Hashemi R, Commenges D. Correction of the p-value after multiple tests in a Cox proportional hazard model. Lifetime Data Anal 2002;8:335-48. Crossref
9. Rusthoven CG, Lauro CF, Kavanagh BD, Schefter TE. Stereotactic body radiation therapy (SBRT) for liver metastases: a clinical review. Semin Colon Rectal Surg 2014;25:48-52. Crossref
10. Kobiela J, Spychalski P, Marvaso G, et al. Ablative stereotactic radiotherapy for oligometastatic colorectal cancer: systematic review. Crit Rev Oncol Hematol 2018;129:91-101. Crossref
11. McPartlin A, Swaminath A, Wang R, et al. Long-term outcomes of phase 1 and 2 studies of SBRT for hepatic colorectal metastases. Int J Radiat Oncol Biol Phys 2017;99:388-95. Crossref
12. Scorsetti M, Comito T, Tozzi A, et al. Final results of a phase II trial for stereotactic body radiation therapy for patients with inoperable liver metastases from colorectal cancer. J Cancer Res Clin Oncol 2015;141:543-53. Crossref
13. Hoyer M, Roed H, Traberg Hansen A, et al. Phase II study on stereotactic body radiotherapy of colorectal metastases. Acta Oncol 2006;45:823-30. Crossref
14. Joo JH, Park JH, Kim JC, et al. Local control outcomes using stereotactic body radiation therapy for liver metastases from colorectal cancer. Int J Radiat Oncol Biol Phys 2017;99:876-83. Crossref

Asthenopia prevalence and vision impairment severity among students attending online classes in low-income areas of western China during the COVID-19 pandemic

© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE (HEALTHCARE IN MAINLAND CHINA)
Asthenopia prevalence and vision impairment severity among students attending online classes in low-income areas of western China during the COVID-19 pandemic
Y Ding, PhD1; H Guan, PhD1; K Du, PhD2; Y Zhang, PhD1; Z Wang, MD1; Y Shi, PhD1
1 Center for Experimental Economics for Education, Shaanxi Normal University, Xi’an, China
2 College of Economics, Xi’an University of Finance and Economics, Xi’an, China
 
Corresponding author: Dr H Guan (hongyuguan0621@gmail.com)
 
 Full paper in PDF
 
Abstract
Introduction: This study explored the impact of online learning during the coronavirus disease 2019 (COVID-19) pandemic on asthenopia and vision impairment in students, with the aim of establishing a theoretical basis for preventive approaches to vision health.
 
Methods: This balanced panel study enrolled students from western rural China. Participant information was collected before and during the COVID-19 pandemic via questionnaires administered at local vision care centres, along with clinical assessments of visual acuity. Paired t tests and fixed-effects models were used to analyse pandemic-related differences in visual status.
 
Results: In total, 128 students were included (mean age before pandemic, 11.82 ± 1.46 years). The mean total screen time was 3.22 ± 2.90 hours per day during the pandemic, whereas it was 1.97 ± 1.90 hours per day in the pre-pandemic period (P<0.001). Asthenopia prevalence was 55% (71/128) during the pandemic, and the mean visual acuity was 0.81 ± 0.30 logarithm of the minimum angle of resolution; these findings indicated increasing vision impairment, compared with the pre-pandemic period (both P<0.001). Notably, asthenopia prevalence increased by two- to three-fold, compared with the pre-pandemic period. An increase in screen time while learning was associated with an increase in asthenopia prevalence (P=0.034).
 
Conclusion: During the COVID-19 pandemic, students spent more time on online classes, leading to worse visual acuity and vision health. Students in this study reported a significant increase in screen time, which was associated with increasing asthenopia prevalence and worse vision impairment. Further research is needed regarding the link between online classes and vision problems.
 
 
New knowledge added by this study
  • Online learning has become increasingly popular during the coronavirus disease 2019 pandemic. Students reported a nearly twofold increase in screen time during the pandemic, compared with the pre-pandemic period.
  • Students reported greater asthenopia prevalence and demonstrated worse vision impairment during the pandemic, compared with the pre-pandemic period.
  • Screen time was associated with asthenopia prevalence but not with the progression of vision impairment.
Implications for clinical practice or policy
  • Policymakers should carefully consider the prevalence of asthenopia and progression of vision impairment among students who are increasingly using digital devices and enrolling in online classes.
  • Policies regarding vision care should be implemented in response to the increasing use of online learning approaches.
 
 
Introduction
The World Health Organization announced that the coronavirus disease 2019 (COVID-19) outbreak had become an international public health emergency on 30 January 2020; on 11 March 2020, it declared that the outbreak had become a pandemic.1 Governments and public health authorities worldwide implemented public health policies to reduce the risk of viral transmission, including strict physical distancing, severe travel restrictions, and the closure of many businesses and schools. On 25 January 2020, China’s Central Government announced a nationwide travel ban and quarantine policy2; it initiated nationwide school closures as an emergency measure to prevent the spread of COVID-19.3 Thus, >220 million school-aged children and adolescents were confined to their homes; online classes were offered and delivered via the internet.4
 
Vision problems are public health challenges; among school-aged children, these problems often involve asthenopia and vision impairment. Asthenopia is defined as a subjective sensation of visual fatigue, eye weakness, or eyestrain; it can manifest through various symptoms, including epiphora, ocular pruritis, diplopia, eye pain, and dry eye.5 Vision impairment is defined as visual acuity (VA) of 6/12 or worse in either eye6; it is often caused by uncorrected refractive errors, and its estimated prevalence is 43%.7 Although both asthenopia and vision impairment have negative effects on students, the effects of vision impairment are greater. A previous global analysis revealed that vision impairment was present in 12.8 million children aged 5 to 15 years, half of whom lived in China.8 Moreover, students with vision impairment have lower scores on various motor and cognitive tests.9 10
 
Excessive use of digital devices contributes to increases in asthenopia prevalence and vision impairment among school-aged children.4 11 12 13 14 15 The COVID-19 pandemic has led to increased use of digital device–supported online classes,16 17 18 which require extended exposure to those devices.19 20 Importantly, long durations of exposure to digital devices can contribute to many vision problems in children.14
 
Asthenopia and vision impairment related to the excessive use of digital devices during the COVID-19 pandemic have been investigated in developed countries and urban China.4 11 12 To our knowledge, no similar studies have been conducted in western rural China. Additionally, online classes are increasingly implemented in rural areas, and the use of digital devices is becoming more prevalent11; thus, there is a need for research that focus on vision health in students.
 
The primary purpose of this study was to assess screen time, asthenopia prevalence, and vision impairment progression during the COVID-19 pandemic among students in western rural China. To achieve this goal, we first conducted a general descriptive analysis of student characteristics and screen time trends before and during the pandemic. We then investigated the prevalence of asthenopia and progression of vision impairment. Finally, we explored factors influencing the prevalence of asthenopia and progression of vision impairment before and during the pandemic.
 
Methods
Setting
This study focused on areas that were broadly representative of rural western China because of limited resources. Thus, the study was conducted in Shaanxi and Ningxia regions in western China. In 2019, the per capita gross domestic product in Shaanxi Province was US$10 167; this is similar to that in Ningxia Autonomous Region (US$8236).21
 
Sample selection
Vision data were acquired from local vision care centres (VCs), which had been established by the Center for Experimental Economics in Education at Shaanxi Normal University, in cooperation with county-level organisations such as the local education ministries and hospitals.
 
Before the pandemic, VC screenings were performed in each county, except during summer and winter vacations. Staff conducted one to two screenings per week (covering 2 to 4 schools); they completed one round of screening in one town each month. In practice, approximately 1 year is needed to complete one round of vision screening for all eligible children in a particular county. The second round and subsequent rounds of vision screening were performed using a similar workflow. After the completion of vision screening, students who required further assessment were referred to the VC for full eye and refractive examinations. This study included students who had visited the VC 3 months before the beginning of the COVID-19 pandemic.
 
During the pandemic, VC staff could not attend schools to perform vision screenings. To maintain vision screening services for students, we telephoned all students who had visited the VC before the pandemic. Participants in this panel study were students who participated in data collection before and during the COVID-19 pandemic.
 
Data collection
We conducted two cycles of surveys in the VC. The first survey cycle was conducted from October to December 2019 (before the pandemic); the second survey cycle was conducted among a group of students who visited the VC for follow-up from July to December 2020 (during the pandemic), based on their enrolment in the study before the pandemic. The same information was collected during the two survey cycles. During the vision screening process, VC staff administered questionnaires to students for collection of the following information: sex (male=1), age, ethnicity (Han=1), residence (non-rural=1), only-child status (yes=1), parental education (parents with ≥12 years of education=1), and parental migration status (one or both out-migrated=1; defined as one or both parents worked away from home during the semester). Household assets were calculated by summing the values of 13 items owned by the family, in accordance with the China Rural Household Survey Yearbook.22
 
The survey also included the collection of information regarding screen time and asthenopia. Students completed a previously described, self-administered questionnaire concerning mean time spent throughout the day on near activities (including computer and smartphone use, television viewing, and studying/homework after school). Reports of time spent on near activities during different parts of the day were categorised as screen time while learning and screen time while playing. Information regarding asthenopia was collected via three questions focused on ocular discomfort: whether the student had experienced dry eyes (yes=1), eye pain and swelling (yes=1), and eye fatigue and watery eyes (yes=1). Asthenopia was defined as the presence of at least one of these three types of vision health problems (yes=1).23 Furthermore, information regarding VA was collected when students visited the VC. The optometrist in the VC conducted a VA test to measure the clarity of each student’s vision. All students completed VA tests without refractive correction; students with spectacles completed VA tests with their routine method of vision correction.
 
The questionnaire regarding asthenopia was developed and reviewed by a group of health experts from Shaanxi Normal University and Zhongshan Ophthalmic Center, a well-known ophthalmology institution in China. The included questions were constructed to ensure that they could be clearly understood by students aged 9 to 17 years with the aid of trained VC staff. These three questions can serve as good indicators of symptoms representing different degrees of asthenopia in students, and they have been used in previous research.23
 
Visual acuity assessment
Visual acuity was assessed using Early Treatment Diabetic Retinopathy Study tumbling-E charts (Precision Vision, La Salle [IL], United States). In an indoor area with sufficient light, VA was separately assessed for each eye without refraction at a distance of 4 m. Students were first examined using a 6/60 line; if they correctly identified the orientation of at least four of five optotypes, they were examined using a 6/30 line, followed by a 6/15 line and a 6/3 line. In this manner, the VA for an eye was defined as the lowest line on which four of five optotypes were correctly identified. If the participant could not read the top line at a distance of 4 m, they were tested at a distance of 1 m, and the VA result was divided by 4.
 
In this study, VA levels were calculated and compared using the logarithm of the minimum angle of resolution (logMAR) scale, which is a linear scale with regular increments that offers a reasonably intuitive interpretation of VA measurement.24 In this study, vision impairment was defined as logMAR ≥0.3 (ie, VA of 6/12 or worse) in either eye.
 
Statistical methods
This balanced panel study compared student data between two periods (before and during the COVID-19 pandemic). Mean screen time, asthenopia prevalence, and vision impairment progression were compared among students using t tests, after stratification according to various demographic and behavioural factors. Fixed-effects logistic and regression models were used to explore factors influencing the prevalence of asthenopia and progression of vision impairment before and during the pandemic. Fixed-effects models were adjusted for sex, age, ethnicity, rural or non-rural residence, only-child status, parental migration status, parental education level, household assets, screen time while learning, and screen time while playing. All analyses were performed using Stata Statistical Software, version 14.1 (StataCorp, College Station [TX], United States). All tests were two-sided, and P values <0.05 were considered statistically significant.
 
Results
This study included 128 students from western rural China (mean age before pandemic, 11.82 ± 1.46 years; mean age during pandemic, 12.32 ± 1.54 years; 80 girls [62.5%] and 48 boys [37.5%]). All participants had vision impairment and were attending online classes (Table 1).
 

Table 1. Screen time before and during the coronavirus disease 2019 pandemic, stratified according to student characteristics (n=128)
 
During the pandemic, screen time significantly increased because of enrolment in online classes. The mean total screen time during the pandemic was 3.22 hours per day, compared with 1.97 hours during the pre-pandemic period (P<0.001). The mean screen time while learning during the pandemic was 1.70 hours per day, compared with 0.90 hours during the pre-pandemic period (P<0.001); the mean screen time while playing during the pandemic was 1.52 hours per day, compared with 1.33 hours during the pre-pandemic period (P=0.019). Additionally, rural students had significantly greater screen time while learning during the pandemic, compared with the pre-pandemic period (P<0.001); there was no such difference among non-rural students (Table 1).
 
The prevalence of asthenopia and progression of vision impairment significantly differed between the pandemic and pre-pandemic periods. The prevalence of asthenopia during the pandemic was 55% (71/128), whereas it was 27% (35/128) during the pre-pandemic period (P<0.001). The mean logMAR VA was worse during the pandemic compared with the pre-pandemic period (0.81 vs 0.65; P<0.001). The prevalence of asthenopia was higher during the pandemic than during the pre-pandemic period, regardless of the characteristics used to stratify participants. The mean logMAR VA was worse during the pandemic than during the pre-pandemic period, although the difference being insignificant among participants with non-Han ethnicity and participants in the top quartile of household assets (Table 2).
 

Table 2. Asthenopia prevalence and visual acuity (in logarithm of the minimum angle of resolution [logMAR]) before and during the coronavirus disease 2019 pandemic, stratified according to student characteristics (n=128)
 
Fixed-effects logistic models for asthenopia revealed that screen time while learning was associated with asthenopia prevalence, and the probability of asthenopia increased by 24.6% for each 1-hour increase in screen time while learning (95% confidence interval [CI]=1.02-1.53; P=0.034). Additionally, older age (odds ratio [OR]=2.073, 95% CI=1.13-3.81, P=0.019), Han ethnicity (OR=2.405, 95% CI=1.22-4.74; P=0.011), and only-child status (OR=0.488, 95% CI=0.21-1.13; P=0.095) were factors associated with asthenopia; screen time while playing was not (Table 3).
 

Table 3. Fixed-effects logistic analysis of factors associated with asthenopia before and during the coronavirus disease 2019 pandemic (n=128)
 
Fixed-effects regression models showed that residence in a non-rural area (OR=-0.200, 95% CI=-0.355 to -0.046; P=0.011) and only-child status (OR=-0.099, 95% CI=-0.197 to 0.000; P=0.049) were factors associated with logMAR VA. The probability of worse logMAR VA increased by 0.200 in non-rural areas, compared with rural areas. However, screen time while learning and screen time while playing were not associated with vision impairment (Table 4).
 

Table 4. Fixed-effects regression analysis of factors associated with visual acuity (in logarithm of the minimum angle of resolution [logMAR]) before and during the coronavirus disease 2019 pandemic (n=128)
 
Discussion
The global spread of the COVID-19 pandemic has affected the education of >1.5 billion children and adolescents worldwide.25 The participants in our study were representative of this important population. They demonstrated declines in VA and vision health during the pandemic, in relation to the excessive use of digital devices; these findings were consistent with the results of previous studies.19 26
 
All students in our study were attending online classes during the pandemic. We observed an increase in the mean daily time spent on digital devices between the pre-pandemic and pandemic periods; these results are consistent with international findings that screen time was greater during the pandemic than before the pandemic.19 Notably, we found that total screen time and screen time while learning significantly changed among rural students but not among non-rural students; these results are also consistent with previous findings.19 This difference presumably occurred because, compared with rural students, non-rural students were more likely to use digital devices and online classes before the pandemic.
 
We observed a significant difference in asthenopia prevalence among students in low-income areas of western China before and during the pandemic; this finding supports the results of previous studies.26 27 Although the risk of asthenopia reportedly increases with screen time,28 there is no published literature concerning changes in asthenopia among students in relation to the COVID-19 pandemic. Similar to previous studies,14 we found that the prevalence of asthenopia was approximately twofold greater among students aged 13 to 17 years than among those aged 9 to 12 years. Furthermore, Moon et al26 reported that symptoms of dry eye diseases were more common among older children than among younger children. Older children spend more time using digital devices, leading to a higher prevalence of asthenopia.29
 
This study showed significant progression of vision impairment in relation to the pandemic; similarly, a study in eastern China revealed that students had worse vision during the pandemic, compared with their vision at pre-pandemic examinations.4 However, screen time has not been associated with vision impairment among students. Furthermore, evidence regarding the impact of digital devices use on vision impairment has been inconsistent,30 31 with computer screen time made students’ vision worse while television viewing had no effect. We speculate that the association will become clearer as school-aged children spend increasing amounts of time using these devices.
 
This study had three important limitations. First, the screen time data were retrospectively collected through a self-reporting mechanism, which may have led to recall bias. However, considering the resource and measurement limitations that researchers encountered during the pandemic, self-reported recall was regarded as the optimal method for collection of screen time data in the present study. Second, the selection of students with poor vision may lead to underestimation of screen time effects on the general population, and the results should be generalised with caution. Third, the study was not designed to accurately distinguish between vision impairment caused by intrinsic factors and vision impairment caused by pandemic-related eye strain.
 
Our findings provide new evidence regarding the effects of increased screen time on asthenopia and vision impairment among students in western rural China during the pandemic; they can also serve as a basis for future research. Although pandemic-related school closures are temporary, the increasing popularity of online classes may accelerate the overall acceptance of digital devices. The use of online learning approaches is associated with multiple vision problems, which merit attention in future studies.
 
Conclusion
The present study demonstrated that asthenopia and vision impairment among students in western rural China were also affected by the pandemic; these findings provide critical insights regarding the effects of the pandemic on vision health in rural students. Moreover, the findings highlight important issues related to childhood vision health during the pandemic; parents, teachers, and eye care providers should consider evidence-based measures to avoid asthenopia and vision impairment in children. The current pace of economic and technological development is leading to increased use of digital devices and online learning approaches, but vision problems in rural China have not received sufficient consideration. Thus, there is a critical need for greater efforts to monitor VA and vision health among students in this region.
 
Author contributions
Concept or design: All authors.
Acquisition of data: Y Ding, H Guan, K Du.
Analysis or interpretation of data: Y Ding, H Guan, K Du, Y Shi.
Drafting of the manuscript: Y Ding, Y Zhang, Z Wang.
Critical revision of the manuscript for important intellectual content: H Guan, Y Shi.
 
All authors contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
As an International Editorial Advisory Board member of the journal, Y Shi was not involved in the peer review process. Other authors have disclosed no conflicts of interest.
 
Acknowledgement
We thank Dr Wenting Liu, Dr Jiaqi Zhu, and staff from the Center for Experimental Economics in Education of Shaanxi Normal University, China for their valuable contributions.
 
Funding/support
H Guan received funding for this study from the National Natural Science Foundation of China (Grant No.: 7180310) and Soft Science Project of Shaanxi Province (Grant No.: 2023-CX-RKX-127). Y Ding received funding for this study from the Fundamental Research Funds for the Central Universities (Grant No.: 2020CSWY018). This study was supported by the 111 Project (Grant No.: B16031). The funders had no role in designing the study, collecting, analysing or interpreting the data, or in drafting this manuscript.
 
Ethics approval
This study protocol was approved by Sun Yat-sen University, China (Registration No.: 2013MEKY018) and all procedures followed the principles of the Declaration of Helsinki. Permission was obtained from the local boards of education in the study area, as well as the principals of all participating schools. All participating children provided oral assent before baseline data collection, and legal guardians provided written informed consent for their children to be enrolled in the study.
 
References
1. World Health Organization. WHO timeline—COVID-19. 2020. Available from: https://www.who.int/news-room/detail/27-04-2020-who-timeline---covid-19. Accessed 13 Sep 2021.
2. Li D, Liu Z, Liu Q, et al. Estimating the efficacy of quarantine and traffic blockage for the epidemic caused by 2019-nCoV (COVID-19): a simulation analysis. medRxiv [Preprint]. 25 Feb 2020. Available from: https://doi.org/10.1101/2020.02.14.20022913. Accessed 13 Sep 2021. Crossref
3. Wang G, Zhang Y, Zhao J, Zhang J, Jiang F. Mitigate the effects of home confinement on children during the COVID-19 outbreak. Lancet 2020;395:945-7. Crossref
4. Wang J, Li Y, Musch DC, et al. Progression of myopia in school-aged children after COVID-19 home confinement. JAMA Ophthalmol 2021;139:293-300. Crossref
5. Kowalska M, Zejda JE, Bugajska J, Braczkowska B, Brozek G, Malińska M. Eye symptoms in office employees working at computer stations [in Polish]. Med Pr 2011;62:1-8.
6. Cumberland PM, Peckham CS, Rahi JS. Inferring myopia over the lifecourse from uncorrected distance visual acuity in childhood. Br J Ophthalmol 2007;91:151-3. Crossref
7. Pascolini D, Mariotti SP. Global estimates of visual impairment: 2010. Br J Ophthalmol 2012;96:614-8. Crossref
8. Resnikoff S, Pascolini D, Mariotti SP, Pokharel GP. Global magnitude of visual impairment caused by uncorrected refractive errors in 2004. Bull World Health Organ 2008;86:63-70. Crossref
9. Jan C, Li SM, Kang MT, et al. Association of visual acuity with educational outcomes: a prospective cohort study. Br J Ophthalmol 2019;103:1666-71. Crossref
10. Roch-Levecq AC, Brody BL, Thomas RG, Brown SI. Ametropia, preschoolers’ cognitive abilities, and effects of spectacle correction. Arch Ophthalmol 2008;126:252-8. Crossref
11. Zhang Z, Xu G, Gao J, et al. Effects of e-learning environment use on visual function of elementary and middle school students: a two-year assessment—experience from China. Int J Environ Res Public Health 2020;17:1560. Crossref
12. Wong CW, Tsai A, Jonas JB, et al. Digital screen time during COVID-19 pandemic: risk for a further myopia boom? Am J Ophthalmol 2021;223:333-7. Crossref
13. Kim J, Hwang Y, Kang S, et al. Association between sexposure to smartphones and ocular health in adolescents. Ophthalmic Epidemiol 2016;23:269-76. Crossref
14. Mohan A, Sen P, Shah C, Jain E, Jain S. Prevalence and risk factor assessment of digital eye strain among children using online e-learning during the COVID-19 pandemic: digital eye strain among kids (DESK study-1). Indian J Ophthalmol 2021;69:140-4. Crossref
15. Guan H, Yu NN, Wang H, et al. Impact of various types of near work and time spent outdoors at different times of day on visual acuity and refractive error among Chinese school-going children. PLoS One 2019;14:e0215827.Crossref
16. Sultana A, Tasnim S, Hossain MM, Bhattacharya S, Purohit N. Digital screen time during the COVID-19 pandemic: a public health concern. Available from: https://f1000research.com/articles/10-81. Accessed 13 Sep 2021. Crossref
17. Nigg CR, Wunsch K, Nigg C, et al. Are physical activity, screen time, and mental health related during childhood, preadolescence, and adolescence? 11-year results from the German Montorik–Modul Longitudinal Study. Am J Epidemiol 2021;190:220-9. Crossref
18. Schmidt SC, Anedda B, Burchartz A, et al. Physical activity and screen time of children and adolescents before and during the COVID-19 lockdown in Germany: a natural experiment. Sci Rep 2020;10:21780. Crossref
19. Aguilar-Farias N, Toledo-Vargas M, Miranda-Marquez S, et al. Sociodemographic predictors of changes in physical activity, screen time, and sleep among toddlers and preschoolers in Chile during the COVID-19 pandemic. Int J Environ Res Public Health 2020;18:176. Crossref
20. Bates LC, Zieff G, Stanford K, et al. COVID-19 impact on behaviors across the 24-hour day in children and adolescents: physical activity, sedentary behavior, and sleep. Children (Basel) 2020;7:138. Crossref
21. National Bureau of Statistics of China, PRC Government. China Statistical Yearbook 2020. Available from: http://www.stats.gov.cn/tjsj/ndsj/2020/indexch.htm. Accessed 14 Sep 2021.
22. National Bureau of Statistics of China, PRC Government. China Statistical Yearbook 2013. Beijing, China: China State Statistical Press; 2013.
23. Seguí Mdel M, Cabrero García J, Crespo A, Verdú J, Ronda E. A reliable and valid questionnaire was developed to measure computer vision syndrome at the workplace. J Clin Epidemiol 2015;68:662-73. Crossref
24. Yi H, Zhang L, Ma X, et al. Poor vision among China’s rural primary school students: prevalence, correlates and consequences. China Econ Rev 2015;33:247-62. Crossref
25. United Nations International Children’s Emergency Fund. Don’t let children be the hidden victims of COVID-19 pandemic. Available from: https://www.unicef.org/press-releases/dont-let-children-be-hidden-victims-covid-19-pandemic. Accessed 6 Oct 2020.
26. Moon JH, Kim KW, Moon NJ. Smartphone use is a risk factor for pediatric dry eye disease according to region and age: a case control study. BMC Ophthalmol 2016;16:188. Crossref
27. Moon JH, Lee MY, Moon NJ. Association between video display terminal use and dry eye disease in school children. J Pediatr Ophthalmol Strabismus 2014;51:87-92. Crossref
28. Rechichi C, De Mojà G, Aragona P. Video game vision syndrome: a new clinical picture in children? J Pediatr Ophthalmol Strabismus 2017;54:346-55.Crossref
29. Mowatt L, Gordon C, Santosh AB, Jones T. Computer vision syndrome and ergonomic practices among undergraduate university students. Int J Clin Pract 2018;72:e13035. Crossref
30. Terasaki H, Yamashita T, Yoshihara N, Kii Y, Sakamoto T. Association of lifestyle and body structure to ocular axial length in Japanese elementary school children. BMC Ophthalmol 2017;17:123. Crossref
31. Fernández-Montero A, Olmo-Jimenez JM, Olmo N, et al. The impact of computer use in myopia progression: a cohort study in Spain. Prev Med 2015;71:67-71. Crossref

Artificial intelligence for detection of intracranial haemorrhage on head computed tomography scans: diagnostic accuracy in Hong Kong

© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE  CME
Artificial intelligence for detection of intracranial haemorrhage on head computed tomography scans: diagnostic accuracy in Hong Kong
Jill M Abrigo, MD, FRCR; Ka-long Ko, MPhil; Qianyun Chen, MSc; Billy MH Lai, MB, BS, FHKAM (Radiology); Tom CY Cheung, MB ChB, FHKAM (Radiology); Winnie CW Chu, MB ChB, FHKAM (Radiology); Simon CH Yu, MB, BS, FHKAM (Radiology)
Department of Imaging and Interventional Radiology, Prince of Wales Hospital, The Chinese University of Hong Kong, Hong Kong SAR, China
 
Corresponding author: Dr Jill M Abrigo (jillabrigo@cuhk.edu.hk)
 
 Full paper in PDF
 
Abstract
Introduction: The use of artificial intelligence (AI) to identify acute intracranial haemorrhage (ICH) on computed tomography (CT) scans may facilitate initial imaging interpretation in the accident and emergency department. However, AI model construction requires a large amount of annotated data for training, and validation with real-world data has been limited. We developed an algorithm using an open-access dataset of CT slices, then assessed its utility in clinical practice by validating its performance on CT scans from our institution.
 
Methods: Using a publicly available international dataset of >750 000 expert-labelled CT slices, we developed an AI model which determines ICH probability for each CT scan and nominates five potential ICH-positive CT slices for review. We validated the model using retrospective data from 1372 non-contrast head CT scans (84 [6.1%] with ICH) collected at our institution.
 
Results: The model achieved an area under the curve of 0.842 (95% confidence interval=0.791-0.894; P<0.001) for scan-based detection of ICH. A pre-specified probability threshold of ≥50% for the presence of ICH yielded 78.6% accuracy, 73% sensitivity, 79% specificity, 18.6% positive predictive value, and 97.8% negative predictive value. There were 62 true-positive scans and 22 false-negative scans, which could be reduced to six false-negative scans by manual review of model-nominated CT slices.
 
Conclusions: Our model exhibited good accuracy in the CT scan–based detection of ICH, considering the low prevalence of ICH in Hong Kong. Model refinement to allow direct localisation of ICH will facilitate the use of AI solutions in clinical practice.
 
 
New knowledge added by this study
  • A deep learning–based artificial intelligence model trained on an international dataset of computed tomography (CT) slices exhibited good accuracy in the detection of intracranial haemorrhage (ICH) on CT scans in Hong Kong.
  • Considering the 6% prevalence of ICH in our institution, and using a pre-specified probability threshold of ≥50%, the model detected 74% of ICH-positive scans; this outcome improved to 93% via manual review of model-nominated images.
Implications for clinical practice or policy
  • Considering the expected clinical applications, model refinement is needed to improve diagnostic performance prior to additional tests in a clinical setting.
  • Our model may facilitate assessment of CT scans by physicians with different degrees of experience in ICH detection, an important aspect of real-world clinical practice.
 
 
Introduction
Head computed tomography (CT) scans constitute the main imaging investigation during the evaluation of trauma and stroke; they are also important in the initial work-up of headache and other non-specific neurological complaints. In Prince of Wales Hospital of Hong Kong alone, >25 000 head CT scans were performed in 2019 during the clinical management of patients who presented to the Accident and Emergency Department. Computed tomography scans are composed of multiple cross-sectional images (ie, slices), which may be challenging to interpret. Typically, these scans are initially reviewed by frontline physicians prior to assessment by radiologists, and delays during the review process can be substantial. Thus, the timely recognition of an acute finding, such as intracranial haemorrhage (ICH), is limited by the competence and availability of frontline physicians.
 
The presence and location or type of ICH impacts the next clinical step, which can be further imaging investigations, medical management, or surgical intervention.1 Furthermore, a confirmation of ICH absence can also be useful in clinical management. For example, it can facilitate safe discharge from the hospital when appropriate; in patients with acute stroke, the absence of ICH is an important exclusion criterion that influences treatment selection.2
 
The use of artificial intelligence (AI) for ICH detection is a topic with global relevance considering its diagnostic impact and ability to optimise workflow, both of which have high practical value.3 4 In the accident and emergency department, AI can facilitate ICH detection in head CT scans during times when a radiologist is unavailable. Although there have been multiple reports of deep learning methods with high accuracy in the detection of ICH, the models in those reports were developed using in-house labelled training datasets and validated using a limited number of cases.3 5 6 7 8 Recently, the Radiological Society of North America (RSNA) publicly released >25 000 multi-centre head CT scans with slices that have been labelled with or without ICH by experts.9 Here, we developed a model using this RSNA dataset, then validated its performance on CT scans from our institution to determine its potential for clinical application in Hong Kong.
 
Methods
Ethical considerations
This study was approved by the Joint Chinese University of Hong Kong—New Territories East Cluster Clinical Research Ethics Committee (Ref No.: 2020.061). The model was developed from a publicly available dataset and validated on retrospectively acquired data from our institution. The requirement for patient consent was waived by the Committee given the retrospective design of the study and anonymisation of all CT scans prior to use.
 
The results of this diagnostic accuracy study are reported in accordance with the Standards for Reporting of Diagnostic Accuracy Studies guidelines.10
 
Public dataset: model development and internal validation
We acquired 25 312 head CT scans from four institutions in North and South America available in the RNSA open dataset,11 and were split into slices (each slice ≥5 mm thick), which were then randomly shuffled and annotated by 60 volunteer experts from the American Society of Neuroradiology. Each CT slice was labelled to indicate the presence and type of ICH. When present, ICH was classified according to its location, namely, intraparenchymal haemorrhage (IPH), subarachnoid haemorrhage (SAH), subdural haemorrhage (SDH), epidural haemorrhage (EDH), and intraventricular haemorrhage (IVH). The RSNA dataset comprised 752 807 CT slices, which were divided into a training subset (85%) and test subset (15%) for internal validation. Each subset consisted of approximately 86% negative ICH slices and 14% positive ICH slices, along with the following proportions of ICH subtypes: 4.8% IPH, 4.7%-4.8% SAH, 6.3% SDH, 0.4% EDH, and 3.4%-3.5% IVH.
 
The convolutional neural network (CNN) VGG (named after the Visual Geometry Group from the University of Oxford, United Kingdom) is an effective end-to-end algorithm for image detection.12 In this study, we adopted the VGG architecture with a customised output layer and loss function optimised for multi-label classification. To adjust for the low prevalence of ICH in the training set, each subtype’s logit outputs zi were concatenated as independent channels after a sigmoid output layer:
 
 
The performance of the CNN model was evaluated by binary cross-entropy loss and Sørensen–Dice loss13:
 
 
The loss functions were linearly combined with weighted values to produce the multi-label classification loss:
 
 
Where wi denotes the class prevalence weight, and α and β denote respective loss mix ratios. For simplicity, wi=1∕(n-1) for all subtype classes and wi=1 for ‘ANY’ was treated as an independent ICH class.
 
The model was trained with software written in our laboratory using the end-to-end open-source machine learning platform TensorFlow on an Nvidia Titan Xp graphics processing unit.
 
During internal validation (ie, slice-level performance for the detection of any type of ICH), the model achieved an area under the curve (AUC) of 0.912 (95% confidence interval [CI]=0.909-0.915) with sensitivity and specificity of 85% and 80%, respectively. Additionally, for the detection of specific types of ICH, the following AUC (95% CI) and sensitivity/specificity values were obtained: 0.860 (0.853-0.867) and 77%/88% for IPH, 0.835 (0.829-0.842) and 75%/82% for SAH, 0.850 (0.845-0.855) and 74%/83% for SDH, 0.813 (0.790-0.836) and 72%/80% for EDH, and 0.870 (0.861-0.879) and 79%/89% for IVH.
 
Prince of Wales Hospital dataset: external validation
The consecutive head CT scans of patients aged ≥18 years who underwent initial brain CT scans in the Accident and Emergency Department of Prince of Wales Hospital from 1 to 31 July 2019 were included, thereby simulating the point prevalence of ICH.
 
Head CT scans were acquired on a 64-slice CT scanner. Data analyses were conducted using reformatted 5-mm-thick slices, which can be accessed and viewed by physicians at all hospital workstations. DICOM (Digital Imaging and Communications in Medicine) images were de-identified prior to data analyses. The large volume of data was explored through the identification of relevant CT data using an automated program which selected scans with specific DICOM tags. Computed tomography scans performed for follow-up purposes or after recent intracranial surgery, as well as scans without radiologist reports, were excluded from the analysis.
 
We reviewed the corresponding radiology reports to determine the presence and type of ICH (IPH, SAH, SDH, EDH, or IVH). The CT scans were assessed by radiologists or senior radiology trainees; the corresponding reports were regarded as scan-level ground truth labels for analysis, consistent with their use as clinical reference standards in Hong Kong. Considering its rarity, EDH was grouped with and labelled as SDH, which has a similar appearance on CT. For scans with false-negative results, we performed post-hoc labelling of model-nominated CT slices. All scans were assessed prior to model construction; thus, the scan reports were established without knowledge of the AI results. Furthermore, all scans comprised the external validation dataset and constituted ‘unseen data’ for the model.
 
Statistical analysis
The diagnostic accuracies of the model for the detection of any type of ICH and each type of ICH were determined by calculation of the AUC with 95% CI, using DeLong et al’s method.14 To construct the confusion matrix during external validation, CT scans were classified as ICH-positive using a pre-specified probability threshold of ≥50%8; the corresponding sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and accuracy were calculated. Additional probability thresholds were established to achieve 90% sensitivity and 90% specificity for the presence of any type of ICH. Statistical analysis was performed using R software (version 4.0.2; R Foundation for Statistical Computing, Vienna, Austria), and the threshold for statistical significance was set at P<0.05.
 
Results
Model output
Figure 1 shows an example of the model output. The model report includes an overall probability for the presence of ICH (labelled ‘A’ in Fig 1). Additionally, the model selects five representative CT image slices which are likely to contain ICH (one such slice is labelled ‘B’ in Fig 1), along with the probability of each ICH type in each representative slice (labelled ‘C’ in Fig 1). All scans were successfully analysed by the model.
 

Figure 1. Sample model output, highlighting three types of information provided by the model. A: intracranial haemorrhage (ICH) probability; B: model-nominated image with possible ICH and corresponding slice number in the computed tomography (CT) scan; C: probability of each ICH type for the corresponding CT slice
 
Prince of Wales Hospital data and model validation
The Standards for Reporting of Diagnostic Accuracy Studies diagram and corresponding confusion matrix are shown in Figure 2. In total, 1372 head CT scans (84 [6.1%] with ICH) were included in the analysis. The distribution of ICH types is summarised in the Table.
 

Figure 2. Standards for the Reporting of Diagnostic Accuracy flowchart for external validation of the model using computed tomography (CT) scans from Prince of Wales Hospital. The confusion matrix is shown below the flowchart
 

Table. Distribution of computed tomography (CT) scans without and with intracranial haemorrhage in the Prince of Wales Hospital dataset (n=1372)
 
Diagnostic performance of scan-based detection for any type of intracranial haemorrhage
The model achieved an AUC of 0.842 (95% CI=0.791-0.894; P<0.001) for the identification of any type of ICH. Using a probability threshold of ≥50% for the presence of ICH, the accuracy, sensitivity, specificity, PPV, and NPV were 78.6%, 73%, 79%, 18.6%, and 97.8%, respectively. In total, 62 scans were true positive, 22 were false negative, 1017 were true negative, and 271 were false positive (Fig 2).
 
Among the 62 true-positive scans, the model output in two cases did not contain ICH-positive CT slices: 6-mm IPH in the pons (n=1) and trace SAH in a patient with multiple metastatic tumours (n=1). Figure 3 shows selected cases of model-nominated CT slices with subtle ICH.
 

Figure 3. Representative computed tomography slices from model outputs for selected true-positive scans showing small or subtle intracranial haemorrhage. Arrowheads have been added to indicate intracranial haemorrhage. (a) Haemorrhage within a cystic tumour; (b, d, and e) subarachnoid haemorrhage; (c) intraparenchymal haemorrhage; (f) subdural haemorrhage
 
Among the 22 false-negative scans, 19 had one type of ICH (6 IPH, 7 SAH, 5 SDH, and 1 IVH), two had two types of ICH (1 IPH+SAH and 1 SAH+SDH), and one had three types of ICH (IPH+SAH+IVH). In 16 scans, the model selected at least one ICH-positive CT slice which allowed correct reclassification (Fig 4). The remaining six scans with undetected ICH (Fig 5) comprised small midbrain IPH (n=1), trace SAH (n=3), and thin SDH/EDH (n=2). One of the three cases of undetected trace SAH was visualised on thin CT slices but not on thick CT slices.
 

Figure 4. Representative computed tomography slices from model outputs for false-negative scans showing intracranial haemorrhage. Arrowheads have been added to indicate intracranial haemorrhage. (a) Intraventricular haemorrhage; (b, c, f, g, h, and i) subarachnoid haemorrhage; (d, e, l, m, n, o, and p) intraparenchymal haemorrhage; (j and k) subdural haemorrhage
 

Figure 5. False-negative computed tomography scans with undetected intracranial haemorrhage. Arrowheads have been added to indicate intracranial haemorrhage. (a-e) Representative images of intracranial haemorrhage in thick computed tomography slices [(a) intraparenchymal haemorrhage; (b and d) subdural haemorrhage; (c and e) subarachnoid haemorrhage]. (f) Trace subarachnoid haemorrhage that was visible in reformatted coronal thin computed tomography slices but not thick computed tomography slices
 
A probability threshold of 20.4% yielded a sensitivity of 90% (40% specificity, 9% PPV, and 98.3% NPV), whereas a threshold of 65.7% yielded a specificity of 90% (64% sensitivity, 30% PPV, and 97.4% NPV), for the detection of ICH.
 
Diagnostic performance of scan-based detection for each type of intracranial haemorrhage
At a probability threshold of ≥50%, the following AUC (95% CI) and corresponding sensitivity/specificity were obtained for each type of ICH: 0.930 (0.892-0.968) and 4%/100% for IPH, 0.766 (0.684-0.849) and 12%/96% for SAH, 0.865 (0.783-0.947) and 75%/90% for SDH/EDH, and 0.935 (0.852-1.000) and 85%/93% for IVH.
 
Discussion
In this study, we used a large international training dataset to construct a model for ICH detection, then conducted external validation using data from Hong Kong. To overcome the discrepancy between the training dataset (composed of CT slices) and the validation dataset (composed of CT scans), and considering our goal of clinical application, we designed a model that iteratively conducts assessments at the slice level to generate an overall probability at the scan level, then nominates the slices with the highest ICH probability for clinician evaluation. Furthermore, we performed validation using a point-prevalence approach to determine the diagnostic performance of the model in a real-world setting. Considering the 6% prevalence of ICH in our institution, and using a pre-specified probability threshold of ≥50%, the model detected 74% of ICH-positive scans; this outcome improved to 93% via manual review of model-nominated images.
 
Artificial intelligence for intracranial haemorrhage detection: research and reality
Multiple studies have successfully used AI for ICH detection via deep learning methods, typically involving variants of CNNs. For example, Arbabshirani et al5 (deep CNN, >37 000 training CT scans) reported an AUC of 0.846 on 342 CT scans; Chang et al4 (two-dimensional/three-dimensional CNN, 10 159 training CT scans) reported an AUC of 0.983 on 862 prospectively collected CT scans. Furthermore, Chilamkurthy et al3 (CNN, >290 000 training CT scans) reported an AUC of 0.94 on 491 CT scans; Lee et al7 (four deep CNNs, 904 training CT scans) reported an AUC of 0.96 on 214 CT scans. Finally, Ye et al8 (three-dimensional joint CNN-recurrent neural network, 2537 training CT scans) reported an AUC of 1.0 on 299 CT scans; Kuo et al6 (patch-based fully CNN, 4396 training CT scans) reported an AUC of 0.991 on 200 CT scans. Although these results demonstrate the high diagnostic performance that can be achieved using deep learning methods for ICH detection, the studies were conducted using in-house training datasets, which are laborious to produce and limit subsequent clinical applications. Moreover, the results may not be directly applicable to clinical practice, considering the limited number (generally <500) of CT scans during validation, as well as the effect of prevalence on sensitivity and specificity. Yune et al15 demonstrated this problem with a deep learning model that had an AUC of 0.993 on selected cases, which decreased to 0.834 when validated on CT scans collected over a 3-month period; notably, this is comparable with the AUC of our model. Thus, model performance in a real-world setting can reduce the risk of bias and serve as a better indicator of clinical relevance.16
 
Artificial intelligence for intracranial haemorrhage detection: our approach
The development of an AI model is the first step in a long process of clinical translation. In this study, we aimed to construct an algorithm that was reasonably comparable with radiologist performance, prior to further tests in a clinical setting. We recognise that our model is not an end-product; it constitutes an initial exploration of the potential for an international dataset–derived algorithm to be implemented in our institution. To avoid problems associated with the lack of an annotated dataset from Hong Kong, we utilised a dataset labelled by international experts, which is the most extensive open-access dataset currently available. However, the model achieved limited diagnostic accuracy, mainly because of type 1 error (ie, identification of false positives). The training dataset was composed of CT slices, whereas the model functioned at the CT scan level, iteratively assessing all slices to identify slices with highest ICH probability. If any slice identified in a single scan is considered positive, the model reports the CT scan as ‘ICH-positive’. Thus, any detection of false positives at the slice level will lead to amplification of the false-positive rate at the scan level. This strategy resulted in a low PPV (~19%) and a high NPV (~98%). To reduce the detection of false positives, we included a CT slice nomination feature in the model, which highlights CT slices with the highest probability of ICH. This facilitates manual review and reduces the black-box nature of the model.
 
Potential implications of artificial intelligence–detected intracranial haemorrhage in clinical practice
During validation, the model was tested using an ICH point–prevalence approach to elucidate the potential clinical implications of the classification outcomes. With respect to true positives, most ICH-positive scans were detected; most of these scans had large areas of ICH, which presumably could be easily identified by non-radiologists. However, in six cases, the model correctly nominated CT slices with small areas of ICH. In two cases, the nominated images did not have ICH, which could potentially have led to incorrect reclassification of the scan as a false positive.
 
Furthermore, there were many false positives. Such results may reduce physician confidence despite the correct interpretation of an ICH-negative scan; they may lead to overdiagnosis (with prolonged hospitalisation) or further investigations, such as a follow-up CT scan that involves additional radiation exposure.
 
With respect to false negatives, the model output includes a secondary mechanism of image review that allowed correct reclassification of 16 scans, increasing the rate of ICH detection from 74% to 93%. In five cases, ICH was conspicuous on the nominated images; in 11 cases, the nominated images displayed subtle ICH. In cases of subtle ICH, it is possible to overlook the trace amount of ICH on the nominated CT slice. The same problem may affect true-positive scans, which may be misclassified as false positives unless subtle ICH is recognised in the nominated image. Unfortunately, the model-generated probability of each type of ICH in each selected image did not facilitate the localisation of ICH.
 
Based on our primary clinical motivation to develop this model, we focused on CT scans with reformatted thick CT slices that can be viewed in all hospital workstations by non-radiologists. In practice, radiologists use dedicated imaging workstations to view sub-millimetre thin CT slices with greater sensitivity, which can display smaller or subtler pathologies. Thus, there is limited capacity for ICH detection in thick CT slices; this was highlighted in a case of trauma-related trace SAH, which was visible on thin CT slices but not thick CT slices. Subarachnoid haemorrhage is reportedly the most difficult type of ICH to interpret.17 In practice, a patient with a very small amount of isolated traumatic SAH would likely receive conservative treatment, and the pathology could reasonably await detection via radiologist assessment.
 
Limitations
This study had some limitations. First, diagnostic accuracy would have been more comprehensively assessed using a larger number of CT scans or a longer point prevalence; however, we limited the assessment to CT scans collected over a 1-month period, considering the preliminary stage of model development. Second, the CT scans were assessed by radiologists and senior radiology trainees who may have different degrees of experience in ICH detection17; importantly, this limitation reflects the real-world setting where model deployment is intended. Finally, the model was specifically trained for the detection of ICH; it was not trained for the detection of other clinically significant non-ICH findings (eg, non-haemorrhagic tumours, hydrocephalus, or mass effect). The detection of these other pathologies will require dedicated models with customised training datasets.
 
Conclusion
In this study, we used a CT slice–based dataset to develop an algorithm for CT scan–based ICH detection; we validated the model using our institutional data with a point-prevalence approach, yielding insights regarding its utility in real-world clinical practice. Although the model demonstrated good accuracy, its diagnostic performance is currently limited to the intended clinical application. However, our results support further development of the model to improve its accuracy and incorporate a mechanism that can facilitate visual confirmation of ICH location. These modifications would enhance the interpretability of the deep learning model and would be useful for further evaluation of clinical applications.
 
Author contributions
Concept or design: JM Abrigo, KL Ko, WCW Chu, SCH Yu.
Acquisition of data: JM Abrigo, Q Chen, WCW Chu, BMH Lai, TCY Cheung.
Analysis or interpretation of data: All authors.
Drafting of the manuscript: JM Abrigo, KL Ko, Q Chen.
Critical revision of the manuscript for important intellectual content: All authors.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
All authors have disclosed no conflicts of interest.
 
Acknowledgement
We thank our department colleagues Mr Kevin Lo for anonymising and downloading Digital Imaging and Communications in Medicine data, and we thank Mr Kevin Leung for preparing figures for this manuscript.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
This research was approved by the Joint Chinese University of Hong Kong—New Territories East Cluster Clinical Research Ethics Committee (Ref No.: 2020.061). The requirement for patient consent was waived by the Committee given the retrospective design of the study and anonymisation of all computed tomography scans prior to use.
 
References
1. Caceres JA, Goldstein JN. Intracranial hemorrhage. Emerg Med Clin North Am 2012;30:771-94. Crossref
2. Powers WJ, Rabinstein AA, Ackerson T, et al. Guidelines for the early management of patients with acute ischemic stroke: 2019 update to the 2018 guidelines for the early management of acute ischemic stroke: a guideline for healthcare professionals from the American Heart Association/American Stroke Association. Stroke 2019;50:e344-418. Crossref
3. Chilamkurthy S, Ghosh R, Tanamala S, et al. Deep learning algorithms for detection of critical findings in head CT scans: a retrospective study. Lancet 2018;392:2388-96. Crossref
4. Chang PD, Kuoy E, Grinband J, et al. Hybrid 3D/2D convolutional neural network for hemorrhage evaluation on head CT. AJNR Am J Neuroradiol 2018;39:1609-16. Crossref
5. Arbabshirani MR, Fornwalt BK, Mongelluzzo GJ, et al. Advanced machine learning in action: identification of intracranial hemorrhage on computed tomography scans of the head with clinical workflow integration. NPJ Digit Med 2018;1:9. Crossref
6. Kuo W, Häne C, Mukherjee P, Malik J, Yuh EL. Expert-level detection of acute intracranial hemorrhage on head computed tomography using deep learning. Proc Natl Acad Sci U S A 2019;116:22737-45. Crossref
7. Lee H, Yune S, Mansouri M, et al. An explainable deep-learning algorithm for the detection of acute intracranial haemorrhage from small datasets. Nat Biomed Eng 2019;3:173-82. Crossref
8. Ye H, Gao F, Yin Y, et al. Precise diagnosis of intracranial hemorrhage and subtypes using a three-dimensional joint convolutional and recurrent neural network. Eur Radiol 2019;29:6191-201. Crossref
9. Flanders AE, Prevedello LM, Shih G, et al. Construction of a machine learning dataset through collaboration: The RSNA 2019 Brain CT Hemorrhage Challenge. Radiol Artif Intell 2020;2:e190211. Crossref
10. Bossuyt PM, Reitsma JB, Bruns DE, et al. STARD 2015: an updated list of essential items for reporting diagnostic accuracy studies. Radiology 2015;277:826-32. Crossref
11. Radiological Society of North America. RSNA intracranial hemorrhage detection: identify acute intracranial hemorrhage and its subtypes. Available from: https://www.kaggle.com/c/rsna-intracranial-hemorrhage-detection/data. Accessed 28 Mar 2023. Crossref
12. Zhang X, Zou J, He K, Sun J. Accelerating very deep convolutional networks for classification and detection. IEEE Trans Pattern Anal Mach Intell 2016;38:1943-55. Crossref
13. Milletari F, Navab N, Ahmadi SA. V-Net: fully convolutional neural networks for volumetric medical image segmentation. 2016 Fourth International Conference on 3D Vision (3DV). USA (CA): Stanford; 2016: 565-71. Crossref
14. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 1988;44:837-45. Crossref
15. Yune S, Lee H, Pomerantz S, et al. Real-world performance of deep-learning–based automated detection system for intracranial hemorrhage. Radiological Society of North America (RSNA) 104th Scientific Assembly and Annual Meeting. McCormick Place, Chicago (IL); 2018.
16. Nagendran M, Chen Y, Lovejoy CA, et al. Artificial intelligence versus clinicians: systematic review of design, reporting standards, and claims of deep learning studies. BMJ 2020;368:m689. Crossref
17. Strub WM, Leach JL, Tomsick T, Vagal A. Overnight preliminary head CT interpretations provided by residents: locations of misidentified intracranial hemorrhage. AJNR Am J Neuroradiol 2007;28:1679-82. Crossref

Evaluation of contemporary olanzapine- and netupitant/palonosetron-containing antiemetic regimens for chemotherapy-induced nausea and vomiting

© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Evaluation of contemporary olanzapine- and netupitant/palonosetron-containing antiemetic regimens for chemotherapy-induced nausea and vomiting
Christopher CH Yip1; L Li, MB, BS, FHKAM (Medicine)1; Thomas KH Lau, MB, BS, FHKAM (Medicine)1; Vicky TC Chan, MB, BS, FHKAM (Medicine)1; Carol CH Kwok, MB, BS, FHKAM (Radiology)2; Joyce JS Suen, MB, BS, FHKAM (Radiology)2; Frankie KF Mo, PhD1; Winnie Yeo, MB, BS, FHKAM (Medicine)1
1 Department of Clinical Oncology, Prince of Wales Hospital, Hong Kong
2 Department of Clinical Oncology, Princess Margaret Hospital, Hong Kong
 
Corresponding author: Prof Winnie Yeo (winnieyeo@cuhk.edu.hk)
 
 Full paper in PDF
 
Abstract
Introduction: This post-hoc analysis retrospectively assessed data from two recent studies of antiemetic regimens for chemotherapy-induced nausea and vomiting (CINV). The primary objective was to compare olanzapine-based versus netupitant/palonosetron (NEPA)-based regimens in terms of controlling CINV during cycle 1 of doxorubicin/cyclophosphamide (AC) chemotherapy; secondary objectives were to assess quality of life (QOL) and emesis outcomes over four cycles of AC.
 
Methods: This study included 120 Chinese patients with early-stage breast cancer who were receiving AC; 60 patients received the olanzapine-based antiemetic regimen, whereas 60 patients received the NEPA-based antiemetic regimen. The olanzapine-based regimen comprised aprepitant, ondansetron, dexamethasone, and olanzapine; the NEPA-based regimen comprised NEPA and dexamethasone. Patient outcomes were compared in terms of emesis control and QOL.
 
Results: During cycle 1 of AC, the olanzapine group exhibited a higher rate of ‘no use of rescue therapy’ in the acute phase (olanzapine vs NEPA: 96.7% vs 85.0%, P=0.0225). No parameters differed between groups in the delayed phase. The olanzapine group had significantly higher rates of ‘no use of rescue therapy’ (91.7% vs 76.7%, P=0.0244) and ‘no significant nausea’ (91.7% vs 78.3%, P=0.0408) in the overall phase. There were no differences in QOL between groups. Multiple cycle assessment revealed that the NEPA group had higher rates of total control in the acute phase (cycles 2 and 4) and the overall phase (cycles 3 and 4).
 
Conclusion: These results do not conclusively support the superiority of either regimen for patients with breast cancer who are receiving AC.
 
 
New knowledge added by this study
  • The olanzapine-based regimen (aprepitant, ondansetron, dexamethasone, and olanzapine) and the NEPA-based regimen (netupitant, palonosetron, and dexamethasone) demonstrated similar efficacies in terms of controlling chemotherapy-induced nausea and vomiting among patients with early-stage breast cancer.
  • Quality of life did not significantly differ between patients receiving the olanzapine-based regimen and patients receiving the NEPA-based regimen.
Implications for clinical practice or policy
  • The available data suggest that olanzapine-containing antiemetic regimens can be used without aprepitant, particularly when seeking to reduce medical expenses.
  • Antiemetic efficacy may potentially be enhanced if NEPA is administered in combination with dexamethasone and olanzapine as a four-drug antiemetic regimen.
 
 
Introduction
Patients with breast cancer receiving (neo)adjuvant treatment exhibit improved prognoses.1 However, chemotherapy regimens for breast cancer are associated with various degrees of chemotherapy-induced nausea and vomiting (CINV). The doxorubicin/cyclophosphamide (AC) regimen is one of the most frequently prescribed regimens for patients with breast cancer who are receiving (neo) adjuvant chemotherapy; AC is among the highly emetogenic chemotherapies with ≥90% risk of nausea and vomiting.
 
In situations where a neurokinin-1 receptor antagonist (NK1RA) is accessible, most current guidelines for AC(-like) chemotherapy recommend the use of a prophylactic triplet antiemetic regimen that consists of an NK1RA, a 5-hydroxytryptamine type-3 receptor antagonist (5HT3RA), and a corticosteroid, with or without olanzapine.2 3 4 In addition to earlier NK1RAs (eg, aprepitant, fosaprepitant, and rolapitant), netupitant/palonosetron (NEPA) [Akynzeo], which is a combination of an NK1RA (netupitant 100 mg) and a second-generation 5HT3RA (palonosetron 0.5 mg), has been available in the past decade. Although palonosetron constitutes a more potent 5HT3RA,5 it also has synergistic interactions with netupitant that include interference with 5HT3 receptor cross-talk and enhancement of the netupitant-mediated effect on NK1 receptor internalisation.6 7
 
In a recent systematic review and meta-analysis, Yokoe et al8 compared different antiemetic regimens to assess their control of CINV in patients receiving highly emetogenic chemotherapy regimens. The authors arbitrarily defined the ‘conventional’ regimen as a three-drug regimen that contained dexamethasone, a first- or second-generation secondgeneration 5HT3RA, and an earlier NK1RA compound (aprepitant, fosaprepitant, or rolapitant); they defined ‘new’ regimens as regimens that contained NEPA or olanzapine. The results indicated that, compared with conventional regimens, new regimens containing NEPA were more effective in terms of producing a complete response (ie, absence of vomiting and no use of rescue therapy). Additionally, Yokoe et al8 showed that olanzapine-containing regimens were most effective in terms of producing a complete response, particularly when olanzapine was added to a triplet regimen of an NK1RA, a 5HT3RA, and dexamethasone. These findings were supported by the results of a prospective randomised study published in 2020, which directly compared an olanzapine-containing four-drug regimen with a standard triplet antiemetic regimen (consisting of aprepitant, ondansetron, and dexamethasone) for the prevention of CINV in patients receiving AC chemotherapy.9
 
Here, we conducted a post-hoc analysis through retrospective assessment of individual patient data from two previously reported prospective antiemetic studies that involved Chinese patients with breast cancer.9 10 We hypothesised that a four-drug antiemetic regimen (consisting of an NK1RA, a 5HT3RA, dexamethasone, and olanzapine) would remain superior to a three-drug regimen (consisting of an NK1RA, a 5HT3RA, and dexamethasone) that included NEPA as a combination NK1RA and 5HT3RA agent. The primary objective was to compare the efficacies of olanzapine- and NEPA-containing antiemetic regimens in terms of controlling CINV during the first cycle of AC. The secondary objectives were: (1) to assess quality of life (QOL) outcomes in patients receiving these treatments during the first cycle of AC, and (2) to assess emesis control outcomes in patients receiving these treatments over multiple cycles of AC.
 
Methods
Patients
This study constituted a post-hoc analysis of data from two recently reported prospective studies. The first prospective study investigated emesis outcomes in patients with breast cancer who received a standard triplet antiemetic regimen (ie, aprepitant, ondansetron, and dexamethasone) with or without olanzapine9; after the first study, a second prospective study was conducted to assess the antiemetic efficacy of NEPA and dexamethasone.10 These studies were conducted with institutional ethics approval and were registered at ClinicalTrials.gov (NCT03386617 and NCT03079219, respectively). For the post-hoc analysis, data were extracted from the first study regarding patients who received an olanzapine plus aprepitant-containing four-drug antiemetic regimen; data were extracted from the second study regarding patients who received NEPA and dexamethasone. These patients were categorised into the ‘olanzapine’ and ‘NEPA’ groups, respectively.
 
Inclusion criteria were similar for the two studies. Specifically, patients were eligible if they were women of Chinese ethnicity, were aged >18 years, had early-stage breast cancer, and planned to receive a regimen of (neo)adjuvant AC. All study participants were required to read, understand, and complete study questionnaires and diaries in Chinese. Exclusion criteria included abnormal bone marrow, renal, or hepatic functions; receipt or planned receipt of radiation therapy to the abdomen or pelvis within 7 days prior to initial administration of study treatment; presence of grade 2 to 3 nausea, as defined by the National Cancer Institute Common Terminology Criteria for Adverse Events version 4.0,11 or vomiting within 24 hours prior to initial administration of the study treatment; presence of an active infection or any uncontrolled disease; a history of illicit drugs, including marijuana or alcohol abuse; mental incapacitation; and/or presence of a clinically significant emotional or psychiatric disorder. Written consent was provided by eligible patients prior to enrolment in the studies.
 
Study treatment
Patients in the olanzapine group received olanzapine 10 mg, aprepitant 125 mg, dexamethasone 12 mg, and ondansetron 8 mg before chemotherapy on day 1; they also received ondansetron 8 mg 8 hours after chemotherapy. Subsequently, they received aprepitant 80 mg daily on days 2-3 and olanzapine 10 mg daily on days 2-5.
 
Patients in the NEPA group received one capsule of NEPA (netupitant 300 mg/palonosetron 0.50 mg) with dexamethasone 12 mg before chemotherapy on day 1. Subsequently, they received dexamethasone 4 mg twice per day on days 2-3.
 
Study assessments
At the initiation of chemotherapy on day 1, individual patients were provided a diary to record the date and time of their symptoms of vomiting and nausea for 120 hours after the AC infusion; the use of any rescue medication was also recorded. On days 2-6, patients rated their symptoms of nausea for the previous 24 hours using a visual analogue scale (in which 0 mm implied no nausea, whereas 100 mm implied nausea that was ‘as bad as it could be’). Additionally, on day 1 (before infusion of AC) and day 6 (after completion of the diary), patients completed the Functional Living Index-Emesis (FLIE) questionnaire. A research nurse/assistant called individual patients on days 2-6 to remind them to take the study medications, complete the patient diary, and complete the FLIE questionnaire.
 
Assessment of efficacy and safety
Antiemetic efficacy was measured across three overlapping time periods. The ‘acute’ phase comprised 0 to 24 hours from the infusion of AC; the ‘delayed’ phase comprised 24 to 120 hours from the infusion of AC; the ‘overall’ phase comprised 0 to 120 hours from the infusion of AC.
 
Variables used to assess antiemetic efficacy were ‘complete response’, ‘no vomiting’, ‘no significant nausea’, ‘no nausea’, ‘no use of rescue therapy’, ‘complete protection’, and ‘total control’; definitions of these variables are provided in Table 1. The proportions of patients who exhibited these variables were recorded separately. Additionally, the ‘time to first vomiting’ in cycle 1 was determined using information recorded in patients’ diaries.
 

Table 1. Definitions of variables used to assess antiemetic efficacy
 
Quality of life was evaluated using the Chinese version of self-reported FLIE questionnaires from individual patients.12 The FLIE questionnaire consists of a nausea domain (9 items) and a vomiting domain (9 items). All scores were transformed to ensure that higher scores indicated worse impact on QOL.
 
Statistical analyses
A modified intention-to-treat approach was used for all efficacy analyses; specifically, analyses included patients who had received chemotherapy, had completed the study procedures from 0 to 120 hours in cycle 1 of AC, and had no major protocol violations.
 
To achieve the primary objective of this study, the efficacies of the two antiemetic regimens were based on the proportions (including 95% confidence intervals) of patients who achieved complete response during the acute, delayed, and overall phases after AC infusion in cycle 1. Other parameters compared in cycle 1 of AC were ‘time to first vomiting’, ‘no vomiting’, ‘no significant nausea’, no nausea’, ‘no use of rescue therapy’, ‘complete protection’, and ‘total control’.
 
To achieve the secondary objectives, QOL was compared between the two antiemetic regimens based on assessments of the nausea domain, vomiting domain, and total score (sum of nausea and vomiting domains) of the FLIE questionnaire during cycle 1 of AC. Emesis control over multiple cycles was compared between the two antiemetic regimens by assessing the proportions (including 95% confidence intervals) of patients who achieved ‘complete response’, ‘complete protection’, and ‘total control’ in the acute, delayed, and overall phases.
 
Comparisons between the two antiemetic regimens were made using the Wilcoxon rank-sum test for continuous data and Pearson’s Chi squared test for dichotomous data. Two-sided P values <0.05 were considered statistically significant. The SAS Software version 9.4 (SAS Institute, Cary [NC], United States) was used for analyses.
 
Results
Patient characteristics
Data from 120 patients were included in this study; 60 patients each were enrolled in the NEPA and olanzapine groups. Fifty-six patients (93.3%) in the olanzapine group completed all four cycles of AC, whereas 60 patients (100%) in the NEPA group completed all four cycles of AC.
 
Patient characteristics, including characteristics that could potentially affect CINV, are shown in Table 2. The olanzapine and NEPA groups had very similar patient characteristics, with median ages of 54.5 and 56 years, respectively. Nearly two-thirds of patients in each group had Stage II breast cancer (63.3% and 66.7%, respectively). The percentage of patients with a history of motion sickness was higher in the NEPA group (35%) than in the olanzapine group (16.7%). Furthermore, 30% of patients in the NEPA group and 20% of patients in the olanzapine group received AC as neoadjuvant treatment.
 

Table 2. Baseline characteristics of Chinese patients with early-stage breast cancer included in this analysis
 
Efficacy assessment
Antiemetic efficacies during cycle 1 of AC in the olanzapine and NEPA groups are shown in Table 3. Complete response rates in acute, delayed, and overall phases in cycle 1 did not differ between groups. In the acute phase, the olanzapine group exhibited a higher rate of ‘no use of rescue therapy’ (olanzapine vs NEPA: 96.7% vs 85.0%, P=0.0225). No parameters differed between groups in the delayed phase. In the overall phase, the olanzapine group exhibited significantly higher rates of ‘no use of rescue therapy’ (91.7% vs 76.7%, P=0.0244) and ‘no significant nausea’ (91.7% vs 78.3%, P=0.0408).
 

Table 3. Comparison of antiemetic efficacy during cycle 1 of doxorubicin/cyclophosphamide between olanzapine and netupitant/palonosetron groups
 
The median time to first vomiting was not reached in either group (P=0.3902). Quality of life results during cycle 1 of AC in the olanzapine and NEPA groups, determined using the FLIE questionnaire, are shown in the Figure. There were no significant differences in the nausea domain, vomiting domain, or total score of the FLIE questionnaire between the two groups.
 

Figure. Comparison of quality of life (assessed using Functional Living Index-Emesis questionnaire) throughout cycle 1 of doxorubicin/cyclophosphamide between olanzapine and netupitant/palonosetron groups
 
Antiemetic efficacies over multiple cycles of AC in the olanzapine and NEPA groups are shown in Table 4. In the acute phase, the NEPA group exhibited significantly higher rates of total control in cycle 2 (olanzapine vs NEPA: 59.6% vs 81.7%, P=0.0087) and cycle 4 (63.2% vs 86.7%, P=0.0032). No parameters differed between groups in the delayed phase. In the overall phase, the NEPA group exhibited significantly higher rates of total control in cycle 3 (55.4% vs 73.3%, P=0.0430) and cycle 4 (54.4% vs 75.0%, P=0.0195).
 

Table 4. Comparison of complete response, complete protection, and total control over multiple cycles between olanzapine and netupitant/palonosetron groups
 
Discussion
Chemotherapy-induced nausea and vomiting is a frustrating adverse effect for patients receiving anticancer treatment.13 The administration of optimal antiemetic prophylaxis can help to maintain QOL, while potentially improving patient compliance in terms of completing planned therapies. In current antiemetic prophylaxis guidelines, the European Society of Medical Oncology/Multinational Association of Supportive Care in Cancer, the American Society of Clinical Oncology, and the United States National Comprehensive Cancer Network offer several options regarding antiemetic regimens for patients receiving AC(-like) chemotherapy. These options mainly involve the combination of a 5HT3RA and corticosteroids, with or without an NK1RA and olanzapine.2 3 4 In particular, the incorporation of olanzapine, an antipsychotic drug with antagonistic effects on various receptors (eg, dopamine and serotonin receptors),14 is increasingly regarded as a component of antiemetic prophylaxis for patients receiving anticancer treatment.
 
In an attempt to identify the best antiemetic regimen, Yokoe et al8 conducted a meta-analysis of randomised trials that tested various antiemetic regimens. The results indicated that olanzapine-based regimens demonstrated the best efficacy. Specifically, olanzapine in combination with an NK1RA, a 5HT3RA, and dexamethasone exhibited the greatest efficacy; other olanzapine-containing regimens (consisting of a 5HT3RA and dexamethasone) were also superior to regimens that lacked olanzapine. Moreover, even in the presence of earlier NK1RAs (eg, aprepitant, fosaprepitant, or rolapitant), regimens lacking olanzapine remained inferior.
 
Similar to the findings with olanzapine, Yokoe et al8 reported that triplet antiemetics involving NEPA were superior to conventional NK1RAs (eg, aprepitant, fosaprepitant, or rolapitant). Furthermore, Zhang et al15 directly compared NEPA-based antiemetic regimens with aprepitant-based triplet regimens in a randomised study that involved 800 patients who underwent administration of a cisplatin-containing regimen. Their results revealed that patients receiving NEPA and dexamethasone exhibited similar control of CINV, compared with patients receiving aprepitant, granisetron, and dexamethasone; however, NEPA-treated patients had a significantly lower requirement for rescue therapy. Additionally, in a recent study focused on patients with breast cancer who were undergoing AC chemotherapy, patients who received NEPA and dexamethasone demonstrated significantly higher rates of complete response, complete protection, and total control with enhanced QOL, compared to historical controls who received aprepitant, ondansetron, and dexamethasone; these benefits persisted over multiple cycles of chemotherapy.10
 
To our knowledge, no study has directly compared olanzapine- and NEPA-containing regimens. Using an indirect comparison approach, the present study showed that the olanzapine-based regimen had higher rates of ‘no use of rescue therapy’ and ‘no significant nausea’ in cycle 1 of AC, compared to the NEPA-based regimen. In contrast, assessments in subsequent cycles revealed that the NEPA-based regimen led to higher rates of total control in the acute phase (cycles 2 and 4) and the overall phase (cycles 3 and 4). The lack of difference in QOL between the two groups of patients may be related to the difference in adverse-effect profiles of the antiemetics used. For instance, the continued use of dexamethasone on days 2-3 in the NEPA group may have affected QOL among those patients because of its effects on mood, insomnia, gastrointestinal symptoms, and metabolic profiles.16 Indeed, a recent meta-analysis showed that, among patients receiving AC or moderately emetogenic chemotherapy, 3 days of dexamethasone did not provide additional benefit compared to 1 day of the agent.17 However, olanzapine has been associated with sedation and somnolence.18 Thus, after the completion of a phase 2 trial in Japan that suggested olanzapine was more effective at 5 mg than at 10 mg,19 the same group of investigators conducted a phase 3 study in which they tested the addition of daily olanzapine 5 mg to an aprepitant-based three-drug regimen; the results showed that, even at a lower dose of olanzapine, the olanzapine-containing regimen remained more efficacious than the olanzapine-free regimen for patients receiving cisplatin.20 Other adverse effects have been reported. Our analysis of olanzapine in combination with aprepitant, ondansetron and dexamethasone revealed a significantly higher incidence of grade ≥2 neutropenia in the olanzapine arm than in the standard arm, although this altered incidence was not associated with a significant difference in neutropenic fever.9 A few cases of olanzapine-induced neutropenia have been reported21; additionally, a recent randomised antiemetic study showed that patients who received an olanzapine-containing regimen had a higher frequency of severe neutropenia (without an increased incidence of neutropenic fever).22 Although the underlying mechanism remains unknown, the results of the aforementioned Japanese study20 suggest that olanzapine 5 mg could reduce the incidence of neutropenia. In contrast, in our previous trial regarding a NEPA-based regimen, we found that patients in the NEPA arm had significantly lower incidences of grade ≥2 neutropenia and neutropenic fever, compared to historical controls who received an aprepitant-based regimen.9 10
 
This study had some potential limitations. First, dexamethasone was only used for 1 day in the olanzapine-based regimen, whereas it was administered for 3 days in the NEPA-based regimen; this difference may have influenced the findings. Second, the use of data from two separate studies may have affected the generalisability of the findings because of slight variations in patient characteristics; the lack of blinding in both studies also increased the potential for patient-related reporting biases. Nonetheless, the original studies were consecutively conducted during the period from 2017 to 2019; both the data from Chinese patients enrolled in a homogenous group with early-stage breast cancer who were receiving (neo)adjuvant AC chemotherapy and the present analysis were analysed based on individual patient data. These factors support the validity of our comparison approach.
 
Conclusion
In conclusion, the present findings do not conclusively support the superiority of either the olanzapine-based regimen or the NEPA-based regimen in terms of antiemetic efficacy or QOL among patients with breast cancer who are receiving AC. Our previous study demonstrated that aprepitant has a limited effect when used with a 5HT3RA and dexamethasone23; we also found that NEPA was superior to aprepitant.10 Overall, the available data suggest that olanzapine-containing antiemetic regimens can be used without aprepitant, particularly when seeking to reduce medical expenses. Moreover, the available data support the previous conclusion that, in parts of the world where socio-economic limitations restrict the availability of NK1RAs, the use of olanzapine combined with a 5HT3RA and dexamethasone may be an effective low-cost alternative antiemetic regimen.8 24 Antiemetic efficacy may be enhanced if NEPA is administered in combination with dexamethasone and olanzapine as a four-drug antiemetic regimen; however, the efficacy of an olanzapine plus NEPA regimen in terms of controlling CINV should be confirmed in a trial setting.
 
Author contributions
Concept or design: W Yeo.
Acquisition of data: FKF Mo, W Yeo.
Analysis or interpretation of data: W Yeo, CCH Yip, FKF Mo.
Drafting of the manuscript: W Yeo, CCH Yip.
Critical revision of the manuscript for important intellectual content: L Li, TKH Lau, VTC Chan, CCH Kwok, JJS Suen, FKF Mo.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
W Yeo has been involved in the Chemotherapy-Induced Nausea and Vomiting (CINV) Network in Asia and has provided lectures on CINV at events organised by Mundipharma International Limited, which supported the design of the NEPA study10 analysed in this post-hoc analysis but had no role in the present comparative analysis, data collection and analysis, decision to publish, or preparation of the manuscript.
 
Acknowledgement
We thank Ms Dong KT Lai, Ms Elizabeth Pang, Ms Vivian Chan, and Ms Maggie Cheung of the Department of Clinical Oncology, The Chinese University of Hong Kong for their support in contributing to patient enrolment and study monitoring.
 
Declaration
Data from this study were presented at the European Society of Medical Oncology Asia Virtual Congress 2020 on 27 November 2020.
 
Funding/support
This study was supported by an education grant from Madam Diana Hon Fun Kong Donation for Cancer Research (Grant No.: CUHK Project Code 7104870). The Donation had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
 
Ethics approval
The studies examined in this post-hoc analysis were approved by The Joint Chinese University of Hong Kong–New Territories East Cluster Institution Review Board of The Chinese University of Hong Kong and the Hong Kong Hospital Authority, and the Kowloon West Cluster Research Ethics Committee of the Hong Kong Hospital Authority (Ref No.: CREC 2016.013, CREC 2017.1609 and KW/FR-18-019[119-19]). All patient data in this study were anonymous and were based on the abovementioned reported studies. There was no additional work on retrieving patient records in this study.
 
References
1. Early Breast Cancer Trialists’ Collaborative Group (EBCTCG); Peto R, Davies C, et al. Comparisons between different polychemotherapy regimens for early breast cancer: meta-analyses of long-term outcome among 100,000 women in 123 randomised trials. Lancet 2012;379:432-44. Crossref
2. National Comprehensive Cancer Network. NCCN Clinical Practice Guidelines in Oncology, Antiemesis Version 1; 2019.
3. Herrstedt J, Roila F, Warr D, et al. 2016 updated MASCC/ESMO consensus recommendations: prevention of nausea and vomiting following high emetic risk chemotherapy. Support Care Cancer 2017;25:277-88. Crossref
4. Hesketh PJ, Kris MG, Basch E, et al. Antiemetics: American Society of Clinical Oncology Clinical Practice Guideline update. J Clin Oncol 2017;35:3240-61. Crossref
5. Popovic M, Warr DG, Deangelis C, et al. Efficacy and safety of palonosetron for the prophylaxis of chemotherapy-induced nausea and vomiting (CINV): a systematic review and meta-analysis of randomized controlled trials. Support Care Cancer 2014;22:1685-97. Crossref
6. Thomas AG, Stathis M, Rojas C, Slusher BS. Netupitant and palonosetron trigger NK1 receptor internalization in NG108-15 cells. Exp Brain Res 2014;232:2637-44. Crossref
7. Stathis M, Pietra C, Rojas C, Slusher BS. Inhibition of substance P-mediated responses in NG108-15 cells by netupitant and palonosetron exhibit synergistic effects. Eur J Pharmacol 2012;689:25-30. Crossref
8. Yokoe T, Hayashida T, Nagayama A, et al. Effectiveness of antiemetic regimens for highly emetogenic chemotherapy-induced nausea and vomiting: a systematic review and network meta-analysis. Oncologist 2019;24:e347-57. Crossref
9. Yeo W, Lau TK, Li L, et al. A randomized study of olanzapine-containing versus standard antiemetic regimens for the prevention of chemotherapy-induced nausea and vomiting in Chinese breast cancer patients. Breast 2020;50:30-8. Crossref
10. Yeo W, Lau TK, Kwok CC, et al. NEPA efficacy and tolerability during (neo)adjuvant breast cancer chemotherapy with cyclophosphamide and doxorubicin. BMJ Support Palliat Care 2022;12:e264-70.
11. National Cancer Institute. Cancer Therapy Evaluation Program. 2021. Available from: https://ctep.cancer.gov/protocoldevelopment/electronic_applications/ctc.htm#ctc_40. Accessed 3 Feb 2023.
12. Martin AR, Pearson JD, Cai B, Elmer M, Horgan K, Lindley C. Assessing the impact of chemotherapy-induced nausea and vomiting on patients’ daily lives: a modified version of the Functional Living Index-Emesis (FLIE) with 5-day recall. Support Care Cancer 2003;11:522-7. Crossref
13. Griffin AM, Butow PN, Coates AS, et al. On the receiving end. V: patient perceptions of the side effects of cancer chemotherapy in 1993. Ann Oncol 1996;7:189-95. Crossref
14. Bymaster FP, Nelson DL, DeLapp NW, et al. Antagonism by olanzapine of dopamine D1, serotonin2, muscarinic, histamine H1 and alpha 1-adrenergic receptors in vitro. Schizophr Res 1999;37:107-22. Crossref
15. Zhang L, Lu S, Feng J, et al. A randomized phase III study evaluating the efficacy of single-dose NEPA, a fixed antiemetic combination of netupitant and palonosetron, versus an aprepitant regimen for prevention of chemotherapy-induced nausea and vomiting (CINV) in patients receiving highly emetogenic chemotherapy (HEC). Ann Oncol 2018;29:452-8. Crossref
16. Roila F, Ruggeri B, Ballatori E, et al. Aprepitant versus metoclopramide, both combined with dexamethasone, for the prevention of cisplatin-induced delayed emesis: a randomized, double-blind study. Ann Oncol 2015;26:1248-53. Crossref
17. Okada Y, Oba K, Furukawa N, et al. One-day versus three-day dexamethasone in combination with palonosetron for the prevention of chemotherapy-induced nausea and vomiting: a systematic review and individual patient data-based meta-analysis. Oncologist 2019;24:1593-600. Crossref
18. Navari RM, Qin R, Ruddy KJ, et al. Olanzapine for the prevention of chemotherapy-induced nausea and vomiting. N Engl J Med 2016;375:134-42. Crossref
19. Abe M, Hirashima Y, Kasamatsu Y, et al. Efficacy and safety of olanzapine combined with aprepitant, palonosetron, and dexamethasone for preventing nausea and vomiting induced by cisplatin-based chemotherapy in gynecological cancer: KCOG-G1301 phase II trial. Support Care Cancer 2016;24:675-82. Crossref
20. Hashimoto H, Abe M, Tokuyama O, et al. Olanzapine 5 mg plus standard antiemetic therapy for the prevention of chemotherapy-induced nausea and vomiting (J-FORCE): a multicentre, randomised, double-blind, placebo-controlled, phase 3 trial. Lancet Oncol 2020;21:242-9. Crossref
21. Malhotra K, Vu P, Wang DH, Lai H, Faziola LR. Olanzapine-induced neutropenia. Ment Illn 2015;7:5871. Crossref
22. Gjafa E, Ng K, Grunewald T, et al. Neutropenic sepsis rates in patients receiving bleomycin, etoposide and cisplatin chemotherapy using olanzapine and reduced doses of dexamethasone compared to a standard antiemetic regimen. BJU Int 2021;127:205-11. Crossref
23. Yeo W, Mo FK, Suen JJ, et al. A randomized study of aprepitant, ondansetron and dexamethasone for chemotherapy-induced nausea and vomiting in Chinese breast cancer patients receiving moderately emetogenic chemotherapy. Breast Cancer Res Treat 2009;113:529-35. Crossref
24. Babu G, Saldanha SC, Kuntegowdanahalli Chinnagiriyappa L, et al. The efficacy, safety, and cost benefit of olanzapine versus aprepitant in highly emetogenic chemotherapy: a pilot study from South India. Chemother Res Pract 2016;2016:3439707. Crossref

Chest computed tomography analysis of lung sparing morphology: differentiation of COVID-19 pneumonia from influenza pneumonia and bacterial pneumonia using the arched bridge and vacuole signs

© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Chest computed tomography analysis of lung sparing morphology: differentiation of COVID-19 pneumonia from influenza pneumonia and bacterial pneumonia using the arched bridge and vacuole signs
Tiffany Y So, FRANZCR1; Simon CH Yu, FRCR1; WT Wong, FRCR2; Jeffrey KT Wong, FRCR1; Heather Lee, FRCR3; YX Wang, MMed, PhD1
1 Department of Imaging and Interventional Radiology, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong
2 Department of Anaesthesia and Intensive Care, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong
3 Department of Diagnostic Radiology, Princess Margaret Hospital, Hong Kong
 
Corresponding author: Prof YX Wang (yixiang_wang@cuhk.edu.hk)
 
 Full paper in PDF
 
Abstract
Introduction: This study evaluated the arched bridge and vacuole signs, which constitute morphological patterns of lung sparing in coronavirus disease 2019 (COVID-19), then examined whether these signs could be used to differentiate COVID-19 pneumonia from influenza pneumonia or bacterial pneumonia.
 
Methods: In total, 187 patients were included: 66 patients with COVID-19 pneumonia, 50 patients with influenza pneumonia and positive computed tomography findings, and 71 patients with bacterial pneumonia and positive computed tomography findings. Images were independently reviewed by two radiologists. The incidences of the arched bridge sign and/or vacuole sign were compared among the COVID-19 pneumonia, influenza pneumonia, and bacterial pneumonia groups.
 
Results: The arched bridge sign was much more common among patients with COVID-19 pneumonia (42/66, 63.6%) than among patients with influenza pneumonia (4/50, 8.0%; P<0.001) or bacterial pneumonia (4/71, 5.6%; P<0.001). The vacuole sign was also much more common among patients with COVID-19 pneumonia (14/66, 21.2%) than among patients with influenza pneumonia (1/50, 2.0%; P=0.005) or bacterial pneumonia (1/71, 1.4%; P<0.001). The signs occurred together in 11 (16.7%) patients with COVID-19 pneumonia, but they did not occur together in patients with influenza pneumonia or bacterial pneumonia. The arched bridge and vacuole signs predicted COVID-19 pneumonia with respective specificities of 93.4% and 98.4%.
 
Conclusion: The arched bridge and vacuole signs are much more common in patients with COVID-19 pneumonia and can help differentiate COVID-19 pneumonia from influenza and bacterial pneumonia.
 
 
New knowledge added by this study
  • On computed tomography, the arched bridge sign is characterised by ground-glass opacities or consolidation with an arched margin outlining unaffected lung parenchyma. The vacuole sign refers to a focal oval or round lucent area (typically <5 mm) that is present within ground-glass opacities or sites of consolidation.
  • These signs were commonly observed in patients with coronavirus disease 2019 (COVID-19) in Hong Kong, consistent with data from other populations.
  • Patients with COVID-19 pneumonia are much more likely to exhibit the arched bridge sign and/or the vacuole sign, compared with patients who have influenza pneumonia or bacterial pneumonia.
Implications for clinical practice or policy
  • The presence of the arched bridge sign and/or the vacuole sign on computed tomography may support a diagnosis of COVID-19 pneumonia and assist in differentiation from other types of pneumonia.
  • The duration of total hospitalisation did not differ between patients with COVID-19 pneumonia who had and did not have these two signs, suggesting that they do not indicate a better or worse prognosis if appropriate treatments are administered.
 
 
Introduction
A diagnosis of coronavirus disease 2019 (COVID-19) is made on the basis of epidemiological and clinical history, as well as the results of severe acute respiratory syndrome coronavirus 2 real-time reverse transcriptase polymerase chain reaction (RT-PCR) testing. Chest computed tomography (CT) has been proposed as a useful alternative investigation method for COVID-19 diagnosis or triage, particularly in healthcare settings with restricted access to RT-PCR testing and in the context of lower RT-PCR sensitivity during early stages of the disease; it may also be useful for imaging-mediated evaluation of disease severity and progression.1 2 The most common CT findings in early-stage COVID-19 pneumonia (illness days 0-5) are pure ground-glass opacities (GGOs); the second most common finding is consolidation.3 4 In the later stages (illness days 6-17), findings usually evolve to a combination of GGOs, consolidation, and reticular opacities with architectural distortion.4 These imaging features are not specific to COVID-19 pneumonia; they can overlap with other types of viral or bacterial pneumonia, particularly influenza pneumonia, as well as other non-infectious inflammatory lung diseases.5 6 Influenza, one of the most common causes of viral pneumonia,7 and bacterial pneumonia, historically the most common type of community-acquired pneumonia worldwide,8 maintained high incidences during the early COVID-19 pandemic when this study was conducted; thus, they had the potential to substantially contribute to hospitalisations in this period. However, COVID-19 pneumonia and other types of viral or bacterial pneumonia distinctly differ in terms of their disease course, temporal progression, and available therapeutics9 10 11; thus, there is a need for early and accurate differentiation among these entities.
 
Studies in 2020 revealed several CT imaging features that can aid in differential diagnosis. Compared with influenza pneumonia, patients with COVID-19 pneumonia are more likely to exhibit a peripheral distribution,12 13 14 patchy combination of GGOs and consolidation,15 fine reticular opacities,16 and vascular thickening or enlargement16 17; patients with influenza pneumonia are more likely to exhibit nodules,18 tree-in-bud sign,18 bronchial wall thickening,15 lymphadenopathy,16 and pleural effusions.12 In the past, diffuse airspace consolidation, centrilobular nodules, bronchial wall thickening, and mucous impaction19 have been identified as typical signs of bacterial pneumonia. Nevertheless, CT assessment of COVID-19 generally remains challenging, with reported accuracies for radiologists ranging from 60 to 83%16 in terms of differentiating patients with COVID-19 pneumonia from patients with influenza pneumonia; considering these rates, further studies of relevant imaging findings are needed.
 
A report by Wu et al20 highlighted the arched bridge sign, which may be a distinct CT feature of COVID-19 pneumonia. In their analysis of 11 patients with COVID-19 pneumonia, the sign was present in 72.7%.20 The arched bridge sign refers to a specific pattern of GGOs or consolidation, commonly in a subpleural location, which forms an arched contour with a smooth concave margin towards the pleural side. The arched margin outlines the spared parenchyma between the GGOs or consolidation and the pleural surface. Another reported sign, regarded as the vacuole sign,21 22 23 24 is presumably based on the morphological pattern of parenchymal sparing in areas of affected lung. The vacuole sign refers to a focal oval or round lucent area (typically <5 mm) that is observed within GGOs or sites of consolidation. In clinical practice, we often observed these two novel signs on CT scans of patients with COVID-19 pneumonia. We hypothesised that these two signs are common in patients with COVID-19 pneumonia and thus could be used to differentiate such pneumonia from other types of infection-related pneumonia. However, considering the limited prior evidence (solely from small retrospective studies20 21 22 23 24) regarding the prevalence of the vacuole sign in COVID-19 pneumonia, and because the arched bridge sign has—to our knowledge—only been reported in a single previous publication,20 additional assessments of these signs are needed. The utilities of the arched bridge and vacuole signs in COVID-19 pneumonia have not been directly assessed in prior reports, nor have they been compared between COVID-19 pneumonia and other types of infection-related pneumonia. In this study, we evaluated the arched bridge and vacuole signs in patients with COVID-19 pneumonia, then examined whether these signs could be used to differentiate such pneumonia from influenza pneumonia or bacterial pneumonia.
 
Methods
Patients
This retrospective study included consecutive patients who were admitted to two hospitals in Hong Kong (Prince of Wales Hospital and Princess Margaret Hospital) with RT-PCR–confirmed COVID-19, along with positive CT findings, from 24 January 2020 to 16 April 2020. These patients represent most patients with COVID-19 in Hong Kong during the study period, when all patients with confirmed COVID-19 were hospitalised regardless of clinical status; moreover, Princess Margaret Hospital also served as a centralised treatment centre for patients with COVID-19. The study recruitment period reflects the early days of the COVID-19 pandemic in Hong Kong, during which CT examinations were commonly performed during the diagnosis and treatment of patients with COVID-19. All patients with COVID-19 underwent complete PCR-based assessment of multiple respiratory pathogens on admission; patients with COVID-19 were excluded from the present study if they exhibited evidence of other concomitant viral or bacterial respiratory infections.
 
The influenza pneumonia and bacterial pneumonia comparison groups comprised consecutive patients who were admitted to Prince of Wales Hospital in Hong Kong, with pure influenza pneumonia or pure bacterial pneumonia and positive CT findings from 20 February 2018 to 13 January 2020. The diagnosis of pure influenza pneumonia was determined by RT-PCR–mediated detection of influenza A or B viral RNA, in the absence of evidence (eg, respiratory or blood cultures, PCR tests, or serological tests) suggesting concomitant infection with other viral or bacterial pathogens. The diagnosis of pure bacterial pneumonia was determined by positive bacterial culture on sputum or bronchoalveolar lavage, in the absence of evidence suggesting concomitant infection with other viral or bacterial pathogens. Patients with pre-existing lung parenchymal disease (eg, interstitial lung disease) or known lung malignancy were excluded from the study.
 
Image acquisition
Computed tomography scans were performed using 64-section multidetector scanners (LightSpeed VCT or LightSpeed Pro 32, GE Medical Systems, Milwaukee [WI], United States). The following scan parameters were used: voltage, 120 kV; tube current, 50-502 mA; and slice thickness, 0.625 mm or 1.25 mm. Scans were performed with the patient in the supine position during end-inspiration.
 
Image evaluation
All CT images were reviewed in random order by two trained radiologists (TY So and YX Wang) with 7 and 5 years of experience in diagnostic chest imaging, respectively, using a dedicated picture archiving and communication system workstation. Each radiologist was blinded to demographic and clinical information for all patients. The images were independently reviewed by each radiologist, and the consensus findings for any discrepancies from discussion are reported.
 
Each CT image was initially subjected to broad assessment of abnormalities. Subsequently, the arched bridge and vacuole signs were specifically assessed; the presence or absence of each sign was recorded. The arched bridge sign was defined as the presence of GGOs or consolidation with an arched concave margin outlining a region of spared lung; the vacuole sign was defined as the presence of a vacuole-like region of normal lung (<5 mm) within GGOs or sites of consolidation.21
 
For patients with COVID-19 pneumonia and patients with influenza pneumonia, CT findings of GGOs (hazy areas of parenchymal opacities that did not conceal underlying vessels), consolidation (parenchymal opacities that concealed underlying vessels), reticular opacities (coarse linear or curvilinear opacities, interlobular septal thickening, or subpleural reticulation), and crazy paving pattern (GGOs with interlobular and intralobular septal thickening) were recorded. Other signs such as air bronchograms (air-filled bronchi on a background of opaque lung), nodules (small rounded focal opacities <3 cm), cavitation (gas-filled spaces within sites of pulmonary consolidation), bronchiectasis, pleural retraction or thickening, pleural effusion, pericardial effusion, pneumothorax, and mediastinal lymphadenopathy (lymph nodes >1 cm in short-axis diameter) were also recorded. The distributions of pulmonary abnormalities were classified as unilateral or bilateral, and peripheral (involving mainly the peripheral one-third of the lung), central (involving mainly the central two-thirds of the lung), or diffuse (involving both peripheral and central regions). Lobar involvement was also recorded (right upper lobe, right middle lobe, right lower lobe, left upper lobe, and/or left lower lobe). For patients with bacterial pneumonia, only the arched bridge and vacuole signs were recorded. Other CT changes and their distributions were not individually recorded. This component of the analysis was determined based on reports that it is easier to differentiate COVID-19 pneumonia from bacterial pneumonia, whereas it is more difficult to differentiate COVID-19 pneumonia from other types of viral pneumonia.6 25 26 This manuscript was written in accordance with the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines for reporting observational studies.
 
Statistical analysis
Imaging findings were compared using the Chi squared test or Fisher’s exact test, as appropriate, followed by Bonferroni correction. Comparisons of disease stage, severity, and clinical course among patients with COVID-19 who had the arched bridge and/or vacuole signs were performed using the non-parametric Mann-Whitney U test. P values <0.05 were considered indicative of statistical significance. For the arched bridge and vacuole signs, the sensitivity, specificity, positive predictive value, and negative predictive value were calculated, along with the respective 95% confidence intervals. All analyses were conducted using SPSS software (Windows version 25.0; IBM Corp, Armonk [NY], United States).
 
Results
Patients
Among 76 patients with bacterial pneumonia who were admitted for treatment during the study period, five patients with pre-existing lung parenchymal disease were excluded from the analysis: organising pneumonia (n=2), non-specific interstitial pneumonia (n=1), and idiopathic interstitial pneumonia of uncertain subtype (n=2). No patients with COVID-19 required exclusion because of concomitant viral or bacterial infections. The final study population comprised 187 patients: 66 patients with COVID-19 pneumonia, 50 patients with influenza pneumonia, and 71 patients with bacterial pneumonia. The following organisms were detected in patients with bacterial pneumonia: Streptococcus pneumoniae, Staphylococcus aureus, Haemophilus influenzae, Enterococcus spp., Klebsiella pneumoniae, Pseudomonas aeruginosa, Escherichia coli, Stenotrophomonas spp., Serratia spp., Acinetobacter spp., and Moraxella catarrhalis. Demographic and clinical characteristics of the study population are shown in Table 1. Compared with patients in the influenza pneumonia and bacterial pneumonia groups, patients with COVID-19 pneumonia tended to be younger and healthier.
 

Table 1. Comparison of demographic characteristics among patients with pneumonia in Hong Kong
 
Arched bridge and vacuole signs
The arched bridge and vacuole signs were present in 42 (63.6%) and 14 (21.2%) of 66 patients with COVID-19 pneumonia, respectively (Table 2). The arched bridge sign was commonly in a subpleural location, and there was a smooth arched margin outlining the underside of the GGO or consolidation in all cases (Fig a and b). The vacuole sign was present with GGOs or sites of consolidation in various locations (Fig c and d). The arched bridge sign was much more common in patients with COVID-19 pneumonia than in patients with influenza pneumonia (63.6% vs 8.0%) or bacterial pneumonia (63.6% vs 5.6%, P<0.001). Similarly, the vacuole sign was much more common in patients with COVID-19 pneumonia than in patients with influenza pneumonia (21.2% vs 2.0%, P=0.005) or bacterial pneumonia (21.2% vs 1.4%, P<0.001).
 

Table 2. Comparison of patterns of lung sparing morphology among patients with pneumonia in Hong Kong
 

Figure. (a) The arched bridge sign. Axial computed tomography (CT) image (i) and magnified view of boxed area (ii) in a 56-year-old woman with coronavirus disease 2019 (COVID-19) pneumonia showing an arched ground-glass opacity (GGO) with a sharp underside outlining a semicircular region of spared lung. The typical subpleural location for the sign is evident. (b) Axial CT image (i) and magnified view of boxed area (ii) in a 35-year-old man with COVID-19 pneumonia showing a subpleural GGO with a sharp arched margin outlining two adjacent regions of spared lung, demonstrating a double arched bridge appearance. Other GGOs are also evident involving both central and peripheral lung parenchyma. (c) Vacuole sign. Axial CT image (i) and magnified view of boxed area (ii) in a 55-year-old woman with COVID-19 pneumonia showing a subpleural GGO with a few very small vacuole-like regions of sparing in the affected region. (d) Axial CT image (i) and magnified view of boxed area (ii) in a 56-year-old man with COVID-19 pneumonia showing a subpleural GGO with multiple very small vacuoles
 
The arched bridge and vacuole signs occurred together in 11 (16.7%) of 66 patients with COVID-19 pneumonia, but they did not occur together in any patients with influenza pneumonia or bacterial pneumonia. Additionally, a review of the five excluded patients with bacterial pneumonia and concurrent pre-existing lung parenchymal disease revealed that none of those patients exhibited the arched bridge sign or the vacuole sign.
 
In this study, the arched bridge and vacuole signs exhibited high specificities (93.4% and 98.4%, respectively) in terms of identifying COVID-19 pneumonia (Table 3), with moderate or low sensitivities (63.6% and 21.2%, respectively). They also exhibited high positive predictive values (84.0% and 87.5%, respectively) and high or moderate negative predictive values (82.5% and 69.6%, respectively).
 

Table 3. Diagnostic performances of the arched bridge and vacuole signs for coronavirus disease 2019 pneumonia
 
The relationships of the arched bridge and vacuole signs with disease course are shown in Table 4. Computed tomography was mainly performed during admission, at a mean of 5.3 days after admission, suggesting these two signs generally appeared at an early stage. Comparisons of patients with COVID-19 pneumonia who had and did not have these two signs revealed that the arched bridge sign was associated with more extensive lung involvement (diseased lobes: 4.0 [present] vs 2.4 [absent], P<0.001). This trend was not evident for the vacuole sign (diseased lobes: 3.8 [present] vs 3.3 [absent]). There was no significant difference in the duration of total hospitalisation between patients with COVID-19 who had and did not have these two signs, suggesting they were not associated with a better or worse prognosis if appropriate treatment was administered.
 

Table 4. Comparison of disease stage, severity, and clinical course among patients with coronavirus disease 2019 pneumonia according to arched bridge sign and vacuole sign statuses
 
Other computed tomography findings
Table 5 shows the comparison of other CT findings between COVID-19 pneumonia and influenza pneumonia. No significant differences were observed in the incidences of GGOs, consolidation, reticular opacities, or crazy paving between patients with COVID-19 and patients with influenza pneumonia (all P>0.05). Air bronchograms (P=0.003), nodules (P=0.009), cavitation (P=0.004), bronchiectasis (P<0.001), pleural effusion (P<0.001), pericardial effusion (P=0.032), and mediastinal lymphadenopathy (P<0.001) were significantly more common in patients with influenza pneumonia.
 

Table 5. Comparison of other computed tomography findings between patients with coronavirus disease 2019 pneumonia and patients with influenza pneumonia
 
Abnormalities were more commonly bilateral in patients with COVID-19 pneumonia (77.3%) and patients with influenza pneumonia (96%). The distribution was more likely to be peripheral in patients with COVID-19 pneumonia (51.5% vs 2.0%, P<0.001), and was more likely to be diffuse in patients with influenza pneumonia (98% vs 48.5%, P<0.001). The right upper lobe (P<0.001), right middle lobe (P<0.001), and left upper lobe (P<0.001) were less commonly involved in patients with COVID-19 pneumonia than in patients with influenza pneumonia.
 
Discussion
Arched bridge and vacuole signs
This study evaluated the incidences and diagnostic values of the arched bridge and vacuole signs among patients with COVID-19 pneumonia in a Hong Kong Chinese population. Since the initial description of Wu et al20 in a series of 11 patients with COVID-19, our study is the first to validate the arched bridge sign in patients with COVID-19. To our knowledge, this is also the first study to evaluate the vacuole sign in non–COVID-19–related pneumonia. The arched bridge sign was significantly more common in COVID-19 pneumonia than in influenza pneumonia or bacterial pneumonia. Additionally, the incidences of the vacuole sign and both signs observed in combination were higher (or tended to be higher) in patients with COVID-19 pneumonia than in patients with influenza pneumonia or bacterial pneumonia. Our results imply that these two signs generally appeared at an early stage; the arched bridge sign is more likely to be observed in patients with more severe lung pathology. These results suggest that the arched bridge and vacuole signs can be used in CT-based identification of COVID-19 pneumonia, as well as efforts to differentiate COVID-19 pneumonia from other types of infection-related pneumonia. Currently, chest CT is not recommended for the screening or diagnosis of COVID-19 pneumonia when RT-PCR tests are available. In selected cases, CT can be used to monitor clinical progress and identify complications of the disease. In some scenarios, CT can be a useful alternative investigation method for COVID-19 diagnosis or triage, such as healthcare settings with restricted access to RT-PCR tests.27 28 When these two signs are observed on CTs performed for COVID-19 pneumonia or other indications during the COVID-19 pandemic, physicians should carefully consider a diagnosis of COVID-19 pneumonia. However, our findings indicated there was no significant difference in the duration of total hospitalisation between patients with COVID-19 pneumonia who had and did not have these two signs, suggesting that they are not indicative of a better or worse prognosis if appropriate treatments are administered.
 
The underlying pathophysiological mechanisms behind these signs remain unclear. However, the morphological appearances of the arched bridge and vacuole signs may indicate different pathophysiological mechanisms of lung sparing that occur during infection-related pneumonia. Histopathological examinations of lung biopsy tissues from patients with COVID-19 pneumonia have provided evidence of variations in diffuse alveolar damage.29 30 The curved concave margin in the arched bridge sign may be the result of secondary pulmonary lobule sparing, with the interlobular septum of the secondary pulmonary nodule forming some resistance to the spread of infection among lobules.20 In contrast, the vacuole sign (ie, a very small focal lucent area) may reflect a spared alveolar cluster or dilated alveolar sac within an area of otherwise involved lung.21 23 Zhang et al23 reported that the vacuole sign was often present in patients with advanced COVID-19 pneumonia, where alveolar sac dilation could result from damage to the alveolar wall.
 
The incidence of the arched bridge sign in patients with COVID-19 (63.6%) was similar to the incidence reported by Wu et al20 (72.7%). The incidence of the vacuole sign (21.2%) in patients with COVID-19 pneumonia is also within the range reported in prior studies describing this sign (17-66%).21 22 23 24 Notably, three additional case series have revealed spared airspaces in patients with COVID-19 pneumonia, comprising ‘round cystic changes’31, ‘cystic air spaces’32 and ‘cavity signs’,33 with prevalences of 10 to 30%; these phenomena may include the vacuole sign. However, these case series did not include formal definitions of the findings. The differences in definitions of the vacuole sign (and phenomena that include the sign) may also explain the disparate prevalences (17%-66%, as noted above) reported in the literature.
 
The arched bridge and vacuole signs differentiated COVID-19 pneumonia from influenza pneumonia and bacterial pneumonia with high specificities and high positive predictive values, suggesting that these signs can help to provide a specific imaging diagnosis of COVID-19 pneumonia. When encountering inconclusive CT features of COVID-19 pneumonia, these signs can be identified with minimal additional effort; their presence may be sufficient to increase suspicion or add to the evidence confirming a diagnosis of COVID-19 pneumonia. The respective sensitivities of the arched bridge and vacuole signs were moderate (63.6%) and low (21.2%); the arched bridge sign may be more useful in this context. Our findings suggest that the combined presence of the arched bridge and vacuole signs strongly supports a diagnosis of COVID-19 pneumonia.
 
Consistent with previous studies, the presence of nodules, cavitations, bronchiectasis, pleural effusion, pericardial effusion, and/or mediastinal lymphadenopathy was uncommon in patients with COVID-19 pneumonia; these features were more common in patients with influenza pneumonia.12 16 17 18 34 35 Our results indicated that COVID-19-related abnormalities on CT were generally bilateral and peripheral, compatible with the findings in prior studies.12 13 14
 
Limitations
This study had several limitations. First, it used a retrospective design, and patients were imaged in a cross-sectional manner at various time intervals after symptom onset. Computed tomography was not regularly performed, which hindered the monitoring or analysis of imaging signs over time. Second, CT was not routinely performed for patients with influenza pneumonia or bacterial pneumonia; it was performed based on clinical judgement, generally because of patient deterioration or poor response to treatment. We did not assess differences in the clinical features of patients with influenza pneumonia and patients with bacterial pneumonia between patients who did and did not undergo CT. Third, we attempted to implement diversity in our analysis of COVID-19 pneumonia by comparisons with influenza pneumonia and bacterial pneumonia, whereas prior studies have generally been limited to comparisons of COVID-19 pneumonia with influenza pneumonia. However, we did not examine other types of viral pneumonia; we also did not conduct subgroup analysis according to influenza subtype. Additionally, we did not systematically compare the prognoses of patients with non–COVID-19 pneumonia who had and did not have the arched bridge or vacuole signs. This comparison was hindered by the sample size, because these two signs were very uncommon in patients with non–COVID-19 pneumonia. However, additional analysis did not reveal a clear pattern whereby these two signs would be predictive for clinical prognosis in patients with non–COVID-19 pneumonia. Finally, the sample size in this study was moderate. Although the prevalences of the arched bridge and vacuole signs in our patients with COVID-19 pneumonia were consistent with findings in the literature, their diagnostic specificities should be validated in other types of pneumonia. Despite these limitations, the high diagnostic specificities of these CT signs provide insights that will be useful in future studies. Additional work is needed regarding the relationships of these CT signs with clinical status, and our findings require validation in larger and more diverse patient populations.
 
Conclusion
In conclusion, two morphological patterns of lung sparing, namely the arched bridge and vacuole signs, are much more common in patients with COVID-19 pneumonia; they have the potential to differentiate COVID-19 pneumonia from influenza pneumonia and bacterial pneumonia. In this study, these signs had high specificities and positive predictive values for COVID-19 pneumonia. The identification of these signs in clinical practice may be useful for increasing suspicion or providing confirmatory evidence to support a diagnosis of COVID-19 pneumonia.
 
Author contributions
Concept and design: TY So, SCH Yu, JYX Wang.
Acquisition of data: All authors.
Analysis and interpretation of data: All authors.
Drafting of the manuscript: TY So, S Yu, JYX Wang.
Critical revision of the manuscript for important intellectual content: All authors.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
The authors have no conflicts of interest to disclose.
 
Acknowledgement
The authors thank Ms Apurva Sawhney, Department of Imaging and Interventional Radiology, The Chinese University of Hong Kong, for assistance with data collection.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
This study was approved by the Joint Chinese University of Hong Kong–New Territories East Cluster Clinical Research Ethics Committee (REC Ref. No.: 2020.232), which waived the requirement for informed consent due to the retrospective nature of the study. The study was conducted in compliance with the established ethical standards and principles of the Declaration of Helsinki.
 
References
1. Kim H. Outbreak of novel coronavirus (COVID-19): what is the role of radiologists? Eur Radiol 2020;30:3266-7. Crossref
2. Yang W, Sirajuddin A, Zhang X, et al. The role of imaging in 2019 novel coronavirus pneumonia (COVID-19). Eur Radiol 2020;30:4874-82. Crossref
3. Pan F, Ye T, Sun P, et al. Time course of lung changes at chest CT during recovery from coronavirus disease 2019 (COVID-19). Radiology 2020;295:715-21. Crossref
4. Wang Y, Dong C, Hu Y, et al. Temporal changes of CT findings in 90 patients with COVID-19 pneumonia: a longitudinal study. Radiology. 2020;296:E55-64. Crossref
5. Koo HJ, Lim S, Choe J, Choi SH, Sung H, Do KH. Radiographic and CT features of viral pneumonia. Radiographics 2018;38:719-39. Crossref
6. Sun Z, Zhang N, Li Y, Xu X. A systematic review of chest imaging findings in COVID-19. Quant Imaging Med Surg 2020;10:1058-79. Crossref
7. Marcos MA, Esperatti M, Torres A. Viral pneumonia. Curr Opin Infect Dis 2009;2:143-7. Crossref
8. Apisarnthanarak A, Mundy LM. Etiology of community-acquired pneumonia. Clin Chest Med 2005;26:47-55. Crossref
9. Sanders JM, Monogue ML, Jodlowski TZ, Cutrell JB. Pharmacologic treatments for coronavirus disease 2019 (COVID-19): a review. JAMA 2020;323:1824-36. Crossref
10. Wiersinga WJ, Rhodes A, Cheng AC, Peacock SJ, Prescott HC. Pathophysiology, transmission, diagnosis, and treatment of coronavirus disease 2019 (COVID-19): a review. JAMA 2020;324:782-93. Crossref
11. Zayet S, Kadiane-Oussou NJ, Lepiller Q, et al. Clinical features of COVID-19 and influenza: a comparative study on Nord Franche-Comte cluster. Microbes Infect 2020;22:481-8. Crossref
12. Lin L, Fu G, Chen S, et al. CT manifestations of coronavirus disease (COVID-19) pneumonia and influenza virus pneumonia: a comparative study. AJR Am J Roentgenol 2021;216:71-9. Crossref
13. Chung M, Bernheim A, Mei X, et al. CT imaging features of 2019 novel coronavirus (2019-nCoV). Radiology 2020;295:202-7. Crossref
14. Song F, Shi N, Shan F, et al. Emerging 2019 novel coronavirus (2019-nCoV) pneumonia. Radiology 2020;295:210-7. Crossref
15. Wang H, Wei R, Rao G, Zhu J, Song B. Characteristic CT findings distinguishing 2019 novel coronavirus disease (COVID-19) from influenza pneumonia. Eur Radiol 2020;30:4910-7. Crossref
16. Bai HX, Hsieh B, Xiong Z, et al. Performance of radiologists in differentiating COVID-19 from non–COVID-19 viral pneumonia on chest CT. Radiology 2020;296:E45-54. Crossref
17. Yin Z, Kang Z, Yang D, Ding S, Luo H, Xiao E. A comparison of clinical and chest CT findings in patients with influenza A (H1N1) virus infection and coronavirus disease (COVID-19). AJR Am J Roentgenol 2020;215:1065-71. Crossref
18. Liu M, Zeng W, Wen Y, Zheng Y, Lv F, Xiao K. COVID-19 pneumonia: CT findings of 122 patients and differentiation from influenza pneumonia. Eur Radiol 2020;30:5463-9. Crossref
19. Tanaka N, Matsumoto T, Kuramitsu T, et al. High resolution CT findings in community-acquired pneumonia. J Comput Assist Tomogr 1996;20:600-8. Crossref
20. Wu R, Guan W, Gao Z, et al. The arch bridge sign: a newly described CT feature of the coronavirus disease- 19 (COVID-19) pneumonia. Quant Imaging Med Surg 2020;10:1551-8. Crossref
21. Zhou S, Wang Y, Zhu T, Xia L. CT features of coronavirus disease 2019 (COVID-19) pneumonia in 62 patients in Wuhan, China. AJR Am J Roentgenol 2020;214:1287-94. Crossref
22. Sabri YY, Nassef AA, Ibrahim IM, Abd El Mageed MR, Khairy MA. CT chest for COVID-19, a multicenter study— experience with 220 Egyptian patients. Egypt J Radiol Nucl Med 2020;51:144. Crossref
23. Zhang L, Kong X, Li X, et al. CT imaging features of 34 patients infected with COVID-19. Clin Imaging 2020;68:226-31. Crossref
24. Zhou S, Zhu T, Wang Y, Xia L. Imaging features and evolution on CT in 100 COVID-19 pneumonia patients in Wuhan, China. Eur Radiol 2020;30:5446-54. Crossref
25. Elmokadem AH, Bayoumi D, Abo-Hedibah SA, El-Morsy A. Diagnostic performance of chest CT in differentiating COVID-19 from other causes of ground-glass opacities. Egypt J Radiol Nucl Med 2021;52:12. Crossref
26. Zheng F, Li L, Zhang X, et al. Accurately discriminating COVID-19 from viral and bacterial pneumonia according to CT images via deep learning. Interdiscip Sci 2021;13:273-85. Crossref
27. Simpson S, Kay FU, Abbara S, et al. Radiological Society of North America Expert consensus statement on reporting chest CT findings related to COVID-19. Endorsed by the Society of Thoracic Radiology, the American College of Radiology, and RSNA - Secondary Publication. J Thorac Imaging 2020;35:219-27. Crossref
28. Dennie C, Hague C, Lim RS, et al. Canadian Society of Thoracic Radiology/Canadian Association of Radiologists consensus statement regarding chest imaging in suspected and confirmed COVID-19. Can Assoc Radiol J 2020;71:470-81. Crossref
29. Bradley BT, Maioli H, Johnston R, et al. Histopathology and ultrastructural findings of fatal COVID-19 infections in Washington State: a case series. Lancet 2020;396:320-32. Crossref
30. Zhang H, Zhou P, Wei Y, et al. Histopathologic changes and SARS-CoV-2 immunostaining in the lung of a patient with COVID-19. Ann Intern Med 2020;172:629-32. Crossref
31. Shi H, Han X, Jiang N, et al. Radiological findings from 81 patients with COVID-19 pneumonia in Wuhan, China: a descriptive study. Lancet Infect Dis 2020;20:425-34. Crossref
32. Rodrigues RS, Barreto MM, Werberich GM, Marchiori E. Cystic airspaces associated with COVID-19 pneumonia. Lung India 2020;37:551-3. Crossref
33. Kong W, Agarwal PP. Chest imaging appearance of COVID-19 infection. Radiol Cardiothorac Imaging 2020;2:e200028. Crossref
34. Salehi S, Abedi A, Balakrishnan S, Gholamrezanezhad A. Coronavirus disease 2019 (COVID-19): a systematic review of imaging findings in 919 patients. AJR Am J Roentgenol 2020;215:87-93. Crossref
35. Xu Z, Pan A, Zhou H. Rare CT feature in a COVID-19 patient: cavitation. Diagn Interv Radiol 2020;26:380-1. Crossref

Fracture incidence and fracture-related mortality decreased with decreases in population mobility during the early days of the COVID-19 pandemic: an epidemiological study

© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Fracture incidence and fracture-related mortality decreased with decreases in population mobility during the early days of the COVID-19 pandemic: an epidemiological study
Janus SH Wong, MB, BS, MRCSEd1; Christian X Fang, FRCSEd, FHKCOS1; Alfred LH Lee, MB, BS, MRCP2; Dennis KH Yee, FRCSEd, FHKCOS3; Kenneth MC Cheung, FRCS (Eng), FHKCOS1; Frankie KL Leung, FRCSEd, FHKCOS1
1 Department of Orthopaedics and Traumatology, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong
2 Department of Microbiology, Prince of Wales Hospital, Hong Kong
3 Department of Orthopaedics and Traumatology, Alice Ho Miu Ling Nethersole Hospital, Hong Kong
 
Corresponding author: Dr Christian X Fang (cfang@hku.hk)
 
 Full paper in PDF
 
Abstract
Introduction: We investigated the impact of coronavirus disease 2019 (COVID-19) social distancing measures on fracture incidence and fracture-related mortality, as well as associations with population mobility.
 
Methods: In total, 47 186 fractures were analysed across 43 public hospitals from 22 November 2016 to 26 March 2020. Considering the smartphone penetration of 91.5% in the study population, population mobility was quantified using Apple Inc’s Mobility Trends Report, an index of internet location services usage volume. Fracture incidences were compared between the first 62 days of social distancing measures and corresponding preceding epochs. Primary outcomes were associations between fracture incidence and population mobility, quantified by incidence rate ratios (IRRs). Secondary outcomes included fracture-related mortality rate (death within 30 days of fracture) and associations between emergency orthopaedic healthcare demand and population mobility.
 
Results: Overall, 1748 fewer fractures than projected were observed during the first 62 days of COVID-19 social distancing (fracture incidence: 321.9 vs 459.1 per 100 000 person-years, P<0.001); the relative risk was 0.690, compared with mean incidences during the same period in the previous 3 years. Population mobility exhibited significant associations with fracture incidence (IRR=1.0055, P<0.001), fracture-related emergency department attendances (IRR=1.0076, P<0.001), hospital admissions (IRR=1.0054, P<0.001), and subsequent surgery (IRR=1.0041, P<0.001). Fracture-related mortality decreased from 4.70 (in prior years) to 3.22 deaths per 100 000 person-years during the COVID-19 social distancing period (P<0.001).
 
Conclusion: Fracture incidence and fracture-related mortality decreased during the early days of the COVID-19 pandemic; they demonstrated significant temporal associations with daily population mobility, presumably as a collateral effect of social distancing measures.
 
 
New knowledge added by this study
  • A significant reduction in fracture incidence was observed during the early days of the coronavirus disease 2019 pandemic.
  • Daily fracture incidence was temporally associated with population mobility.
Implications for clinical practice or policy
  • Data regarding population mobility could facilitate estimation of fracture incidence and be used (along with many other factors) to estimate clinical service demand for timely management of public health responses involving changes in population mobility.
  • As digital literacy increases, population digital usage patterns could support epidemiological investigations and address gaps in conventional data sources.
 
 
Introduction
The coronavirus disease 2019 (COVID-19) pandemic, which began in early 2020, has resulted in unprecedented large-scale public health responses. Stringent regional social distancing measures (eg, quarantine, school closures, and restrictions at work and recreation destinations) were rapidly implemented during the early days of the pandemic as forms of non-pharmacological intervention.1 Although there is evidence that such measures can temporarily contain the spread of severe acute respiratory syndrome coronavirus 2,2 collateral effects among non–COVID-19–related conditions have also been reported.3 Trauma is the leading cause of death and disability among young adults worldwide,4 but the effects of the COVID-19 pandemic on injuries and fracture incidence within Hong Kong have not been fully elucidated. This uncertainty has hindered healthcare resource deployment and clinical service demand estimation in times of stringency. We sought to address this problem using ‘big data’ sources and regional clinical data repositories, which allow researchers to rapidly delineate epidemiological associations with potential applications in forecasting models, while avoiding resource-intensive collection of conventional epidemiological information and protecting patient anonymity.
 
We presumed that restrictions on citizen mobility, in concert with social distancing, were associated with reductions in musculoskeletal injuries during the early days of the COVID-19 pandemic. Specifically, we hypothesised that reduced population mobility was associated with reductions in fracture incidence and fracture-related healthcare needs during the early days of the pandemic. We investigated these relationships by analysing daily multicentre hospital data registries in Hong Kong, along with digital population mobility datasets published by a technology company. Our main outcome measurement was skeletal fractures, which served as a specific surrogate for musculoskeletal trauma.
 
Methods
Data collection
This study was conducted in Hong Kong, a highincome region (with gross domestic product per capita of HK$357 667 in 20205) that was among the first areas affected by COVID-19; social distancing measures were implemented during the early days of the pandemic.
 
Using the Clinical Data Analysis and Reporting System of the Hospital Authority, anonymised patient records were retrieved from all 43 public hospitals in Hong Kong for the period from 22 November 2016 to 20 May 2020. In Hong Kong, up to 90% of hospital bed-days occur in public hospitals, which manage nearly all critical emergencies in the region.6 Anonymised clinical data were retrieved, including time of initial injury presentation, emergency department triage, trauma category, hospital admission, diagnosis, and surgical procedures. Diagnoses and procedures were encoded in accordance with the International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM) by treating physicians based on clinical and radiological investigations, intraoperative findings, and date of hospital discharge. The ICD-9-CM codes that met the inclusion criteria (which included fractures under the purview or commonly admitted under the care of orthopaedic and traumatology service) were all codes from 805 to 829 (inclusive). Duplicate records from fracture reassessment related to follow-up attendances, hospitalisation after emergency department attendance, and elective hospital re-admissions (ie, episodes assigned to the same patient unique identifier with identical diagnostic codes, which occurred within 30 days of the index episode) were regarded as a single event to avoid double counting. Pathological fractures and records with missing diagnosis codes or admission times were excluded from the analysis.
 
Time intervals
The ‘COVID-19 epoch’ was defined as 25 January 2020 (activation of the government’s ‘emergency’ response and commencement of social distancing policies7) to 26 March 2020; this arbitrarily chosen 62-day period included all patients with fractures who presented during that period. This epoch was compared with the 9 weeks preceding the onset of the COVID-19 pandemic (ie, 22 November 2019 to 24 January 2020), as well as the same period over the past 3 years to adjust for seasonality-related variations8 (ie, 25 January to 26 March in 2017, 2018, and 2019). Differences between actual and projected daily fracture incidences were calculated based on mean values at the same time of year over the past 3 years. Fracture-related mortality rates, defined as the numbers of deaths within 30 days after initial fracture presentation per 100 000 person-years, were compared. The Chi squared test was used to detect differences in fracture incidence and fracture-related mortality during the COVID-19 pandemic and pre-pandemic epochs.
 
Quantifying population mobility
Surrogate data concerning population mobility were retrieved from Mobility Trends Reports9—an aggregate daily measure of geographical direction requests on Apple Maps, a service established by Apple Inc, which holds the largest market share of electronic mobile devices (including smartphones and tablets) in Hong Kong.10 Walking index was regarded as an index of population mobility, considering the smartphone penetration of 91.5%11 among the 7.50 million residents of Hong Kong.12
 
Data analysis
Associations between daily fracture incidence and population mobility were determined by incidence rate ratios (IRRs) using quasi-Poisson regression. Secondary analysis involved associations between mobility index and fracture repair surgeries, all types of orthopaedic emergency department attendances, orthopaedic hospital admissions, and emergency orthopaedic surgeries.
 
Because medical records were timestamped in Hong Kong time (8 hours ahead of Greenwich Mean Time), they were converted to Pacific Time to match the time intervals listed in Mobility Trends Reports; this conversion ensured that data were temporally matched for analysis.
 
To determine whether mobility associations simply reflected health-seeking behaviour, we included analyses of diseases which lacked a physiological basis and were not associated with population mobility—these ‘controls’ included appendicitis, cellulitis, and abscess (ICD-9-CM diagnosis codes 540 and 682). Statistical analysis was performed using R software, version 3.6.2 (R Foundation for Statistical Computing, Vienna, Austria). Quasi-Poisson regression was used to model the relationship between the population mobility index and the daily incidences of fractures and fracture-related events; the population mobility index was the explanatory variable, whereas the
 
daily incidences of various events were response variables. A quasi-Poisson distribution was preferred over a Poisson distribution, considering the presence of significant overdispersion among some response variables (in the form of count data) when a dispersion parameter was included. In accordance with standard statistical methods, the natural logarithm was utilised as the link function. Incidence rate ratios were reported and represented by the following formula:
 
Estimated incidence = IRRPMI × BIR
 
where IRR represents the incidence rate ratio, PMI represents the population mobility index, and BIR represents the baseline incidence rate. The IRR, which quantifies the relationship between the mobility index and fracture incidence, is multiplicative in nature—for every unit increase in the mobility index, there is a corresponding multiplicative increase in the IRR. If the IRR is <1, it is expected to decrease in a multiplicative manner for every unit decrease in the mobility index. Multiple comparisons were adjusted by Bonferroni correction, and the threshold for statistical significance was regarded as P<0.00227 (0.05/22).
 
Results
In total, 59 931 fracture-related medical records from orthopaedic emergency department attendances, hospital admissions, and surgeries were reviewed. After exclusion of 11 498 linked episodes, 284 pathological fractures, 786 follow-up attendances, 175 hospital re-admissions, and two episodes with missing admission times, 47 186 fractures were included in the analysis. Descriptive statistics regarding daily fracture incidences, controls, and fracture-related surgeries during COVID-19 social distancing are shown in Table 1. Intra-year and inter-year comparison cohorts are presented in Table 2.
 

Table 1. Incidences of fractures and surgeries during the early days of the coronavirus disease 2019 social distancing (25 January to 26 March 2020)
 

Table 2. Incidences of fractures before and during the early days of the coronavirus disease 2019 pandemic
 
Fracture incidence during COVID-19 social distancing
A reduction of 1748 fractures in the actual versus projected incidence (321.9 vs 459.1 per 100 000 person-years, P<0.001) was observed during the COVID-19 epoch; the relative risk was 0.690 (95% confidence interval [CI]=0.678-0.702), compared with mean incidences in the previous 3 years (ie, inter-year cohort) [Table 2]. Differences in fracture incidence between the pandemic and pre-pandemic epochs are shown in Figure 1.
 

Figure 1. Daily fracture incidences (triangles) before (22 November 2019 to 24 January 2020) and during (25 January to 26 March 2020) the early days of coronavirus disease 2019 social distancing, with comparison to the same period in the previous 3 years (dots in different shades of grey). There were 1748 fewer fractures than projected
 
Fracture incidences, population mobilities, and controls are depicted in Figure 2. The first two COVID-19 cases in Hong Kong were reported on 23 January 202013; three additional cases were reported on 24 January 2020. Social distancing measures were implemented on 25 January 2020; these included suspension of schools, initiation of ‘work from home’ measures among civil servants, and suspension of hospital visitations. Mandatory border quarantine was enforced on 8 February 2020. The sharpest decrease in mobility was observed on 24 January 2020; population mobility subsequently remained at low levels, in conjunction with cancellations of large-scale social and sporting events, as well as the imposition of travel restrictions with quarantine measures for returning travellers.7
 

Figure 2. Fracture incidences, population mobilities (based on mobility index data from Apple Inc’s Mobility Trend Reports), and controls over time in the study population. The largest decrease in population mobility coincided with the first confirmed case of coronavirus disease 2019 in Hong Kong
 
Associations of fracture incidence with population mobility
Fracture incidence was positively associated with the population mobility index (IRR=1.0055, 95% CI=1.0044-1.0066, P<0.001). Analyses of fracture incidence according to anatomical location revealed associations of the population mobility index with upper limb fractures (IRR=1.0073, 95% CI=1.0057-1.0088, P<0.001) and lower limb fractures (IRR=1.0045, 95% CI=1.0030-1.0060, P<0.001) [Fig 3].
 

Figure 3. Associations of fracture incidence with population mobility. Fracture incidence was associated with mobility index according to quasi-Poisson regression, with incidence rate ratios of 1.0055 (95% confidence interval [CI]=1.0044-1.0066) for all fractures, 1.0073 (95% CI=1.0057-1.0088) for upper limb fractures, and 1.0045 (95% CI=1.0030-1.0060) for lower limb fractures (all P<0.001)
 
The population mobility index was associated with the incidences of fractures involving the radius and ulna (IRR=1.0079, 95% CI=1.0057-1.0101, P<0.001), hand and fingers (IRR=1.0069, 95% CI=1.0039-1.0098, P<0.001), femoral neck (IRR=1.0065, 95% CI=1.0035-1.0095, P<0.001), and tibia and fibula (IRR=1.0097, 95% CI=1.0044-1.0151, P<0.001) [Fig 4]. However, after Bonferroni correction, the population mobility index did not exhibit statistically significant associations with trochanteric hip fractures (IRR=1.0008, P=0.683), spine fractures (IRR=0.996, P=0.183), or pelvic fractures (IRR=1.0064, P=0.00799).
 

Figure 4. Incidence rate ratios indicating relationships between fracture incidence and population mobility index. Incidence rate ratios of fractures are grouped according to anatomical locations with 95% confidence intervals indicated on each bar. Bars in dark grey and asterisks in y-axis labels indicate statistically significant associations (P<0.00227). Note ‘control groups’ of diseases in grey, which were included to investigate possibility of confounding between mobility index and disease incidence by alterations in health-seeking behaviour; no statistically significant associations were present in these groups
 
Stronger associations were observed among fractures, such that some patients presented at a younger age (eg, patients with tibia, fibula, hand, and finger fractures), whereas other patients presented at an older age (eg, patients with femoral neck fractures). Digital literacy, manual dexterity and visual acuity, and higher internet and smartphone usage among younger residents11 are among the factors that cause the population mobility index to have increased sensitivity for analysis in such age-groups.
 
The incidences of cellulitis, abscesses, and appendicitis were not associated with the population mobility index (P>0.00227). These findings support the hypothesis that changes in associations between fracture incidence and population mobility were not solely caused by changes in health-seeking behaviour; if they had been caused by changes in such behaviour, corresponding reductions in those conditions would have been observed.
 
Secondary exploratory analysis of surgeries, emergency department attendances, and hospital admissions
The daily population mobility index was associated with the number of patients admitted on a particular day who subsequently underwent fracture repair surgeries (IRR=1.0041, 95% CI=1.0020-1.0062, P<0.001). The population mobility index was also associated with all types of emergency orthopaedic surgeries (IRR=1.0040, 95% CI=1.0021-1.0058, P<0.001), attendances at orthopaedic emergency departments (IRR=1.0076, 95% CI 1.0064-1.0087, P<0.001), and emergency orthopaedic hospital admissions (IRR=1.0054, 95% CI=1.0043-1.0064, P<0.001). Additionally, the numbers of orthopaedic patients triaged as critical, emergent, and urgent (ie, patients who require physician attention within 30 minutes of attendance) were also associated with the population mobility index (IRR=1.0063, 95% CI=1.0054-1.0073, P<0.001). Whereas the numbers of traffic-related and sports-related trauma cases were associated with the population mobility index (IRR=1.008, 95% CI=1.0063-1.0097 and IRR=1.013, 95% CI=1.0092-1.0158, respectively, both P<0.001), the number of assault-related trauma cases was not (P=0.238).
 
Fracture-related mortality rate
Forty-nine patients with fractures died within 30 days of presentation during the COVID-19 epoch. This constituted a mortality rate of 3.22 deaths per 100 000 person-years, which was lower than the rate of 4.70 deaths per 100 000 person-years during the period before the pandemic (P<0.001); thus, there were around 19 fewer fracture-related deaths in the Hong Kong population during the 62-day study period. Four patients with fractures had COVID-19 (ie, they had positive results in nasopharyngeal swab reverse transcriptase-polymerase chain reaction tests for severe acute respiratory syndrome coronavirus 2) and survived beyond 30 days after initial fracture presentation. The change in mortality was presumably explained by reduced fracture incidence: 30-day mortality among patients with fractures did not significantly differ between the COVID-19 epoch (1.2%, 49 deaths in 4101 patients) and the preceding period (1.0%, 175 deaths in 17 198 patients) [P=0.305].
 
Discussion
This study analysed 47 186 fractures in Hong Kong, prior to and during the early days of the COVID-19 pandemic. Population mobility was assessed through aggregate digital footprints using the volume of location service requests as a surrogate marker, considering the high smartphone and internet penetration in Hong Kong11; importantly, datasets of aggregate digital footprints have been published to facilitate efforts to control COVID-19.9 The findings support our hypothesis in terms of the relationship between fracture incidence and population mobility.
 
Fractures incur substantial healthcare costs; for example, fragility fracture-related costs incurred costs of 37.5 billion euros, along with the loss of 1.0 million quality-adjusted life years, among the six largest European countries in 2017.14 Some fractures (eg, hip fractures) warrant early surgical management to mitigate the morbidity and mortality associated with surgical delays.15 Guidance regarding early surgical management remained in effect, even during the early days of the COVID-19 pandemic.16 Despite the best available tools, fracture prediction remains difficult; there are additional challenges associated with epidemiological projections of specific time points when such fractures occur. Accordingly, hospitals and public health entities experience difficulties in terms of estimating emergency trauma service load and allocating limited healthcare resources. Our findings suggest that population mobility indices, which are freely and publicly accessible, can provide insights regarding fracture incidence. Population mobility may be useful in quantitative modelling of fracture-related inpatient and surgical theatre service demand, using the IRRs described in this study.
 
Although there is evidence to support the efficacy of social distancing measures with respect to COVID-19 transmission,2 our findings emphasise the collateral impacts of pandemic-related interventions on non-communicable diseases. We found that fracture incidence decreased when population mobility was hindered by social distancing measures; the relative reduction in overall fractures appeared to be similar to the effect of established pharmacological interventions on fragility fractures.17 Although this relationship appears to contradict the common notion that physical activity confers a protective effect against fractures in both young and old age-groups, 18 19 associations of increased fracture risk with specific types of exercises (eg, bicycling), or regular participation in other exercise and sports activities, have been described.20 Thus, long-term benefits (eg, increased bone mineral density) may be accrued at the expense of increased exposure to fracture risk when engaging in physical activity. Although the long-term impact of reduced population mobility on fracture incidence remains unclear, vitamin D deficiency caused by prolonged time indoors (ie, without sunlight exposure) is an established risk factor for future fractures.21
 
The strengths of our study include its inclusion of data from all public hospitals in Hong Kong, which allowed extensive analysis of rare events such as fractures. Our database has a high (>96%) positive predictive value for fractures,22 presumably because data entry is conducted by impartial registered medical practitioners. Furthermore, high internet and smartphone penetration increased the sensitivity of the population mobility analysis, such that the mobility index was geographically specific to the study population. Pedestrian and road traffic densities, which are indirectly represented by the population mobility index, could also precipitate accidents, falls, and subsequent fracture risk. Additionally, potential confounding based on health-seeking behaviour was partially mitigated by the inclusion of ‘control’ groups. Fortunately, all hospitals involved in the study maintained full emergency service during the early days of the COVID-19 pandemic23; this maintenance of emergency service minimised potential confounding by hospitals that were unable to provide service to patients with fractures.
 
Limitations of the study involved deficiencies in the population mobility index. For example, travel between familiar places and travel where navigation guidance is unnecessary, as well as the usage of alternative electronic service providers, were not considered. Therefore, the population mobility index served as a more specific (rather than sensitive) tool for assessment of population mobility. Global positioning system (GPS)–based mobility tracking would theoretically allow more extensive data collection, thus providing greater detection sensitivity; however, such mobility tracking would cause substantial privacy issues, resulting in legal and ethical challenges.
 
Notably, older adults are less adept in smartphone usage (62.2% of residents aged ≥65 years reported internet usage in 202011), and the digital population mobility index does not adequately illustrate this division in the population. Furthermore, fractures in older adults are largely caused by osteoporosis, whereas high-energy injury mechanisms are observed in younger individuals.24 Therefore, social distancing may have a negligible effect on the incidences of osteoporotic fractures sustained indoors. We caution against using population mobility data as the sole source of estimates for health service planning because that approach could underestimate fragility fracture service demand.
 
Additionally, the use of fracture incidence data from a public healthcare database only included approximately 90% of the population health demand. During the early days of the COVID-19 pandemic, instances of diversion to the private sector, attendances in private clinics, and visits to alternative practitioners were not coded; the lack of these data may have led to underestimation of total fracture incidence. Finally, we caution against generalising these findings to regions with less internet and smartphone penetration.
 
Conclusion
During the early days of the COVID-19 pandemic, fracture incidence and fracture-related mortality considerably decreased with the implementation of government social distancing measures that targeted population mobility. This unique opportunity enabled the identification of collateral associations and revealed that population mobility could be used (along with many other factors) to estimate clinical service demand.
 
Author contributions
Concept or design: JSH Wong, DKH Yee.
Acquisition of data: JSH Wong.
Analysis or interpretation of data: JSH Wong, ALH Lee, DKH Yee, CX Fang.
Drafting of the manuscript: JSH Wong, ALH Lee, CX Fang.
Critical revision of the manuscript for important intellectual content: CX Fang, DKH Yee, FKL Leung, KMC Cheung.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
All authors have disclosed no conflicts of interest.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
Ethics approval was granted by the Institutional Review Board of The University of Hong Kong/ Hospital Authority Hong Kong West Cluster (HKU/HA HKW IRB Ref No.: UW 20-275), and investigations were carried out in accordance with the Declaration of Helsinki. The requirement for patient informed consent was waived by the Board because the study used anonymised data and the risk of identification was low.
 
References
1. Leung K, Wu JT, Liu D, Leung GM. First-wave COVID-19 transmissibility and severity in China outside Hubei after control measures, and second-wave scenario planning: a modelling impact assessment. Lancet 2020;395:1382-93. Crossref
2. Cousins S. New Zealand eliminates COVID-19. Lancet 2020;395:1474. Crossref
3. De Filippo O, D’Ascenzo F, Angelini F, et al. Reduced rate of hospital admissions for ACS during Covid-19 outbreak in Northern Italy. N Engl J Med 2020;383:88-9. Crossref
4. Krug EG, Sharma GK, Lozano R. The global burden of injuries. Am J Public Health 2000;90:523-6. Crossref
5. Census and Statistics Department, Hong Kong SAR Government. Table 31: Gross Domestic Product (GDP), implicit price deflator of GDP and per capita GDP. 2020. Available from: https://www.censtatd.gov.hk/en/web_table.html?id=31. Accessed 26 Apr 2020.
6. Food and Health Bureau, Hong Kong SAR Government. Report of the Strategic Review on Healthcare Manpower Planning and Professional Development. 2017. Available from: https://www.fhb.gov.hk/en/press_and_publications/otherinfo/180500_sr/srreport.html. Accessed 20 May 2020.
7. Leung GM, Cowling BJ, Wu JT. From a sprint to a marathon in Hong Kong. N Engl J Med 2020;382:e45. Crossref
8. Yee DK, Fang C, Lau TW, Pun T, Wong TM, Leung F. Seasonal variation in hip fracture mortality. Geriatr Orthop Surg Rehabil 2017;8:49-53. Crossref
9. Apple Inc. COVID-19–mobility trends reports. Available from: https://www.apple.com/covid19/mobility. Accessed 26 Apr 2020.
10. Statcounter GlobalStats. Mobile & tablet vendor market share Hong Kong. Jan – Mar 2020. Available from: https://gs.statcounter.com/vendor-market-share/mobile-tablet/hong-kong/#monthly-202001-202003. Accessed 26 Apr 2020.
11. Census and Statistics Department, Hong Kong SAR Government. Thematic Household Survey Report No. 69: Personal Computer and Internet Penetration. 2020. Available from: https://www.ogcio.gov.hk/en/about_us/facts/doc/householdreport2020_69.pdf. Accessed 5 May 2020.
12. Census and Statistics Department, Hong Kong SAR Government. Population– Overview. 2020. Available from: https://www.censtatd.gov.hk/hkstat/sub/so20.jsp. Accessed 12 Apr 2020.
13. Centre for Health Protection, Department of Health, Hong Kong SAR Government. Latest situation of novel coronavirus infection in Hong Kong. Available from: https://chp-dashboard.geodata.gov.hk/covid-19/en.html. Accessed 12 Apr 2022.
14. Borgström F, Karlsson L, Ortsäter G, et al. Fragility fractures in Europe: burden, management and opportunities. Arch Osteoporos 2020;15:59. Crossref
15. Leung F, Lau TW, Kwan K, Chow SP, Kung AW. Does timing of surgery matter in fragility hip fractures? Osteoporos Int 2010;21 Suppl 4:S529-34. Crossref
16. British Orthopaedic Association. COVID BOAST-Management of patients with urgent orthopaedic conditions and trauma during the coronavirus pandemic. Available from: https://www.boa.ac.uk/resources/covid-19-boasts-combined1.html. Accessed 13 Feb 2023.
17. Tsuda T, Hashimoto Y, Okamoto Y, Ando W, Ebina K. Meta-analysis for the efficacy of bisphosphonates on hip fracture prevention. J Bone Miner Metab 2020;38:678-86. Crossref
18. Fritz J, Cöster ME, Nilsson JA, Rosengren BE, Dencker M, Karlsson MK. The associations of physical activity with fracture risk—a 7-year prospective controlled intervention study in 3534 children. Osteoporos Int 2016;27:915-22. Crossref
19. Morseth B, Ahmed LA, Bjørnerem Å, et al. Leisure time physical activity and risk of non-vertebral fracture in men and women aged 55 years and older: the Tromsø study. Eur J Epidemiol 2012;27:463-71. Crossref
20. Appleby PN, Allen NE, Roddam AW, Key TJ. Physical activity and fracture risk: a prospective study of 1898 incident fractures among 34,696 British men and women. J Bone Miner Metab 2008;26:191-8. Crossref
21. Nilson F, Moniruzzaman S, Andersson R. A comparison of hip fracture incidence rates among elderly in Sweden by latitude and sunlight exposure. Scand J Public Health 2014;42:201-6. Crossref
22. Sing CW, Woo YC, Lee AC, et al. Validity of major osteoporotic fracture diagnosis codes in the Clinical Data Analysis and Reporting System in Hong Kong. Pharmacoepidemiol Drug Saf 2017;26:973-6. Crossref
23. Hospital Authority, Hong Kong SAR Government. HA adjusts service provision to focus on combatting epidemic. 2020. Press Release. Available from: https://www.info.gov.hk/gia/general/202002/10/P2020021000711.htm. Accessed 20 May 2020.
24. Bergh C, Wennergren D, Möller M, Brisby H. Fracture incidence in adults in relation to age and gender: a study of 27,169 fractures in the Swedish Fracture Register in a well-defined catchment area. PLoS One 2020;15:e0244291. Crossref

Ten-year refractive and visual outcomes of intraocular lens implantation in infants with congenital cataract

© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE  CME
Ten-year refractive and visual outcomes of intraocular lens implantation in infants with congenital cataract
Joyce JT Chan, FRCOphth; Emily S Wong, FCOphthHK; Carol PS Lam, FCOphthHK; Jason C Yam, FRCSEd
Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong Eye Hospital, Hong Kong
 
Corresponding author: Dr JC Yam (yamcheuksing@cuhk.edu.hk)
 
 Full paper in PDF
 
Abstract
Introduction: There is no consensus regarding optimal target refraction after intraocular lens implantation in infants. This study aimed to clarify relationships of initial postoperative refraction with long-term refractive and visual outcomes.
 
Methods: This retrospective review included 14 infants (22 eyes) who underwent unilateral or bilateral cataract extraction and primary intraocular lens implantation before the age of 1 year. All infants had ≥10 years of follow-up.
 
Results: All eyes exhibited myopic shift over a mean follow-up period of 15.9 ± 2.8 years. The greatest myopic shift occurred in the first postoperative year (mean=-5.39 ± +3.50 dioptres [D]), but smaller amounts continued beyond the tenth year (mean=-2.64 ± +2.02 D between 10 years postoperatively and last follow-up). Total myopic shift at 10 years ranged from -21.88 to -3.75 D (mean=-11.62 ± +5.14 D). Younger age at operation was correlated with larger myopic shifts at 1 year (P=0.025) and 10 years (P=0.006) postoperatively. Immediate postoperative refraction was a predictor of spherical equivalent refraction at 1 year (P=0.015) but not at 10 years (P=0.116). Immediate postoperative refraction was negatively correlated with final best-corrected visual acuity (BCVA) (P=0.018). Immediate postoperative refraction of ≥+7.00 D was correlated with worse final BCVA (P=0.029).
 
Conclusion: Considerable variation in myopic shift hinders the prediction of long-term refractive outcomes in individual patients. When selecting target refraction in infants, low to moderate hyperopia (<+7.00 D) should be considered to balance the avoidance of high myopia in adulthood with the risk of worse long-term visual acuity related to high postoperative hyperopia.
 
 
New knowledge added by this study
  • The greatest myopic shift occurred in the first year after cataract surgery, but smaller shifts continued beyond the tenth year. Overall, 50% of eyes exhibited myopic shift >-2.00 dioptres between the tenth postoperative year and last follow-up.
  • Considerable variation in refractive change after intraocular lens implantation in infants aged <1 year hinders the prediction of long-term refractive outcomes in individual patients. Immediate postoperative refraction was not correlated with spherical equivalent refraction at 10 years postoperatively.
  • Immediate postoperative refraction of ≥+7.00 dioptres was correlated with worse final visual acuity.
Implications for clinical practice or policy
  • When selecting target refraction in infants, low to moderate hyperopia (<+7.00 dioptres) should be considered to balance the avoidance of high myopia in adulthood with the risk of worse long-term visual acuity related to high postoperative hyperopia.
 
 
Introduction
Appropriate optical correction after cataract extraction in infants is important for efforts to avoid amblyopia. Primary intraocular lens (IOL) implantation allows constant in situ optical correction during the critical years of visual development, while avoiding the expenses and compliance issues associated with contact lenses.1 Disadvantages include increased rates of surgical complications and re-operations,2 as well as the inability to modify IOL power during ocular growth. A recent report by the American Academy of Ophthalmology suggested that IOL implantation can be safely conducted in children aged >6 months.3 However, because of the unpredictable nature of ocular growth, it remains challenging to select a target refraction in infants that allows achievement of optimal long-term visual and refractive outcomes.
 
Surgeons target various initial hyperopia values, ranging from +5.00 dioptres (D) to +10.50 D,4 5 6 7 8 9 to compensate for the rapid myopic shift that occurs during infancy. However, prediction of the myopic shift remains difficult; significant hyperopia in infants requires stringent optical correction to prevent amblyopia, and some studies have linked high initial hyperopia to worse visual acuity.10 11 This retrospective study aimed to clarify the relationships of initial postoperative refraction with 10-year spherical equivalent refraction (SER) and long-term best-corrected visual acuity (BCVA) after IOL implantation in infants.
 
Methods
Inclusion and exclusion criteria
This retrospective study included patients who underwent unilateral or bilateral congenital cataract extraction and primary IOL implantation before the age of 1 year between 1997 and 2009 at a single secondary and tertiary referral eye centre. Only patients with ≥10 years of follow-up were included. Eyes with associated ocular co-morbidities (eg, persistent foetal vasculature and glaucoma) were excluded.
 
Surgical technique and follow-up
The patients’ baseline characteristics (eg, age, axial length [determined by applanation A-scan biometry], and keratometry) were recorded. Intraocular lens powers were calculated using the Sanders–Retzlaff–Kraff II formula. The operating surgeon selected the target refraction and IOL power, considering the patient’s age (all cases) and refractive error in the fellow eye (unilateral cases). All operations were performed using similar techniques, including the creation of a 3.0-mm scleral tunnel, anterior continuous curvilinear capsulorhexis, and lens removal by automated irrigation and aspiration. Heparin-surface-modified polymethyl methacrylate IOLs or acrylic foldable IOLs were implanted. The IOL was placed in the capsular bag or in the sulcus. All wounds were sutured. In some cases, primary posterior curvilinear capsulorhexis and anterior vitrectomy were performed. Because of reports that a significant number of eyes in young infants required secondary posterior capsule opening despite primary posterior capsulotomy,12 13 this procedure was omitted in some eyes to increase the likelihood of achieving capsular IOL implantation. Postoperatively, all eyes were treated with intensive topical steroid and antibiotic medication. Patients were assessed on postoperative day 1, week 1, week 2, and week 4; they were then assessed every 3 to 6 months. When clinically significant posterior capsular opacification developed, secondary posterior capsulotomy was performed promptly. Glasses were used for postoperative optical correction; in some cases, contact lenses were also used. Amblyopia treatment by patching was performed as necessary.
 
Outcome measures and statistical analysis
Spherical equivalent refraction at 2 weeks postoperatively was regarded as immediate postoperative refraction. Serial refractions at each year of postoperative follow-up were recorded, and SERs were calculated as the algebraic sum of the sphere and half the cylindrical power. Postoperative axial length was measured using non-contact optical biometry, which was less invasive than applanation biometry.
 
Statistical analysis was performed using Microsoft Excel and SPSS (Windows version 21.0; IBM Corp, Armonk [NY], United States). Best-corrected visual acuities were converted to logarithm of the minimum angle of resolution (logMAR) values for statistical analysis. Correlations between continuous variables were assessed by Spearman correlation. Differences between groups were analysed by the Mann–Whitney U test. Preoperative axial length and keratometry were compared with values at the last follow-up using the paired-samples Wilcoxon signed-rank test. The independent-sample Kruskal–Wallis test was used to compare 10-year SER and BCVA values among groups with immediate postoperative refraction ≤+3.50 D, +3.50 to +7.00 D, and ≥+7.00 D. Partial correlation analysis was performed to detect correlations of immediate postoperative refraction with spherical refraction at 1 year and 10 years after adjustment for age at operation. Multiple linear regression was performed for multivariate analysis of statistically significant factors identified during univariate analysis. P values <0.05 were considered statistically significant.
 
Results
Twenty-two eyes of 14 patients were included in this study. One eye in one patient with bilateral cataract was excluded because it was surgically treated after the patient reached 1 year of age. One eye in another patient with bilateral cataract was excluded because it exhibited secondary glaucoma. During surgery, heparin-surface-modified polymethyl methacrylate IOLs were implanted in three eyes, whereas acrylic foldable IOLs were implanted in 19 eyes. The IOL was placed in the capsular bag in 18 eyes and in the sulcus in four eyes. Additionally, primary posterior curvilinear capsulorhexis and anterior vitrectomy were performed in 13 eyes. For postoperative optical correction, all 14 patients wore glasses; four patients (including two with unilateral cataract) also wore contact lenses. Thirteen patients underwent amblyopia treatment by patching.
 
The Table summarises the baseline characteristics, refractive outcomes, and visual outcomes of eyes included in this study. All 22 eyes exhibited myopic shift, ranging from -21.88 to -3.75 D at 10 years. Figures 1 and 2 show the amounts of myopic shift and SER, respectively, at 1 to 10 years postoperatively and at last follow-up. The greatest myopic shift occurred in the first postoperative year, but smaller shifts continued beyond the tenth year. Ninety percent of eyes exhibited myopic shift >-2.00 D between the third postoperative year and last follow-up (mean myopic shift: -6.40 ± +3.29 D; range, -12.00 to -1.63 D). These proportions were 82% between the sixth postoperative year and last follow-up (mean myopic shift: -4.14 ± +2.35 D; range, -9.38 to -1.13 D), and 50% between the tenth postoperative year and last follow-up (mean myopic shift: -2.64 ± +2.02 D; range, -0.125 to -6.75 D).
 

Table. Baseline characteristics and follow-up results of patients who underwent unilateral or bilateral cataract extraction and primary intraocular lens implantation before the age of 1 year
 

Figure 1. Magnitude of myopic shift per year from 1 to 10 years postoperatively, and between 10 years postoperatively and last follow-up, after primary implantation of intraocular lens in infants aged <1 year. Boxes: quartile 1 to quartile 3 (interquartile range). Lines: medians. Whiskers: maximum and minimum values excluding potential outliers and extreme values. Circles: potential outliers, more than 1.5 interquartile ranges but at most 3 interquartile ranges below quartile 1 or above quartile 3. Asterisks: extreme values, more than 3 interquartile ranges below quartile 1 or above quartile 3
 

Figure 2. Spherical equivalent refraction immediately after operation, at 1 year to 10 years, and at last follow-up after primary intraocular lens implantation in infants aged <1 year. Boxes: quartile 1 to quartile 3 (interquartile range). Lines: medians. Whiskers: maximum and minimum values excluding potential outliers and extreme values. Circles: potential outliers, more than 1.5 interquartile ranges but at most 3 interquartile ranges below quartile 1 or above quartile 3
 
Factors affecting myopic shift at 1 year and at 10 years
In univariate analysis, a larger myopic shift at 1 year postoperatively was correlated with younger age at operation (R2=0.585, P=0.004), more hyperopic immediate postoperative refraction (R2=-0.533, P=0.011), and a need for secondary posterior capsulotomy (U=20, z=-2.066, P=0.04). One-year myopic shift was not correlated with initial axial length (R2=0.038, P=0.878), and it did not differ between unilateral (median=-5.81 D) and bilateral cases (median=-4.38 D) [U=41.5, z=0.469, P=0.652]. Multiple linear regression was performed for statistically significant factors identified during univariate analysis. Only age at operation remained statistically significant (P=0.025); immediate postoperative refraction (P=0.191) and a need for secondary posterior capsulotomy (P=0.781) were not significant in multivariate analysis.
 
The total amount of myopic shift at 10 years postoperatively was correlated with age at operation (R2=0.579, P=0.006), but it was not correlated with immediate postoperative refraction (R2=-0.339, P=0.133) or initial axial length (R2=0.291, P=0.241). There was no difference in the amount of myopic shift at 10 years between unilateral (median=-14.62 D) and bilateral cases (median=-10.50 D) [U=40.5, z=1.357, P=0.185] or between eyes that required secondary posterior capsulotomy (median=-11.25 D) and eyes that did not (median = -6.19 D) [U=24, z=-1.645, P=0.112].
 
Factors affecting spherical equivalent refraction at 1 year and at 10 years
Spherical equivalent refraction at 1 year did not significantly differ between unilateral (median=-2.69 D) and bilateral cases (median=+1.13 D) [U=59, z=1.959, P=0.053] or between eyes that required secondary capsulotomy (median=+0.31 D) and eyes that did not (median=+1.42 D) [U=32.5, z=-1.143, P=0.261]. Partial correlation analysis showed that after adjustment for age at operation, immediate postoperative refraction (R2=0.522, P=0.015) was a statistically significant predictor of SER at 1 year.
 
In contrast, SER at 10 years postoperatively was significantly more myopic in unilateral cases (median=-10.63 D) than in bilateral cases (median=-4.81 D) [U=49.5, z=2.264, P=0.017]. This finding may be related to surgeon preference for less hyperopic target refractions in unilateral cases, which can match the refraction of the fellow eye and potentially prevent significant postoperative anisometropia. Indeed, after adjustment for age, immediate postoperative SER was significantly less hyperopic in unilateral cases than in bilateral cases (P=0.025). A need for secondary posterior capsulotomy (U=28, z=-1.325, P=0.205) was not correlated with SER at 10 years. After adjustment for laterality, both age at operation (P=0.066) and immediate postoperative refraction (P=0.116) were not statistically significant predictors of SER at 10 years. There was no significant difference in 10-year SER among eyes with immediate postoperative refraction ≤+3.50 D, +3.50 to +7.00 D, and ≥+7.00 D (P=0.439), as shown in Figure 3.
 

Figure 3. Ten-year spherical equivalent refraction in eyes with immediate postoperative refraction ≤+3.50 dioptres (D), +3.50 to +7.00 D, and ≥+7.00 D. Boxes: quartile 1 to quartile 3 (interquartile range). Lines: medians. Whiskers: maximum and minimum values excluding potential outliers and extreme values. Circles: potential outliers, more than 1.5 interquartile ranges but at most 3 interquartile ranges below quartile 1 or above quartile 3
 
Subgroup analysis was performed for bilateral cases only. Multiple regression analysis showed that at 1 year, both age at operation (P=0.014) and immediate postoperative refraction (P=0.024) remained significant predictors of SER after unilateral cases had been excluded. At 10 years postoperatively, age at operation was a significant predictor of SER (P=0.015), whereas immediate postoperative refraction was not (P=0.135).
 
Axial length and keratometry
Mean preoperative axial length was 19.12 mm, whereas mean axial length at 10 years was 24.82 mm. There were no differences in initial axial length (U=32, z=0.894, P=0.421) or total axial length change (U=22, z=-0.224, P=0.875) between unilateral and bilateral cases. Final axial length was significantly greater than preoperative axial length (z=3.823, P<0.0005). Total axial length change was strongly correlated with total myopic shift (R2=-0.791, P<0.0005). There was no difference between preoperative and final keratometry values (z=0.081, P=0.936). The total change in the mean keratometry value was not correlated with total myopic shift (R2=-0.168, P=0.490).
 
Final best-corrected visual acuity
At the last follow-up, 11 eyes (50%) had a final BCVA of 0.18 logMAR or better, six eyes (27%) had moderate amblyopia with BCVA of 0.3 to 0.6 logMAR, and five eyes (23%) had severe amblyopia with BCVA of 0.7 to 1.0 logMAR. There was a statistically significant correlation between immediate postoperative refraction and final BCVA (R2=0.440, P=0.041). Best-corrected visual acuity was worse in eyes that required secondary capsulotomy (U=74.5, z=1.995, P=0.049). Multiple regression revealed that a need for secondary capsulotomy was no longer a significant predictor for BCVA (P=0.299), whereas immediate postoperative refraction remained a significant predictor for BCVA (P=0.018). Best-corrected visual acuity was significantly worse in eyes with immediate postoperative refraction of +7.00 D or higher than in eyes with lower levels of immediate postoperative hyperopia (P=0.029) [Fig 4]. There were no significant correlations of final BCVA with age at operation (R2=-0.041, P=0.856), SER at 10 years (R2=0.011, P=0.963), SER at last follow-up (R2=-0.122, P=0.589), or laterality (U=48.5, z=-1.087, P=0.300).
 

Figure 4. Logarithm of the minimum angle of resolution (logMAR) best-corrected visual acuity in eyes with immediate postoperative refraction ≤+3.50 dioptres (D), +3.50 to +7.00 D, and ≥+7.00 D. Boxes: quartile 1 to quartile 3 (interquartile range). Lines: median. Whiskers: maximum and minimum values excluding potential outliers and extreme values. Circles: potential outliers, more than 1.5 interquartile ranges but at most 3 interquartile ranges below quartile 1 or above quartile 3. Asterisks: extreme values, more than 3 interquartile ranges below quartile 1 or above quartile 3
 
Complications and re-operations
Seventeen eyes underwent 21 re-operations in total, 17 of which were secondary posterior capsulotomies. All nine eyes that did not undergo primary posterior capsulorhexis and anterior vitrectomy required secondary capsulotomy; one of the nine eyes required secondary capsulotomy twice. Seven of 13 eyes with primary posterior capsulorhexis and anterior vitrectomy required secondary capsulotomy. Three eyes underwent injection of intracameral tissue plasminogen activator, one eye underwent dissection of fibrinous membrane, and one eye required removal of anterior capsular phimosis. Notably, anterior capsular phimosis did not develop in any other eyes. One eye developed secondary glaucoma and was excluded from this study.
 
Discussion
Two important goals in the management of congenital cataract include achievement of good long-term visual acuity and minimisation of refractive error in adulthood. This study focused on long-term outcomes after primary IOL implantation in infants, all of whom had ≥10 years of follow-up. Myopic shift was present in all eyes, and its magnitude considerably varied. Immediate postoperative refraction was not a statistically significant predictor of SER at 10 years. Moreover, there was a statistically significant negative correlation between immediate postoperative refraction and final BCVA. Finally, immediate postoperative refraction of +7.00 D or higher was correlated with worse final BCVA.
 
Refractive change in a growing eye
Refractive change in a normal growing eye involves a complex interaction among axial length elongation, corneal curvature flattening, and the reduction of crystalline lens power.14 Additional effects on ocular growth (eg, related to the presence or laterality of congenital cataract, age at corrective surgery, initial axial length, postoperative visual input, and compliance with postoperative amblyopia therapy) remain uncertain.15 The presence of an intraocular lens magnifies myopic shift in a growing eye—the intraocular lens exhibits constant power and moves anteriorly away from the retina during ocular growth, hindering the extrapolation of data from phakic eyes.5 Figure 5 shows the mean SER during the first decade of life in pseudophakic eyes from patients in the present study, compared with normal eyes from the ongoing population-based Hong Kong Children Eye Study.16 At 10 years after corrective surgery, the mean SER was -6.48 D in pseudophakic eyes, whereas it was -0.72 D in normal eyes of age-matched children. The mean axial length at 10 years was 24.82 mm in pseudophakic eyes, whereas it was 23.79 mm in normal eyes of age-matched children.16 These data imply the presence of a greater myopic shift and greater increase in axial length among pseudophakic eyes which continues beyond the first 2 years of life; notably, these increases are relative to the mean growth of normal eyes in Hong Kong children, who exhibit a higher prevalence of myopia compared with other populations.16
 

Figure 5. Mean spherical equivalent refraction during the first decade of life in pseudophakic eyes from patients in the present study compared with normal eyes from the Hong Kong Children Eye Study
 
Refractive change after primary intraocular lens implantation in infants
Several other studies of myopic shift have revealed considerable refractive change after primary IOL implantation in infants aged <1 year. At 5 years postoperatively, the Infant Aphakia Treatment Study revealed a mean myopic shift of -8.97 D for infants surgically treated at the age of 1 month and -7.22 D for infants surgically treated at the age of 6 months,9 whereas Negalur et al17 found a median myopic shift of -8.43 D after the same duration of follow-up in infants operated before the age of 6 months. Fan et al18 reported a mean myopic shift of -7.11 D at 3 years postoperatively in infants operated before the age of 1 year; Lu et al19 reported a mean myopic shift of -6.46 D at 2 years in 22 eyes, as well as a mean myopic shift of -8.67 D at 6 years in three eyes, among infants operated between the age of 6 and 12 months. In our study, which had a longer follow-up period, the mean 10-year myopic shift was -11.62 ± 5.14 D and myopic progression continued beyond 10 years postoperatively. These findings highlight the importance of using long-term data to guide management decisions, including the selection of target refraction and the determination of appropriate timing for enhancement procedures (eg, IOL exchange).
 
Our results showed that myopic shift was greatest in the first postoperative year and was correlated with age at operation, which is consistent with findings in the literature.9 10 17 18 19 20 21 Because age at operation is most frequently associated with the magnitude of refractive change, many surgeons prefer to adjust initial hyperopia according to age. McClatchey et al22 recommended targets of +5.00 to +8.00 D in infants aged <1 year, whereas Valera Cornejo et al4 selected targets of +7.00 to +9.00 D in infants of the same age-group. The results of the Infant Aphakia Treatment Study suggested that, to achieve emmetropia at 5 years, immediate postoperative hyperopia should be +10.50 D from 4 to 6 weeks of age and +8.50 D from 7 weeks to 6 months of age.9 However, our results showed considerable variation in myopic shift at 10 years (range, -21.88 to -3.75 D); after adjustment for age, immediate postoperative refraction was not a statistically significant predictor of SER at 10 years. Other studies have shown that initial refraction and IOL undercorrection were not significantly associated with the magnitude or rate of myopic shift9 18 22 23; they also revealed large and unpredictable variations in refractive outcomes after IOL implantation in young infants.10 20 21 22 24 25 At the 3-year follow-up, refractive change ranged from +2.00 to -15.50 D in a study by Gouws et al26 and from -0.47 to -10.69 D in a study by Fan et al.18 Although we observed a trend towards more myopic 10-year refractions in groups with lower initial postoperative hyperopia, there were no significant differences because of substantial variability in the data (Fig 3). The Infant Aphakia Treatment Study showed that the actual and expected amounts of myopic shift differed in a large percentage of patients; 50% of patients exhibited differences of +3.00 to +14.00 D from expected values.9 Therefore, age-adjusted suggested targets only compensate for the mean expected myopic shift; large interpatient variability will often result in unanticipated long-term outcomes for individual patients. Correlation analysis in our study revealed that age at operation only explained 58% of the variance in myopic shift at 10 years. This correlation is presumably influenced by other factors that contribute to myopic progression, such as genetics, ethnicity, outdoor exposure, education level, and extent of near work.27
 
Long-term best-corrected visual acuity
The achievement of optimal long-term BCVA is another important goal of surgical treatment for congenital cataract. In our study, immediate postoperative refraction of ≥+7.00 D was correlated with worse BCVA. Similarly, in a study of infants who underwent surgery between the ages of 2 and 21 months, with ≥4 years of follow-up, Magli et al10 found that BCVA was higher in infants with initial spherical refraction between +1.00 and +3.00 D than in infants with initial spherical refraction >+3.00 D. In a study that included older children who underwent surgery at or before the age of 8.5 years, Lowery et al11 found that low early postoperative hyperopia (+1.75 to +5.00 D) yielded better longterm BCVA, compared with refractions <+1.75 or >+5.00 D in unilateral cases; no difference was observed in bilateral cases. Another study of older children (surgically treated between the ages of 2 and 6 years) revealed no difference in BCVA between initial postoperative refractive errors of near emmetropia versus undercorrection of +2.00 to +5.50 D23; no patients had initial refraction values >+5.50 D. High initial postoperative hyperopia requires good compliance with refractive correction; in infants, such hyperopia also requires amblyopia treatment because younger children are at higher risk of developing amblyopia. Hyperopia is more amblyogenic than myopia because young children have higher demands for near vision28; moreover, hyperopia causes defocusing in both distance and near vision, particularly among patients who exhibit pseudophakia related to accommodation loss. Studies have shown variable compliance with optical correction and amblyopia treatment after congenital cataract surgery19 29; the use of high-plus spectacles is associated with various optical aberrations. Additionally, contact lenses are suboptimal because one of the original aims of intraocular lens implantation is to avoid the need for contact lens. Myopia is comparatively less amblyogenic because it allows retention of near vision, particularly if the amount of myopia remains low until later in childhood when visual development is more mature.30 Therefore, parental motivation and the likelihood of compliance should be included in decisions regarding postoperative refraction. Ideally, high myopia in adulthood should be minimised. However, this goal should be balanced with the risks of amblyopia and long-term poor vision. Therefore, the selection of high hyperopia (>+7.00 D) as an initial postoperative target refraction should be avoided when possible.
 
Strengths and limitations
A major strength of our study was the long follow-up period. Additionally, it only included infants who underwent IOL implantation before the age of 1 year because the refractive change in this group exhibits the greatest variability and is most challenging to predict.5
 
There were some limitations in this study. First, it used a retrospective design and included a small number of patients. Second, there was no objective monitoring of compliance with optical correction or amblyopia treatment. Third, few unilateral cases were included, which may have hindered the detection of larger myopic shifts in post–IOL implantation in unilateral cases. Notably, some previous studies revealed larger myopic shifts after IOL implantation in such cases.4 17 21 22
 
In conclusion, the large and variable refractive change after IOL implantation in infants aged <1 year hinders the prediction of long-term refractive outcomes in individual patients. When selecting target refraction in infants, low to moderate hyperopia (<+7.00 D) should be considered to balance the avoidance of high myopia in adulthood with the risk of worse long-term visual acuity related to high postoperative hyperopia
 
Author contributions
Concept or design: All authors.
Acquisition of data: JJT Chan.
Analysis or interpretation of data: JJT Chan.
Drafting of the manuscript: JJT Chan.
Critical revision of the manuscript for important intellectual content: All authors.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
As an editor of the journal, JC Yam was not involved in the peer review process. Other authors have disclosed no conflicts of interest.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
Ethics approval was granted by the Research Ethics Committee (Kowloon Central/Kowloon East), Hospital Authority (Ref No.: KC/KE-19-0059/ER-4). The requirement for patient consent was waived by the ethics board due to the retrospective nature of the study. The study is conducted in accordance with the ethical principles of the Declaration of Helsinki.
 
References
1. Kumar P, Lambert SR. Evaluating the evidence for and against the use of IOLs in infants and young children. Expert Rev Med Devices 2016;13:381-9. Crossref
2. Infant Aphakia Treatment Study Group; Lambert SR, Lynn MJ, et al. Comparison of contact lens and intraocular lens correction of monocular aphakia during infancy: a randomized clinical trial of HOTV optotype acuity at age 4.5 years and clinical findings at age 5 years. JAMA Ophthalmol 2014;132:676-82. Crossref
3. Lambert SR, Aakalu VK, Hutchinson AK, et al. Intraocular lens implantation during early childhood: a report by the American Academy of Ophthalmology. Ophthalmology 2019;126:1454-61. Crossref
4. Valera Cornejo DA, Flores Boza A. Relationship between preoperative axial length and myopic shift over 3 years after congenital cataract surgery with primary intraocular lens implantation at the National Institute of Ophthalmology of Peru, 2007-2011. Clin Ophthalmol 2018;12:395-9. Crossref
5. McClatchey SK, Parks MM. Theoretic refractive changes after lens implantation in childhood. Ophthalmology 1997;104:1744-51. Crossref
6. Enyedi LB, Peterseim MW, Freedman SF, Buckley EG. Refractive changes after pediatric intraocular lens implantation. Am J Ophthalmol 1998;126:772-81. Crossref
7. Astle WF, Ingram AD, Isaza GM, Echeverri P. Paediatric pseudophakia: analysis of intraocular lens power and myopic shift. Clin Experiment Ophthalmol 2007;35:244-51. Crossref
8. Yam JC, Wu PK, Ko ST, Wong US, Chan CW. Refractive changes after pediatric intraocular lens implantation in Hong Kong children. J Pediatr Ophthalmol Strabismus 2012;49:308-13.
9. Weakley DR Jr, Lynn MJ, Dubois L, et al. Myopic shift 5 years after intraocular lens implantation in the Infant Aphakia Treatment Study. Ophthalmology 2017;124:822-7. Crossref
10. Magli A, Forte R, Carelli R, Rombetto L, Magli G. Long-term outcomes of primary intraocular lens implantation for unilateral congenital cataract. Semin Ophthalmol 2015;31:1-6. Crossref
11. Lowery RS, Nick TG, Shelton JB, Warner D, Green T. Long-term visual acuity and initial postoperative refractive error in pediatric pseudophakia. Can J Ophthalmol 2011;46:143-7. Crossref
12. Vasavada A, Chauhan H. Intraocular lens implantation in infants with congenital cataracts. J Cataract Refract Surg 1994;20:592-8. Crossref
13. Plager DA, Yang S, Neely D, Sprunger D, Sondhi N. Complications in the first year following cataract surgery with and without IOL in infants and older children. J AAPOS 2002;6:9-14. Crossref
14. Gordon RA, Donzis PB. Refractive development of the human eye. Arch Ophthalmol 1985;103:785-9. Crossref
15. Indaram M, VanderVeen DK. Postoperative refractive errors following pediatric cataract extraction with intraocular lens implantation. Semin Ophthalmol 2018;33:51-8. Crossref
16. Yam JC, Tang SM, Kam KW, et al. High prevalence of myopia in children and their parents in Hong Kong Chinese population: the Hong Kong Children Eye Study. Acta Ophthalmol 2020;98:e639-48. Crossref
17. Negalur M, Sachdeva V, Neriyanuri S, Ali M, Kekunnaya R. Long-term outcomes following primary intraocular lens implantation in infants younger than 6 months. Indian J Ophthalmol 2018;66:1088-93. Crossref
18. Fan DS, Rao SK, Yu CB, Wong CY, Lam DS. Changes in refraction and ocular dimensions after cataract surgery and primary intraocular lens implantation in infants. J Cataract Refract Surg 2006;32:1104-8. Crossref
19. Lu Y, Ji YH, Luo Y, Jiang YX, Wang M, Chen X. Visual results and complications of primary intraocular lens implantation in infants aged 6 to 12 months. Graefes Arch Clin Exp Ophthalmol 2010;248:681-6. Crossref
20. O’Keefe M, Fenton S, Lanigan B. Visual outcomes and complications of posterior chamber intraocular lens implantation in the first year of life. J Cataract Refract Surg 2001;27:2006-11. Crossref
21. Hoevenaars NE, Polling JR, Wolfs RC. Prediction error and myopic shift after intraocular lens implantation in paediatric cataract patients. Br J Ophthalmol 2011;95:1082-5. Crossref
22. McClatchey SK, Dahan E, Maselli E, et al. A comparison of the rate of refractive growth in pediatric aphakic and pseudophakic eyes. Ophthalmology 2000;107:118-22. Crossref
23. Lambert SR, Archer SM, Wilson ME, Trivedi RH, del Monte MA, Lynn M. Long-term outcomes of undercorrection versus full correction after unilateral intraocular lens implantation in children. Am J Ophthalmol 2012;153:602-8.e1. Crossref
24. Barry JS, Ewings P, Gibbon C, Quinn AG. Refractive outcomes after cataract surgery with primary lens implantation in infants. Br J Ophthalmol 2006;90:1386-9. Crossref
25. Plager DA, Kipfer H, Sprunger DT, Sondhi N, Neely DE. Refractive change in pediatric pseudophakia: 6-year follow-up. J Cataract Refract Surg 2002;28:810-5. Crossref
26. Gouws P, Hussin HM, Markham RH. Long term results of primary posterior chamber intraocular lens implantation for congenital cataract in the first year of life. Br J Ophthalmol 2006;90:975-8. Crossref
27. Morgan IG, French AN, Ashby RS, et al. The epidemics of myopia: aetiology and prevention. Prog Retin Eye Res 2018;62:134-49. Crossref
28. Pascual M, Huang J, Maguire MG, et al. Risk factors for amblyopia in the vision in preschoolers study. Ophthalmology 2014;121:622-9.e1. Crossref
29. Drews-Botsch CD, Hartmann EE, Celano M, Infant Aphakia Treatment Study Group. Predictors of adherence to occlusion therapy 3 months after cataract extraction in the Infant Aphakia Treatment Study. J AAPOS 2012;16:150-5. Crossref
30. O’Hara MA. Pediatric intraocular lens power calculations. Curr Opin Ophthalmol 2012;23:388-93. Crossref

Pages