Assessment of healthcare quality among village clinicians in rural China: the role of internal work motivation

Hong Kong Med J 2023 Feb;29(1):57-65 | Epub 9 Feb 2023
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE (HEALTHCARE IN MAINLAND CHINA)
Assessment of healthcare quality among village clinicians in rural China: the role of internal work motivation
Q Gao, PhD; L Peng, MSc; S Song, MSc; Y Zhang, MSc; Y Shi, PhD
Center for Experimental Economics in Education, Shaanxi Normal University, Xi'an, China
 
Corresponding author: Prof Y Shi (shiyaojiang7@gmail.com)
 
 Full paper in PDF
 
Abstract
Introduction: The quality of primary care is important for health outcomes among residents in China. There is evidence that internal work motivation improves the quality of healthcare provided by clinicians. However, few empirical studies have examined the relationship between internal work motivation and clinical performance among village clinicians in rural China. This study was performed to evaluate healthcare quality among village clinicians, then explore its relationships with internal work motivation among those clinicians.
 
Methods: We collected survey data using a standardised patient method and a structured questionnaire. We observed 225 interactions between standardised patients and village clinicians from 21 counties in three provinces. We used logistic regression models to analyse the relationships between work motivation and healthcare quality, then conducted heterogeneity analysis.
 
Results: Healthcare quality among village clinicians was generally low. There was a significantly positive correlation between internal work motivation and healthcare quality among village clinicians (P<0.1). Additionally, the positive effect of internal work motivation on healthcare quality was strongest among clinicians who received financial incentives and had a lighter workload (fewer patients per month) [P<0.1].
 
Conclusion: Healthcare quality among village clinicians requires urgent improvement. We recommend implementing financial incentives to stimulate internal work motivation among village clinicians, thus improving their clinical performance.
 
 
New knowledge added by this study
  • Internal work motivation was positively correlated with healthcare quality among village clinicians in rural China.
  • The positive correlation was strongest among clinicians who received financial incentives and had a lighter workload (fewer patients per month).
Implications for clinical practice or policy
  • Healthcare quality among village clinicians in rural China should be enhanced by improving their internal work motivation.
  • Interventions that include financial incentives should be implemented to strengthen the positive effect of internal work motivation on healthcare quality among clinicians.
 
 
Introduction
Village clinics, the first tier of rural health systems in China, are responsible for preventing and treating common diseases among rural residents.1 2 However, the quality of healthcare provided by village clinicians may be unsatisfactory in rural China.3 4 Village clinicians generally have a low level of education and limited medical qualifications.4 There is some evidence that, among village clinicians, the first records of formal schooling are primarily vocational school degrees; most (84.3%) of these clinicians only have the basic medical certification necessary to practise medicine in rural areas.5 Moreover, despite limited empirical evaluation, available data indicate that rural primary clinicians have low diagnostic quality and provide poor management of chronic diseases.6 A 2012 study in Shaanxi Province revealed that 41% of diagnoses were incorrect; treatments were considered correct or partially correct in 53% of clinician-patient interactions.5 A systematic review of 24 studies between 2000 and 2012 showed the rate of antibiotic use in rural clinics was much higher than the rate recommended by the World Health Organization.7 8
 
The Chinese Government has recognised the need to strengthen primary healthcare in rural areas. To improve health among rural residents, the government has recently issued multiple policies that are intended to improve service capacity within primary medical systems.9 10 For example, to improve clinical knowledge among village clinicians, several government departments jointly implemented a plan in 2013, which focused on the provision of continuing education for clinicians.11 In 2019, the Basic Medical and Health Promotion Law of the People’s Republic of China emphasised the need to support the development of primary medical institutions and implement various policies that would improve primary medical service capabilities.12
 
Although improvements in internal work motivation among village clinicians may help to enhance their medical performance, few empirical studies have examined the relationship between these two characteristics among village clinicians in rural China. Theory-focused researches indicate that internal work motivation is important for improvements to clinician performance.13 14 Other theory-based researches in China have suggested that clinicians with higher internal motivation are more likely to deliver higher-quality work.15 16 Quantitative analyses of clinician behaviour, primarily conducted in other countries, have also revealed positive effects of internal work motivation on healthcare quality and work performance of clinicians.17 18 19 To our knowledge, empirical studies of work motivation in China have primarily focused on individuals in business careers and similar occupations; few have considered groups of clinicians.20 21 22 Thus, there have been few empirical studies involving village clinicians in rural China.
 
This study explored the relationship between internal work motivation and healthcare quality among village clinicians in rural China. First, using a standardised patient method and questionnaire interviews, we evaluated healthcare quality and internal work motivation among village clinicians. Second, we examined the relationships between internal work motivation and healthcare quality among village clinicians. Third, we conducted heterogeneity analysis with a focus on clinician workload and financial incentives.
 
Methods
Sampling and data collection
Our study sampling was conducted in the rural areas of three prefectures, each located in one of the following three provinces: Sichuan, Shaanxi, and Anhui. Representative samples were selected using a multi-level random method. First, 21 sample counties were randomly selected from the sample prefectures. Next, 10 townships from each sampled county were randomly chosen as sample townships; 209 sample townships were selected because one sample county contained only nine townships. Then, one village was randomly selected from each township. Finally, all village clinics in the sample village were included; one standardised patient interaction was completed in the sample village.
 
We conducted two sets of surveys to collect data regarding basic characteristics, internal work motivation, and healthcare quality among village clinicians in 2015. In the first set of surveys, we primarily gathered information regarding the characteristics of village clinics and village clinicians. Specifically, we used a facility structured questionnaire to enquire about the value of each sample clinic’s medical instruments and institutional net income in 2014 (both in Renminbi [RMB]), and length of daily lunch break (hours). We recorded the following characteristics of sample village clinicians: age, gender, level of education and clinical qualifications, duration of service, monthly salary (in RMB), number of training days in 2014, clinician workload (mean number of patients per month), mean duration of consultation per patient (minutes), and any financial incentives. Additionally, we asked village clinicians to respond to questions regarding internal work motivation.
 
In the second set of surveys, we used a standardised patient method to evaluate the quality of healthcare provided by sample village clinicians. This method avoids problems such as the Hawthorne effect and recall bias, accurately assesses healthcare quality among clinicians, and is widely used in other countries.23 24 We recruited 63 individuals (ie, standardised patients; 21 in each province) to present three predetermined disease cases of diarrhoea, tuberculosis, and unstable angina in a standardised manner. Generally, we randomly allocated one standardised patient to each sample clinic to report a case that had been randomly selected prior to allocation.
 
Measurement of healthcare quality
We evaluated the quality of healthcare provided by village clinicians using three indicators: process quality, diagnostic accuracy, and treatment accuracy. We assigned a process quality value of 1 to clinicians who completed more than the mean percentage of suggested items, indicating a high-quality enquiry process. Otherwise, the process quality value was 0. Regarding diagnostic and treatment accuracies, we assigned a value of 0 to an ‘incorrect’ result, based on predetermined criteria. Otherwise, ‘correct’ or ‘partly correct’ results were assigned an accuracy value of 1. The treatment was also considered correct if the clinician referred the patient to a higher-level hospital.
 
Measurement of internal work motivation
According to Amabile and Mueller,25 an individual’s work motivation is defined as internal work motivation if it originates from love and interest. The internal motivation instrument in our study included four items, such as ‘because I like what I do for a living'. The responses of four items were rated on a 7-point Likert-type scale, ranging from 1 = strongly disagree to 7 = strongly agree. In this study, we assigned a value of 0 to responses indicating disagreement or neutrality (with original score of 1-4) and a value of 1 to responses indicating agreement (with original score of 5-7). The total score of the four items on our instrument represented a clinician's level of internal work motivation. The total score ranges from 0 to 4; a higher score indicated a higher level of motivation.
 
The Cronbach’s α value of the internal work motivation questionnaire was 0.826, which indicated that the scale had good internal consistency. The Kaiser–Meyer–Olkin value of the questionnaire was 0.705, indicating that the scale had good structural validity. These results confirmed that the questionnaire was an acceptable measurement tool.
 
Statistical analysis
STATA15.0 software (Stata Corporation; College Station [TX], United States) was used to perform descriptive and regression analyses of the collected data. Logistic regression models with a significance threshold of P<0.1 were used to analyse relationships between internal work motivation and healthcare quality.26 27 28 Two items, clinician workload × internal motivation interaction and financial incentive × internal motivation interaction, were added to the model for analyses of heterogeneity. All regression analyses were adjusted for fixed effects of disease cases, standardised patients, and the coder.
 
Results
Characteristics of sample village clinicians and clinics
In total, 225 village clinicians from 225 village clinics were included in this study. Table 1 describes the basic characteristics of sample village clinicians. The mean age of the clinicians was 49.20 years, and 196 clinicians (87.11%) were men. Among the 225 clinicians, 25 (11.11%) had attended college or above, whereas seven (3.11%) had a practising clinician qualification. Each clinician examined a mean of 171 patients per month. Mean salaries for village clinicians were particularly low (slightly >1900 RMB per month), and 103 clinicians (45.78%) had received financial incentives.
 

Table 1. Characteristics of sample village clinics and clinicians (n=225)
 
Table 1 also describes the characteristics of sample village clinics. The mean value of medical equipment was 920 RMB, and the mean institutional net income in 2014 was 25 500 RMB. However, only 86 clinics (38.22%) had a medical equipment value above the mean. This result indicates that the value of medical equipment considerably varied among sample clinics, and the value of medical equipment in most clinics was inadequate. Notably, clinics had a mean lunch break length of <1 hour.
 
Healthcare quality among village clinicians
The unannounced standardised patients completed 225 disease cases (57, 87, and 81 cases of diarrhoea, angina, and tuberculosis, respectively). Table 2 shows the healthcare quality among sample village clinicians determined via three disease cases. On average, the clinicians completed 17% of the recommended consultation and examination items. Furthermore, 129 clinicians (57.33%) completed fewer than the mean number of recommended consultation and examination items. Among all types of cases, 73 clinicians (32.44%) provided a completely or partially correct diagnosis. Furthermore, 94 clinicians (41.78%) provided correct or partly correct treatments across all types of cases. Although the results of these three indicators varied among diseases, the percentages of clinicians with number of recommended consultation and examination items above the mean, number of correct diagnoses, and number of treatments for each disease were generally low.
 

Table 2. Healthcare quality among sample village clinicians determined via three disease cases
 
Internal work motivation of village clinicians
Table 3 shows the levels of internal work motivation among sample village clinicians. Overall, 213 clinicians (94.67%) believed that ‘I like what I do for a living’ or ‘I enjoy my job’ motivated their work in clinics. Furthermore, 187 (83.11%) and 206 (91.56%) clinicians indicated that their respective main work motivations were ‘because my job is interesting’ and ‘because my job is fun’. Integration of the scores for the four items revealed that the mean overall score for internal work motivation was 3.64 ± 0.85 (range, 0-4).
 

Table 3. Internal work motivation of sample village clinics (n=225)
 
Relationships between internal work motivation and healthcare quality among village clinicians
Table 4 presents the results of logistic regression analysis of the relationship between internal work motivation and healthcare quality among village clinicians. Internal work motivation had a positive effect on clinical performance among sample clinicians. Specifically, for each one-unit increase in internal work motivation, village clinicians were 42.17% (P<0.1) and 45.61% (P<0.1) more likely to provide a correct or partially correct diagnosis and treatment, respectively.
 

Table 4. Logistic regression analysis of relationships between internal work motivation and healthcare quality among sample village clinicians
 
Table 5 shows the results of heterogeneity analysis from the perspective of clinician workload and financial incentives. The clinician workload × internal motivation interaction was significantly negatively correlated with diagnostic accuracy, whereas the financial incentive × internal motivation interaction was significantly positively correlated with treatment accuracy (P<0.1). These results indicate that a heavier workload could hinder the positive effect of internal motivation on diagnostic accuracy among village clinicians. Furthermore, among village clinicians who received financial incentives, the positive effect of their internal work motivation on their treatments was stronger than the corresponding effect among village clinicians who did not receive financial incentives.
 

Table 5. Heterogeneity analysis of the relationship between internal work motivation and healthcare quality among sample village clinicians
 
Discussion
This study evaluated the healthcare quality among village clinicians in rural China and its relationship with internal work motivation among these clinicians, through an analysis of 225 rural village clinicians from three provinces in 2015. There were three main findings. First, healthcare quality among village clinicians needed to be improved. Second, village clinicians with stronger internal work motivation were more likely to offer appropriate treatment. Third, village clinicians with a lighter workload (fewer patients per month) or financial incentives exhibited a stronger positive correlation between internal motivation and healthcare quality.
 
Generally, interactions between unannounced standardised patients and sample village clinicians showed that poor healthcare quality was provided by village clinics in rural China. On average, village clinicians completed only 17% of the recommended consultation and examination items. The rates of diagnostic accuracy and treatment accuracy (including correct or partly correct treatment) were 32.44% and 41.78%, respectively. Our findings of poor healthcare quality are comparable with the results of other studies performed at primary health centres in rural China. For example, a study based on the patient’s perspective, conducted in Guangdong Province, highlighted the difficulty in maintaining adequate coordination among primary medical services.29 A survey using a standardised patient method revealed that healthcare quality was worse in rural China than in primary care settings in Nairobi, Kenya.30 A systematic analysis of rural township health centres in Shandong Province also indicated a need for improved healthcare quality among primary care clinicians.16
 
We found that internal work motivation was generally high among village clinicians. The mean internal work motivation score was 3.64 ± 0.85, indicating that most village clinicians liked their jobs and were interested in their careers. Consistent with our findings, previous studies in other countries showed that most medical workers had high levels of internal work motivation.31 32 33 Although few empirical studies have evaluated internal work motivation among village clinicians, there is some evidence that rural primary care clinicians in China experience meaning and pleasure from engaging in medical work.13 Additionally, similar to results in other countries, we found that among the intrinsic factors, most village clinicians believed that a love for their career motivated them to work.33 34 35
 
Consistent with data from studies in other countries,17 19 our empirical analysis demonstrated significant positive correlations between internal work motivation and healthcare quality among village clinicians in rural China. According to affect heuristic theory, this relationship presumably arises because individuals rely on emotions to make behavioural decisions, and a positive attitude will lead to higher-quality behaviours.36 Empirical results from other countries support this assumption. A study in the United States demonstrated the importance of internal work motivation in medical behaviour decisions; clinicians with higher internal motivation were more willing to maintain higher quality in their work.17 The findings of studies in developing countries, such as Ghana and Indonesia, also indicated that work motivation can significantly improve the quality of medical services provided by clinicians.19 37 Thus, efforts to stimulate internal work motivation among village clinicians may help to improve their healthcare quality.
 
The results of heterogeneity analysis showed that the positive effect of internal work motivation on healthcare quality varied according to clinician workload and financial incentives. Specifically, internal work motivation had a stronger positive effect on clinical performance among village clinicians who had a lighter workload (fewer patients per month). This is presumably because clinicians with a heavier workload (more patients) are more likely to experience burnout38 and a decreased sense of autonomy,39 40 which could reduce internal motivation and ultimately lead to a decline in work performance.14 39 41 Additionally, compared with village clinicians who did not receive financial incentives, clinicians who received financial incentives experienced a stronger positive effect on healthcare quality because of their internal work motivation. Studies of clinicians, combined with the results of theoretical analyses in other fields (ie, motivational synergy theory and self-decision theory), suggest that the provision of financial incentives encourages a belief of greater competence among clinicians; this belief, in conjunction with internal work motivation, enables clinicians to maintain high quality in their work.13 14 41 42 Previous empirical studies have also demonstrated that performance-related financial incentives can improve internal work motivation among employees, leading to improvements in performance.43
 
The results of these heterogeneity analyses support efforts to enhance the positive effect of internal work motivation on healthcare quality by providing appropriate incentives for clinicians. Consistent with this perspective, the Chinese Government has been implementing incentive programmes during the past decade to improve healthcare quality among primary care clinicians in rural China.10 11 For example, the government is actively restructuring the salary and performance system, while asserting that healthcare systems at all levels should engage in combined efforts to provide additional financial incentives.44
 
To further promote internal work motivation among clinicians and improve their work performance, we recommend the revision of governmental incentives policies, based on existing policies. Specifically, medical institutions at all levels should establish performance accountability45; and emphasis should be placed on including physician performance in assessments to incentivise high-quality healthcare. Furthermore, medical institutions at all levels should provide additional financial incentives to clinicians based on assessments of patient experiences. These programmes could strengthen the positive effect of internal motivation on work performance among clinicians and improve their healthcare quality. Additionally, the workload of primary clinicians should be carefully managed to preserve the positive effect of their intrinsic motivation on job performance.
 
This study had a few limitations. First, it was a cross-sectional study, and the results represent correlations rather than causal relationships. Second, because we randomly selected samples from Sichuan, Shaanxi, and Anhui provinces, our results may not be fully representative of village clinicians and village clinics throughout rural China. Third, the reported level of internal work motivation may have been overestimated because this variable was self-reported by village clinicians.
 
Conclusion
Overall, healthcare quality was poor among village clinicians in rural China. Furthermore, there were positive correlations between internal work motivation and healthcare quality among rural village clinicians; these positive correlations were stronger among clinicians with financial incentives and lighter workload. Our findings suggest that the Chinese Government should implement policies to provide financial incentives for clinicians, with the goal of enhancing internal work motivation among village clinicians and improving their healthcare quality.
 
Author contributions
Concept or design: Q Gao, Y Shi, L Peng.
Acquisition of data: L Peng, S Song, Y Zhang.
Analysis or interpretation of data: L Peng, Q Gao, S Song.
Drafting of the manuscript: L Peng, Q Gao, Y Shi.
Critical revision of the manuscript for important intellectual content: All authors.
 
Conflicts of interest
As an International Editorial Advisory Board member of the journal, Y Shi was not involved in the peer review process. Other authors have disclosed no conflicts of interest.
 
Acknowledgement
All authors thank the standardised patients and investigators for their contribution and hard work.
 
Funding/support
This work received funding from 111 Project (Grant No.: B16031), National Natural Science Foundation of China (Grant No.: 72203134) and Innovation Capability Support Program of Shaanxi, China (Grant No.: 2022KRM007).
 
Ethics approval
Ethical approval was obtained from the Institutional Review Board of Sichuan University, China (Protocol No.: K2015025). The board approved the verbal consent procedure. Participants in this study were informed of the survey procedure and consented to publication.
 
References
1. Babiarz KS, Miller G, Yi H, Zhang L, Rozelle S. China’s new cooperative medical scheme improved finances of township health centers but not the number of patients served. Health Aff (Millwood) 2012;31:1065-74. Crossref
2. Shi Y, Xue H, Wang H, Sylvia S, Medina A, Rozelle S. Measuring the quality of doctors’ health care in rural China: an empirical research using standardized patients [in Chinese]. Stud Labour Econ 2016;4:48-71.
3. Guo W, Sylvia S, Umble K, Chen Y, Zhang X, Yi H. The competence of village clinicians in the diagnosis and treatment of heart disease in rural China: a nationally representative assessment. Lancet Reg Health West Pac 2020;2:100026. Crossref
4. Tan J, Yao Y, Wang Q. Comparative analysis of the development and service utilisation of health resources in Guangxi township hospitals and the whole country [in Chinese]. Soft Sci Health 2019;33:46-50.
5. Sylvia S, Shi Y, Xue H, et al. Survey using incognito standardized patients shows poor quality care in China’s rural clinics. Health Policy Plan 2015;30:322-33. Crossref
6. Li X, Lu J, Hu S, et al. The primary health-care system in China. Lancet 2017;390:2584-94. Crossref
7. Yin X, Song F, Gong Y, et al. A systematic review of antibiotic utilization in China. J Antimicrob Chemother 2013;68:2445-52. Crossref
8. Virtual Health Library. Using indicators to measure country pharmaceutical situations: fact book on WHO level I and level II monitoring indicators. 2006. Available from: https://pesquisa.bvsalud.org/portal/resource/pt/mis-19254. Accessed 30 Sep 2021.
9. Communist Party of China Central Committee, State Council, People’s Republic of China. “Healthy China 2030” Programme [in Chinese]. 2016. Available from: http://www.gov.cn/zhengce/2016-10/25/content_5124174.htm. Accessed 30 Sep 2021.
10. State Council, People’s Republic of China. “Thirteenth Five-Year” Sanitation and Health Plan [in Chinese]. 2017. Available from: http://www.gov.cn/zhengce/content/2017-01/10/content_5158488.htm. Accessed 30 Sep 2021.
11. National Health and Family Planning Commission, National Development and Reform Commission, Ministry of Education of the People’s Republic of China, Ministry of Finance of the People’s Republic of China, National Administration of Traditional Chinese Medicine. National Rural Doctor Education Plan (2011-2020) [in Chinese]. 2013. Available from: http://www.gov.cn/gzdt/2013-10/30/content_2518099.htm. Accessed 20 Jan 2023.
12. State Council, People’s Republic of China. The 15th Meeting of the Standing Committee of the 13th National People’s Congress. Basic Medical Hygiene and Health Promotion Law of the People’s Republic of China [in Chinese]. 2019. Available from: http://www.gov.cn/xinwen/2019-12/29/content_5464861.htm. Accessed 5 Oct 2021.
13. Yuan B, Meng Q. Behaviour determinants of rural health workers: based on work motivation theory analysis [in Chinese]. Chin Health Econ 2012;31:50-2.
14. Kao AC. Driven to care: aligning external motivators with intrinsic motivation. Health Serv Res 2015;50 Suppl 2:2216-22. Crossref
15. Yuan B, Meng Q, Hou Z, Sun X, Song K. Analysis on incentive mechanism and motivation of rural health providers [in Chinese]. Chin J Health Policy 2010;3:3-9.
16. Yuan S, Meng Q, Sun X. Studying on the quality of medical services in township health centres in view of structural quality [in Chinese]. Chin Health Serv Manage 2012;29:841-4.
17. Green EP. Payment systems in the healthcare industry: an experimental study of physician incentives. J Econ Behav Organ 2014;106:367-78. Crossref
18. Mangkunegara AP, Agustine R. Effect of training, motivation and work environment on physicians’ performance. Acad J Interdiscip Stud 2016;5:173-88. Crossref
19. Al Aluf W, Sudarsih S, Musemedi DP, Supriyadi S. Assessing the impact of motivation, job satisfaction, and work environment on the employee performance in healthcare services. Int J Sci Technol Res 2017;6:337-41.
20. Hou X, Lu F. Effects of work values of millennial employees, intrinsic motivation on job performance: the moderating effect of organizational culture. Manage Rev 2018;30:157-68.
21. Li W, Mei J. Intrinsic motivation and employee performance: based on the effects of work engagement as intermediary [in Chinese]. Manage Rev 2013;25:160-7.
22. Liao J, Jing Z, Liu W, Wang X. Promotion opportunity stagnation and job performance: the mediating—role of intrinsic motivation and perceived insider status. Ind Eng Manage 2015;20:15-21.
23. Das J, Holla A, Das V, Mohanan M, Tabak D, Chan B. In urban and rural India, a standardized patient study showed low levels of provider training and huge quality gaps. Health Aff (Millwood) 2012;31:2774-84. Crossref
24. Rethans JJ, Gorter S, Bokken L, Morrison L. Unannounced standardised patients in real practice: a systematic literature review. Med Educ 2007;41:537-49. Crossref
25. Amabile TM, Mueller J. Studying creativity, its processes, and its antecedents: an exploration of the componential theory of creativity. In: Zhou J, Shalley CE, editors. Handbook of Organizational Creativity. New York: Lawrence Erlbaum Associates; 2008: 33-64.
26. Lee KI, Koval JJ. Determination of the best significance level in forward step-wise logistic regression. Commun Stat Simul Comput 1997;2:559-75. Crossref
27. Maneejuk P, Yamaka W. Significance test for linear regression: how to test without P-values? J Appl Stat 2020;31:827-45. Crossref
28. Feise R. Do multiple outcome measures require p-value adjustment? BMC Med Res Methodol 2002;17:1471-2288-2-8. Crossref
29. Feng S. Analysis of the quality of primary care and the influencing factors in rural Guangdong Province—based on demander perspective [in Chinese]. Chin Health Serv Manage 2016;33:824-7.
30. Daniels B, Dolinger A, Bedoya G, et al. Use of standardised patients to assess quality of healthcare in Nairobi, Kenya: a pilot, cross-sectional study with international comparisons. BMJ Glob Health 2017;2:e000333.
31. Settineri S, Merlo EM, Frisone F, et al. The experience of health and suffering in the medical profession. Mediterr J Clin Psychol 2018;6:1-14.
32. Toode K, Routasalo P, Helminen M, Suominen T. Hospital nurses’ individual priorities, internal psychological states and work motivation. Int Nurs Rev 2014;61:361-70. Crossref
33. Tsounis A, Sarafis P, Bamidis PD. Motivation among physicians in Greek public health-care sector. Br J Med Med Res 2014;4:1094-105. Crossref
34. Ferraro T, dos Santos NR, Moreira JM, Pais L. Decent work, work motivation, work engagement and burnout in physicians. Int J Appl Posit Psychol 2020;5:13-35. Crossref
35. Lochner L, Wieser H, Mischo-Kelling M. A qualitative study of the intrinsic motivation of physicians and other health professionals to teach. Int J Med Educ 2012;3:209-15. Crossref
36. Slovic P, Finucane ML, Peters E, MacGregor DG. The affect heuristic. Eur J Oper Res 2007;177:1333-52. Crossref
37. Alhassan RK, Spieker N, van Ostenberg P, Ogink A, Nketiah-Amponsah E, de Wit TF. Association between health worker motivation and healthcare quality efforts in Ghana. Hum Resour Health 2013;11:37. Crossref
38. Ward ZD, Morgan ZJ, Peterson LE. Family physician burnout does not differ with rurality. J Rural Health 2021;37:755-61. Crossref
39. Wang Y, Hu XJ, Wang HH, et al. Follow-up care delivery in community-based hypertension and type 2 diabetes management: a multi-centre, survey study among rural primary care physicians in China. BMC Fam Pract 2021;22:224. Crossref
40. Shirom A, Nirel N, Vinokur AD. Work hours and caseload as predictors of physician burnout: the mediating effects by perceived workload and by autonomy. Appl Psychol 2010;59:539-65. Crossref
41. Deci EL, Ryan RM. Intrinsic Motivation and Self-Determination in Human Behavior. New York: Springer; 1985. Crossref
42. Amabile TM. Motivational synergy: toward new conceptualizations of intrinsic and extrinsic motivation in the workplace. Hum Resour Manage Rev 1993;3:185-201. Crossref
43. Eisenberger R, Aselage J. Incremental effects of reward on experienced performance pressure: positive outcomes for intrinsic interest and creativity. J Organiz Behav 2009;30:95-117. Crossref
44. Qin J, Li S, Lin C. Reform progress and development strategy of incentive mechanism for training and use of general practitioners in China [in Chinese]. Chin Gen Pract 2020;23:2351-8.
45. Li X, Krumholz HM, Yip W, et al. Quality of primary health care in China: challenges and recommendations. Lancet 2020;395:1802-12. Crossref

Cost-minimisation analysis of intravenous versus subcutaneous trastuzumab regimen for breast cancer management in Hong Kong

Hong Kong Med J 2023 Feb;29(1):16-21 | Epub 3 Feb 2023
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE  CME
Cost-minimisation analysis of intravenous versus subcutaneous trastuzumab regimen for breast cancer management in Hong Kong
Vivian WY Lee, PharmD1; Franco WT Cheng, MClinPharm2
1 Centre for Learning Enhancement And Research, The Chinese University of Hong Kong, Hong Kong
2 Department of Pharmacology and Pharmacy, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong
 
Corresponding author: Prof Vivian WY Lee (vivianlee@cuhk.edu.hk)
 
 Full paper in PDF
 
Abstract
Introduction: In 2017, breast cancer was the most common cancer and third leading cause of cancer death among women in Hong Kong. Approximately 20% of patients were human epidermal growth factor receptor-2 (HER2)-positive. This study was conducted to investigate cost differences between intravenous and subcutaneous trastuzumab regimens in Hong Kong using medical resources utilisation data from other countries.
 
Methods: A cost-minimisation model was developed to compare the cost of total care, including direct medical cost and full-time equivalent (FTE) hours. The drug acquisition cost was obtained from the manufacturer, whereas the costs for hospitalisation and clinic visits were acquired from the Hong Kong Gazette. Time (in FTE hours) was determined by literature review. All costs were expressed in US dollars (US$1 = HK$7.8). Costs were not discounted because of the short time horizon. One-way deterministic sensitivity analysis was performed to identify the effects of changes in drug acquisition cost, changes in FTE hours (based on confidence intervals reported), and changes in body weight (±20%).
 
Results: Literature review indicated that 0.18 FTE hour of nursing time (7.9 hours) and 0.14 FTE hour of pharmacist time (6.2 hours) could be saved each week if the subcutaneous formulation was used. Using data in 2017, after 18 cycles of treatment with subcutaneous trastuzumab, the drug acquisition and healthcare professional time costs were reduced by US$9451.28 and US$566.16, respectively, yielding an annual savings of over US$8 million.
 
Conclusion: The subcutaneous formulation of trastuzumab is a potential cost-saving therapy for HER2-positive breast cancer patients in Hong Kong. The drug acquisition cost was the parameter with the greatest effect on the total cost of treatment.
 
 
New knowledge added by this study
  • The results of this study suggest that the subcutaneous formulation of trastuzumab would be a cost-saving therapy for HER2-positive breast cancer patients in Hong Kong.
  • The drug acquisition cost was the parameter with the greatest effect on the total cost of treatment.
Implications for clinical practice or policy
  • The high drug acquisition cost of trastuzumab may prevent patients from receiving effective treatment.
  • The subcutaneous formulation of trastuzumab is expected to remain more cost-effective, despite the potential emergence of biosimilar trastuzumab.
 
 
Introduction
In 2017, breast cancer was the most common cancer and third leading cause of cancer death among women in Hong Kong.1 Additionally, an estimated 20% of breast cancers in Hong Kong were human epidermal growth factor receptor-2 (HER2)-positive.2 3
 
Intravenous (IV) trastuzumab, in combination with chemotherapy, is licensed for the treatment of HER2-positive early-stage breast cancer and metastatic breast cancer. It must be reconstituted into solution for loading dose infusion over a duration of 90 minutes, followed by maintenance dose infusion over a duration of 30 minutes.4 Additionally, IV trastuzumab is dosed according to each patient’s body weight, with a loading dose of 8 mg/kg followed by a maintenance dose of 6 mg/kg every 3 weeks.4 This regimen consumes considerable healthcare resources, including drug preparation and administration time, clinic and chair time, and physician time dedicated to patient interaction.5
 
A fixed-dose subcutaneous (SC) formulation of trastuzumab was developed to allow drug administration over approximately 5 minutes, which is much shorter than the duration of IV infusion. The 600-mg dose of SC trastuzumab every 3 weeks is non-dinferior to the IV formulation with respect to efficacy and tolerability.6 7 Furthermore, approximately 90% of patients preferred SC over IV administration of trastuzumab in the PrefHer (Preference for subcutaneous or intravenous administration of trastuzumab in patients with HER2-positive early breast cancer) randomised crossover trials,8 9 which were designed to assess patient preference and healthcare professional satisfaction with both treatment options.
 
Data from other countries have demonstrated that for SC formulation of trastuzumab, less time is required for drug preparation and administration; moreover, fewer consumables are used.10 11 12 13 A cost-minimisation analysis (CMA) study in Greece demonstrated that the total cost of therapy per patient was 21 870 euros (€) when using the SC formulation of trastuzumab, whereas it was €23 118 when using the IV formation of trastuzumab. The investigators concluded that use of the SC formulation of trastuzumab would provide cost savings for the Greek healthcare system.10 A study in Spain revealed similar findings: the use of the SC formulation of trastuzumab led to a 19.4 to 28.8% cost savings in the hospital.11 Additionally, a time-and-motion study in New Zealand compared medical resource utilisation between the IV and SC formulations of trastuzumab in patients with HER2-positive breast cancer. The potential cost saving was NZ$96.94 per patient per cycle.12 Furthermore, a time-and-motion sub-study13 from the PrefHer trials involving eight countries (Canada, France, Switzerland, Denmark, Italy, Russia, Spain, and Turkey) demonstrated time savings for patient chair, administration by healthcare professionals, and drug preparation.
 
The SC formulation of trastuzumab is expected to provide cost savings in other countries. However, healthcare systems and modes of clinical services differ between Hong Kong and other countries. Therefore, this study was conducted to investigate cost differences between IV and SC trastuzumab regimens in Hong Kong medical settings, using medical resources utilisation data from other countries.
 
Methods
Cost methods and data sources
A CMA model was developed to compare the cost of total care. The CMA approach was used because the clinical efficacy and safety profiles of IV and SC trastuzumab regimens are similar, as demonstrated in the previous studies7 14 15; this fulfils the CMA requirement for two treatments to demonstrate similar efficacy. The following steps were followed in the CMA. We compared direct medical costs related to the IV and SC trastuzumab regimens that produced equivalent health outcomes. The CMA solely focuses on selection of the least costly option. In this study, the CMA was conducted from a hospital perspective. All direct medical costs and full-time equivalent (FTE) hours were included in this study. Drugs, clinic visits for drug administration, specialist out-patient clinic visits, and consumables were regarded as direct medical costs. The time horizon was 18 cycles of treatment, which mimics the duration of treatment for early-stage HER2-positive breast cancer. Drug acquisition cost data were obtained from the manufacturer, whereas costs for hospitalisation and clinic visits were acquired from the 2017 Hong Kong Gazette.16 The drug acquisition cost was based on the dose used in previous clinical trials: IV loading dose of 8 mg/kg and maintenance dose of 6 mg/kg every 3 weeks versus SC fixed dose of 600 mg every 3 weeks. A mean body weight of 57.3 kg was used, based on data from the 2016 Hong Kong Cancer Registry.3
 
Estimated FTE hour values were obtained from previous literature. These values were regarded as the time (in hours) required for drug preparation and administration, divided by 44 hours, the weekly average working hours for such tasks. The FTE hour values were then converted to monetary values, calculated as the median hourly rate received by individuals in each position. In Hong Kong, nurses and pharmacists are mainly involved in drug preparation and administration; thus, the salaries of these positions were used for estimation of FTE hour values.
 
All costs were expressed in US dollars (US$1 = HK$7.8), using 2016 as the fiscal year. Because of the short time horizon in the study, no costs were discounted.
 
Literature review
Medical resources and FTE hour values were determined by literature review in Embase and MEDLINE, using the key words ‘subcutaneous’, ‘trastuzumab’, ‘time’, ‘cost’, and ‘medical resources’.
 
Statistical analyses
The CMA was conducted from the healthcare payer perspective. All continuous variables were described as means ± standard deviations and medians with ranges.
 
A drug budget impact forecast analysis was performed to determine how changes in the total cost of treatment regimens, including direct medical costs and FTE hours, would impact healthcare expenditures in Hong Kong. Each individual parameter, namely drug acquisition cost for each formulation (±20%), patient body weight (±20%), and time and consumables reported in the literature (based on confidence intervals reported) were analysed independently within specified ranges, whereas other factors were fixed at base-case values. The analysis parameters were chosen based on the findings in previous cost-effectiveness studies.17 A simulation model was used to run 10 000 iterations of the forecast model; for each iteration, model parameters were input as shown in Table 1. We assumed that cost changes were consistent with the beta distribution around the mean. One-way deterministic sensitivity analysis was also performed to evaluate the extent to which the total cost would be affected by changes in the drug acquisition cost for each formulation (±20%), changes in times and consumables obtained from literature (based on confidence intervals reported), and changes in body weight (±20%); this approach is consistent with the methodology used in another cost-effectiveness analysis focused on trastuzumab.17 Figure 1 summarises the analysis process of this study.
 

Table 1. Parameters and costs of the drug budget impact forecast model
 

Figure 1. Flowchart of the analysis process
 
Results
In total, 11 studies were identified, eight of which were eligible for analysis.12 18 19 20 21 22 23 24 Three studies were excluded because they did not report the time required for administration or preparation. There are a total of six studies with information on pharmacist time on preparation and nursing time on administration for IV and SC trastuzumab; the remaining two only reported time differences between the two formulations. Among the six studies that reported the time for preparation, four reported the total drug preparation time required for IV and SC trastuzumab, whereas the remaining two only reported time differences. If the SC formulation was used, 0.18 FTE hour of nursing time (7.9 hours) and 0.14 FTE hour of pharmacist time (6.2 hours) could be saved each week. Table 2 summarises the findings from these studies.
 

Table 2. Summary of findings on healthcare professional time
 
After 18 cycles of treatment with SC trastuzumab, the drug acquisition and healthcare professional time costs were reduced by US$9451.28 and US$566.16, respectively, compared with IV trastuzumab. Therefore, US$10 017.44 could be saved for each patient who completed 18 cycles of treatment. The cost of consumables was excluded because only two studies reported this information, and the contributions to overall costs were minimal (NZ$15.2712 and GBP0.6421, respectively). Table 3 summarises the direct medical costs of IV and SC formulations.
 

Table 3. Total cost of care for 18 cycles of treatment with intravenous trastuzumab versus subcutaneous trastuzumab
 
Sensitivity analysis
The drug budget impact forecast model was most affected by body weight and drug acquisition cost. Cost differences between the IV and SC formulations were reduced by decreases in body weight and IV trastuzumab cost, as well as an increase in SC trastuzumab cost. The effects of changes in nursing time and pharmacist time were smaller. Table 1 summarises the model parameters, and Figure 2 illustrates the effects of each variable on cost differences.
 

Figure 2. Tornado diagram of factors affecting total cost of treatment
 
Drug budget impact forecast
In 2017, 4373 women were diagnosed with invasive breast cancer,1 and approximately 20% of them were HER2-positive.2 3 Furthermore, trastuzumab was the most commonly used targeted therapy (95.3%).3 Assuming that the SC formulation was used (instead of the IV formulation) for all HER2-positive patients receiving trastuzumab and using the 2017 data stated here, an annual saving of over US$8.3 million could be achieved in Hong Kong.
 
Discussion
The results of this study suggest that SC trastuzumab would be more cost-effective than its IV counterpart in Hong Kong. Even if lower-cost biosimilar trastuzumab becomes available, the SC formulation will remain less expensive unless there is a substantial reduction in the acquisition cost of IV trastuzumab.
 
As body weight decreases, the necessary dosage and corresponding expenditures are expected to decrease. Paradoxically, ≤20% increases in body weight had a neutral effect in the analysis. This result could be related to a substantial amount of drug wastage when using weight-based IV trastuzumab, which is consistent with previous findings.19 25 Therefore, further studies are needed to determine the optimal route of administration for patients who are underweight or do not require full doses of trastuzumab because of their clinical conditions.
 
Although the SC formulation is expected to save time for healthcare professionals,26 27 28 the present analysis suggests that its contribution to the total cost of care is minimal. The cost of drug acquisition has the greatest effect on financial burden.
 
The use of data from previous time-and-motion studies in other countries may not be appropriate for medical settings in Hong Kong. Further studies should be conducted in Hong Kong to estimate the actual cost savings with respect to healthcare professional time, although theoretical time savings may not accurately represent actual time savings because of clinical activities conducted during administration of trastuzumab.29 Furthermore, data from other countries exhibited wide distributions in terms of standard deviation and range. Nevertheless, the influence of the SC formulation on the total cost-saving effect may be limited, as demonstrated in the sensitivity analysis.
 
Although the costs of clinic visits and chemotherapy were assumed to be identical throughout 18 cycles of treatment between the two formulations, some patients can receive SC trastuzumab in ambulatory care settings. Thus, the mean savings may have been underestimated in our model.
 
There were several limitations in this study. First, because of the small number of studies identified in the literature review, consumables could not be included in the CMA. Second, societal cost and patient preferences were not considered because such information is unavailable in Hong Kong. A more patient-centred approach would provide greater insights. Third, time-and-motion analysis and waste handling in Hong Kong were not considered; these factors may have specific impact on drug preparation time and administration time and costs. Fourth, costs for adverse drug reactions were not included because these costs were assumed to be equal for IV and SC trastuzumab regimens. However, this assumption may be incorrect, particularly with regard to infusion-related reactions.
 
Conclusion
The results of this study suggest that the SC formulation of trastuzumab would be a cost-saving therapy for HER2-positive breast cancer patients in Hong Kong. The drug acquisition cost was the parameter with the greatest effect on the total cost of treatment.
 
Author contributions
Concept or design: VWY Lee.
Acquisition of data: FWT Cheng.
Analysis or interpretation of data: Both authors.
Drafting of the manuscript: FWT Cheng.
Critical revision of the manuscript for important intellectual content: VWY Lee.
 
Both authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
Both authors disclosed no conflicts of interest.
 
Declaration
The datasets generated and/or analysed in this study are available from the corresponding author on reasonable request.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
Not applicable because this study did not involve human participants.
 
References
1. Hong Kong Cancer Registry, Hospital Authority, Hong Kong SAR Government. Female breast cancer in 2017. Available from: https://www3.ha.org.hk/cancereg/pdf/factsheet/2017/breast_2017.pdf. Accessed 11 May 2020.
2. Yau TK, Sze H, Soong IS, Hioe F, Khoo US, Lee AW. HER2 overexpression of breast cancers in Hong Kong: prevalence and concordance between immunohistochemistry and in-situ hybridisation assays. Hong Kong Med J 2008;14:130-5.
3. Hong Kong Breast Cancer Foundation. Hong Kong Breast Cancer Registry Report No. 8. 2016. Available from: http://www.hkbcf.org/download/bcr_report8/hkbcf_report_2016_full_report.pdf. Accessed 21 May 2017.
4. Genentech. Highlights of prescribing information. 2016. Available from: http://www.gene.com/download/pdf/herceptin_prescribing.pdf. Accessed 8 Aug 2016.
5. Kruse GB, Amonkar MM, Smith G, Skonieczny DC, Stavrakas S. Analysis of costs associated with administration of intravenous single-drug therapies in metastatic breast cancer in a U.S. population. J Manag Care Pharm 2008;14:844-57. Crossref
6. Ismael G, Hegg R, Muehlbauer S, et al. Subcutaneous versus intravenous administration of (neo)adjuvant trastuzumab in patients with HER2-positive, clinical stage I-III breast cancer (Hannah study): a phase 3, open-label, multicentre, randomised trial. Lancet Oncol 2012;13:869-78. Crossref
7. Jackisch C, Kim SB, Semiglazov V, et al. Subcutaneous versus intravenous formulation of trastuzumab for HER2-positive early breast cancer: updated results from the phase III HannaH study. Ann Oncol 2015;26:320-5. Crossref
8. Pivot X, Gligorov J, Müller V, et al. Preference for subcutaneous or intravenous administration of trastuzumab in patients with HER2-positive early breast cancer (PrefHer): an open-label randomised study. Lancet Oncol 2013;14:962-70. Crossref
9. Pivot X, Gligorov J, Müller V, et al. Patients’ preferences for subcutaneous trastuzumab versus conventional intravenous infusion for the adjuvant treatment of HER2- positive early breast cancer: final analysis of 488 patients in the international, randomized, two-cohort PrefHer study. Ann Oncol 2014;25:1979-87. Crossref
10. Mylonas C, Kourlaba G, Fountzilas G, Skroumpelos A, Maniadakis N. Cost-minimization analysis of trastuzumab intravenous versus trastuzumab subcutaneous for the treatment of patients with HER2+ early breast cancer and metastatic breast cancer in Greece. Value Health 2014;17:A640-1. Crossref
11. Gutierrez F, Nazco G, Viña M, Bullejos M, Gonzalez I, Valcarcel C. Economic impact of using subcutaneous trastuzumab. Value Health 2014;17:A641. Crossref
12. North RT, Harvey VJ, Cox LC, Ryan SN. Medical resource utilization for administration of trastuzumab in a New Zealand oncology outpatient setting: a time and motion study. Clinicoecon Outcomes Res 2015;7:423-30. Crossref
13. De Cock E, Pivot X, Hauser N, et al. A time and motion study of subcutaneous versus intravenous trastuzumab in patients with HER2-positive early breast cancer. Cancer Med 2016;5:389-97. Crossref
14. Van den Nest M, Glechner A, Gold M, Gartlehner G. The comparative efficacy and risk of harms of the intravenous and subcutaneous formulations of trastuzumab in patients with HER2-positive breast cancer: a rapid review. Syst Rev 2019;8:321. Crossref
15. Jackisch C, Stroyakovskiy D, Pivot X, et al. Subcutaneous vs intravenous trastuzumab for patients with ERBB2-positive early breast cancer: final analysis of the HannaH phase 3 randomized clinical trial. JAMA Oncol 2019;5:e190339. Crossref
16. Government Logistics Department, Hong Kong SAR Government. Hospital Authority Ordinance (Chapter 113). Revisions to list of charges. Available from: https://www.gld.gov.hk/egazette/pdf/20172124/egn201721243884.pdf. Accessed 30 May 2017.
17. Kurian AW, Thompson RN, Gaw AF, Arai S, Ortiz R, Garber AM. A cost-effectiveness analysis of adjuvant trastuzumab regimens in early HER2/neu-positive breast cancer. J Clin Oncol 2007;25:634-41. Crossref
18. Olofsson S, Norrlid H, Karlsson E, Wilking U, Ragnarson Tennvall G. Societal cost of subcutaneous and intravenous trastuzumab for HER2-positive breast cancer—an observational study prospectively recording resource utilization in a Swedish healthcare setting. Breast 2016;29:140-6. Crossref
19. Ponzetti C, Canciani M, Farina M, Era S, Walzer S. Potential resource and cost saving analysis of subcutaneous versus intravenous administration for rituximab in non-Hodgkin’s lymphoma and for trastuzumab in breast cancer in 17 Italian hospitals based on a systematic survey. Clinicoecon Outcomes Res 2016;8:227-33. Crossref
20. De Cock E, Pan YI, Tao S, Baidin P. Time savings with trastuzumab subcutaneous (SC) injection verse trastuzumab intravenous (IV) infusion: a time and motion study in 3 Russian centers. Value Health 2014;17:A653. Crossref
21. Nawaz S, Samanta K, Lord S, Diment V, Mcnamara S. Cost savings with Herceptin® (trastuzumab) SC vs IV administration: a time & motion study. Breast 2013;22(S1):S112.
22. De Cock E, Tao S, Alexa U, Pivot X, Knoop A. Abstract P5- 15-07: Time savings with trastuzumab subcutaneous (SC) injection vs. trastuzumab intravenous (IV) infusion: first results from a Time-and-Motion study (T&M). Cancer Res 2014;72(24 Suppl):P5-15-07. Crossref
23. De Cock E, Tao S DM-P, Millar D CN. Time savings with trastuzumab subcutaneous (SC) injection vs. trastuzumab intravenous (IV) infusion: a time and motion study in 5 Canadian centres. Proceedings of the Canadian Association for Population Therapeutics (CAPT) Annual Conference; 2013 Nov 17-19; Toronto, Canada.
24. Samanta K, Moore L, Jones G, Evason J, Owen G. PCN39 potential time and cost savings with herceptin (trastuzumab) subcutaneous (SC) injection versus herceptin intravenous (IV) infusion: results from three different English patient settings. Value Health 2012;15:A415. Crossref
25. Nestorovska A, Naumoska Z, Grozdanova A, et al. Subcutaneous vs intravenous administration of trastuzumab in HER2+ breast cancer patients: a Macedonian cost-minimization analysis. Value Health 2015;18:A463. Crossref
26. Rojas L, Muñiz S, Medina L, et al. Cost-minimization analysis of subcutaneous versus intravenous trastuzumab administration in Chilean patients with HER2-positive early breast cancer. PLoS One 2020;15:e0227961. Crossref
27. O’Brien GL, O’Mahony C, Cooke K, et al. Cost minimization analysis of intravenous or subcutaneous trastuzumab treatment in patients with HER2-positive breast cancer in Ireland. Clin Breast Cancer 2019;19:e440-51. Crossref
28. Lopez-Vivanco G, Salvador J, Diez R, et al. Cost minimization analysis of treatment with intravenous or subcutaneous trastuzumab in patients with HER2-positive breast cancer in Spain. Clin Transl Oncol 2017;19:1454-61. Crossref
29. Papadmitriou K, Trinh XB, Altintas S, Van Dam PA, Huizing MT, Tjalma WA. The socio-economical impact of intravenous (IV) versus subcutaneous (SC) administration of trastuzumab: future prospectives. Facts Views Vis Obgyn 2015;7:176-80.

Correlation between primary family caregiver identity and maternal depression risk in poor rural China

Hong Kong Med J 2022;28(6):457–65 | Epub 7 Dec 2022
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE (HEALTHCARE IN MAINLAND CHINA)
Correlation between primary family caregiver identity and maternal depression risk in poor rural China
N Wang, PhD; M Mu, MSc; Z Liu, MSc; Z Reheman, MSc; J Yang, PhD; W Nie, MSc; Y Shi, PhD; J Nie, PhD
Center for Experimental Economics in Education, Shaanxi Normal University, Xi’an, China
 
Corresponding author: Dr J Yang (jyang0716@163.com)
 
 Full paper in PDF
 
Abstract
Introduction: Prenatal and postpartum depression are important public health challenges because of their long-term adverse impacts on maternal and neonatal health. This study investigated the risk of maternal depression among pregnant and postpartum women in poor rural China, along with the correlation between primary family caregiver identity and maternal depression risk.
 
Methods: Pregnant women and new mothers were randomly selected from poor rural villages in the Qinba Mountains area in Shaanxi. Basic demographic information was collected regarding the women and their primary family caregivers. The Edinburgh Postnatal Depression Scale was used to identify women at risk of depression, and the Perceived Social Support Scale was used to evaluate perceived family support.
 
Results: This study included 220 pregnant women and 473 new mothers. The mean proportions of women at risk of prenatal and postpartum depression were 19.5% and 18.6%, respectively. Regression analysis showed that identification of the baby’s grandmother as the primary family caregiver was negatively correlated with maternal depression risk (β=-0.979, 95% confidence interval [CI]=-1.946 to -0.012, P=0.047). However, the husband’s involvement in that role was not significantly correlated with maternal depression risk (β=-0.499, 95% CI=-1.579 to 0.581, P=0.363). Identification of the baby’s grandmother as the primary family caregiver was positively correlated with family support score (β=0.967, 95% CI=-0.062 to 1.996, P=0.065).
 
Conclusion: Prenatal and postpartum depression are prevalent in poor rural China. The involvement of the baby’s grandmother as the primary family caregiver may reduce maternal depression risk, but the husband’s involvement in that role has no effect.
 
 
New knowledge added by this study
  • Prenatal and postpartum depression are prevalent in poor rural areas of China. Despite evidence regarding the importance of family support during prenatal and postpartum periods, husbands in poor rural China did not provide effective support.
  • There was a persistent risk of maternal depression during both prenatal and postpartum periods.
  • Maternal depression persists in the absence of external interventions.
Implications for clinical practice or policy
  • High-quality family support is necessary to ensure that pregnant women maintain good mental health. Compared with husbands, grandmothers may be better primary caregivers because they are experienced in terms of parenting and housework.
  • Husbands in poor rural China should receive training that enables them to provide effective maternal care.
 
 
Introduction
Maternal depression is a common mental health problem during the prenatal and postpartum periods. The World Health Organization estimates that approximately 10% of pregnant women and 13% of postpartum women worldwide have mental health problems, mainly depression.1 In China, the prevalence of maternal depression ranges from 8.2% to 28.5%.2 3 4 5 6 7 Women in urban areas have access to specialised maternity care services and mental health services that can help manage these mental health problems and difficulties.8 However, these commercialised services are usually expensive and distant from poor rural areas of China. Therefore, it is particularly important for pregnant and postpartum women in poor rural areas to rely on family and social relationships for reasonable care and support.
 
There is evidence that the level of perceived social support, particularly family support, is associated with a woman’s mental health status during pregnancy.9 10 China’s rapid societal and economic development have resulted in substantial changes to family structure in both urban and rural areas. For example, modern couples are more likely to live with only their children, rather than with family members from multiple generations.11 When grandparents are absent from a family’s daily life, the role of the husband becomes more important because he must be more engaged in housework12 and provide greater support.
 
The changes in primary family caregiver identity during prenatal and postpartum periods reflect this transformation of family structure.13 The results of multiple studies in developed countries and the urban areas of China have suggested that husbands are able to care for their wives and children during pregnancy and after delivery; moreover, a husband’s companionship has a positive impact on the mental health status of his pregnant wife.2 10 14 However, in poor rural areas, no consensus has been reached concerning whether a husband can provide effective family support for his pregnant wife.15 For example, husbands usually have lower awareness of maternity care because of limited education and limited housework experience. However, in a traditional Chinese family with patrilocal features, the husband is the main worker and is responsible for the economic well-being of the family,16 whereas the wife stays at home and cares for the family. This stereotype of traditional household arrangement prevents some men from providing maternal care, regardless of their presence at home. Accordingly, grandmother, the mother of the baby’s mother, becomes a possible caregiver for the mother and baby,17 although this may lead to mother-in-law conflict.18
 
Here, using data from a large-scale survey of pregnant and postpartum women in poor rural areas, we analysed the status of maternal mental health in poor rural areas, with family support as an intermediate variable, to understand the correlation between primary family caregiver identity and maternal depression risk.
 
Methods
Sampling
The data analysed in this study were collected during a survey of maternal and neonatal health and nutrition statuses among residents of poor rural villages in the Qinba Mountains area; the survey was conducted by Shaanxi Normal University from March 2019 to April 2019. The Qinba Mountains area spans six provinces including Gansu, Sichuan, Shaanxi, Chongqing, Henan, and Hubei. Its primary portion is situated in Shaanxi’s southern region. In 2019, the per capita annual disposable income in the Qinba Mountains area was RMB 11 443, similar to that of rural residents in poverty-stricken counties (RMB 11 567).19 In 2018, the mean poverty rate in this area was 3.6%; for comparison, the national mean was 1.7% and the rate in poverty-stricken counties was 4.5%.20 This study included women aged ≥18 years who were either pregnant (≥4 weeks of gestation) or in the postpartum period (0-6 months after delivery).
 
The following multilevel cluster-based random sampling method was used in this study. First, 13 national-level poor counties in two prefectures in the Qinba Mountains area were selected. Then, a list of villages was obtained for each county, and the total numbers of pregnant women and households with babies aged ≤6 months in each village were counted with assistance from local government officials. Considering the financial limitations and overall feasibility of the study, villages with a small sample size (<3) or large sample size (>15) were excluded. Finally, we used Stata 15.0 (Stata Corp, College Station [TX], United States) to analyse the data. The sample size was estimated to achieve, for an average incidence of independent variables of 0.15 in consideration of our pilot study, a sampling standard error (SE) of 0.03 with a 95% confidence interval (CI). The final 131 villages were randomly selected as sample villages, and all households in the sample villages that met the above criteria were considered eligible for the study.
 
Data collection
The data used in this study were collected through face-to-face interviews. To ensure accuracy and consistency during data collection, enumerators were selected from a group of interested university students in Xi’an. The enumerators underwent extensive training, then completed a pilot study with 20 participants prior to formal data collection. Each eligible participant received a consent form with information regarding programme objectives, procedures, potential risks, and benefits, as well as an explanation of privacy protection. Participants provided oral consent for inclusion in the study before engaging in a face-to-face interview with a single enumerator. Each interview only involved the participant, and interruptions from other family members were avoided.
 
Assessments
Basic participant information
A questionnaire was used to collect basic participant information, including their age, education level, and self-rated health status, along with whether the baby had been born and whether it was the firstborn child. The women were also asked whether they had access to any support groups where mothers could seek help and exchange information concerning parenting experiences. Furthermore, they were asked nine yes/no questions regarding family assets (eg, possession of a computer, an air conditioner, and a car). The above questions were also included in our questionnaire to better understand maternal social interactions and household assets in order to control for them in the regression analysis and thus produce more accurate regression results. Each participant’s decision-making power was measured using a scale of eight items compiled by Peterman et al.21 A higher score on the decision-making power scale was presumed to indicate greater autonomy concerning childcare and the management of other family issues.
 
Primary family caregivers
A questionnaire was used to collect information about all family members living in the participant’s home for >3 months, who were more likely to be the primary caregivers and to have an impact on maternity. Each participant was asked to identify the family member who served as the primary family caregiver, providing the most care for the participant and her baby during the prenatal and postpartum periods. Considering the sample size and sample distribution, three primary family caregiver categories were used in this study: the husband, the baby’s grandmother (the mother of the baby's mother or the baby's father), and other family members or no caregivers.
 
Edinburgh Postnatal Depression Scale
The Edinburgh Postnatal Depression Scale (EPDS) is a 10-item scale used to identify women at risk of maternal depression.22 23 The total EPDS score ranges from 0 to 30, where a higher score indicates a greater risk of depression. Although the original cut-off value was an EPDS score of ≥13 points, we used the standard cut-off value in China (≥9.5 points24 25) as an indicator of sufficient depression risk to merit psychiatric examination and possible treatment. Previous research has demonstrated that the EPDS has satisfactory reliability and validity. Specifically, Wang et al26 reported that the EPDS had a content validity ratio of 0.93 and good internal consistency (Cronbach’s α coefficient of 0.76). The correlation coefficients between the 10 individual item scores and the total score ranged from 0.37 to 0.67, with P values <0.01.
 
Perceived Social Support Scale
The Perceived Social Support Scale, developed by Zimet et al27 and translated into Chinese by Jiang,28 is a 12-item self-assessment questionnaire that measures three sources of social support (ie, three subscales): family support, friends’ support, and other people’s support. Responses to questionnaire items are recorded using a seven-point Likert scale that ranges from ‘completely negative’ to ‘completely positive’ (1-7 points), indicating the respondent’s level of agreement with each item. The total score is 84 points (28 points per subscale), and a higher score indicates the receipt of greater social support. The Cronbach’s α coefficient of the scale is 0.88; the Cronbach’s α coefficients for family support, friends’ support, and other people’s support subscales are 0.81, 0.85, and 0.91, respectively.27 Because this study focused on family support, only the family support subscale was used as an intermediate variable to analyse the correlation between primary family caregiver identity and maternal depression risk.
 
Statistical methods
STATA 15.1 software was used to clean the data and perform statistical analysis. Descriptive statistical analysis was performed and presented as mean ± standard deviation. F-test and t test were used to detect differences in depression scores among subgroups of women with different characteristics. Multiple linear regression was used to explore correlations between primary family caregiver identity and maternal depression risk or family support score. P values <0.05 were considered statistically significant. Additionally, we adjusted the SE at the village level and calculated coefficients with greater precision because individual values within the same village are correlated, which might result in biased SE in multiple linear regression.
 
Results
In total, 715 women were interviewed, including 220 pregnant women and 495 new mothers. Twenty-two samples with missing values were excluded to ensure sample uniformity throughout the analysis procedure. Finally, analyses in this study were based on the data of 693 participants (220 pregnant women and 473 new mothers) and the questionnaire return efficiency was 96.9%, which is the percentage of survey responses that were valid.
 
Maternal depression risk in poor rural areas
Among the 220 pregnant women, 37 (16.8%), 66 (30.0%), and 117 (53.2%) were in the early, middle, and late stages of pregnancy, respectively (Table 1). In total, 226 of the 473 new mothers (47.8%) had babies aged 1 to 3 months, whereas 247 new mothers (52.2%) had babies aged 4 to 6 months.
 

Table 1. Maternal depression risk in poor rural areas
 
The mean maternal EPDS score was 5.85 and the proportion of women at risk of depression was 18.9% (131/693). The proportion of women at risk of depression was generally stable regardless of pregnancy stage. Specifically, the proportion of women at risk of depression during early pregnancy was 16.2% (6/37); during middle and late pregnancy, the proportions of women at risk were slightly increased. The proportions of women at risk of depression were 16.8% (38/226) and 20.2% (50/247) at 1-3 months and 4-6 months after delivery, respectively. However, the maternal EPDS scores and proportions of women at risk of depression did not significantly differ according to pregnancy stage or time since delivery.
 
Univariate analysis of maternal depression risk
Overall, the mean participant age was 28.13 ± 4.70 years. In total, 239 women (34.5%; mean age, 25.52 ± 3.95 years) reported that the current pregnancy or ≤6-month-old baby was their firstborn child. The remaining 454 women (65.5%; mean age, 29.50 ± 4.49 years) were experienced mothers who have already had children and are familiar with caring for them. Overall, 116 women (16.7%) had an education level above junior high school. The self-rated health status was good in 89 women (12.8%), and 102 women (14.7%) were involved in a parenting support group. Table 2 summarises the participant characteristics.
 

Table 2. Univariate analysis of maternal depression risk (n=693)
 
As shown in Table 2, the participants were clustered into three groups according to primary caregiver identity: the husband for 151 women (21.8%), the baby’s grandmother for 452 women (65.2%), and other family members or no caregiver for 90 women (13.0%). The mean EPDS scores of women in the three groups were 6.23 ± 4.34, 5.56 ± 4.01, and 6.63 ± 4.84, respectively (P=0.039). Additionally, univariate analysis revealed statistically significant differences in depression scores according to education level, self-rated health status, and parenting support group involvement. There were no statistically significant differences in other variables.
 
Correlation between primary family caregiver identity and maternal depression risk
As shown in Table 3, identification of the baby’s grandmother as the primary family caregiver was significantly negatively correlated with EPDS score (β=-0.979, 95% CI=-1.946 to -0.012, P=0.047). However, identification of the husband as the family caregiver was not significantly correlated with EPDS score (β=-0.499, 95% CI=-1.579 to 0.581, P=0.363).
 

Table 3. Multiple linear regression analysis of correlation between primary family caregiver identity and maternal depression risk
 
Correlation between primary family caregiver identity and family support score
As shown in Table 4, after adjustment for other variables, there was no significant correlation between identification of the husband as the primary family caregiver and the family support score (β=0.375, 95% CI=-0.704 to 1.455, P=0.493). However, identification of the baby’s grandmother as the primary family caregiver was significantly positively correlated with family support score (β=0.967, 95% CI=-0.062 to 1.996, P=0.065). Furthermore, identification of the baby’s grandmother as the primary family caregiver had the largest standardised regression coefficient among the three caregiver categories, indicating that pregnant and postpartum women felt the greatest family support when the baby’s grandmother was the primary family caregiver.
 

Table 4. Multiple linear regression analysis of correlation between primary family caregiver identity and family support score
 
Discussion
Maternal depression risk in poor rural areas
In this study, the overall proportion of women at risk of maternal depression was 18.9%, including a mean proportion of 19.5% among pregnant women and a mean proportion of 18.6% among women ≤6 months postpartum. This overall proportion of women at risk of maternal depression is much higher than the proportion in a western urban area of China (12.4%)29 and comparable with the proportions in low- and middle-income countries such as Ethiopia (19.9%)30—both previous studies also used the EPDS to identify women at risk of maternal depression. The high proportion in the present study may be related to the location (poor rural areas): compared with women in urban areas, women in poor rural areas are more likely to have a lower socio-economic status.31 The lack of knowledge regarding mental health and its services in rural areas also makes women in such areas more likely to become depressed if they do not receive timely treatment for mental health problems.32 Therefore, the mental health of rural mothers should receive greater attention from their family members and the relevant health departments.
 
This study also revealed a persistent risk of depression during the prenatal and postpartum periods (Table 1). Notably, the proportion did not substantially decrease by 6 months after delivery. Yue et al33 investigated the mental health of caregivers for babies aged 6 to 36 months in a rural area in western China. Their results showed that the proportion of caregivers at risk of depression was similar to the proportion in the present study. These findings suggest that maternal depression persists in the absence of external intervention. Thus, there is an urgent need for timely external mental health interventions among pregnant women and mothers of young children. The present study also showed that the maternal depression risk in poor rural areas is influenced by factors such as a woman’s education level, self-rated health status, and parenting support group involvement. These results are consistent with the findings by Zhou et al,7 Lancaster et al,10 and Lee et al.18
 
Correlation between primary family caregiver identity and maternal depression risk
Our results showed that identification of the husband as the primary family caregiver was not significantly correlated with maternal depression risk in poor rural areas (Table 3). This finding was considerably different from the results of previous studies in urban areas. Xie et al34 found that insufficient or poor-quality emotional support from the husband was significantly associated with an increased risk of postpartum depression among mothers in Changsha, Hunan Province, China. In contrast, Wan et al2 found that the proportions of women at risk of maternal depression were 1.9- to 2.6-fold higher among women without support from the husband before and after delivery than among women with support from the husband, based on a study of mothers in Beijing, China. The results of these studies suggest that the husband’s involvement as the primary family caregiver can reduce the risk of maternal depression in urban areas, but this effect was not apparent in poor rural areas.
 
We also found that maternal depression risk was significantly lower when the baby’s grandmother was identified as the primary family caregiver (Table 3). Our results are consistent with the findings by Wan et al2 in a study of 342 pregnant women in Beijing, China: during the ‘confinement’ period, care and support from the baby’s grandmother(s) were important for relieving depression. However, Lee et al18 showed that mother-in-law conflict remains prominent in China, which may have negative emotional outcomes for grandmothers and new mothers. Although pregnant and postpartum women in poor rural areas may experience similar conflict, our findings suggest that support from the baby’s grandmother(s) remains predominantly positive.
 
Correlation between family support and maternal depression risk
We attempted to determine why support from the husband did not reduce maternal depression risk in poor rural areas through the analysis of an intermediary variable. Initially, we hypothesised that the positive effect of the husband acting as the primary family caregiver would be offset by the loss of income caused by the husband’s inability to seek work opportunities in other locations. However, data analysis revealed that the husband’s role as the primary caregiver had no impact on the family income and family asset index (online supplementary Table 1). Thus, we explored the effect of family support. Multiple previous studies demonstrated that family support influenced maternal depression risk14; consistent with those findings, our analysis showed that family support was significantly negatively correlated with maternal depression risk (online supplementary Table 2).
 
There may be two main reasons for this negative correlation. First, husbands in poor rural areas have insufficient knowledge and skills related to maternal care.16 Husbands do not have first-hand experience in childbirth and can only acquire it through education. However, compared with men in urban areas, men in poor rural areas have lower levels of education and may be less inclined to learn on their own, making it more difficult to acquire such knowledge and skills.35 In contrast, grandmothers are more experienced overall, which may enable them to provide more effective family support. For example, based on their own experience, grandmothers can help new mothers to prepare for and manage pain that sometimes occurs during breastfeeding, which can alleviate anxiety and provide a feeling of greater support.17 Second, in poor rural areas, husbands may lack sufficient time and energy to provide effective family care. Compared with families in urban areas, families in poor rural areas are more economically disadvantaged33; therefore, husbands in such families may prioritise financial stability and be unable to expend time or energy in support of maternal care, despite their physical presence in the home. In contrast, the baby’s grandmother(s) may have sufficient time and energy to provide effective maternal care (eg, by feeding the baby and changing its diapers), thus relieving the mother’s psychological stress.
 
The findings in this analysis of women in poor rural areas differ from the results of studies in urban areas, indicating important differences in family structure between urban and rural areas. There is evidence that a gradual transformation of the family is underway in urban areas, whereby husbands have begun to actively engage in caregiving. However, the transformation of family structure is much slower in poor rural areas,13 and husbands in those areas are not yet prepared for this new role. Because of constraints regarding their education level and skills, as well as family finances, husbands in poor rural areas continue to prioritise financial stability36; their support does not have a positive impact on the risk of maternal depression. Thus, women in poor rural areas must continue to rely on family members outside of the nuclear family, such as the baby’s grandmother(s), to assume some caregiving responsibilities.
 
Commercialised and specialised mental health counselling services in urban areas play important roles in improving maternal mental health.8 Xiao37 found that postnatal care through a menstrual club provided continuous physical, psychological, and emotional support that was sufficient to reduce the incidence of postpartum depression. However, such clubs are not available in poor rural areas. Therefore, it is important to promote better caregiving from family members, including husbands. For example, husbands could receive training that enables them to provide practical support, as well as guidance concerning the early identification of depressive tendencies and the development of communication skills for psychological adjustment.
 
Limitations
This study had some limitations. First, its cross-sectional design prevented the assessment of maternal depression trends during pregnancy and after delivery, although such an assessment could have been conducted in a cohort study. Second, this study focused on primary family caregiver identity and did not explore the type or form of caregiving provided. Third, all participants were residents of rural northwest China, and thus the results may not be generalisable to other populations. These limitations should be addressed in future studies.
 
Conclusions and policy implications
The prevalence of maternal depression is high in poor rural areas of Shaanxi Province. Identification of the husband as the family caregiver was not significantly correlated with maternal depression risk, whereas the involvement of the baby’s grandmother in that role was significantly negatively correlated with maternal depression risk. Based on our findings, we make the following suggestions. In rural areas, high-quality family support is necessary to ensure that pregnant women maintain good mental health. Compared with husbands, grandmothers may be better primary caregivers because they are more experienced in terms of parenting and housework. Husbands in poor rural China should receive training that enables them to provide effective maternal care.
 
Author contributions
Concept or design: N Wang, M Mu, J Yang, Y Shi, J Nie.
Acquisition of data: N Wang, M Mu, Z Liu, R Zulihumaer, W Nie.
Analysis or interpretation of data: N Wang, M Mu, J Yang, J Nie.
Drafting of the manuscript: N Wang, M Mu, J Yang, Y Shi, J Nie.
Critical revision of the manuscript for important intellectual content: All authors.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
As an International Editorial Advisory Board member of the journal, Y Shi was not involved in the peer review process. Other authors have disclosed no conflicts of interest.
 
Acknowledgement
The authors thank the study participants and the enumerators who conducted data collection.
 
Funding/support
The authors are supported by the 111 Project (Grant No. B16031), Soft Science Research Project of Xi’an Science and Technology Plan (Grant No. 2021-0059), the Fundamental Research Funds for the Central Universities (Grant No. 2021CSWY024) and the Fundamental Research Funds for the Central Universities (Grant No. 2021CSWY025) of China.
 
Ethics approval
The study was approved by the Medical Ethics Committee of Shaanxi Normal University and Xi’an Jiaotong University of China (No: 2020-1240). Each eligible participant received a consent form with information regarding programme objectives, procedures, potential risks, and benefits, as well as an explanation of privacy protection. Participants provided oral consent for inclusion in the study before engaging in a face-to-face interview with a single enumerator.
 
References
1. Ceulemans M, Foulon V, Ngo E, et al. Mental health status of pregnant and breastfeeding women during the COVID- 19 pandemic—a multinational cross-sectional study. Acta Obstet Gynecol Scand 2021;100:1219-29. Crossref
2. Wan EY, Moyer CA, Harlow SD, Fan Z, Jie Y, Yang H. Postpartum depression and traditional postpartum care in China: role of Zuoyuezi. Int J Gynecol Obstet 2009;104:209-13. Crossref
3. Zeng Y, Cui Y, Li J. Prevalence and predictors of antenatal depressive symptoms among Chinese women in their third trimester: a cross-sectional survey. BMC Psychiatry 2015;15:66. Crossref
4. Zhang Y, Zou S, Cao Y, Zhang Y. Relationship between domestic violence and postnatal depression among pregnant Chinese women. Int J Gynecol Obstet 2012;116:26-30. Crossref
5. Song Y, Li W. The impact of social support and antepartum emotion on postpartum depression [in Chinese]. China J Health Psychol 2014;22:909-11.
6. Xu M, Li C, Zhang K, Sun S, Zhang H, Jin J. Comparison of prenatal depression and social support differences between Korean and Han pregnant women in Yanbian region [in Chinese]. Matern Child Health Care China 2015;30:524-7.
7. Zhou X, Liu H, Li X, Zhang S, Li F, Zhao Z. Depression and risk factors of women in their second-third trimester of pregnancy in Shaanxi province [in Chinese]. Chin Nurs Manage 2019;19:1005-11.
8. Liu H, Zhang C. Comparative study on service efficiency of China’s urban and rural health systems [in Chinese]. China Soft Sci 2011;10:102-13.
9. Leahy-Warren P, McCarthy G, Corcoran P. First-time mothers: social support, maternal parental self-efficacy and postnatal depression. J Clin Nurs 2012;21:388-97. Crossref
10. Lancaster CA, Gold KJ, Flynn HA, Yoo H, Marcus SM, Davis MM. Risk factors for depressive symptoms during pregnancy: a systematic review. Am J Obstet Gynecol 2010;202:5-14. Crossref
11. Tang C. A review of modernisation theory and its development on family [in Chinese]. Sociol Stud 2010;25:199-222,246.
12. Högberg U. The World Health Report 2005: “make every mother and child count”—including Africans. Scand J Public Health 2005;33:409-11. Crossref
13. Zhao F, Ji Y, Chen F. Conjugal or parent-child relationship? Factors influencing the main axis of Chinese family relations in transition period [in Chinese]. J Chin Women Stud 2021;4:97-112.
14. Li D, Xu X, Liu J, Wu P. The relationship between life event and pregnancy stress: the mediating effect of mental health and the moderating effect of husband support [in Chinese]. J Psychol Sci 2013;36:876-83.
15. Dennis CL, Ross L. Women’s perceptions of partner support and conflict in the development of postpartum depressive symptoms. J Adv Nurs 2006;56:588-99. Crossref
16. Yang H, Lu Y, Ren L. A study on the impact of bearing two children on urban youth’s work-family balance: the empirical analysis based on the third survey of Chinese women’s social status [in Chinese]. Popul Econ 2016;9:1-9.
17. Hu J, Wang Y. Study on related risk factors of patients with pre-and postnatal depression [in Chinese]. Chin Nurs Res 2010;24:765-7.
18. Lee DT, Yip AS, Chan SS, Lee FF, Leung TY, Chung TK. Determinants of postpartum depressive symptomatology—a prospective multivariate study among Hong Kong Chinese Women [in Chinese]. Chin Ment Health J 2005;9:54-9.
19. National Bureau of Statistics of China. Income of villagers in poor rural areas in 2019 [in Chinese]. Available from: http://www.stats.gov.cn/xxgk/sjfb/zxfb2020/202001/t20200123_1767753.html. Accessed 30 Nov 2022.
20. Sun J, Zhang J, Li C, Lu Y. Strategic judgements and development suggestions on the development of poverty-stricken areas in China [in Chinese]. Manage World 2019;35:150-9,185.
21. Peterman A, Schwab B, Roy S, Hidrobo M, Gilligan DO. Measuring women’s decision making: indicator choice and survey design experiments from cash and food transfer evaluations in Ecuador, Uganda, and Yemen. World Dev 2021;141:105387. Crossref
22. Adouard F, Glangeaud-Freudenthal NM, Golse B. Validation of the Edinburgh postnatal depression scale (EPDS) in a sample of women with high-risk pregnancies in France. Arch Womens Ment Health 2005;8:89-95. Crossref
23. Zhang H, Li L. Comparative analysis of 3 postpartum depression scales abroad [in Chinese]. Chin J Nurs 2007;2:186-8.
24. Lee DT, Yip SK, Chiu HF, et al. Detecting postnatal depression in Chinese women. Validation of the Chinese version of the Edinburgh Postnatal Depression Scale. Br J Psychiatry 1998;172:433-7. Crossref
25. Liu Y, Zhang L, Guo N, Li J, Jiang H. Research progress of Edinburgh Postnatal Depression Scale in screening of perinatal depression [in Chinese]. Chin J Mod Nurs 2021;27:5026-31.
26. Wang Y, Guo X, Lau Y, Chan KS, Yin L, Chen J. Psychometric evaluation of the Mainland Chinese version of the Edinburgh Postnatal Depression Scale. Int J Nurs Stud 2009;46:813-23. Crossref
27. Zimet GD, Dahlem NW, Zimet SG, Farley GK. The multidimensional scale of perceived social support. J Pers Assess 1988;52:30-41. Crossref
28. Yan R, Liu L, Zhang L. Analysis of the relationship between social support self-efficacy and health behaviours among college students [in Chinese]. Chin J Sch Health 2010;31:362-3.
29. Zhang X, Li X, Li Y, Quan X, Li X. Study on status and influencing factors of antenatal depression among urban women in western China [in Chinese]. Matern Child Health Care China 2019;34:5275-7.
30. Dibaba Y, Fantahun M, Hindin MJ. The association of unwanted pregnancy and social support with depressive symptoms in pregnancy: evidence from rural southwestern Ethiopia. BMC Pregnancy Childbirth 2013;13:135. Crossref
31. Wu H, Rao J. Research review on left-behind women [in Chinese]. J Chin Agric Univ Soc Sci 2009;26:18-23.
32. Hou F, Cerulli C, Wittink MN, Caine ED, Qiu P. Depression, social support and associated factors among women living in rural China: a cross-sectional study. BMC Womens Health. 2015;15:28. Crossref
33. Yue A, Gao J, Yang M, Swinnen L, Medina A, Rozelle S. Caregiver depression and early child development: a mixed-methods study from rural China. Front Psychol 2018;9:2500. Crossref
34. Xie RH, Yang J, Liao S, Xie H, Walker M, Wen SW. Prenatal family support, postnatal family support and postpartum depression. Aust N Z J Obstet Gynaecol 2010;50:340-5. Crossref
35. Wei Q, Zhang C, Hao B, Wang X. Study on parenting concepts and behaviours of caregivers with children under 3 years old in rural areas of China [in Chinese]. Matern Child Health Care China 2017;32:1759-61.
36. Hao J, Wang F, Huang J. Male labour migration, women empowerment and household protein intake: evidence from less developed rural areas in China [in Chinese]. China Rural Econ 2021;8:125-44.
37. Xiao G. A practical study of puerperal health care in a menstrual centre [in Chinese]. Chin J Mode Drug Appl 2016;10:271-2.

Paediatric high-grade osteosarcoma and its prognostic factors: a 10-year retrospective study

Hong Kong Med J 2022;28(6):447-56 | Epub 25 Nov 2022
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Paediatric high-grade osteosarcoma and its prognostic factors: a 10-year retrospective study
Grace PY Tong, MB, BS1; WF Hui, MB, ChB, MSc2; KC Wong, MB, ChB, MD (CUHK)3; Benjamin ST Fong, MB, BS4; CW Luk, MB, BS2; CK Li, MB, BS, MD (CUHK)5
1 Department of Paediatrics, Queen Elizabeth Hospital, Hong Kong
2 Department of Paediatrics, Hong Kong Children’s Hospital, Hong Kong
3 Department of Orthopaedics and Traumatology, Prince of Wales Hospital, Hong Kong
4 Department of Orthopaedics and Traumatology, Queen Elizabeth Hospital, Hong Kong
5 Department of Paediatrics, Prince of Wales Hospital, The Chinese University of Hong Kong, Hong Kong
 
Corresponding author: Dr Grace PY Tong (grace.tong@ha.org.hk)
 
 Full paper in PDF
 
Abstract
Introduction: This retrospective study was conducted to identify the characteristics of paediatric high-grade osteosarcoma and define its prognostic factors.
 
Methods: We identified paediatric patients (aged <19 years at diagnosis) diagnosed with high-grade osteosarcoma from 1 January 2009 to 31 December 2018 in two hospitals in Hong Kong, then retrospectively evaluated their medical records to identify prognostic factors.
 
Results: In total, 52 patients were included in this study (22 girls, 42.3%). Femoral tumour was the most common form of osteosarcoma. Most patients (78.8%) had localised disease at diagnosis. The lung was the most common site of metastasis. Almost half (n=23, 46.9%) of the patients showed a good response to chemotherapy (ie, chemonecrosis >90%). Most patients (n=40, 80%) underwent limb-salvage surgery. The event-free survival and overall survival rates were 55.8% and 71.2%, respectively. Prognostic factors independently associated with poor event-free survival and poor overall survival were the presence of metastasis at diagnosis, poor tumour chemonecrosis, and the need for amputation.
 
Conclusion: This multicentre review of paediatric high-grade osteosarcoma showed that the baseline patient demographics, event-free survival, and overall survival in Hong Kong were similar to previous findings in other countries. Patients with metastatic disease at diagnosis and poor chemonecrosis had worse survival outcomes. Molecular analyses of genetic abnormalities may help to identify targeted therapies in future studies.
 
 
New knowledge added by this study
  • The need for amputation was a prognostic factor independently associated with poor event-free survival and poor overall survival.
  • Many conventional biochemical markers were not useful as prognostic factors for event-free survival or overall survival.
Implications for clinical practice or policy
  • An updated protocol is needed for the management of paediatric high-grade osteosarcoma. Factors that can be incorporated for early risk stratification include local tumour aggressiveness and the need for amputation; genetic mutations in the tumour may also be useful.
 
 
Introduction
Osteosarcoma arises from primitive bone-forming mesenchymal cells.1 It is the most common primary malignant bone tumour worldwide,1 2 with an annual incidence of 4.8 per million population in the US.2 Moreover, it was one of the most common types of childhood malignancy in Hong Kong in 2017 to 2019.3
 
In Hong Kong, paediatric patients with high-grade osteosarcoma are treated in accordance with the Hong Kong Paediatric Haematology and Oncology Study Group Treatment Protocol for High-Grade Osteosarcoma (Fig 1), which consists of neoadjuvant chemotherapy, followed by tumour resection and subsequent adjuvant chemotherapy. Histological response to neoadjuvant chemotherapy, defined as tumour chemonecrosis according to the Huvos system,4 is used to stratify patients into good responders (patients with ≥90% tumour chemonecrosis) and poor responders (patients with <90% tumour chemonecrosis). These two groups receive different chemotherapy regimens.
 

Figure 1. Roadmap of the Hong Kong Paediatric Haematology and Oncology Study Group treatment protocol for high-grade osteosarcoma
 
A few prognostic factors for survival have been recognised; these factors include the presence of metastasis at diagnosis5 6 7 and poor tumour chemonecrosis.5 6 7 8 9 However, no specific studies have examined the validity of these factors for patients with osteosarcoma in Hong Kong. Moreover, the presence of lung nodules in computed tomography (CT) scans is a common finding at diagnosis and reassessment. In this study, we reviewed paediatric patients with osteosarcoma at two of the largest paediatric oncology centres in Hong Kong. We sought prognostic factors for event-free survival (EFS) and overall survival (OS), and we assessed lung nodules and metastasis in these patients.
 
Methods
This study included paediatric patients (aged <19 years at diagnosis) with biopsy-proven high-grade osteosarcoma, who were diagnosed from 1 January 2009 to 31 December 2018 at Queen Elizabeth Hospital and Prince of Wales Hospital. The patients’ medical records were reviewed and the following information was collected: demographic data (sex, age at diagnosis, ethnicity, and time from symptom onset to presentation), clinical characteristics (location of tumour, largest dimension of tumour, staging, maximal alkaline phosphatase [ALP] level, calcium and phosphate levels before the start of treatment, and presence of pathological fracture), treatment-related characteristics (time from diagnosis to start of chemotherapy, surgical treatment approach, and histological results of the resected tumour [including tumour chemonecrosis and surgical margin]), and details of metastasis (timing, location, size, and treatment). Radiological reports for the primary tumour were reviewed to determine the presence of local anatomical aggressiveness, which was defined as intra-articular tumour involvement or neurovascular bundle involvement. These risk factors made surgical resection with a negative tumour margin difficult or impossible. Thus, amputation was expected to provide the best local control. There were three indications for limb-salvage surgery (ie, tumour resection and reconstruction). First, clinical and radiological responses were observed during neoadjuvant chemotherapy. Clinical responses were reduction or stabilisation of tumour size and a significant decrease in pain. Radiological responses were an increase in consolidated calcification of the tumour on plain radiographs, reduction or stabilisation of tumour size, and decreased peritumoral oedema on magnetic resonance images. Second, magnetic resonance images showed no neurovascular involvement. Third, there was no need for extensive muscle resection that would render the limb non-functional. Amputation was considered when patients did not meet the above criteria for limb-salvage surgery. It was also considered as a palliative treatment for patients who had large painful tumours with metastatic disease.
 
The characteristics of lung nodules identified in chest CT scans were recorded from CT reports. The initial and final sizes, laterality, timing of appearance, and mediastinal lymph node involvement were analysed to determine whether their characteristics were sufficiently different for clear distinction. Non-specific lung nodules were either small or remained stable in subsequent scans; they were not biopsied or surgically excised for histological diagnosis. Lung metastases were either large when first observed, had radiological features of metastasis, or demonstrated enlargement in subsequent follow-up scans.
 
Statistical analysis
Descriptive data were expressed as median (interquartile range) or frequency (percentage). The Pearson Chi squared test or Fisher’s exact test was used for comparisons of categorical variables. The Mann-Whitney U test was used for comparisons of continuous variables.
 
The primary outcomes were OS and EFS. Overall survival was defined as the time from diagnosis to death. Event-free survival was defined as the time from diagnosis to the appearance of a new metastasis, progression of an existing metastasis, or death (whichever occurred first). The study end date was 31 December 2020. Patients who had no events were censored at the time of the last follow-up (if they had been lost to follow-up) or at the study end date. Kaplan-Meier curves and log-rank tests were used for survival analysis. To identify prognostic factors for OS and EFS, unadjusted hazard ratios (with 95% confidence intervals) were determined for each potential factor by using Cox proportional hazards models. Significant factors in univariate analyses were included in subsequent multivariate analyses; adjusted hazard ratios (with 95% confidence intervals) were generated in the multivariate analyses.
 
The secondary outcome was the differentiation of lung nodules. Their initial and final sizes, initial and final lateralities, timing of appearance, and mediastinal lymph node involvement were compared to identify statistical differences.
 
The SPSS software (Windows version 23.0; IBM Corp, Armonk [NY], US) was used for statistical analysis. P values <0.05 were considered statistically significant.
 
Results
Patient characteristics
In total, 52 paediatric patients (22 girls and 30 boys; age 5-18 years) with high-grade osteosarcoma were included in this study. Their baseline demographics are shown in Table 1. Two patients (3.8%) had a predisposing condition: one had Rothmund–Thomson syndrome and the other had osteofibrous dysplasia, from which the high-grade osteosarcoma developed.
 

Table 1. Baseline demographics (n=52)
 
For the histological diagnosis, 45 patients (86.5%) had conventional high-grade osteosarcomas. The other subtypes of osteosarcoma were: two giant cell-rich osteosarcomas, one telangiectatic osteosarcoma, one chondrosarcomatous-predominant osteosarcoma, one osteoblastoma-like osteosarcoma, and one chondroblastic osteosarcoma. One patient had features suggestive of small round-cell sarcoma; the tumour was subsequently treated as a conventional high-grade osteosarcoma.
 
Treatment
All patients received neoadjuvant chemotherapy with doxorubicin, cisplatin, and high-dose methotrexate (Fig 1). After two courses of chemotherapy, the patients underwent surgical resection of tumours. Almost all patients (n=50, 96.2%) underwent resections of primary tumours. Most patients (n=44, 84.6%) completed the whole course of treatment with adjuvant chemotherapy. Five patients (9.6%) experienced disease progression during treatment. They had terminated chemotherapy early, received palliative care, and eventually died. One patient (1.9%) had disease progression and left Hong Kong to seek a second opinion. In one patient (1.9%), the last course of chemotherapy was omitted because previous chemotherapy had induced clinically significant renal impairment. In one patient (1.9%), the last course of chemotherapy was omitted because of disease progression while receiving treatment.
 
Metastasis and lung nodules
The lung was the most common site of distant metastasis, both synchronous (n=10, 90.9%) and metachronous (n=12, 80.0%) [Table 2]. Lung nodules in chest CT scans were observed in 39 patients (75.0%). Seventeen nodules (43.6%) were non-specific, while 22 nodules (56.4%) were lung metastases. Factors that could potentially be used to differentiate lung metastases from non-specific lung nodules included laterality of nodules in serial follow-up scans (P=0.004), number of initial nodules (P=0.006), maximal number of nodules (P<0.001), size of initial nodule (P<0.001), and size of the largest nodule (P<0.001) [Table 3].
 

Table 2. Metastasis characteristics (n=25)
 

Table 3. Lung nodule characteristics (n=39)
 
For the 15 patients who had no lung nodules throughout the course of treatment, OS was very good (93%) [Fig 2]. Only one patient died of local recurrence. For the 17 patients with non-specific lung nodules (Figs 2 and 3), OS was excellent (100%), regardless of the timing of nodule appearance (ie, at diagnosis or later). Among these 17 patients, survival ranged from 2 years 4 months to 11 years 9 months from diagnosis. For the 10 patients with synchronous lung metastasis (Fig 3), OS was 60%, whereas for the 12 patients with metachronous lung metastasis (Fig 2), OS was 33.3%. This difference was not statistically significant (P=0.666).
 

Figure 2. Lung nodules, nature, treatment, and outcomes for patients with no lung nodules at diagnosis
 

Figure 3. Lung nodules, nature, treatment, and outcomes for patients with lung nodules at diagnosis
 
Survival analysis
All patients were followed up until 31 December 2020; thus, the follow-up interval for the last recruited patient was 2 years from diagnosis. The median follow-up interval for all patients was 53 months from diagnosis (range, 4 months to 11 years 11 months). The EFS and OS were 55.8% and 71.2%, respectively. The EFS for patients with localised disease at diagnosis was significantly better than the EFS for patients with metastatic disease at diagnosis (65.9% vs 18.2%; P=0.007). The OS for patients with localised disease was better than the OS for patients with metastatic disease (75.6% vs 54.5%), but the difference was not statistically significant (P=0.26). Similarly, patients with good tumour chemonecrosis had better EFS and OS, compared with poor responders. These values were 78.3% versus 42.3% (P=0.019) and 82.6% versus 65.4% (P=0.209), respectively. Notably, patients with localised disease and good tumour chemonecrosis had very good outcomes, with EFS of 80% and OS of 86.7%. All deaths in this study were caused by disease progression. No patients died of treatment complications or other causes.
 
The prognostic factors identified for both EFS and OS included the presence of metastasis at diagnosis and throughout, as well as the need for amputation to manage the primary tumour. Local aggressiveness and a larger tumour dimension contributed to OS, while poor tumour chemonecrosis contributed to EFS (Table 4). The following factors did not have a statistically significant impact on EFS or OS: sex, age at diagnosis, primary tumour location in the femur, the presence of a pathological fracture, time from symptom onset to presentation, time from diagnosis to start of chemotherapy, and the histological subtype of the primary tumour.
 

Table 4. Factors associated with poor event-free survival and poor overall survival
 
Multivariate analysis (Table 5) revealed that the presence of metastasis at diagnosis, the need for amputation, and poor tumour chemonecrosis were prognostic factors independently associated with poor EFS, while the presence of metastasis at diagnosis and the need for amputation were prognostic factors independently associated with poor OS. Figure 4 shows the Kaplan-Meier analysis of the effects of the above three independent prognostic factors on EFS.
 

Table 5. Multivariate analysis of factors associated with poor event-free survival and poor overall survival
 

Figure 4. (a) Comparison of event-free survival between localised and metastatic disease groups. (b) Comparison of event-free survival between good and poor responder groups. (c) Comparison of event-free survival between limb-salvage surgery and amputation groups
 
Discussion
Patient characteristics
To our knowledge, this is the first multicentre review of the demographic characteristics and prognostic factors of paediatric high-grade osteosarcoma in Hong Kong. The number of patients included in the study constituted 76.5% of children with osteosarcoma diagnosed in Hong Kong during the study period (unpublished data). Thus, our findings are likely to be representative of the courses of disease and treatment for children with osteosarcoma in Hong Kong. In this study, slightly more patients with high-grade osteosarcoma were boys, the median age at diagnosis was 13.5 years, and the femur was the most common site of involvement. All of these results are comparable to findings in western countries.1 10 Only one patient had a high-grade osteosarcoma in the humerus; no patients had a primary nodule in the axial skeleton, demonstrating the rarity of such nodules. Although our study was limited by a short follow-up interval in some patients, the EFS and OS were comparable to the findings of large studies conducted in other countries,6 8 10 11 as well as the results of a study performed in Hong Kong in 2009.12 The shortest follow-up interval was 2 years from diagnosis. Most events occurred in the first 2 to 3 years, but some poor responders experienced late relapse at 4 to 5 years after initial diagnosis. Thus, a longer follow-up interval is necessary to better determine patient outcomes and more comprehensively assess the incidence of late toxicity.
 
Prognostic factors
In terms of prognostic factors, our findings in a cohort of Hong Kong patients confirm the validity of some important prognostic factors recognised in studies performed elsewhere, including the largest dimension of the primary tumour,8 13 14 presence of metastasis at diagnosis,6 8 9 11 and poor tumour chemonecrosis.6 8 9 13 14 15 Additionally, our study identified the need for amputation as an independent prognostic factor for EFS and OS. With respect to mortality, the prognostic effect of the need for amputation has varied among studies.6 13 14 15 16 This variation is presumably because the decision to amputate depends on many factors, including the opinions of orthopaedic surgeons and parental acceptance. In our study, 10 patients underwent amputation; in two of these patients, the procedure was performed with palliative intent to achieve symptomatic control. Both of those patients had metastatic disease at diagnosis, which involved large and painful primary tumours. Thus, risk stratification of patients according to the need for amputation may enable the selection of patients with more advanced disease. Nonetheless, in our cohort, all surgical margins were negative in both limb-salvage surgery and amputation groups. Moreover, local recurrence was uncommon. Thus, our findings may provide insights that can be used to update risk stratification protocols for paediatric patients with osteosarcoma. In the current treatment protocol, there is a considerable delay between initial diagnosis and risk stratification (at week 16), which is performed after tumour resection and when tumour chemonecrosis data are available. However, the surgical approach is usually determined after approximately 4 to 5 weeks of treatment, when reassessment imaging is conducted. If aggressive features are observed at diagnosis (eg, a massive tumour, early intra-articular tumour involvement, or early neurovascular bundle involvement), the discussion of possible amputation may have already begun. Thus, the presence of such features, or the early recognition of the need for amputation, may be suitable for risk stratification after validation in larger-scale studies.
 
Surrogate markers of aggressiveness
In recent decades, there has been extensive research into surrogate markers for aggressiveness in osteosarcoma. In some studies, the levels of ALP8 9 14 15 17 and lactate dehydrogenase18 19 20 were identified as significant prognostic factors. Although the maximal ALP level was not associated with EFS or OS in our study, some studies have demonstrated a positive association between the serum ALP level and tumour volume.21 This association is presumably related to the increased rate of bone remodelling in the tumour. However, the serum ALP level is also elevated in teenagers because of increased bone remodelling during periods of rapid growth. Thus, a universal cut-off for all paediatric patients may not provide the greatest prognostic accuracy.22 Some studies23 have explored methods to increase the age specificity of serum ALP levels. However, more validation and larger-scale studies are required before these methods can be widely adopted. Whereas the serum lactate dehydrogenase level showed a weaker association with tumour volume,21 a change in this level is more likely to be associated with a non-specific increase in tumour metabolism. Currently, the serum lactate dehydrogenase level is not included in pretreatment staging and investigation protocols; thus, it is not routinely checked. For investigation purposes, it should also be included in baseline investigations.
 
Because most biochemical parameters are not accurate surrogate markers for the aggressiveness of osteosarcoma, technological advancements have enabled molecular and genetic profiling of osteosarcomas to become the focus of research in the past 10 to 15 years.22 24 25 These studies may provide insights concerning the ‘non-conforming’ behaviour of certain tumours in our patients; examples include locally aggressive primary tumours that warrant amputation but demonstrate good tumour chemonecrosis, or tumours that show disease progression despite good tumour chemonecrosis. The current literature suggests that paediatric osteosarcoma is a heterogeneous disease,21 26 although general knowledge of the disease remains incomplete. Despite advancements in surgical techniques for primary tumour resection,27 improvements in the accuracy of staging imaging, and enhancements of supportive care, further revisions are needed concerning the medical treatment of paediatric osteosarcoma. Thus, molecular and genetic studies are essential and may facilitate further stratification of patients with osteosarcoma into different risk groups, which may require tailored treatment regimens for better outcomes. Future studies may also enable the identification of molecular nodules for targeted therapy or immunotherapy, which lead to considerable advances in the treatment of osteosarcoma.
 
Lung nodule analysis
Lung nodules were common in the initial staging and serial follow-up CT scans of our patients. The small size of some nodules hindered characterisation. They might represent benign lung pathologies. However, they may also represent micro-metastases which were responsive to chemotherapy. Thus, they remained stable in size and number on subsequent scans, suggesting that they persisted as scars. Patients with non-specific lung nodules had an excellent prognosis, with an OS rate of 100% in our study. However, repeated scans are needed for the follow-up and characterisation of such lung nodules. Additionally, the appearance of any new lung nodules on CT scans creates a considerable psychological burden for patients and their families. Our study identified parameters that can help to differentiate true lung metastases from non-specific nodules, including the number of initial nodules, maximal number of nodules, initial nodule size, and largest nodule size. A study in Hong Kong in 201128 also identified number (≤5 vs >5), size, and laterality of lung nodules as important prognostic factors for survival, whereas a study in the US in 201129 found that survival was worse for central lung metastases than for peripherally located lung metastasis. Additional larger-scale risk stratification studies are needed to clearly delineate the cut-offs for size, number, laterality, and location of initial lung nodules. The establishment of a scoring system that considers parameters of lung nodules observed in chest CT scans may enable prediction of the risk of malignancy, thus improving early detection of lung metastasis and reducing unnecessary anxiety for patients and their families. Furthermore, standardised reviews of CT scans will help to ensure more uniform classification of lung nodules, particularly when the nodules are small.
 
With respect to confirmed lung metastasis, the situation becomes increasingly complicated. For a single synchronous or metachronous lung metastasis, metastasectomy was conducted whenever surgically feasible because it has been regarded as the primary treatment approach in multiple studies.30 31 32 However, given the diversity of treatments, there was no consensus regarding treatment strategies for multiple metastases or local recurrence. This is presumably because the effects of different chemotherapy regimens and targeted therapies remain under investigation.11 33 34 35 36 In our study, surgical resection of lung metastasis whenever possible, together with zoledronic acid, generally achieved durable clinical remission for >5 years. Long-term randomised controlled trials of different chemotherapy or targeted therapy regimens should be conducted to determine the most cost-effective regimens that improve survival for relapsed paediatric patients with high-grade osteosarcoma. Many centres in other nations are performing karyotyping and genetic analysis of osteosarcoma tumour cells to identify targetable mutations.11 24 37 However, these tests are not routinely conducted for standard care in Hong Kong. Collaborations with centres in other nations should be pursued to facilitate the implementation of international standards in Hong Kong.
 
Conclusion
To our knowledge, this represents the first multicentre review of paediatric high-grade osteosarcoma in Hong Kong. Important prognostic factors, including metastatic disease at diagnosis and poor tumour chemonecrosis, were validated. The need for amputation may reflect local aggressiveness, which influences OS. Larger-scale studies of high-grade osteosarcoma in paediatric patients should be conducted over a longer period of time to better understand the characteristics, patterns, and prognostic factors.
 
Author contributions
Concept or design: GPY Tong, CK Li.
Acquisition of data: GPY Tong.
Analysis or interpretation of data: GPY Tong, WF Hui, CK Li.
Drafting of the manuscript: GPY Tong, WF Hui, KC Wong, CK Li.
Critical revision of the manuscript for important intellectual content: All authors.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
The authors have no conflicts of interest to disclose.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
This study was approved by the Research Ethics Committee (Kowloon Central/Kowloon East Cluster and New Territories East Cluster), Hospital Authority Hong Kong (Ref: KC/KE-19-0301/ER-1, 2019.712). The requirement for patient informed consent was waived.
 
References
1. Ottaviani G, Norman J. The epidemiology of osteosarcoma. In: Jaffe N, Bruland OS, Bielack S, editors. Pediatric and Adolescent Osteosarcoma. Boston, MA: Springer; 2009: 3-13. Crossref
2. North American Association of Central Cancer Registries (NAACCR), 2021.
3. Hong Kong Cancer Registry. Hong Kong Cancer Statistics 2015-2019.
4. Huvos AG, Rosen G, Marcove RC. Primary osteogenic sarcoma: pathologic aspects in 20 patients after treatment with chemotherapy en bloc resection, and prosthetic bone replacement. Arch Pathol Lab Med 1977;101:14-8.
5. Hung GY, Yen HJ, Yen CC, et al. Experience of pediatric osteosarcoma of the extremity at a single institution in Taiwan: prognostic factors and impact on survival. Ann Surg Oncol 2015;22:1080-7. Crossref
6. Pakos EE, Nearchou AD, Grimer RJ, et al. Prognostic factors and outcomes for osteosarcoma: an international collaboration. Eur J Cancer 2009;45:2367-75. Crossref
7. Abou Ali B, Salman M, Ghanem KM, et al. Clinical prognostic factors and outcome in pediatric osteosarcoma: effect of delay in local control and degree of necrosis in a multidisciplinary setting in Lebanon. J Glob Oncol 2019;5:1-8. Crossref
8. Vasquez L, Tarrillo F, Oscanoa M, et al. Analysis of prognostic factors in high-grade osteosarcoma of the extremities in children: a 15-year single-institution experience. Front Oncol 2016;6:22. Crossref
9. Mialou V, Philip T, Kalifa C, et al. Metastatic osteosarcoma at diagnosis: prognostic factors and long-term outcome—the French pediatric experience. Cancer 2005;104:1100-9. Crossref
10. Marina NM, Smeland S, Bielack SS, et al. Comparison of MAPIE versus MAP in patients with a poor response to preoperative chemotherapy for newly diagnosed highgrade osteosarcoma (EURAMOS-1): an open-label, international, randomised controlled trial. Lancet Oncol 2016;17:1396-408. Crossref
11. Gorlick R, Janeway K, Lessnick S, Randall RL, Marina N. Children’s Oncology Group’s 2013 blueprint for research: bone tumors. Pediatr Blood Cancer 2013;60:1009-15. Crossref
12. Yang JY, Cheng FW, Wong KC, et al. Initial presentation and management of osteosarcoma, and its impact on disease outcome. Hong Kong Med J 2009;15:434-9.
13. Xin S, Wei G. Prognostic factors in osteosarcoma: a study level meta-analysis and systematic review of current practice. J Bone Oncol 2020;21:100281. Crossref
14. Bramer JA, van Linge JH, Grimer RJ, Scholten RJ. Prognostic factors in localized extremity osteosarcoma: a systematic review. Eur J Surg Oncol 2009;35:1030-6. Crossref
15. Hu J, Zhang C, Zhu K, et al. Treatment-related prognostic factors in managing osteosarcoma around the knee with limb salvage surgery: a lesson from a long-term follow-up study. Biomed Res Int 2019;2019:3215824. Crossref
16. Jauregui JJ, Nadarajah V, Munn J, et al. Limb salvage versus amputation in conventional appendicular osteosarcoma: a systematic review. Indian J Surg Oncol 2018;9:232-40. Crossref
17. Zamzam MA, Moussa EA, Ghoneimy AE, et al. Outcomes and prognostic factors for non-metastatic osteosarcoma of the extremity. SM J Pediatr 2017;2:1013.
18. González-Billalabeitia E, Hitt R, Fernández J, et al. Pre-treatment serum lactate dehydrogenase level is an important prognostic factor in high-grade extremity osteosarcoma. Clin Transl Oncol 2009;11:479-83. Crossref
19. Fu Y, Lan T, Cai H, Lu A, Yu W. Meta-analysis of serum lactate dehydrogenase and prognosis for osteosarcoma. Medicine (Baltimore) 2018;97:e0741. Crossref
20. Chen J, Sun M, Hua Y, Cai Z. Prognostic significance of serum lactate dehydrogenase level in osteosarcoma: a meta-analysis. J Cancer Res Clin Oncol 2014;140:1205-10. Crossref
21. Lindsey BA, Markel JE, Kleinerman ES. Osteosarcoma overview. Rheumatol Ther 2017;4:25-43. Crossref
22. Savitskaya YA, Rico-Martínez G, Linares-González LM, et al. Serum tumor markers in pediatric osteosarcoma: a summary review. Clin Sarcoma Res 2012;2:9. Crossref
23. Shimose S, Kubo T, Fujimori J, Furuta T, Ochi M. A novel assessment method of serum alkaline phosphatase for the diagnosis of osteosarcoma in children and adolescents. J Orthop Sci 2014;19:997-1003.Crossref
24. de Nigris F, Zanella L, Cacciatore F, et al. YY1 overexpression is associated with poor prognosis and metastasis-free survival in patients suffering osteosarcoma. BMC Cancer 2011;11:472. Crossref
25. Morrow JJ, Khanna C. Osteosarcoma genetics and epigenetics: emerging biology and candidate therapies. Crit Rev Oncog 2015;20:173-97. Crossref
26. Poos K, Smida J, Maugg D, et al. Genomic heterogeneity of osteosarcoma–shift from single candidates to functional modules. PLoS One 2015;10:e0123082. Crossref
27. Wong KC, Kumta SM. Use of computer navigation in orthopedic oncology. Curr Surg Rep 2014;2:47. Crossref
28. Rasalkar DD, Chu WC, Lee V, Paunipagar BK, Cheng FW, Li CK. Pulmonary metastases in children with osteosarcoma: characteristics and impact on patient survival. Pediatr Radiol 2011;41:227-36. Crossref
29. Letourneau PA, Xiao L, Harting MT, et al. Location of pulmonary metastasis in pediatric osteosarcoma is predictive of outcome. J Pediatr Surg 2011;46:1333-7. Crossref
30. Daw NC, Chou AJ, Jaffe N, et al. Recurrent osteosarcoma with a single pulmonary metastasis: a multi-institutional review. Br J Cancer 2015;112:278-82. Crossref
31. Matsubara E, Mori T, Koga T, et al. Metastasectomy of pulmonary metastases from osteosarcoma: prognostic factors and indication for repeat metastasectomy. J Respir Med 2015;2015:1-5. Crossref
32. de Bree E, Drositis I, Michelakis D, Mavroudis D. Resection of pulmonary metastases in osteosarcoma. Is it justified? Hellenic J Surg 2018;90:293-8. Crossref
33. Geller DS, Gorlick R. Osteosarcoma: a review of diagnosis, management, and treatment strategies. Clin Adv Hematol Oncol 2010;8:705-18.
34. Warwick AB, Malempati S, Krailo M, et al. Phase 2 trial of pemetrexed in children and adolescents with refractory solid tumors: a Children’s Oncology Group study. Pediatr Blood Cancer 2013;60:237-41. Crossref
35. Jacobs S, Fox E, Krailo M, et al. Phase II trial of ixabepilone administered daily for five days in children and young adults with refractory solid tumors: a report from the children’s oncology group. Clin Cancer Res 2010;16:750-4. Crossref
36. Geoerger B, Chisholm J, Le Deley MC, et al. Phase II study of gemcitabine combined with oxaliplatin in relapsed or refractory paediatric solid malignancies: an innovative therapy for children with Cancer European Consortium Study. Eur J Cancer 2011;47:230-8. Crossref
37. Saraf AJ, Fenger JM, Roberts RD. Osteosarcoma: accelerating progress makes for a hopeful future. Front Oncol 2018;8:4. Crossref

Effects of primary granulocyte-colony stimulating factor prophylaxis on neutropenic toxicity and chemotherapy dose delivery in Chinese patients with breast cancer who received adjuvant docetaxel plus cyclophosphamide chemotherapy: a retrospective cohort study

Hong Kong Med J 2022;28(6):438-46 | Epub 20 Oct 2022
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Effects of primary granulocyte-colony stimulating factor prophylaxis on neutropenic toxicity and chemotherapy dose delivery in Chinese patients with breast cancer who received adjuvant docetaxel plus cyclophosphamide chemotherapy: a retrospective cohort study
Carol CH Kwok, MB, ChB1; WH Wong, BSc, MSc1; Landon L Chan, MB, ChB1; Sabrina PY Wong, MB, BS, FCSHK2; F Wang, BMed, PhD3; Martin CS Wong, MD, MPH3; Shelly LA Tse, BMed, PhD3
1 Department of Oncology, Princess Margaret Hospital, Hong Kong
2 Department of Surgery, Princess Margaret Hospital, Hong Kong
3 JC School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong
 
Corresponding author: Dr Carol CH Kwok (kwokch@ha.org.hk)
 
 Full paper in PDF
 
Abstract
Introduction: This study was performed to examine the effects of primary granulocyte-colony stimulating factor (G-CSF) prophylaxis on neutropenic toxicity, chemotherapy delivery, and hospitalisation among Chinese patients with breast cancer in Hong Kong.
 
Methods: This retrospective study included patients with breast cancer who received adjuvant docetaxel plus cyclophosphamide chemotherapy from November 2007 to October 2013 at Princess Margaret Hospital. Data were collected regarding the usage of G-CSF prophylaxis; incidences of grade 3 or 4 neutropenia, febrile neutropenia, non-neutropenic fever, and infection; hospital admissions, and chemotherapy dose delivery. Patients who began to receive G-CSF prophylaxis during the first cycle of chemotherapy and continued such prophylaxis in subsequent cycles were regarded as the primary G-CSF prophylaxis group.
 
Results: In total, 231 female Chinese patients with breast cancer were included in the analysis. Overall, 193 (83.5%) patients received primary G-CSF prophylaxis. The demographics and tumour characteristics were comparable between patients with and without primary G-CSF prophylaxis. Primary G-CSF prophylaxis significantly reduced febrile neutropenia incidence from 31.6% to 14.5% (relative risk=0.45, 95% confidence interval=0.25-0.81). Primary G-CSF prophylaxis also significantly reduced the incidence of grade 3 or 4 neutropenia from 57.9% to 24.7% (relative risk=0.43, 95% confidence interval=0.30-0.62) and the incidence of febrile neutropenia–related hospital admission from 31.6% to 12.4% (P=0.025). Finally, it enabled more patients to receive adequate chemotherapy dose delivery.
 
Conclusion: Primary G-CSF prophylaxis effectively reduced the incidences of grade 3 or 4 neutropenia and febrile neutropenia, while enabling adequate chemotherapy dose delivery and reducing hospital admissions among Chinese patients with breast cancer who received adjuvant docetaxel plus cyclophosphamide chemotherapy.
 
 
New knowledge added by this study
  • Primary granulocyte-colony stimulating factor (G-CSF) prophylaxis was associated with reduced neutropenic toxicity from adjuvant docetaxel plus cyclophosphamide (TC) chemotherapy.
  • Our 4-day G-CSF schedule helped to maintain the planned chemotherapy regimen and reduce the rate of hospital admission.
Implications for clinical practice or policy
  • Routine use of primary G-CSF prophylaxis enabled successful chemotherapy treatment and adequate chemotherapy dose delivery for patients with early breast cancer who received adjuvant TC chemotherapy.
  • Primary G-CSF prophylaxis could reduce hospital admissions for the management of febrile neutropenia; it may reduce in-patient bed occupancy and offset hospitalisation costs.
  • G-CSF prophylaxis can be extended to patients on all docetaxel-containing regimens of neoadjuvant and adjuvant chemotherapy.
 
 
Introduction
Adjuvant chemotherapy significantly improves disease-free survival and overall survival in patients with early breast cancer.1 For intermediate-risk patients who have axillary lymph node-negative early breast cancer,2 a common regimen comprises four cycles of doxorubicin plus cyclophosphamide. In 2009, Jones et al3 published a 7-year follow-up study of patients with stages I-III operable breast cancer in the United States; they reported that superior disease-free survival and overall survival could be achieved with four cycles of docetaxel plus cyclophosphamide (TC), compared with four cycles of doxorubicin plus cyclophosphamide. Since then, TC has been increasingly regarded as an alternative chemotherapy regimen to doxorubicin plus cyclophosphamide for patients with early-stage breast cancer. However, docetaxel causes significant myelotoxicity, characterised by high incidences of grade 3 or 4 neutropenia and febrile neutropenia (FN). Chemotherapy-induced neutropenia is a major type of toxicity that limits the dose of cancer therapy; FN is associated with substantial morbidity, mortality, and financial costs.4 Febrile neutropenia is considered a medical emergency, which often requires immediate hospitalisation and empirical administration of broad-spectrum antibiotics. Severe (grade 3 or 4) neutropenia or FN is the most common cause of dose reductions and/or cycle delays that lead to lower chemotherapy dose intensity; such changes may influence clinical outcomes, particularly when treatment is intended to be curative or to prolong survival.5 6
 
There is substantial evidence that granulocyte-colony stimulating factor (G-CSF) prophylaxis reduces the incidence of chemotherapy-associated FN in patients with diverse malignancies, including patients with breast cancer who are receiving chemotherapy and have moderately high/high FN risk; this prophylaxis can result in fewer chemotherapy dose reductions or delays.5 7 8 9 Current guidelines consistently recommend G-CSF prophylaxis during chemotherapy treatment for patients with cancer who have a high estimated risk of FN (ie, approximately 20%), as well as patients with cancer who have a history of FN.5 10 11 12 The administration of G-CSF should also be used to facilitate the maintenance of chemotherapy dose intensity for patients in whom reduced chemotherapy dose intensity or density is likely to cause a poor outcome (eg, patients receiving adjuvant or potentially curative treatment, or patients receiving treatment to prolong survival).5 10 12
 
We began using TC chemotherapy in 2007 at Princess Margaret Hospital, but we encountered a high incidence of FN. Our initial solution comprised dose reduction; after the first episode of FN, doses of chemotherapeutic agents were reduced by 10% to 25% in subsequent cycles. Then, G-CSF prophylaxis was administered if the second episode of FN occurred to avoid further dose reduction and to ensure delivery of planned chemotherapy; it was also intended to prevent the occurrence of further FN. This study was conducted to investigate the effects of primary G-CSF prophylaxis on neutropenic toxicity, chemotherapy delivery, and hospitalisation in patients with breast cancer if G-CSF was administrated from the first cycle of TC chemotherapy.
 
Methods
Patient selection
This retrospective cohort study was performed at Princess Margaret Hospital, Hong Kong. We reviewed the medical records of female Chinese patients with breast cancer who received adjuvant TC chemotherapy from November 2007 to October 2013. Exclusion criteria were as follows: previous receipt of chemotherapy, mixed TC and doxorubicin plus cyclophosphamide or other chemotherapy regimens; more than four cycles of TC; failure to complete four cycles of chemotherapy; and use of G-CSF after the occurrence of FN. Data were retrieved from the included patients’ out-patient and in-patient records, chemotherapy charts, and discharge summaries.
 
Tumour characteristics
Tumour staging, histological type, histological grade, lymphovascular invasion, oestrogen receptor status, progesterone receptor status, human epidermal growth factor receptor 2 (HER2) status, and Ki67 status were extracted from each patient’s pathology report. Oestrogen and/or progesterone receptor statuses were considered positive if either percentage of immunohistochemical staining was ≥1% (ie, an H score of ≥50 or an Allred score of ≥3). The HER2 status was considered positive if the immunohistochemical score was 3, or if fluorescence in situ hybridisation showed HER2 gene amplification if immunohistochemical score was equivocal.
 
Docetaxel plus cyclophosphamide treatment protocol
Chemotherapy was initiated within 6 to 8 weeks after surgery. The chemotherapeutic regimen consisted of docetaxel 75 mg/m2 and cyclophosphamide 600 mg/m2 administered by intravenous infusion over 60 minutes and 30 minutes, respectively, on day 1 at 3-week intervals for four cycles; dexamethasone premedication and standard anti-emetic were administered during each cycle. In accordance with standard protocol in the Department of Oncology at Princess Margaret Hospital, patients were required to have an Eastern Cooperative Oncology Group performance status of 0 or 1. At baseline and before each cycle of chemotherapy, complete blood counts were performed, along with tests of renal and hepatic function. To proceed with treatment, patients were required to have a white blood cell count of ≥3 × 109/L, an absolute neutrophil count (ANC) of ≥1.5 × 109/L, and a platelet count of ≥100 × 109/L. For patients with insufficient blood counts, chemotherapy was deferred for ≥1 week until counts reached the required levels. For patients with an elevated alanine transaminase level (ie, ≥1.5-fold above the upper limit of normal), the dose of docetaxel was reduced by 15%, in accordance with prescribing information. Complete blood counts were also checked at nadir (ie, the lowest white blood cell count or ANC recorded within 21 days of the first cycle of chemotherapy, typically on day 10 after chemotherapy) to assess the severity of neutropenia; based on blood count findings, chemotherapy dosage was adjusted (if necessary) in subsequent cycles. The dosage reduction ranged from 10% to 25% according to the occurrence of grade 4 neutropenia or FN in prior cycles. Hepatitis status was checked at baseline. Prophylactic antiviral therapy was administered to patients who had a positive test result for hepatitis B surface antigen.
 
Granulocyte-colony stimulating factor use
We suggested G-CSF prophylaxis (on a self-financed basis during the study period) to each patient who was scheduled to receive adjuvant TC, unless they had contra-indications mentioned in the prescribing information. The intent of G-CSF prophylaxis was to prevent the occurrence of FN, which would lead to cycle delay and chemotherapy dose reduction. We defined primary G-CSF prophylaxis as upfront use in the first chemotherapy cycle and continuation in subsequent cycles. The administration of G-CSF after an episode of FN during a previous chemotherapy cycle was considered secondary use; patients who received such treatment were excluded from the analysis, as indicated in the Patient selection subsection. Antibiotic treatment was administered to patients who showed grade 3 or 4 neutropenia at nadir.13 Neupogen (filgrastim) 30 MU was the form of G-CSF used during the study period; this treatment was administered subcutaneously from day 4 to day 7 of each chemotherapy cycle.
 
Febrile neutropenia and other adverse events
Febrile neutropenia was defined as a single reading of oral temperature ≥38.3°C or a sustained (≥1 h) oral temperature of ≥38.0°C, with either an ANC of <0.5 × 109 cells/L or an ANC of <1.0 × 109 cells/L and predicted decrease to <0.5 × 109 cells/L over the next 48 hours.14 Haematological and other non-haematological adverse events were categorised and graded in accordance with the National Cancer Institute’s Common Terminology Criteria for Adverse Events (version 3.0); they were expressed as maximum toxicity per patient. Other adverse effects were reported as grade 3 or 4.
 
Hospital admission for chemotherapy-related toxicities
An admission was regarded as a single hospital admission that occurred between two consecutive chemotherapy cycles; admissions were recorded until 1 month after the final cycle of chemotherapy. If neutropenic fever was diagnosed or suspected, patients were admitted for isolation, sepsis tests, and antibiotic treatment; if patients refused admission, they were prescribed oral antibiotics. If indicated, patients were also admitted for treatment of chemotherapy-related adverse effects. Admissions were categorised according to the primary diagnosis at the time of admission.
 
Statistical analyses
Descriptive statistics were used to summarise baseline patient and tumour characteristics, chemotherapy delivery, and adverse events. The Chi squared test (or Fisher’s exact test) and Student’s t test (or Mann–Whitney U test) were used for comparisons of categorical and continuous variables, respectively. Multivariate logistic regression models were used to calculate the relative risk and 95% confidence interval for the occurrence of FN with primary G-CSF prophylaxis after adjustment for age.
 
The total percentage of planned doses received was calculated as the sum of the percentage of planned doses over four cycles divided by the number of cycles of chemotherapy administered. Dose intensity was calculated as the cumulative dose (mg/m2) divided by the total duration of chemotherapy (wk). Planned dose intensity was calculated as the planned cumulative dose (mg/m2) divided by the planned treatment duration (wk). Relative dose intensity was calculated as the ratio of delivered dose intensity to planned dose intensity. Chemotherapy dose delay was defined as a delay of ≥3 days from the planned treatment date. In all statistical analyses, P<0.05 was considered indicative of statistical significance. All data analyses were performed using SPSS software (Windows version 26.0; IBM Corp, Armonk [NY], United States).
 
Results
Baseline characteristics of patients according to primary granulocyte-colony stimulating factor prophylaxis status
During the initial review, 261 patients were identified and eight of them presented with synchronous bilateral breast cancer. In total, 30 patients were excluded from analysis for the following reasons: previous receipt of chemotherapy (n=2), mixed TC and doxorubicin plus cyclophosphamide or other chemotherapy regimens (eg, epirubicin plus cyclophosphamide, or cyclophosphamide plus methotrexate plus 5-fluorouracil) [n=14], or more than four cycles of TC (n=6); failure to complete four cycles of chemotherapy (n=3; 1 died of pneumonia, 1 discontinued chemotherapy after one cycle because of advanced age, and 1 discontinued chemotherapy after three cycles for unspecified reasons); and use of G-CSF after the occurrence of FN (n=5). Finally, 231 female Chinese patients with breast cancer were included in this study; 193 patients (83.5%) received primary G-CSF prophylaxis (Table 1). Age at diagnosis, body weight and height, body mass index, and menopausal and co-morbidity statuses were comparable between patients with and without primary G-CSF prophylaxis. The distributions of tumour stages, histological subtypes, histological grades, and molecular biomarker statuses were also similar between the two groups.
 

Table 1. Characteristics of patients with breast cancer, stratified according to primary granulocyte-colony stimulating factor prophylaxis status
 
Development of febrile neutropenia and other chemotherapy-related toxicities
In total, 106 patients (45.9%) had ≥1 episode of neutropenia; 69 patients (65.1%) developed grade 3 or 4 neutropenia (Table 2). Among patients with neutropenia, the incidences of FN were 40.0% (12/30) in the non-G-CSF group and 36.8% (28/76) in the G-CSF group. Compared with patients who did not receive G-CSF prophylaxis, primary G-CSF prophylaxis was associated with a lower incidence of FN (31.6% vs 14.5%) and a lower incidence of grade 3 or 4 neutropenia (57.9% vs 24.7%) [Table 2]; the relative risks were 0.45 (95% confidence interval=0.25-0.81) and 0.43 (95% confidence interval=0.30-0.62), respectively. However, G-CSF prophylaxis was not associated with reduced incidences of non-neutropenic fever and infection. The incidences of grade 3 or 4 chemotherapy-related toxicities other than neutropenia were very low among our patients. Only four episodes of grade 3 or 4 chemotherapy-related adverse events were observed (one episode of anaemia, one episode of non-neutropenic leukopenia, and two episodes of diarrhoea).
 

Table 2. Incidences of chemotherapy-related toxicities during docetaxel plus cyclophosphamide chemotherapy in patients with breast cancer, stratified according to primary granulocyte-colony stimulating factor prophylaxis status
 
Delivery of chemotherapy
Primary G-CSF prophylaxis helped to maintain the planned regimen of TC chemotherapy (Table 3). When compared with the non-G-CSF group, the proportion of patients who received the standard dose of TC was higher among patients in the G-CSF group (63.2% vs 74.6%); moreover, the proportion of patients with an overall dose deduction of >10% for docetaxel or cyclophosphamide was lower among patients in the G-CSF group (18.4% vs 14.0% and 13.2% vs 7.3%, respectively), although the difference was not statistically significant. However, compared with the non-G-CSF group, more patients in the G-CSF group experienced chemotherapy cycle delays (26.3% vs 56.5%). There were no significant differences in dose intensity or relative dose intensity for both docetaxel and cyclophosphamide between the G-CSF and non-G-CSF group (online supplementary Table 1).
 

Table 3. Delivery of docetaxel plus cyclophosphamide chemotherapy to patients with breast cancer, stratified according to primary granulocyte-colony stimulating factor prophylaxis status
 
Rate of hospital admission
Sixty patients had ≥1 hospital admission for a severe chemotherapy-related adverse event; 36 of these patients (60%) were diagnosed with FN. Compared with the non-G-CSF group, the hospital admission rate was lower in the primary G-CSF prophylaxis group, particularly in terms of admission for FN (31.6% vs 12.4%) [Table 4].
 

Table 4. Hospital admissions among patients with breast cancer, stratified according to primary granulocyte-colony stimulating factor prophylaxis status
 
Discussion
Primary granulocyte-colony stimulating factor prophylaxis reduced febrile neutropenia and severe neutropenia
To our knowledge, this is the first study to demonstrate an association between the use of a fixed schedule of G-CSF prophylaxis (days 4 to 7) and the reduction of neutropenia and FN incidences in patients with early-stage breast cancer who received adjuvant TC chemotherapy. Our study demonstrated that patients who received primary G-CSF prophylaxis were significantly more likely to maintain their planned regimen of chemotherapy and have a lower rate of hospital admission. These findings suggest that G-CSF prophylaxis can enhance treatment efficacy and conserve medical resources.
 
Chinese patients on docetaxel-based chemotherapy had higher incidences of neutropenia/febrile neutropenia
Our results are consistent with previous reports that the FN rate was higher among patients receiving TC chemotherapy in the absence of G-CSF.3 15 16 17 Rates of myelosuppression and neutropenia during docetaxel-based chemotherapy were higher in our Chinese patients compared with those in Caucasian patients in previous studies.18 19 20 In the original TC study, incidence of FN was only 5%.3 Inter-individual and inter-ethnic variations in pharmacokinetics and pharmacodynamics may be linked to variations in docetaxel toxicity.21 22 23
 
Four-day course of granulocyte-colony stimulating factor prophylaxis was effective
With regard to the schedule of G-CSF prophylaxis, international guidelines recommend initiation between 24 and 72 hours after the last day of chemotherapy, with continuation until sufficient and stable ANC recovery has been achieved after nadir.5 10 11 The optimal clinical benefits of filgrastim have been achieved with approximately 11 daily injections; ANC recovery typically requires 10 to 11 days.5 Therefore, the median recommended duration of daily filgrastim injections is 10 to 11 days.5 11 Nevertheless, the chemotherapy schedule varies in clinical practice.5 24 25 In a previous study, von Minckwitz et al8 found that daily G-CSF was most frequently administered at five to seven doses per cycle. We initiated G-CSF on day 4 in accordance with the recommendation (mentioned above) that G-CSF should be initiated between 24 and 72 hours after chemotherapy. According to the docetaxel prescribing information, a median of 7 days is needed to reach nadir; the median duration to reach severe neutropenia (<500 cells/mL) is also 7 days. Based on our previous experience concerning nadir, we examined complete blood counts from day 7 to day 11; we observed that the duration of neutrophil nadir for docetaxel was short and the ANC rapidly rebounded after day 10. Because the median neutrophil nadir of docetaxel typically occurs on day 7, the administration of G-CSF until day 7 would constitute the shortest duration of G-CSF injection; the increased neutrophil count as a result of G-CSF injection (due to the effect of G-CSF) would presumably have a protective effect during the expected neutrophil nadir period.
 
In this study, FN occurred after the first cycle in 28 of 193 (14.5%) patients who received primary G-CSF prophylaxis; all of those patients received G-CSF on days 4 to 7. It is important to consider whether this finding suggests that our G-CSF schedule was insufficient for FN prevention. Notably, there have been reports that the initial episode of neutropenia most frequently occurred during the first cycle in patients receiving cancer chemotherapy.6 25 26 The lower apparent risk after the first cycle presumably results from subsequent dose reductions and delays, or from the secondary use of a white blood cell growth factor.6 27 The high frequency of first-cycle FN has been proposed to emphasise the need for early (during the first cycle) initiation of G-CSF to reduce the risk of FN. A longer duration of G-CSF prophylaxis may further reduce the incidence of first-cycle FN, but this hypothesis requires further investigation.
 
Granulocyte-colony stimulating factor prophylaxis enabled adequate chemotherapy dose delivery
We found that primary G-CSF prophylaxis influenced the incidences of FN and FN-related hospitalisation and also enabled adequate chemotherapy dose delivery. Our findings were similar to the results reported by von Minckwitz et al8; however, their study involved a comparison of primary prophylaxis with long-acting pegfilgrastim to either no G-CSF treatment or any cycle of G-CSF/pegfilgrastim. In the group of patients who received G-CSF, the longer duration of chemotherapy might have offset the effect of the higher total percentage of scheduled chemotherapy doses on dose intensity and relative dose intensity. These findings were analogous to the findings in a Cochrane review, which showed CSF treatment did not help much in maintaining the planned chemotherapy schedules.9 Additionally, von Minckwitz et al8 reported that neutropenia prophylaxis influenced chemotherapy dose reductions (≥15%) but did not affect the incidence of chemotherapy dose delays (≥3 days). Similarly, in a study that evaluated the effect of pegfilgrastim during the first and subsequent cycles versus placebo, the proportion of patients who received their planned dose on time (defined as receiving ≥80% of the planned dose and no dose ≥3 days late) did not significantly differ between the two groups. The authors of the study concluded that no difference had been present because patients who developed FN were allowed to receive pegfilgrastim in subsequent cycles, which (because of study design) prevented the identification of a difference between the pegfilgrastim and placebo groups.26
 
Although the reduction of FN incidence is an important clinical outcome, G-CSF prophylaxis might facilitate the maintenance of chemotherapy dose intensity7; G-CSF has also been used as an adjunct to achieve moderate increases in dose intensity. Early clinical trials of patients with solid malignancies demonstrated a limited survival benefit for patients who received higher dose therapy.28 In clinical studies, dose-dense schedules (ie, with shortened treatment intervals) have shown increased survival, whereas the benefit in dose escalation studies has been less consistent.6 Notably, a meta-analysis showed that the receipt of primary G-CSF prophylaxis was associated with a modest reduction in all-cause mortality, compared with the absence of primary prophylaxis.29 Recently published meta-analyses confirm the survival benefit of dose-dense chemotherapy.29 30 These provide supporting evidence that ensuring chemotherapy dose intensity is an important consideration for treatment outcome which is particularly relevant in adjuvant chemotherapy settings. Further studies are awaited to assess the effects of chemotherapy dose delivery on survival outcomes in our patients with early breast cancer.
 
Our results showed that G-CSF prophylaxis reduced the rate of hospital admission for FN. Conventional management of FN involves hospital admission with intravenous administration of broad-spectrum antibiotics for 5 to 7 days. The mean length of hospitalisation for FN may exceed 1 week; patients must be placed in isolation rooms to undergo numerous diagnostic procedures and receive intravenous antibiotic support, and there is a need to consider the potential complications of such therapy (American Society of Clinical Oncology guideline 1994, 2000).31 The benefit of reducing the rate of hospital admission is that it can reduce demands on the resources of a public healthcare system with a limited number of in-patient beds. Additionally, the reduced rate of hospital admission can help to minimise disruption for patients and their families, thereby avoiding negative impacts on quality of life.
 
Study implications
The patients in this study received treatment from November 2007 to October 2013. Among the patients, 14.7% had luminal A–like breast cancer subtypes according to histopathological criteria (online supplementary Table 2); thus far, patients with such breast cancer subtypes have experienced minimal benefits from chemotherapy. With the increasing use of gene expression profiling as a personalised medicine approach for adjuvant chemotherapy in patients with hormone receptor–positive, HER2-negative breast cancers, the use of adjuvant chemotherapy has become increasingly selective for such patients; nevertheless, it remains important for patients with HER2-positive and triple-negative breast cancers. The TC regimen is an important type of adjuvant chemotherapy; it is recommended within the National Comprehensive Cancer Network guidelines. Because it is an anthracycline-free regimen, TC chemotherapy has been compared with anthracycline-containing regimens in some large, randomised trials; it has demonstrated excellent results.32 33 The TC regimen is considered an efficacious and less-toxic option in lower-risk patients, as well as patients with known cardiac disease or pre-existing risk factors for cardiac toxicity.32 33 Routine G-CSF prophylaxis has made adjuvant TC chemotherapy safer and more successful. Currently, the 4-day G-CSF schedule is widely used for patients in our hospital who receive other docetaxel-containing regimens, such as TCH (ie, trastuzumab, carboplatin, and docetaxel) and docetaxel 100 mg/m2 regimens; the outcomes are generally positive.
 
Study limitations
First, this study used a retrospective cohort design and only included patients from a single centre. Thus, the overall sample size was moderate and the non-G-CSF group included a small number of patients. Moreover, because this was an observational study without randomisation, the number of patients who received G-CSF prophylaxis substantially differed from the number of patients who did not receive such prophylaxis. Our results require confirm in multicentre studies with diverse patient populations. Second, indication bias might have been present, such that patients who received G-CSF might constitute a distinct group, compared with patients who did not receive G-CSF. We suggested the use of primary G-CSF prophylaxis to each patient who was scheduled to receive adjuvant TC chemotherapy; most patients accepted this suggestion (193/231). Among 43 patients who did not receive G-CSF treatment at the first cycle of chemotherapy, only five received secondary administration of G-CSF, which indicated that cost was the main factor influencing receipt of primary G-CSF prophylaxis. Nevertheless, we used multivariate logistic regression models to adjust for potential confounding factors (eg, baseline differences between the two groups). Third, incomplete documentation of adverse effects might have influenced our findings because only significant adverse effects were recorded for most study participants. Febrile neutropenia and hospital admission were the major clinical outcomes recorded in medical records; therefore, medication-related adverse effects might have been neglected. Finally, because only short-term toxicity (ie, neutropenic toxicity) was examined in this study, the long-term effects of chemotherapy (eg, neurotoxicity) and G-CSF prophylaxis on quality of life should be addressed in future studies.
 
Conclusion
Our study demonstrated that the use of 4-day primary G-CSF prophylaxis can reduce neutropenic toxicity from adjuvant TC chemotherapy; it enables successful chemotherapy treatment and facilitates adequate chemotherapy dose delivery. Further studies are needed to assess the effects of primary G-CSF prophylaxis and chemotherapy dose delivery on survival outcomes in patients with breast cancer.
 
Author contributions
Concept or design: CCH Kwok, F Wang, SLA Tse.
Acquisition of data: CCH Kwok, SPY Wong.
Analysis or interpretation of data: WH Wong, CCH Kwok, SLA Tse, F Wang, LL Chan.
Drafting of the manuscript: CCH Kwok.
Critical revision of the manuscript for important intellectual content: CCH Kwok, F Wang, SLA Tse, MCS Wong.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
As the Editor-in-Chief and adviser of the journal, respectively, MCS Wong and SLA Tse were not involved in the peer review process. Other authors have disclosed no conflicts of interest.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
This study was approved by the Hong Kong Hospital Authority Kowloon West Cluster Research Ethics Committee (Ref: KW/EX-12-068 [53-03]). The requirement for informed consent was waived.
 
References
1. Early Breast Cancer Trialists’ Collaborative Group (EBCTCG). Effects of chemotherapy and hormonal therapy for early breast cancer on recurrence and 15-year survival: an overview of the randomized trials. Lancet 2005;365:1687-717. Crossref
2. Goldhirsch A, Wood WC, Gelber RD, et al. Progress and promise: highlights of the international expert consensus on the primary therapy of early breast cancer 2007. Ann Oncol 2007;18:1133-44. Crossref
3. Jones S, Holmes FA, O’Shaughnessy J, et al. Docetaxel with cyclophosphamide is associated with an overall survival benefit compared with doxorubicin and cyclophosphamide: 7-year follow-up of US oncology research trial 9735. J Clin Oncol 2009;27:1177-83. Crossref
4. Kuderer NM, Dale DC, Crawford J, Cosler LE, Lyman GH. Mortality, morbidity, and cost associated with febrile neutropenia in adult cancer patients. Cancer 2006;106:2258-66. Crossref
5. Aapro MS, Bohlius J, Cameron DA, et al. 2010 Update of EORTC guidelines for the use of granulocyte-colony stimulating factor to reduce the incidence of chemotherapy-induced febrile neutropenia in adult patients with lymphoproliferative disorders and solid tumours. Eur J Cancer 2011;47:8-32. Crossref
6. Lyman GH. Impact of chemotherapy dose intensity on cancer patient outcomes. J Natl Compr Canc Netw 2009;7:99-108. Crossref
7. Kuderer NM, Dale DC, Crawford J, Lyman GH. Impact of primary prophylaxis with granulocyte colony-stimulating factor on febrile neutropenia and mortality in adult cancer patients receiving chemotherapy: a systematic review. J Clin Oncol 2007;25:3158-67. Crossref
8. von Minckwitz G, Schwenkglenks M, Skacel T, et al. Febrile neutropenia and related complications in breast cancer patients receiving pegfilgrastim primary prophylaxis versus current practice neutropenia management: Results from an integrated analysis. Eur J Cancer 2009;45:608-17. Crossref
9. Renner P, Milazzo S, Liu JP, Zwahlen M, Birkmann J, Horneber M. Primary prophylactic colony-stimulating factors for the prevention of chemotherapy-induced febrile neutropenia in breast cancer patients. Cochrane Database Syst Rev 2012;(10):CD007913. Crossref
10. Smith TJ, Khatcheressian J, Lyman GH, et al. 2006 Update of recommendations for the use of white blood cell growth factors: an evidence-based clinical practice guideline. J Clin Oncol 2006;24:3187-205. Crossref
11. Crawford J, Caserta C, Roila F; ESMO Guidelines Working Group. Hematopoietic growth factors: ESMO clinical practice guidelines for the applications. Ann Oncol 2010;21 Suppl 5:v248-51. Crossref
12. Crawford J, Becker PS, Armitage JO, et al. Myeloid Growth Factors, Version 2.2017, NCCN Clinical Practice Guidelines in Oncology. J Natl Compr Canc Netw 2017;15:1520-41. Crossref
13. Herbst C, Naumann F, Kruse EB et al. Prophylactic antibiotics or G-CSF for the prevention of infections and improvement of survival in cancer patients undergoing chemotherapy. Cochrane Database Syst Rev 2009;(1):CD007107. Crossref
14. Taplitz RA, Kennedy EB, Bow EJ, et al. Antimicrobial prophylaxis for adult patients with cancer-related immunosuppression: ASCO and IDSA clinical practice guideline update. J Clin Oncol 2018;36:3043-54. Crossref
15. Rayson D, Lutes S, Sellon M, et al. Incidence of febrile neutropenia during adjuvant chemotherapy for breast cancer: a prospective study. Curr Oncol 2012;19:e216-8. Crossref
16. Myers R, Higgins B, Jeffrey M, et al. Chemotherapy induced febrile neutropenia of docetaxel with cyclophosphamide (TC) for adjuvant therapy of breast cancer in the community—reality check [abstract 2092]. Cancer Res 2009;69(24 Suppl):2092. Crossref
17. Younis T, Rayson D, Thompson K. Primary G-CSF prophylaxis for adjuvant TC or FEC-D chemotherapy outside of clinical trial settings: a systematic review and meta-analysis. Support Care Cancer 2012;20:2523-30. Crossref
18. Chow LW, Biganzoli L, Leo AD, et al. Toxicity profile differences of adjuvant docetaxel/cyclophosphamide (TC) between Asian and Caucasian breast cancer patients. Asia Pac J Clin Oncol 2017;13:372-8. Crossref
19. Yau TK, Ng TY, Soong IS, et al. Toxicity of docetaxel plus cyclophosphamide as adjuvant therapy for breast cancer in Chinese patients—the Hong Kong experience. Asia Pac J Clin Oncol 2009;5:123-8. Crossref
20. Ye X, Zhai Q, Wang ZY, Du Q, Zhu B, Yu B. Neutropenic complications in Chinese patients with breast cancer in a real-world setting. Int J Clin Exp Pathol 2017;10:651-60.
21. Nieuweboer AJ, de Morrée ES, de Graan AJ, Sparreboom A, de Wit R, Mathijssen RH. Inter-patient variability in docetaxel pharmacokinetics: a review. Cancer Treat Rev 2015;41:605-13. Crossref
22. Puisset F, Alexandre J, Treluyer JM, et al. Clinical pharmacodynamic factors in docetaxel toxicity. Br J Cancer 2007;97:290-6. Crossref
23. Ling WH, Lee SC. Inter-ethnic differences—how important is it in cancer treatment? Ann Acad Med Singap 2011;40:356-61.
24. Frasci G. Treatment of breast cancer with chemotherapy in combination with filgrastim: approaches to improving therapeutic outcome. Drugs 2002;62 Suppl 1:17-31. Crossref
25. Crawford J, Dale DC, Kuderer NM, et al. Risk and timing of neutropenic events in adult cancer patients receiving chemotherapy: the results of a prospective nationwide study of oncology practice. J Natl Compr Canc Netw 2008;6:109-18. Crossref
26. Vogel CL, Wojtukiewicz MZ, Carroll RR, et al. First and subsequent cycle use of pegfilgrastim prevents febrile neutropenia in patients with breast cancer: a multicenter, double-blind, placebo-controlled phase III study. J Clin Oncol 2005;23:1178-84. Crossref
27. Lyman GH. Guidelines of the National Comprehensive Cancer Network on the use of myeloid growth factors with cancer chemotherapy: a review of the evidence. J Natl Compr Canc Netw 2005;3:557-71. Crossref
28. Papaldo P, Lopez M, Marolla P, et al. Impact of five prophylactic filgrastim schedules on hematologic toxicity in early breast cancer patients treated with epirubicin and cyclophosphamide. J Clin Oncol 2005;23:6908-18. Crossref
29. Lyman GH, Dale DC, Culakova E, et al. The impact of the granulocyte colony-stimulating factor on chemotherapy dose intensity and cancer survival: a systemic review and meta-analysis of randomized trials. Ann Oncol 2013;24:2475-84. Crossref
30. Early Breast Cancer Trialists' Collaborative Group (EBCTCG). Increasing the dose intensity of chemotherapy by more frequent administration or sequential scheduling: a patient-level meta-analysis of 37 298 women with early breast cancer in 26 randomized trials. Lancet 2019;393:1440-52. Crossref
31. Ozer H, Armitage JO, Bennett CL, et al. 2000 Update of recommendations for the use of hematopoietic colony-stimulating factors: evidence-based, clinical practice guidelines. American Society of Clinical Oncology Growth Factors Expert Panel. J Clin Oncol 2000;18:3558-85. Crossref
32. Nitz U, Gluz O, Clemens M, et al. West German Study PlanB Trial: adjuvant four cycles of epirubicin and cyclophosphamide plus docetaxel versus six cycles of docetaxel and cyclophosphamide in HER2-negative early breast cancer. J Clin Oncol 2019;37:799-808. Crossref
33. Caparica R, Bruzzone M, Poggio F, Ceppi M, de Azambuja E, Lambertini M. Anthracycline and taxane-based chemotherapy versus docetaxel and cyclophosphamide in the adjuvant treatment of HER2-negative breast cancer patients: a systematic review and meta-analysis of randomized controlled trials. Breast Cancer Res Treat 2019;174:27-37. Crossref

Neonatal mortality in singleton pregnancies: a 20-year retrospective study from a tertiary perinatal unit in Hong Kong

Hong Kong Med J 2022;28(6):430-7 | Epub 29 Nov 2022
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Neonatal mortality in singleton pregnancies: a 20-year retrospective study from a tertiary perinatal unit in Hong Kong
Genevieve PG Fung, MB BChir, MRCPCH1; SL Lau, MB, ChB, MRCOG2; Annie SY Hui, MB, ChB, MRCOG2; Sani TK Wong, MB, ChB2; WT Tse, MB, ChB, MRCOG2; PC Ng, MD, FRCPCH1; DS Sahota, PhD2; HS Lam, MD, FRCPCH1; TY Leung, MD, FRCOG2
1 Department of Paediatrics, The Chinese University of Hong Kong, Hong Kong
2 Department of Obstetrics and Gynaecology, The Chinese University of Hong Kong, Hong Kong
 
Corresponding author: Prof TY Leung (tyleung@cuhk.edu.hk)
 
 Full paper in PDF
 
Abstract
Introduction: The global neonatal death (NND) rate has been declining in recent decades, but there are no comprehensive data concerning the characteristics of NNDs in Hong Kong. This study investigated the trends and aetiologies of NNDs among singleton pregnancies in Hong Kong.
 
Methods: This study included all cases of NND from singleton pregnancies in a tertiary hospital in Hong Kong between 2000 and 2019. The rates, clinical characteristics, and aetiologies of NND were compared between the first (2000-2009) and the second (2010-2019) decades.
 
Results: The NND rate decreased from 1.66/1000 livebirths (97 cases) in the first decade to 1.32/1000 livebirths (87 cases) in the second decade. Congenital or genetic abnormalities (82 cases) caused 44.6% of all NNDs. There was a significant reduction from 0.82/1000 livebirths in the first decade to 0.52/1000 livebirths in the second decade (P=0.037). Other causes of NND were prematurity (69 cases; 37.5%), sepsis (16 cases; 8.7%), hypoxic-ischaemic encephalopathy (15 cases; 8.2%), and sudden infant death syndrome (2 cases; 1.1%). Gestational age-specific neonatal mortality for moderately preterm neonates (31-33 weeks of gestation) significantly decreased from 34.73/1000 in 2000-2009 to 8.63/1000 in 2010-2019 (P=0.001), but there were no significant changes in neonatal mortality for other gestations.
 
Conclusions: The NND rate in Hong Kong is among the lowest worldwide. Neonatal deaths in our centre declined over the past two decades, mainly because of improvements in the prenatal diagnosis and treatment of congenital or genetic abnormalities, as well as an improved survival rate among moderately preterm neonates.
 
 
New knowledge added by this study
  • The rate of neonatal mortality among singleton pregnancies in Hong Kong decreased from 1.66/1000 livebirths in 2000-2009 to 1.32/1000 livebirths in 2010-2019.
  • The decline in the neonatal mortality rate mainly resulted from improvements in the prenatal diagnosis and treatment of congenital or genetic abnormalities, as well as an improved survival rate among moderately preterm neonates (31-33 weeks of gestation).
Implications for clinical practice or policy
  • Future improvements in the neonatal mortality rate should focus on in utero treatment, expanded carrier screening of genetic abnormalities, and the prevention of preterm birth and pre-eclampsia.
 
 
Introduction
Perinatal and neonatal mortality rates are important measures of the quality of medical care during pregnancy, childbirth, and the neonatal period. Although the global neonatal death (NND) rate has demonstrated a decreasing trend over the past 30 years, from 37/1000 livebirths to 17/1000 livebirths between 1990 and 2020,1 NND rates considerably vary among regions. According to the 2020 report by the World Health Organization, the NND rates were the highest in Africa (27/1000 livebirths), Eastern Mediterranean (25/1000 livebirths) and South-East Asia (18/1000 livebirths); the NND rates were the lowest in Americas (7/1000 livebirths), Western Pacific (5/1000 livebirths) and Europe (4/1000 livebirths).1 In Hong Kong, territory-wide statistics indicated NND rates of 1.2/1000 and 1.0/1000 livebirths in 2004 and 2014, respectively2; these rates are lower than the rates in most regions, according to the above report by the World Health Organization. However, there have been few in-depth studies concerning the trends and underlying causes of NND in Hong Kong. Our group recently published two epidemiological studies regarding singleton pregnancies in Hong Kong, which revealed a decreasing trend in the rate of stillbirths among singleton pregnancies from 3.61/1000 in 2000-2009 to 3.09/1000 in 2010-2019.3 The rate of perinatal mortality in multiple pregnancies also decreased from 5.52/1000 to 4.59/1000 during the same period.4 These improvements in mortality rates have mainly occurred because of advances in the prenatal diagnosis and management of fetal malformations and genetic diseases, as well as improvements in the antenatal management of multiple pregnancies. The present study investigated the trends of NNDs among singleton pregnancies in the largest tertiary perinatal centre in Hong Kong, as well as changes in the characteristics and aetiologies of NND over the past two decades, with the goal of improving perinatal care in Hong Kong.
 
Methods
Study setting
This retrospective study included all singleton pregnancies that delivered at the Prince of Wales Hospital from 1 January 2000 to 31 December 2019. The STROBE reporting guideline was followed when writing this manuscript. The Prince of Wales Hospital is affiliated with The Chinese University of Hong Kong and serves a large population of 1.7 million in the New Territories East region of Hong Kong; the hospital’s annual delivery rate is 6000 to 7000 (approximately one-sixth of the total births in all public hospitals in Hong Kong, and one-ninth of the total births in Hong Kong). Both the obstetric unit and the neonatal unit are the largest in Hong Kong. The neonatal unit is a Level III centre that consists of a 22-bed neonatal intensive care unit (NICU). The staff of the neonatal unit worked closely with the staff of the obstetric unit to manage high-risk deliveries from complicated pregnancies, as well as pregnancies that required fetal intervention after referral from other hospitals.
 
Perinatal and neonatal management
Complicated pregnancies were discussed in weekly perinatal meetings attended by the staff of both the obstetric unit and the neonatal unit; discussions of these pregnancies focused on management plans and the optimal timing of delivery. Relevant disciplines (eg, paediatric surgery, cardiology, neurosurgery, radiology, or otolaryngology) were included as appropriate. In cases where a specialist service outside of Prince of Wales Hospital (eg, cardiothoracic surgery) was anticipated after delivery, specialists from other centres were invited to participate in management planning. Active resuscitation was provided for all viable neonates delivered at ≥24 weeks. For extremely premature neonates with borderline viability (ie, delivered at 22-23 weeks of gestation), considering the high risks of mortality and long-term morbidity, comprehensive counselling was provided to affected families, which allowed them to select active resuscitation or no resuscitation at birth. In accordance with the departmental protocol, the paediatric unit was requested to prepare for rapid management of deliveries that involved specific maternal or fetal conditions (eg, prematurity, fetal distress, instrumental deliveries, and antenatally diagnosed severe congenital abnormalities). Neonates were managed in the NICU in accordance with the standard unit protocols and guidelines. These protocols were updated regularly according to the latest evidence-based consensus guidelines and recommendations established by clinicians in Hong Kong and other nations. In accordance with departmental guidelines, comprehensive investigations were performed to determine the cause of death in all cases of NND. All NNDs were referred for autopsy unless the cause of death was clearly identified (eg, trisomy 13 or 18 confirmed by genetic tests). If the cause of NND could not be identified, the case was reported to the coroner. If NND was caused by multiple pathologies, the most clinically significant pathology that contributed to death was selected for analysis.
 
Data collection and analysis
All cases of NND in livebirths of singleton pregnancies during the study period were retrieved using the Hospital Authority’s Clinical Data Analysis and Reporting System. Cases of NND in livebirths of multiple pregnancies were excluded. The included cases of NND were divided into two groups according to the decade of birth. The first group (ie, first decade) included cases of NND among singleton pregnancies that delivered between 1 January 2000 and 31 December 2009. The second group (ie, second decade) included cases of NND among singleton pregnancies that delivered between 1 January 2010 and 31 December 2019. Obstetric data including maternal demographics (maternal age, maternal illnesses, antenatal complications, and treatment) and birth history (gestation, mode of delivery, sex, birth weight, Apgar scores, and neonatal resuscitation) were collected from the Obstetric Specialty Clinical Information System. Neonatal data comprising neonatal diagnoses, interventions, and length of survival were retrieved from the Hospital Authority’s Clinical Management System. When further details were needed, individual case records were retrieved for analysis.
 
All NNDs were categorised as early NND (within 7 days after birth) or late NND (within 8-28 days after birth). Early, late, and total rates of NND, as well as baseline demographics, were compared between the two groups. Causes of death were divided into four main categories: prematurity, hypoxic-ischaemic encephalopathy (HIE), congenital abnormalities, and sepsis. Congenital abnormalities were defined by characteristic features on physical examination, confirmed by either genetic tests, diagnostic investigations, or autopsy. Sepsis was defined on the basis of positive cultures established using samples of blood, urine, cerebrospinal fluid, or tissue from the affected neonate. Hypoxic-ischaemic encephalopathy was diagnosed in accordance with criteria derived from international guidelines.5
 
Statistical analysis
The analysis was performed using data that overlapped with a previous study.3 Categorical variables were compared by the Chi squared test or Fisher’s exact test. The threshold of statistical significance was defined as a two-sided P value of <0.05. Data analysis was performed with the SPSS software (Windows version 22.0; IBM Corp, Armonk [NY], United States).
 
Results
Overall and gestational age-specific neonatal mortality
There were 124 281 livebirths from singleton pregnancies between 2000 and 2019 (Table 1). The number of livebirths increased by 12.7% from 58 442 in the first decade (2000-2009) to 65 839 in the second decade (2010-2019). There were 184 NNDs (1.48/1000 livebirths) between 2000 and 2019, including 97 in the first decade (1.66/1000 livebirths) and 87 in the second decade (1.32/1000 livebirths). Overall, there were 136 cases (73.9%) of early NND and 48 cases (26.1%) of late NND.
 

Table 1. Neonatal mortality in singleton pregnancies during 2000-2009 and 2010-2019
 
The maternal demographic characteristics of all singleton pregnancies during the study period were reported in our previous paper.3 The distribution of gestational age among all livebirths did not differ between the two decades (P=0.237) [Table 2]. The highest rate of NND (195.12/1000 livebirths) was observed in extremely preterm neonates (≤27 weeks of gestation) [Table 3]. The rate of NND decreased with increasing gestational age, such that NND rates were 48.42/1000, 20.13/1000, 4.98/1000, and 0.37/1000 for neonates delivered at gestational ages of 28-30 weeks, 31-33 weeks, 34-36 weeks, and ≥37 weeks, respectively. Compared with the first decade, there was a significant reduction (75.2%) in the rate of NND among neonates delivered at 31-33 weeks of gestation during the second decade (34.73/1000 vs 8.63/1000; P=0.001); however, there were no significant differences in the rates of NND among neonates in other gestational groups.
 

Table 2. Distribution of gestational age among all livebirths (including neonatal deaths) at time of delivery during 2000-2009 and 2010-2019
 

Table 3. Gestational age-specific neonatal mortality† during 2000-2009 and 2010-2019
 
Causes of neonatal death
The primary causes of NND in the two decades are shown in Table 4. Congenital or genetic abnormalities was the most common cause of NND (82 of 184; 44.6%) during the 20-year study period. Other common causes of NND were prematurity (69 cases; 37.5%), sepsis (16 cases; 8.7%), and HIE (15 cases; 8.2%) [Supplementary Fig].
 

Table 4. Causes of neonatal death in singleton pregnancies during 2000-2009 and 2010-2019
 
Chromosomal abnormalities caused 18.3% (15 of 82) of NNDs related to congenital or genetic abnormalities; all of these abnormalities were trisomy 13 or 18. Structural abnormalities caused 63.4% (52 of 82) of NNDs related to congenital or genetic abnormalities, and respiratory system abnormalities were the most common causes in both decades (22 cases). These respiratory system abnormalities included congenital diaphragmatic hernia (13 cases), pulmonary hypoplasia (5 cases), alveolar capillary dysplasia (2 cases), and tracheal stenosis or atresia (2 cases). The next most common causes were congenital cardiac abnormalities (8 cases), including transposition of the great arteries (2 cases), total anomalous pulmonary venous drainage (2 cases), endocardial cushion defect (2 cases), hypoplastic left heart syndrome (1 case), and congenital heart block (1 case); central nervous system abnormalities (8 cases), including anencephaly (3 cases), central nervous system malformation (4 cases), and brain tumour (1 case); and musculoskeletal abnormalities (7 cases), including fetal akinesia syndrome with arthrogryposis (3 cases), spinal muscular atrophy (2 cases), and skeletal dysplasia (2 cases). There were two cases of gastrointestinal abnormalities (volvulus and bowel atresia with meconium peritonitis), two cases of sacrococcygeal teratoma, two cases of multiple abnormalities, and one case of bilateral renal agenesis (a urogenital abnormality). There were also nine cases of haemoglobin Barts disease and six cases of idiopathic hydrops. There was a statistically significant decline in the rate of NND caused by congenital or genetic abnormalities, from 0.82/1000 livebirths in the first decade to 0.52/1000 livebirths in the second decade (P=0.037).
 
There were no significant differences in the rates of NND caused by prematurity, sepsis, or HIE between the two decades. Cases of NND due to sepsis were mainly caused by Group B Streptococcus in the first decade and Escherichia coli in the second decade. The majority of HIE cases (67.7%) were related to acute intrapartum events, including placenta abruption (5 cases), uterine rupture (2 cases), vasa praevia (1 case), cord accident (1 case), and chorioamnionitis (1 case).
 
Discussion
The NND rate in our tertiary centre is consistent with the rate of 1.2/1000 livebirths in the territory-wide report2 and lower than the rates in many developed countries (eg, the United States, Australia, and nations located in Europe; neonatal mortality rates of 2-3/1000 livebirths).1 2 The global NND rate has been decreasing over the past two decades because of advances in perinatal care.2 Our overall NND rate decreased by 20%, from 1.66/1000 in the first decade to 1.32/1000 in the second decade. This decrease is mainly the result of a decrease in NNDs related to congenital or genetic disorders, as well as a decrease in NNDs among neonates delivered at 31-33 weeks of gestation.
 
Neonatal death due to congenital abnormalities
Similar to our previous report, which showed a reduction in the rate of congenital or genetic abnormality-related stillbirths,3 the present study showed that the rate of congenital or genetic abnormality-related NNDs decreased from 0.82/1000 livebirths in the first decade to 0.52/1000 livebirths in the second decade. This decline was presumably because of improvements in antenatal screening and the early detection of lethal congenital abnormalities, which resulted in termination of pregnancy before 24 weeks of gestation. Universal first trimester combined screening for Down syndrome was implemented by the Hospital Authority in 2010.6 In 2011, non-invasive cell-free fetal DNA tests for common trisomies, as well as chromosomal microarrays for the diagnosis of chromosomal microdeletion syndromes, became available in the private sector.7 8 Expanded antenatal screening of inborn errors of metabolism was launched in the private sector in 2013; this expanded screening has gradually become available in the public sector since 2018.9 Although we expected a decline in the rate of trisomy-related NNDs after universal aneuploidy screening became available in 2011, there was an increase in the rate of trisomy 18–related NNDs (from 2 cases to 7 cases). A review of the individual cases revealed that the rate of trisomy 13–related NNDs decreased from six cases in the first decade to none in the second decade. Conversely, five of the seven cases of trisomy 18–related NND in the second decade were in pregnancies that had not received any screening; all of these five cases occurred during the period from 2010 to 2013. The other two cases of trisomy 18–related NND were diagnosed during prenatal screening, but the parents chose conservative management rather than termination of pregnancy. To further reduce mortality associated with hereditary genetic disorders such as spinal muscular atrophy and fetal akinesia syndrome (which caused NND in 5 cases), there is a need for expanded carrier screening of parents, particularly in families with a history of consanguineous marriage.10 11
 
The other main congenital abnormalities that caused NND in our cohort were cardiorespiratory and neuromusculoskeletal disorders, among which congenital diaphragmatic hernia was the most common. Although survival was common among neonates with mild to moderate congenital diaphragmatic hernia, neonates with severe congenital diaphragmatic hernia had a survival rate of 10% to 20% because of pulmonary hypoplasia. A recent large randomised controlled trial showed that fetoscopic endoluminal tracheal occlusion can improve the survival rate to 40% to 50%.12 In our unit, a baby survived after treatment with fetoscopic endoluminal tracheal occlusion in 2020.13 Pulmonary hypoplasia caused by hydrothorax or lung tumours can also be effectively and safely treated before birth with newly designed instruments such as the Somatex® shunt for pleuro-amniotic shunting, and radiofrequency ablation of the tumour feeding artery, respectively.14 15 Fetal tumours such as sacrococcygeal teratoma, placental chorioangioma, and lung tumours remain challenging to manage because the rapid growth of tumours in utero increases the risk of preterm birth and leads to impaired neonatal cardiac function. We have demonstrated improvements in survival after in utero embolisation of chorioangioma using cyanoacrylate, and after in utero radiofrequency ablation of lung sequestration.15 16 Although spinal muscular atrophy has no cure, it can be prevented by accurate parental carrier screening using genomic technology and prenatal diagnosis.10
 
Neonatal death due to hydrops fetalis
The rate of idiopathic hydrops fetalis–related NND decreased from 5.2% in the first decade to 1.1% in the second decade. Advances in antenatal diagnostic techniques in recent years have identified the underlying causes of many conditions which may have previously been regarded as ‘idiopathic hydrops fetalis’.17 The early diagnosis of treatable conditions in the antenatal period can prevent the development of severe hydrops fetalis and subsequent NND.17 18 Intrauterine blood transfusion for fetal anaemia and anti-arrhythmic treatment19 has significantly reduced the rate of hydrops fetalis, resulting in improved survival and long-term outcomes.
 
Neonatal death due to prematurity
Our study showed a significant (75.2%) decrease in the rate of NND among moderately preterm neonates (31-33 weeks of gestation) from 34.73/1000 in 2000-2009 to 8.63/1000 in 2010-2019; however, the rate of NND did not change in other gestational groups (Table 3). The decrease in mortality among moderately preterm neonates could be attributed to the implementation of multiple approaches for the management of such neonates since 2010, including improved ventilation strategies with early extubation to non-invasive ventilation, new methods for surfactant administration (eg, the ‘less invasive surfactant administration’ method), and improvements in NICU care through continuous quality improvement programmes. The rate of NND among extremely preterm neonates (24-27 weeks of gestation) was 175-211/1000, which is comparable with the rates in other developed countries (139-326/1000).20 It is difficult to reduce the rate of NND among extremely preterm neonates. Research is ongoing regarding artificial placenta and womb technology, and the results may improve the survival of extremely preterm neonates in the future.21
 
The rate of prematurity-related NND can be reduced by preventing preterm delivery; however, this prevention remains a challenging goal. Although our overall preterm delivery rate of 7% is lower than the rates in other developed countries,1 22 it has remained at this level for the past two decades, and there has been no variations in gestation age-specific neonatal mortality among preterm categories. In a previous study, we demonstrated that measurements of cervical length can help to identify pregnant women who are at higher risk of preterm delivery, although the risk prediction values for Chinese women in Hong Kong are lower than the corresponding values for women in non-Asian countries.23 Additional methods to predict the onset of labour (eg, cervical elastography, immune markers, and genetic markers) should be explored to improve accuracy.24 25 Prophylactic progesterone is effective in reducing the risk of preterm delivery among women who have a short cervix.26 Although the use of a cervical ring pessary reportedly had a similar effect in a Spanish study,27 this result was not confirmed by a randomised controlled trial in Hong Kong28 or by subsequent meta-analysis.29 Pre-eclampsia is a common complication that requires medically induced preterm delivery. First trimester screening of pre-eclampsia, followed by prophylactic aspirin treatment in high-risk cases, is a proven strategy to effectively delay the onset of pre-eclampsia and the associated preterm births.29 Our recent study confirmed the accuracy of a screening programme for pre-eclampsia.30 Reductions in pre-eclampsia–related preterm births and mortality may be achieved by the implementation of a universal screening programme in the future.
 
Neonatal death due to hypoxic ischaemic encephalopathy
Despite advances in NICU management of HIE and the use of therapeutic hypothermia since 2011, the rate of HIE-related NND did not improve during the study period. Approximately 67% of HIE-related NNDs were caused by acute and unpredictable perinatal events such as cord prolapse, uterine rupture, vasa praevia, or placental abruption. We previously reported an infant death secondary to severe cerebral palsy as a result of prolonged shoulder dystocia, which occurred during the first study decade.31 Therefore, team-based training for the above perinatal events is needed to ensure that the obstetric team can respond appropriately and efficiently so that the risk of HIE and associated perinatal mortality can be reduced. During these situations that involved irreversible peripartum hypoxia, we showed that umbilical cord arterial pH decreased as the length of the bradycardia-to-delivery interval increased.31 32 33 With appropriate training, we were able to achieve a median bradycardia-to-delivery interval of 10 minutes and a median decision-to-delivery interval of 11 minutes,32 which was effective in preventing peripartum mortality. Furthermore, we showed that during umbilical cord prolapse, the knee-chest position is the most effective approach for relieving fetal compression of the prolapsed cord34; we also formulated an algorithm for acute resolution of cord prolapse.35 Shoulder dystocia is associated with macrosomia, but the optimal fetal weight cut-off for prophylactic elective caesarean delivery has not been established. Our previous study suggested a cut-off of 4.2 kg may help to prevent shoulder dystocia.36 With effective training and correct use of manoeuvres such as posterior arm delivery, we recently showed that the head-to-delivery interval can be shortened and the Apgar scores can be improved.37 38 We also proposed a modified posterior axillary sling technique to relieve severe shoulder dystocia.39
 
Neonatal death due to sepsis
The rate of severe sepsis-related NND is low and has been decreasing over the past two decades. Since the implementation of universal Group B Streptococcus screening and peripartum antibiotic prophylaxis in 2012, the rate of early onset Group B Streptococcus infection has significantly decreased from 1/1000 to 0.24/1000 births.40 Despite the reduced risk of neonatal Group B Streptococcus infection, recent reports have shown an increase in Escherichia coli–related early-onset neonatal sepsis.41 Clinicians should remain vigilant concerning the presence of chorioamnionitis and risk factors for sepsis.
 
To our knowledge, this is the largest and most comprehensive analysis of neonatal mortality during a 20-year period in Hong Kong. Nevertheless, there were a few limitations in this study. First, it was performed in a single large centre, rather than in a large segment of the population. Because the Prince of Wales Hospital is the main centre for fetal intervention in Hong Kong, many high-risk pregnancies are referred from adjacent hospitals, which may have led to an over-representation of complex cases and a bias towards worse outcomes. Second, some case details were not available for analysis because of the retrospective nature of the study. Third, our study excluded cases of NND among neonates with borderline viability (gestational age: 22-23 weeks and 6 days) because such NNDs are regarded as miscarriages based on the legal definition in Hong Kong. Although some parents of neonates with borderline viability requested resuscitation, the survival rate in this small group was zero according to a recent study in our centre.42 Finally, because the rate of NND is very low in Hong Kong, this study could have been strengthened by including data regarding the rates of major morbidities (eg, cerebral palsy). Nonetheless, our findings provide a basis for future territory-wide reviews of perinatal outcomes.
 
Conclusion
Hong Kong has one of the lowest rates of NND worldwide. The neonatal mortality in our centre has decreased from 1.66/1000 livebirths to 1.32/1000 livebirths over the past two decades, mainly because of improvements in the prenatal diagnosis and treatment of congenital or genetic abnormalities, as well as an improved survival rate among moderately preterm neonates. Future improvements should focus on in utero treatment, expanded carrier screening for genetic abnormalities, and the prevention of preterm birth and pre-eclampsia.
 
Author contributions
Concept or design: GPG Fung, TY Leung.
Acquisition of data: All authors.
Analysis or interpretation of data: GPG Fung, TY Leung.
Drafting of the manuscript: GPG Fung, TY Leung.
Critical revision of the manuscript for important intellectual content: All authors.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
All authors have disclosed no conflicts of interest.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
Ethical approval was obtained from the Joint Chinese University of Hong Kong–New Territories East Cluster Clinical Research Ethics Committee (Ref No.: CRE 2017.442).
 
References
1. World Health Organization. Neonatal mortality rate (0 to 27 days per 1000 live births) (SDG 3.2.2). Available from: https://www.who.int/data/gho/indicator-metadata-registry/imr-details/67. Accessed 5 Oct 2022.
2. Hong Kong College of Obstetricians & Gynaecologists. Territory-wide audit in obstetrics & gynaecology 2014. Available from: https://www.hkcog.org.hk/hkcog/Download/Territory-wide_Audit_in_Obstetrics_Gynaecology_2014.pdf. Accessed 15 Jun 2022.
3. Wong ST, Tse WT, Lau SL, Sahota DS, Leung TY. Stillbirth rate in singleton pregnancies: a 20-year retrospective study from a public obstetric unit in Hong Kong. Hong Kong Med J 2022;28:285-3. Crossref
4. Lau SL, Wong ST, Tse WT, et al. Perinatal mortality rate in multiple pregnancies: a 20-year retrospective study from a tertiary obstetric unit in Hong Kong. Hong Kong Med J 2022;28:347-56. Crossref
5. Executive summary: Neonatal encephalopathy and neurologic outcome, second edition. Report of the American College of Obstetricians and Gynecologists’ Task Force on Neonatal Encephalopathy [editorial]. Obstet Gynecol 2014;123:896-901. Crossref
6. Sahota DS, Leung WC, Chan WP, To WW, Lau ET, Leung TY. Prospective assessment of the Hong Kong Hospital Authority universal Down syndrome screening programme. Hong Kong Med J 2013;19:101-8.
7. Chan YK, Leung WC, Leung TY, et al. Women’s preference for non-invasive prenatal DNA testing versus chromosomal microarray after screening for Down syndrome: a prospective study. BJOG 2018;125:451-9. Crossref
8. Hui AS, Chau MH, Chan YM, et al. The role of chromosomal microarray analysis among fetuses with normal karyotype and single system anomaly or nonspecific sonographic findings. Acta Obstet Gynecol Scand 2021;100:235-43. Crossref
9. Chong SC, Law LK, Hui J, Lai CY, Leung TY, Yuen YP. Expanded newborn metabolic screening programme in Hong Kong: a three-year journey. Hong Kong Med J 2017;23:489-96. Crossref
10. Chan OY, Leung TY, Cao Y, et al. Expanded carrier screening using next-generation sequencing of 123 Hong Kong Chinese families: a pilot study. Hong Kong Med J 2021;27:177-83. Crossref
11. Siong KH, Au Yeung SK, Leung TY. Parental consanguinity in Hong Kong. Hong Kong Med J 2019;25:192-200. Crossref
12. Deprest JA, Nicolaides KH, Benachi A, et al. Randomized trial of fetal surgery for severe left diaphragmatic hernia. N Engl J Med 2021;385:107-18. Crossref
13. TOPick. 3次流產40歲婦終懷孕胎兒27周時橫膈膜穿洞威爾斯婦產科團隊施宮內手術力保胎兒順利出世. Available from: https://topick.hket.com/article/2757416. Accessed 15 Jun 2022.
14. Chung MY, Leung WC, Tse WT, et al. The use of Somatex Shunt for fetal pleural effusion: a cohort of 8 procedures. Fetal Diagn Ther 2021;48:440-7. Crossref
15. Tse WT, Poon LC, Wah YM, Hui AS, Ting YH, Leung TY. Bronchopulmonary sequestration successfully treated with prenatal radiofrequency ablation of the feeding artery. Ultrasound Obstet Gynecol 2021;58:325-7. Crossref
16. Cheng YK, Yu SC, So PL, Leung TY. Ultrasound-guided percutaneous embolisation of placental chorioangioma using cyanoacrylate. Fetal Diagn Ther 2017;41:76-9. Crossref
17. Swearingen C, Colvin ZA, Leuthner SR. Nonimmune hydrops fetalis. Clin Perinatol 2020;47:105-21. Crossref
18. Songdej D, Babbs C, Higgs DR; BHFS International Consortium. An international registry of survivors with Hb Bart’s hydrops fetalis syndrome. Blood 2017;129:1251-9. Crossref
19. Donofrio MT, Moon-Grady AJ, Hornberger LK, et al. Diagnosis and treatment of fetal cardiac disease: a scientific statement from the American Heart Association. Circulation 2014;129:2183-242. Crossref
20. Ancel PY, Goffinet F; EPIPAGE-2 Writing Group, et al. Survival and morbidity of preterm children born at 22 through 34 weeks’ gestation in France in 2011: results of the EPIPAGE-2 cohort study. JAMA Pediatr 2015;169:230-8. Crossref
21. De Bie FR, Davey MG, Larson AC, Deprest J, Flake AW. Artificial placenta and womb technology: past, current, and future challenges towards clinical translation. Prenat Diagn 2021;41:145-58. Crossref
22. Hui AS, Lao TT, Leung TY, Schaaf JM, Sahota DS. Trends in preterm birth in singleton deliveries in a Hong Kong population. Int J Gynaecol Obstet 2014;127:248-53. Crossref
23. Leung TN, Pang MW, Leung TY, Poon CF, Wong SM, Lau TK. Cervical length at 18-22 weeks of gestation for the prediction of spontaneous preterm delivery in Hong Kong Chinese women. Ultrasound Obstet Gynecol 2005;25:713-7. Crossref
24. Feng Q, Chaemsaithong P, Duan H, et al. Screening for spontaneous preterm birth by cervical length and shear-wave elastography in the first trimester of pregnancy. Am J Obstet Gynecol 2022;227:500.e1-14. Crossref
25. Chim SS, Lee WS, Ting YH, Chan OK, Lee SW, Leung TY. Systematic identification of spontaneous preterm birth-associated RNA transcripts in maternal plasma. PLoS One 2012;7:e34328. Crossref
26. Romero R, Nicolaides KH, Conde-Agudelo A, et al. Vaginal progesterone decreases preterm birth ≤34 weeks of gestation in women with a singleton pregnancy and a short cervix: an updated meta-analysis including data from the OPPTIMUM study. Ultrasound Obstet Gynecol 2016;48:308-17. Crossref
27. Goya M, Pratcorona L, Merced C, et al. Cervical pessary in pregnant women with a short cervix (PECEP): an open-label randomised controlled trial. Lancet 2012;379:1800-6. Crossref
28. Hui SY, Chor CM, Lau TK, Lao TT, Leung TY. Cerclage pessary for preventing preterm birth in women with a singleton pregnancy and a short cervix at 20 to 24 weeks: a randomized controlled trial. Am J Perinatol 2013;30:283-8. Crossref
29. Conde-Agudelo A, Romero R, Nicolaides KH. Cervical pessary to prevent preterm birth in asymptomatic high-risk women: a systematic review and meta-analysis. Am J Obstet Gynecol 2020;223:42-65.e2. Crossref
30. Chaemsaithong P, Pooh RK, Zheng M, et al. Prospective evaluation of screening performance of first trimester prediction models for preterm preeclampsia in Asian population. Am J Obstet Gynecol 2019;221:650.e1-16. Crossref
31. Leung TY, Stuart O, Sahota DS, Suen SS, Lau TK, Lao TT. Head-to-body delivery interval and risk of fetal acidosis and hypoxic ischaemic encephalopathy in shoulder dystocia: a retrospective review. BJOG 2011;118:474-9. Crossref
32. Leung TY, Chung PW, Rogers MS, Sahota DS, Lao TT, Chung TK. Urgent cesarean delivery for fetal bradycardia. Obstet Gynecol 2009;114:1023-8. Crossref
33. Wong L, Tse WT, Lai CY et al. Bradycardia-to-delivery interval and fetal outcomes in umbilical cord prolapse. Acta Obstet Gynecol Scand 2021;100:170-7. Crossref
34. Kwan AH, Chaemsaithong P, Wong L, et al. Transperineal ultrasound assessment of fetal head elevation by maneuvers used for managing umbilical cord prolapse. Ultrasound Obstet Gynecol 2021;58:603-8. Crossref
35. Wong L, Kwan AH, Lau SL, Sin WT, Leung TY. Umbilical cord prolapse: revisiting its definition and management. Am J Obstet Gynecol 2021;225:357-66. Crossref
36. Cheng YK, Lao TT, Sahota DS, Leung VK, Leung TY. Use of birth weight threshold for macrosomia to identify fetuses at risk of shoulder dystocia among Chinese populations. Int J Gynaecol Obstet 2013;120:249-53. Crossref
37. Chan EH, Lau SL, Leung TY. Changes in the incidence and management of shoulder dystocia over 20 years from a tertiary obstetric unit in Hong Kong. Hong Kong Med J. In press.
38. Leung TY, Stuart O, Suen SS, Sahota DS, Lau TK, Lao TT. Comparison of perinatal outcomes of shoulder dystocia alleviated by different type and sequence of manoeuvres: a retrospective review. BJOG 2011;118:985-90. Crossref
39. Kwan AH, Hui AS, Lee JH, Leung TY. Intrauterine fetal death followed by shoulder dystocia and birth by modified posterior axillary sling method: a case report. BMC Pregnancy Childbirth 2021;21:672. Crossref
40. Ma TW, Chan V, So CH, et al. Prevention of early onset group B streptococcal disease by universal antenatal culture-based screening in all public hospitals in Hong Kong. J Matern Fetal Neonatal Med 2018;31:881-7. Crossref
41. Stoll BJ, Puopolo KM, Hansen NI, et al. Early-onset neonatal sepsis 2015 to 2017, the rise of Escherichia coli, and the need for novel prevention strategies. JAMA Pediatr 2020;174:e200593. Crossref
42. Hon KL, Liu S, Chow JC, et al. Mortality and morbidity of extremely low birth weight infants in Hong Kong, 2010-2017: a single-centre review. Hong Kong Med J 2018;24:460-5. Crossref

THIRD bedside ultrasound protocol for rapid diagnosis of undifferentiated shock: a prospective observational study

Hong Kong Med J 2022 Oct;28(5):383-91 | Epub 29 Sep 2022
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE (HEALTHCARE IN MAINLAND CHINA)
THIRD bedside ultrasound protocol for rapid diagnosis of undifferentiated shock: a prospective observational study
P Geng, MD1 #; B Ling, MM1 #; Y Yang, MM1; Joseph Harold Walline, MD2; Y Song, MM1; M Lu, MD1; H Wang, MM1; Q Zhu, MM1; D Tan, MD1; J Xu, MD3
1 Department of Emergency Medicine, Clinical Medical College of Yangzhou University, Northern Jiangsu People’s Hospital, Yangzhou, China
2 Accident and Emergency Medicine Academic Unit, Prince of Wales Hospital, The Chinese University of Hong Kong, Hong Kong
3 Department of Emergency Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences, Beijing, China
# The first two authors contributed equally to this work
 
Corresponding author: Dr D Tan (tandingyu1981@163.com)
 
 Full paper in PDF
 
Abstract
Introduction: It is clinically challenging to differentiate the pathophysiological types of shock in emergency situations. Here, we evaluated the ability of a novel bedside ultrasound protocol (Tamponade/tension pneumothorax, Heart, Inferior vena cava, Respiratory system, Deep venous thrombosis/aorta dissection [THIRD]) to predict types of shock in the emergency department.
 
Methods: An emergency physician performed the THIRD protocol on all patients with shock who were admitted to the emergency department. All patients were closely followed to determine their final clinical diagnoses. The kappa index, sensitivity, specificity, positive predictive value, and negative predictive value were calculated for the initial diagnostic impression provided by the THIRD protocol, compared with each patient’s final diagnosis.
 
Results: In total, 112 patients were enrolled in this study. The kappa index between initial impression and final diagnosis was 0.81 (95% confidence interval=0.73-0.89; P<0.001). For hypovolaemic, cardiogenic, distributive, and obstructive types of shock, the sensitivities of the THIRD protocol were 100%, 100%, 93%, and 100%, respectively; the sensitivity for a ‘mixed’ shock aetiology was 86%. The negative predictive value of the THIRD protocol for all five types of shock was ≥96%.
 
Conclusion: Initial diagnostic judgements determined using the THIRD protocol showed favourable agreement with the final diagnosis in patients who presented with undifferentiated shock. The THIRD protocol has great potential for use as a bedside approach that can guide the rapid management of undifferentiated shock in emergency settings, particularly for patients with obstructive, hypovolaemic, or cardiogenic shock.
 
 
New knowledge added by this study
  • Differentiating shock types in emergency situations is clinically challenging. We sought to assess the ability of a novel bedside ultrasound protocol (Tamponade/tension pneumothorax, Heart, Inferior vena cava, Respiratory system, Deep venous thrombosis/aorta dissection [THIRD]) in predicting shock aetiology. Shock aetiology determined using the THIRD protocol showed acceptable agreement with the final diagnosis of shock in critically ill emergency department patients.
  • The THIRD protocol demonstrated very high negative predictive values when it was used to evaluate patients with hypovolaemic, cardiogenic, distributive, and obstructive shock.
  • The THIRD protocol showed least sensitivity for evaluation of mixed aetiology shock.
Implications for clinical practice or policy
  • Our findings support incorporation of the THIRD protocol into routine emergency department assessment of patients with undifferentiated shock to help guide early treatment.
  • Additional clinical assessments should be conducted to confirm a diagnosis of mixed shock made using the THIRD protocol.
 
 
Introduction
Undifferentiated shock is a common presenting condition in the emergency department (ED) which requires timely and effective interventions. The rapid and accurate differentiation of possible shock aetiologies is essential for reducing morbidity and mortality in critically ill patients with shock.1 Patients with undifferentiated shock in the ED often have an acute onset of severe illness, unstable vital signs, a limited medical history, and sparse physical examination findings.2 Point-of-care ultrasound (POCUS) has a crucial role in the management of undifferentiated shock because it is the only visual imaging tool that can provide real-time information concerning the key elements of haemodynamics.3 4 The use of POCUS in the ED has been rapidly increasing because it is safe, reliable, non-invasive, rapid, and repeatable at the bedside.5
 
The first ultrasound protocol for undifferentiated shock was published in 20046; since then, several additional protocols have been developed. The results of multiple studies have provided evidence that POCUS can help to differentiate the cause of hypotension, identify the most immediate life-threatening conditions, improve diagnostic certainty, and optimise treatment.7 8 The ‘Tamponade/tension pneumothorax, Heart, Inferior vena cava, Respiratory system, Deep venous thrombosis/aorta dissection’ (THIRD) bedside ultrasound protocol was published in 2017; it is the first POCUS protocol for undifferentiated shock in emergency medicine in mainland China.9 Compared with the ‘Rapid Ultrasound for Shock and Hypotension’ (RUSH) protocol, the THIRD protocol has been reported to significantly increase physician trainee self-confidence when diagnosing undifferentiated shock.10
 
The THIRD protocol is now widely accepted and regularly used by emergency physicians in China; to our knowledge, the protocol has not been validated in any studies thus far. This study was conducted to examine the effectiveness and accuracy of the THIRD protocol as an early and rapid bedside approach for the investigation of undifferentiated shock in emergency settings. We hypothesised that THIRD early diagnostic predictions would not demonstrate significant inconsistency with the final clinical diagnosis.
 
Methods
Enrolment
This single-centre prospective observational study enrolled patients with shock who presented to the emergency intensive care unit (EICU) section of the ED at a large, urban, tertiary teaching hospital between October 2017 and May 2019. The ED of the hospital has approximately 240 000 visits per year, and >800 patients annually are admitted to the 15-bed EICU for extended management. Shock was defined as acute circulatory failure which led to inadequate cellular oxygen utilisation.11 We established the enrolment criteria for this study based on clinical feasibility and previous literature12 13: (1) age >18 years and <95 years; (2) systolic blood pressure <90 mmHg or shock index (pulse/systolic blood pressure) >1.0, confirmed after ≥3 measurements during the first assessment; and (3) at least one of the following symptoms or signs of hypoperfusion: altered mental status (eg, syncope, delirium, or unresponsiveness), respiratory distress, oliguria, severe fatigue or discomfort, skin mottling, elevated blood lactate, and severe chest pain or abdominal pain. Patients with the following conditions were excluded from the study: (1) a pre-existing ‘hypotensive’ state recorded in past medical history or reported by the patient; (2) transfer from another hospital with a known diagnosis of shock type; and (3) no definite diagnosis of shock established during hospitalisation, despite plenary discussion of their clinical data.
 
Point-of-care ultrasound technique
The POCUS is routinely performed in all hemodynamically unstable patients in our EICU. In this study, an independent emergency physician with specific competence in emergency ultrasound performed the THIRD protocol evaluation upon patient arrival (Fig 1). The physician had completed a 20-hour emergency ultrasound workshop including the THIRD protocol and had 3 years of experience with >300 ultrasound examinations per year. The physician was unaware of the history of present illness or any other diagnostic test results; the pathophysiological diagnosis of shock was made based on ultrasound findings (Table 1). Ultrasound evaluation was performed with a Philips® Sparq ultrasound device routinely used in the EICU. This ultrasound system contains a high-frequency 4-12 MHz linear probe, a 2-6 MHz curvilinear probe, and a 2-4 MHz cardiac probe.
 

Figure 1. THIRD protocol for bedside ultrasound evaluation of undifferentiated shock
 

Table 1. THIRD protocol: possible ultrasound findings in shock
 
The THIRD protocol is divided into the following five parts.
 
Tamponade or tension pneumothorax: First, the subcostal cardiac view is used to determine the presence of any pericardial effusion; then, evidence of right atrial or right ventricular diastolic collapse or cardiac oscillation is assessed to identify signs of pericardial tamponade-related shock. Second, the bilateral anterior thorax in the mid-clavicular lines is explored to identify pleural sliding, the ‘seashore’ sign, A lines, and B lines. If the above signs are not found and a ‘stratosphere’ sign or lung point is identified, tension pneumothorax–related obstructive shock is suspected.
 
Heart: The SMART (Size, Motivation, Aorta, Rhythm/rate, Tricuspid regurgitation) procedure is used to assess the heart size, shape, and wall motion; aortic diameter; presence of an aortic intimal flap; cardiac rhythm and rate; and presence of tricuspid regurgitation on the parasternal long-axis, parasternal short-axis, and apical four-chamber views. These assessments help to clarify the cause and type of shock with respect to cardiac function (Table 2).
 

Table 2. SMART procedure for focused cardiac ultrasound assessment
 
Inferior vena cava: The subcostal longitudinal acoustic window is used to localise the inferior vena cava. The diameter and respiratory variation rate of the inferior vena cava are measured to estimate central venous pressure, assess right heart function and overall blood volume, and evaluate indirect evidence of shock caused by hypovolaemia, right heart failure, pulmonary embolism, or pulmonary hypertension.
 
Respiratory system: Lung ultrasound assessment is performed using a symmetrical three-point technique to identify common lung ultrasound signs (eg, pleural fluid, pleural sliding, A lines, B lines, shred sign, and lung rockets). These signs are indicators of shock caused by lung consolidation, massive pleural effusion/haemorrhage, or other aetiologies such as pulmonary oedema.
 
Deep venous thrombosis or aortic dissection: The acoustic windows of the bilateral symmetrical inguinal areas and popliteal fossae are used to detect and assess the compressibility of the femoral veins and popliteal veins; these assessments facilitate the identification of deep vein thromboses. Because pulmonary emboli are commonly associated with deep venous thrombosis from the lower extremities, this ultrasound technique is an indirect test for shock caused by pulmonary emboli. Scans of the abdominal aorta in horizontal sections of the peritoneal trunk, superior mesenteric artery, and renal artery are then conducted to determine whether aortic dissection or aneurysm is present.
 
Clinical evaluation and final diagnosis
Upon admission to the EICU, the following information was recorded for all enrolled patients: demographic data, co-morbidities, APACHE II (acute physiology and chronic health evaluation II) score, need for mechanical ventilation, and physiological data (eg, mean arterial pressure, shock index, lactate level, and central venous oxygen saturation). All patients were closely followed to confirm their final diagnosis of shock. A panel of three board-certified physicians (D Tan, emergency physician; J Ye, radiologist; and J Zhang, cardiologist) established the diagnosis of shock type based on all relevant clinical data including history of present illness, signs, auxiliary examination results. Disagreements concerning diagnosis were resolved using a majority vote approach. Patients were excluded if their diagnosis could not be agreed upon by at least two physicians.
 
Statistical analysis
Statistical analysis was performed using SPSS (Windows version 21; IBM Corp., Armonk [NY], United States). Sample size calculations were performed prior to enrolment; to detect a protocol accuracy of >90% using the kappa method and considering an anticipated 10% rate of exclusion or dropout,14 at least 100 patients were required. Thus, we planned a sample size of >110 patients in this study. We calculated the kappa index between the diagnosis of shock type according to the THIRD protocol and the final diagnosis of shock. Additionally, we separately assessed the kappa agreement and reliability indices (sensitivity, specificity, positive predictive value [PPV] and negative predictive value [NPV])of the THIRD protocol for each type of shock. For this analysis, we excluded patients without a definite final diagnosis of shock type.
 
Results
Patient characteristics and final clinical diagnoses
In total, 120 patients were enrolled; eight patients were excluded prior to the analysis (two patients had a history of hypotension in their previous medical records, three patients were transferred from another hospital with a known diagnosis of shock type, and three patients did not have a definite final diagnosis of shock type) [Fig 2]. In the final sample size of 112 patients, 54% were men, with a mean age of 66.5 ± 13.5 years and a mean arterial pressure of 51.2 ± 10.9 mmHg at presentation to the EICU. The mean duration of a complete THIRD protocol evaluation was 9.1 ± 1.5 minutes. The baseline characteristics of enrolled patients are shown in Table 3. The final charted clinical diagnoses of the 112 patients are reported in Table 4.
 

Figure 2. Patient enrolment flowchart
 

Table 3. Baseline characteristics of enrolled patients (n=112)
 

Table 4. Final charted clinical diagnoses of 112 patients with symptomatic undifferentiated shock at presentation
 
The final clinical diagnoses of the 112 patients, based on chart assessment by the three auditors and the THIRD protocol evaluation results, are shown in Table 5. The most common type of shock in our study was distributive shock (36 patients, 32.1%). The kappa index for general agreement between final clinical diagnosis and the type of shock identified using the THIRD protocol was 0.81 (95% confidence interval=0.73-0.89; P<0.001) for all patients. Table 6 shows the kappa index, sensitivity, specificity, PPV, and NPV of the THIRD protocol for determining each type of shock among patients with definite final diagnoses.
 

Table 5. Prevalences of types of shock based on final clinical diagnosis and THIRD protocol evaluation
 

Table 6. Reliability indices and kappa agreement values of the THIRD protocol for each individual shock subtype
 
Hypovolaemic shock
Using the THIRD protocol, 32 patients had a diagnosis of hypovolaemic shock. The causes of hypovolaemic shock were traumatic bleeding (n=23), sepsis (n=2), acute gastrointestinal bleeding (n=3), cachexia (n=1), and pancreatitis (n=1). The remaining two patients included in the 32 patients were misdiagnosed with hypovolaemic shock based on their ultrasound findings, but the final clinical diagnoses were mixed shock (n=1) and distributive shock secondary to sepsis (n=1) [97.3% specificity and 95% PPV].
 
Cardiogenic shock
Using the THIRD protocol, 21 patients had a diagnosis of cardiogenic shock. Of them, 19 patients had a final clinical diagnosis of cardiogenic shock due to decompensated heart failure (n=8), acute myocardial infarction (n=7), and intoxication (n=4). The final clinical diagnosis for the remaining two patients was mixed aetiology shock (97.83% specificity and 90.9% PPV). The agreement between ultrasound findings and the final diagnosis was 92% (P<0.001) for cardiogenic shock.
 
Distributive shock
Using the THIRD protocol, 33 patients were diagnosed with distributive shock. Of them, 30 patients had a final clinical diagnosis of sepsis (concurrent pneumonia [n=26], concurrent cholangitis [n=2], concurrent urinary tract infections [n=2]) and three patients had a final clinical diagnosis of neurogenic aetiologies. Three other patients had a final clinical diagnosis of distributive shock, who were initially misdiagnosed using the THIRD protocol with hypovolaemic (n=1) and mixed aetiology shock (n=2). The agreement between ultrasound findings and the final diagnosis was 89% (P<0.001) for distributive shock.
 
Obstructive shock
Using the THIRD protocol, five patients were diagnosed with obstructive shock, and all five of them had a final clinical diagnosis of obstructive shock (cardiac tamponade [n=3], large, acute pulmonary embolism [n=2]). The agreement between ultrasound findings and the final diagnosis was 100% (P<0.001) for obstructive shock.
 
Mixed aetiology shock
Using the THIRD protocol, 21 patients were diagnosed with mixed aetiology shock. Of them, 19 had a final clinical diagnosis of mixed aetiology shock (sensitivity of 90.4%). The remaining two patients were misdiagnosed with mixed aetiology shock; the final clinical diagnosis was distributive shock (n=2). Three other patients had a final clinical diagnosis of mixed aetiology shock, who were initially misdiagnosed using the THIRD protocol with hypovolaemic (n=1) or cardiogenic shock (n=2). The THIRD protocol had the lowest agreement (82%, P<0.001) with the final diagnosis for mixed aetiology shock.
 
Discussion
In this prospective study, the primary diagnosis after implementation of the THIRD protocol in patients with undifferentiated shock was highly concordant with the final clinical diagnosis. The protocol was highly effective in guiding the rapid bedside management of undifferentiated shock in emergency settings, particularly for patients with obstructive, hypovolaemic, or cardiogenic shock.
 
Point-of-care ultrasound is the only tool available at the bedside that can rapidly reveal acute pathophysiology and establish key diagnoses to guide targeted interventions.15 As in other disciplines, POCUS has been commonly used in emergency medicine; it is an essential skill for emergency physicians. In mainland China, bedside ultrasound has been used in EDs since 2006, mainly for guidance during vascular puncture procedures and for the assessment of free intraperitoneal fluid.16 After a decade of rapid development, nearly half of EDs in China have dedicated bedside ultrasound equipment.17 The applications of POCUS have gradually expanded to undifferentiated hypotension, shortness of breath, chest pain, sepsis, and cardiac arrest, as well as other clinical manifestations.
 
To our knowledge, the THIRD protocol is the first ultrasound protocol for assessment of undifferentiated shock in mainland China, and this is the first study to validate the effectiveness and accuracy of the protocol. This study demonstrated favourable general agreement between the final clinical diagnosis of shock and the results of this early bedside ultrasound assessment (kappa=0.81). Similar to the RUSH protocol,13 the highest agreements were observed in patients with hypovolaemic, cardiogenic, and obstructive shock (kappa values of 0.92, 0.92, and 1.00, respectively). The NPVs for these shock types were all 100%, suggesting that the THIRD protocol can reliably exclude these types of shock. Clinically significant hypovolaemia, cardiac dysfunction, cardiac tamponade, pulmonary embolism, and tension pneumothorax are readily identifiable on ultrasound; the corresponding signs facilitate rapid diagnosis and ensure minimal delays in life-saving interventions for these conditions.4 18
 
The sensitivity, NPV, and kappa values of the THIRD protocol were lower for distributive or mixed shock than for the other three types of shock. Sepsis was the main cause of distributed shock in our study; high cardiac output with reduced vascular resistance is the main pathophysiological feature of this type of shock.19 A plausible explanation for this pathophysiological feature is that a hyperdynamic heart is not specific to distributive shock, and a decrease in vascular resistance lacks specific ultrasound signs. Our protocol had the least sensitivity and agreement for mixed aetiology shock. Considering this increased uncertainty, caution is advised when making a diagnosis of a ‘mixed’ type of shock.
 
Since its initial use in 2001 by Rose et al,20 POCUS has been increasingly used in the management of patients with undifferentiated shock in the ED. Furthermore, >15 ultrasound protocols for hypotension have been developed since 2001.7 These protocols consist of items such as echocardiography, transthoracic scanning, evaluation of the inferior vena cava and aorta, assessment of free fluid in the abdominal cavity, and detection of deep vein thrombosis. The overall goal of these protocols is to provide a comprehensive and practical approach for the classification of clinical syndromes that involve circulatory failure—syndromes which lack specificity and may have substantially different possible treatments—into four specific and manageable types of shock.
 
The RUSH protocol is one of the most frequently used POCUS protocols for undifferentiated shock.21 Multiple studies have shown that the kappa index of the RUSH protocol–based ultrasound diagnosis and the final clinical diagnosis is approximately 0.7.13 14 22 The RUSH protocol is used to find ultrasound abnormalities in three major aspects of a patient’s physiology, including ‘pump, tank, and pipe’.23 Unlike the RUSH protocol or other existing POCUS shock protocols (eg, ACES,24 UHP,20 or FATE25), each letter of the THIRD protocol represents a specific ultrasound assessment. The THIRD protocol is easy for clinicians to remember and can be used as a practical checklist for ultrasound examination during the management of patients with shock. This might explain why trainees had greater confidence and performance when using the THIRD protocol than when using the RUSH protocol in a training curriculum.10
 
There were some limitations in this study. First, we did not exclude certain patients, such as patients with traumatic injuries or gastrointestinal bleeding. However, trauma and gastrointestinal bleeding are common among patients who present to our ED, and the inclusion of these patients ensures that our shock assessment is consistent with real-world emergency settings. Second, we did not compare the THIRD protocol with other protocols, such as the RUSH protocol; thus, we cannot conclusively state whether the THIRD and RUSH protocols are equally effective. Third, we did not assess the impact of the THIRD protocol on subsequent treatment. In a previous study, 24.6% of patients had a statistically significant change in their management after a POCUS protocol examination.12
 
In conclusion, this study demonstrated that the initial diagnostic judgements obtained using the THIRD protocol in the ED are consistent with the final diagnosis in patients who present with undifferentiated shock. The findings in this study encourage the incorporation of the THIRD protocol into routine ED assessment of patients with undifferentiated shock to help guide early interventions. The impact of the THIRD protocol on the outcomes of patients with shock, as well as comparisons of the effectiveness and accuracy of the THIRD protocol with other POCUS protocols, should be the focus of future studies.
 
Author contributions
Concept or design: P Geng, B Ling, J Xu, D Tan.
Acquisition of data: B Ling, Y Song, H Wang, Q Zhu, Y Yang, M Lu.
Analysis or interpretation of data: Y Song, H Wang, Q Zhu, Y Yang, M Lu.
Drafting of the manuscript: B Ling, JH Walline, D Tan, P Geng.
Critical revision of the manuscript for important intellectual content: D Tan, JH Walline, J Xu, P Geng.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
The authors declare that they have no competing interests.
 
Acknowledgement
We are grateful to J Ye and J Zhang for their contributions to establish the diagnosis of shock type in this study.
 
Funding/support
This work was supported by the Rui E Special Fund for Emergency Medicine Research (R2017003), the Yangzhou Science and Technology Development Plan (YZ2018090), the Yangzhou Phase III ‘Talent Cultivation Program’ Support Project (2018034), the Hospital-Level Support Project of Northern Jiangsu People’s Hospital (yzucms2018943), and the Hospital-Level Support Project of Northern Jiangsu People’s Hospital (fcjs201708, fcjs201842).
 
Ethics approval
This study protocol was approved by the Institutional Ethics Committee of Northern Jiangsu People’s Hospital (Ref: fcjs2017008). Written informed consent was obtained from all patients (or next of kin, if the patient was unable to provide informed consent). The study was registered in the Chinese Clinical Trial Registry (Ref: ChiCTR2000031072).
 
References
1. Hiemstra B, Eck RJ, Keus F, van der Horst IC. Clinical examination for diagnosing circulatory shock. Curr Opin Crit Care 2017;23:293-301. Crossref
2. Goodwin M, Ito K, Gupta AH, Rivers EP. Protocolized care for early shock resuscitation. Curr Opin Crit Care 2016;22:416-23. Crossref
3. Sasmaz MI, Gungor F, Guven R, Akyol KC, Kozaci N, Kesapli M. Effect of focused bedside ultrasonography in hypotensive patients on the clinical decision of emergency physicians. Emerg Med Int 2017;2017:6248687. Crossref
4. Koster G, van der Horst IC. Critical care ultrasonography in circulatory shock. Curr Opin Crit Care 2017;23:326-33. Crossref
5. Whitson MR, Mayo PH. Ultrasonography in the emergency department. Crit Care 2016;20:227. Crossref
6. PRISM Investigators; Rowan KM, Angus DC, Bailey M, et al. Early, goal-directed therapy for septic shock—a patient-level meta-analysis. N Engl J Med 2017;376:2223-34. Crossref
7. Mok KL. Make it SIMPLE: enhanced shock management by focused cardiac ultrasound. J Intensive Care 2016;4:51. Crossref
8. Alpert EA. The ABCDs of ResUS—Resuscitation ultrasound. Cureus 2019;11:e4616. Crossref
9. Unexplained shock emergency ultrasound clinical practice expert consensus group. Expert consensus on emergency ultrasound practice for unexplained shock [in Chinese]. Chin J Emerg Med 2017;26:498-506.
10. Shi D, Liu J, Xu J, Zhu H, Yu X. Evaluation of a new goal-directed training curriculum for point-of-care ultrasound in the emergency department: impact on physician self-confidence and ultrasound skills. Eur J Trauma Emerg Surg 2021;47:435-44. Crossref
11. Cecconi M, De Backer D, Antonelli M, et al. Consensus on circulatory shock and hemodynamic monitoring. Task force of the European Society of Intensive Care Medicine. Intensive Care Med 2014;40:1795-815. Crossref
12. Shokoohi H, Boniface KS, Pourmand A, et al. Bedside ultrasound reduces diagnostic uncertainty and guides resuscitation in patients with undifferentiated hypotension. Crit Care Med 2015;43:2562-9. Crossref
13. Ghane MR, Gharib MH, Ebrahimi A, et al. Accuracy of rapid ultrasound in shock (RUSH) exam for diagnosis of shock in critically ill patients. Trauma Mon 2015;20:e20095.
14. Volpicelli G, Lamorte A, Tullio M, et al. Point-of-care multiorgan ultrasonography for the evaluation of undifferentiated hypotension in the emergency department. Intensive Care Med 2013;39:1290-8. Crossref
15. Sun JT. New advances in emergency ultrasound protocols for shock. J Med Ultrasound 2017;25:191-4. Crossref
16. Vincent JL, De Backer D. Circulatory shock. N Engl J Med 2013;369:1726-34. Crossref
17. Shi D, Walline JH, Yu X, Xu J, Song PP, Zhu H. Evaluating and assessing the prevalence of bedside ultrasound in emergency departments in China. J Thorac Dis 2018;10:2685-90. Crossref
18. Bernier-Jean A, Albert M, Shiloh AL, Eisen LA, Williamson D, Beaulieu Y. The diagnostic and therapeutic impact of point-of-care ultrasonography in the intensive care unit. J Intensive Care Med 2017;32:197-203. Crossref
19. Russell JA. Vasopressor therapy in critically ill patients with shock. Intensive Care Med 2019;45:1503-17. Crossref
20. Rose JS, Bair AE, Mandavia D, Kinser DJ. The UHP ultrasound protocol: a novel ultrasound approach to the empiric evaluation of the undifferentiated hypotensive patient. Am J Emerg Med 2001;19:299-302. Crossref
21. Keikha M, Salehi-Marzijarani M, Soldoozi Nejat R, Sheikh Motahar Vahedi H, Mirrezaie SM. Diagnostic accuracy of rapid ultrasound in shock (RUSH) Exam; a systematic review and meta-analysis. Bull Emerg Trauma 2018;6:271-8.
22. Ghane MR, Gharib M, Ebrahimi A, et al. Accuracy of early rapid ultrasound in shock (RUSH) examination performed by emergency physician for diagnosis of shock etiology in critically ill patients. J Emerg Trauma Shock 2015;8:5-10. Crossref
23. Perera P, Mailhot T, Riley D, Mandavia D. The RUSH exam: Rapid Ultrasound in SHock in the evaluation of the critically ill. Emerg Med Clin North Am 2010;28:29-56, vii. Crossref
24. Atkinson PR, McAuley DJ, Kendall RJ, et al. Abdominal and Cardiac Evaluation with Sonography in Shock (ACES): an approach by emergency physicians for the use of ultrasound in patients with undifferentiated hypotension. Emerg Med J 2009;26:87-91. Crossref
25. Sordillo LA, Pu Y, Pratavieira S, Budansky Y, Alfano RR. Deep optical imaging of tissue using the second and third near-infrared spectral windows. J Biomed Opt 2014;19:056004. Crossref

Vaginal delivery of second twins: factors predictive of failure and adverse perinatal outcomes

Hong Kong Med J 2022 Oct;28(5):376-82 | Epub 13 Sep 2022
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Vaginal delivery of second twins: factors predictive of failure and adverse perinatal outcomes
SL Mok, MB, BS, FHKAM (Obstetrics and Gynaecology); TK Lo, MB, BS, FHKAM (Obstetrics and Gynaecology)
Department Obstetrics and Gynaecology, Princess Margaret Hospital, Hong Kong
 
Corresponding author: Dr SL Mok (juliaslmok@yahoo.com.hk)
 
 Full paper in PDF
 
Abstract
Introduction: This study was performed to explore factors associated with adverse perinatal outcomes for second twins and to identify predictive factors for successful vaginal delivery of the second twin after vaginal delivery of the first twin.
 
Methods: This 10-year retrospective study included 231 cases of twin pregnancies in which vaginal delivery of the second twin was attempted after vaginal delivery of the first twin. The relationships of obstetric characteristics with the composite adverse perinatal outcome of the second twin were analysed. Predictive factors for successful vaginal delivery of the second twin were also explored.
 
Results: Gestational age <32 weeks was the only independent risk factor for the composite adverse perinatal outcome and neonatal intensive care unit admission for the second twin. A longer inter-twin delivery interval was associated with greater risk of caesarean delivery of the second twin, but it did not increase the risk of an adverse perinatal outcome. Non-vertex presentation of the second twin at delivery was independently associated with caesarean delivery (9.0% vs 2.0%, P=0.03). For second twins in breech presentation, caesarean delivery was associated with the presence of less experienced birth attendants.
 
Conclusion: Among second twins born to mothers who had attempted vaginal delivery, adverse perinatal outcomes were mainly related to prematurity. The presence of more experienced birth attendants may contribute to successful vaginal delivery of the second twin, particularly for twins in non-vertex presentation.
 
 
New knowledge added by this study
  • Among second twins born to mothers who had attempted vaginal delivery, adverse perinatal outcomes were mainly related to prematurity, rather than actual mode of delivery.
  • An inter-twin delivery interval of >30 minutes alone did not increase the risk of an adverse perinatal outcome, although it increased the risk of caesarean delivery of the second twin.
  • For second twins in breech presentation, caesarean delivery was independently associated with a longer intertwin delivery interval (>30 minutes) and the presence of less experienced birth attendants.
Implications for clinical practice or policy
  • Our findings support vaginal delivery of the second twin when the first twin is delivered in cephalic presentation.
  • If monitoring of the second twin is possible and the findings are reassuring, obstetricians may consider a conservative approach, even 30 minutes after delivery of the first twin; emergency caesarean delivery should be readily available if necessary.
 
 
Introduction
Selection of the mode of delivery in a twin pregnancy is always challenging for obstetricians, although vaginal delivery is theoretically feasible for diamniotic twins if the first twin is in cephalic presentation.1 In the past 15 years, two cohort studies2 3 and a multicentre randomised trial4 concluded that when the first twin was in cephalic presentation, planned caesarean delivery did not significantly decrease or increase the risk of fetal/neonatal death or serious neonatal morbidity, compared with planned vaginal delivery. These findings suggest that vaginal delivery of twins is a safe and reasonable mode of delivery. However, attempts to deliver vaginally are not always successful, and the intrapartum risks of adverse outcomes for second twins should be carefully considered.
 
In a study of factors that were predictive of successful vaginal delivery, Easter et al5 found that the vaginal delivery rates of second twins in non-vertex presentation were comparable with the vaginal delivery rates of second twins in vertex presentation. Successful vaginal delivery was associated with higher parity. In the subgroup of second twins in non-vertex presentation, successful vaginal delivery was associated with the presence of more experienced birth attendants. The rates of neonatal morbidity and mortality were low in both groups, and they did not differ between groups. However, that study only included twins with gestational ages of ≥32 weeks.
 
In a study that examined caesarean delivery of the second twin after successful vaginal delivery of the first twin, Breathnach et al6 found that the most common indication for caesarean delivery of the second twin was malpresentation (transverse/shoulder/brow) or compound presentation. Second twins who were delivered by emergency caesarean section after vaginal delivery of the first twin had a perinatal morbidity rate of 29%, but there were only 14 such twins; thus, the sample size was insufficient for robust statistical analysis.
 
There is a need for additional information concerning factors predictive of successful vaginal delivery of the second twin, which will allow better case selection and avoid combined vaginal-caesarean delivery (ie, failed vaginal delivery of the second twin). To our knowledge, there have been few studies of these factors in Asian populations. Here, we examined the medical records of second twins born to mothers who had attempted vaginal delivery of twins in Hong Kong; we sought to identify factors that could affect the perinatal outcomes and predict failure of vaginal delivery in a predominantly Asian population. We also included deliveries of preterm gestations (23-32 weeks), which were not extensively investigated in previous studies.
 
Methods
This retrospective study focused on twin pregnancies that were delivered between 1 January 2006 and 31 December 2015 in Princess Margaret Hospital, a regional public hospital in Kowloon, Hong Kong. Inclusion criteria were vaginal delivery of the first twin at gestational viability or beyond. Exclusion criteria were miscarriage (delivery before gestational viability) or delivery of the first twin by elective or emergency caesarean section. Under Hong Kong law, 24 full weeks of gestation is generally regarded as the threshold of gestational viability. In exceptional cases, the threshold may be reduced to 23 weeks if, after full discussion with the obstetric and neonatal care teams, the parents demonstrate a strong preference for earlier initiation of active neonatal management.
 
Eligible cases were identified from the Obstetric Clinical Information System (OBSCIS); for each case, the mother’s demographic and clinical data were retrieved. The OBSCIS is a territory-wide electronic database that contains the prenatal, intrapartum, and postpartum information of all mothers who receive care in public hospitals in Hong Kong. Clinical information in the system is updated in a timely manner by each patient’s midwives and physicians before the patient is discharged from the hospital. Data entry integrity is continuously monitored by a dedicated information technology team within the Hospital Authority, and each obstetrics unit is asked to provide missing data promptly. Each infant’s clinical information was retrieved from the Electronic Patient Record, a comprehensive system that contains all health information (except obstetric records) of patients from birth to death and is shared by all public hospitals and out-patient clinics under the Hong Kong Hospital Authority.
 
The following maternal data were retrieved: age, parity, gestational age at delivery, chorionicity, and mode of delivery of the second twin. The following infant data were retrieved: birth weight, Apgar score, cord blood pH, delivery time, inter-twin delivery interval, presentation at delivery, and neonatal intensive care unit (NICU) admission status.
 
The primary outcome of the study was a composite adverse perinatal outcome that included any of the following: Apgar score <6 at 5 minutes after birth, cord blood pH <7, NICU admission, birth trauma, and presence of neonatal complications. For infants with a hospital stay of >28 days, complications until hospital discharge were included. The following complications were considered: respiratory morbidity, intracranial haemorrhage, hypoxic ischaemic encephalopathy, sepsis, metabolic disturbance, birth defects, and neonatal death. The secondary outcome was mode of delivery. Gestational age was established by the patient’s last menstrual period and verified by ultrasound in the first or early second trimester. Chorionicity was established by prenatal ultrasound and confirmed by placental histology after delivery. The likelihood of vaginal delivery may be adversely impacted by considerably larger second twin size, compared with the first twin. Breathnach et al6 found that the rate of caesarean section was higher if the first twin had ≥20% lower weight than the second twin. Therefore, clinically significant weight discordance was regarded as ≥20% in the present study, where weight discordance was defined as the weight difference between the second and first twin divided by the weight of first twin.
 
Vaginal deliveries of twins were managed in accordance with our labour ward protocol, which does not regard estimated fetal weight discordance as a contra-indication to vaginal delivery. All deliveries were attended by two physicians (as described below) and assisted by ≥2 midwives. Specialist supervision was recommended. In this context, a specialist is an obstetrician who has completed ≥6 years of postgraduate residency training and received accreditation as a Fellow of the Hong Kong College of Obstetricians and Gynaecologists (FHKCOG). Membership in the Royal College of Obstetricians and Gynaecologists (MRCOG) is a prerequisite for FHKCOG accreditation. When a specialist was unavailable (particularly at night), deliveries were conducted or supervised by an MRCOG-qualified physician. Paediatricians were present for all deliveries of second twins. Prenatal steroids (either betamethasone or dexamethasone depending on pharmacy availability and initial treatment at the referral unit) were administered in cases of delivery before 34 weeks of gestation. If necessary, oral nifedipine was used as a first-line tocolytic. Intravenous salbutamol was used as a second-line tocolytic until 2012; since 2013, atosiban has been used as a second-line tocolytic.
 
Statistical analysis was carried out using SPSS software (Windows version 17.0; SPSS Inc., Chicago [IL], United States). Categorical data were analysed by the Chi squared test or Fisher’s exact test, as appropriate. Among the factors that showed statistical significance in univariate analysis, binary logistic regression was used to identify factors that were independently predictive of vaginal delivery and adverse perinatal outcomes. P values <0.05 were considered statistically significant.
 
Results
During the 10-year study period, 47 595 deliveries were performed in Princess Margaret Hospital; 718 twin pairs were delivered. Among these twin pairs, 182 and 305 were delivered by elective and emergency caesarean section, respectively; they were excluded from the study. In the remaining 231 cases, the mothers delivered the first twin vaginally and intended to deliver the second twin vaginally. The second twins in this group of patients were included for analysis.
 
Table 1 shows the demographic and obstetric characteristics of the 231 cases, stratified according to the mode of delivery of the second twin. Emergency caesarean delivery was required in 10 cases (4.3%). Among the three second twins in vertex presentation, two were delivered by caesarean section because of second twin retention; the remaining twin was delivered by caesarean section because of fetal distress. Among the seven second twins in non-vertex presentation, the indications for caesarean delivery were second twin retention (two cases), fetal distress (four cases), and transverse lie (one case). Of the factors shown in Table 1, only an inter-twin delivery interval of >30 minutes and non-vertex presentation of the second twin at delivery were associated with the mode of delivery of the second twin. Logistic regression analysis showed that an inter-twin delivery interval of >30 minutes (odds ratio [OR]=26.952, 95% confidence interval [CI]=5.924-122.619) and non-vertex presentation of the second twin at delivery (OR=5.003, 95% CI=1.101-22.743) were independently associated with caesarean delivery of the second twin.
 

Table 1. Demographic and obstetric characteristics of 231 cases of twin pregnancies in which vaginal delivery of the second twin was attempted after vaginal delivery of the first twin
 
In subgroup analyses, we examined the relationships of the demographic and obstetric factors in Table 1 to determine their relationships with the mode of delivery for second twins in breech presentation. Univariate analysis revealed that only an inter-twin delivery interval >30 minutes and the presence of less experienced birth attendants were significantly associated with the mode of delivery. Logistic regression showed that an inter-twin delivery interval >30 minutes (OR=36.492, 95% CI=3.035-438.712) and the presence of less experienced birth attendants (OR=10.252, 95% CI=1.001-104.956) were independently associated with caesarean delivery of second twins in breech presentation.
 
Perinatal outcomes of second twins are shown in Table 2. Univariate analysis revealed that the composite adverse perinatal outcome was only associated with gestational age <32 weeks (P<0.001; OR=12.1, 95% CI=2.738-53.481) [Table 3]. Similarly, gestational age <32 weeks was the only factor significantly associated with NICU admission (P<0.001; OR=6.420, 95% CI=2.073-19.878) [Table 4].
 

Table 2. Outcomes of second twins (n=231)
 

Table 3. Factors associated with the composite adverse perinatal outcome of the second twin
 

Table 4. Subgroup analysis of risk factors for neonatal intensive care unit admission
 
Among the 47 cases with delivery before 34 weeks of gestation, 27 completed steroid treatment before delivery. In 17 cases, delivery occurred before the completion of steroid treatment because of rapid labour that did not respond to tocolytics. Steroid treatment was not administered in three cases; two of these cases involved delivery before 24 weeks of gestation, which is the threshold for beginning steroid treatment in our hospital. In the third case, the mother was admitted in advanced labour. Completion or non-completion of steroid treatment was not associated with the composite adverse perinatal outcome (25/27 vs 18/20, P=0.753).
 
Discussion
To our knowledge, this is the first study in Hong Kong concerning the short-term composite adverse perinatal outcomes of second twins in cases where vaginal delivery was attempted. Our approach enabled simultaneous consideration of multiple outcome parameters. The inclusion of additional clinical information until hospital discharge for infants with prolonged hospital stay (>28 days) allowed a more comprehensive assessment of outcomes. Notably, cases of gestation <32 weeks were included; there are minimal published data for this group of infants because they have been excluded from many large trials. Additionally, we examined the effects of birth attendant experience and birth timing.
 
Perinatal outcomes
Prior to this study, there were two analyses of twin deliveries in a predominantly Asia population, both from Hong Kong. The first analysis mainly focused on patient preference regarding the mode of delivery; it also included few vaginal deliveries (35 cases).7 The second analysis, reported by Tang et al,8 was performed in the same obstetric unit as the first analysis; it reviewed neonatal and maternal outcomes after an increase in the rate of vaginal delivery of twins. The authors did not find any significant differences in neonatal morbidities between the vaginal delivery group and the elective caesarean delivery group. However, there were fewer successful vaginal deliveries of ≥1 twin (72 cases) and the effect of inter-twin delivery interval was not evaluated.
 
In this study, the main factor that affected the composite adverse perinatal outcome was gestational age; complications were mainly related to prematurity. Similarly, NICU admission was mainly related to complications of prematurity, rather than complications of vaginal delivery. There were no statistically significant differences in adverse perinatal outcomes, even for twins who were not delivered in cephalic presentation. Thus, non-cephalic presentation alone should not be considered sufficient to recommend caesarean delivery for twin pregnancies.
 
A study in Hong Kong by Leung et al,9 published in 2002, showed that all umbilical cord blood gas parameters in the second twin were significantly associated with the inter-twin delivery interval. The risk of severe fetal acidosis was 27% if the second twin was not delivered ≤30 minutes after delivery of the first twin, but the outcomes of second twins were not analysed. In our study, an inter-twin delivery interval of >30 minutes alone did not increase the risk of short-term adverse perinatal outcomes, although it increased the risk of caesarean delivery of the second twin. Schneuber et al10 also reported similar findings in their series, which suggested that an increased inter-twin delivery interval was not associated with adverse fetal outcomes. If monitoring of the second twin is possible and the findings are reassuring, obstetricians may thus consider a conservative approach, even 30 minutes after delivery of the first twin; however, emergency caesarean delivery should be readily available if necessary.
 
Our study also showed no increase in adverse perinatal outcomes for infants who were delivered after midnight. In general, delivery of twins in daytime or early evening is preferable because additional staff are present, and those staff are often more experienced. Therefore, when there are no indications for urgent delivery, the usual practice in our unit is to begin labour induction for twin pregnancies in the early morning. Deliveries after midnight usually follow spontaneous labour and are thus unplanned. However, such deliveries are supervised by the most senior on-call obstetrician (MRCOG-qualified or FHKCOG-accredited) during the intrapartum period.
 
Delivery of non-cephalic second twin
The vaginal delivery of second twins in non-cephalic presentation is challenging. Our findings showed a higher rate of caesarean delivery for second twins in non-cephalic presentation (9.0% vs 2.0%, P=0.03). In a large cohort study using the World Health Organization Global Survey dataset, Vogel et al11 showed that caesarean rates were 6.2% and 0.9% for second twins in non-cephalic and cephalic presentation, respectively. Another study by Kong et al12 revealed the caesarean delivery rates of second twins were 4.7% in cephalic presentation, 11.1% in breech presentation, and up to 90% in transverse lie. In both of these studies, analyses were conducted based on the presentation of the second twin at the onset of labour; their findings were consistent with our results. The presence of more experienced obstetricians who are able to perform artful manoeuvres (ie, internal podalic version and external cephalic version) can increase the likelihood of successful vaginal delivery of the second twin. Regular training and rehearsal of the vaginal delivery of twins is important for obstetricians to maintain their skills.
 
Caesarean section of second twin
In our study, caesarean delivery of the second twin was necessary in 4.3% of cases, which is similar to or lower than the proportions in other series.6 8 13 14 Regardless of whether the second twin was delivered by caesarean section, there were no significant increases in short-term adverse perinatal outcomes; however, this mode of delivery is less favourable for mothers. These results are contrary to the findings by Breathnach et al,6 in which the perinatal morbidity rate was 29% among second twins delivered by emergency caesarean section after vaginal delivery of the first twin. A systematic review by Rossi et al15 also showed a higher rate of morbidity in second twins after caesarean delivery (19.8% vs 9.5% after vaginal delivery). Thus, combined vaginal-caesarean delivery of twins should be avoided whenever possible.
 
In the present study, the presence of a larger second twin (≥20% weight discordance) did not significantly increase the risk of caesarean delivery. The second twin was larger in only 12 cases (5.2%). We suspect that many other cases with a larger second twin were scheduled for caesarean delivery without a trial of vaginal delivery. Decisions concerning the mode of delivery are affected by the estimated fetal weight, fetal presentation, and whether the mother has a history of successful vaginal delivery. Various factors must be carefully considered in each case.
 
Limitations
There were some limitations in this study. First, the retrospective design may have resulted in missing data or incomplete data collection. This is not a large problem because clinical information in the OBSCIS and the Electronic Patient Record is required to be updated when each patient is discharged from the hospital; therefore, these systems are reliable sources of patient data. Nevertheless, some information was not retrievable, such as the presentation of the second twin at the time of first twin delivery and whether birth attendant manoeuvres were necessary to deliver the second twin. Second, the non-randomised analysis might have led to selection bias concerning the mode of delivery, such that low-risk cases were over-represented in the study. The number of second twins delivered by caesarean section was small; a larger trial is needed to more comprehensively evaluate such cases.
 
Conclusion
Among second twins born to mothers who had attempted vaginal delivery, we found that adverse perinatal outcomes were mainly related to prematurity, rather than actual mode of delivery. For all second twins, an inter-twin delivery interval <30 minutes was associated with a higher rate of vaginal delivery; for second twins in breech presentation, the presence of more experienced birth attendants was also associated with a higher rate of vaginal delivery. Overall, the risk of caesarean delivery of the second twin was low. Our findings in a predominantly Asian population in Hong Kong support vaginal delivery of the second twin when the first twin is delivered in cephalic presentation.
 
Author contributions
This study was planned and designed by both authors. Both authors also jointly performed the data analysis. TK Lo provided leadership and supervision, while SL Mok wrote and managed the manuscript. Both authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
Both authors have disclosed no conflicts of interest.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
This study was approved by the Kowloon West Cluster Research Ethics Committee (Ref KW/EX-17-154 (118-02)). The requirement for patient informed consent was waived because this was a retrospective review of medical records that did not involve patient participation.
 
References
1. Monson M, Silver RM. Multifetal gestation: mode of delivery. Clin Obstet Gynecol 2015;58:690-702. Crossref
2. Peaceman AM, Kuo L, Feinglass J. Infant morbidity and mortality associated with vaginal delivery in twin gestations. Am J Obstet Gynecol 2009;200:462.e1-6. Crossref
3. Fox NS, Silverstein M, Bender S, Klauser CK, Saltzman DH, Rebarber A. Active second-stage management in twin pregnancies undergoing planned vaginal delivery in a U.S. population. Obstet Gynecol 2010;115:229-33. Crossref
4. Barrett JF, Hannah ME, Hutton EK, et al. A randomized trial of planned cesarean or vaginal delivery for twin pregnancy. N Engl J Med 2013;369:1295-305.Crossref
5. Easter SR, Lieberman E, Carusi D. Fetal presentation and successful twin vaginal delivery. Am J Obstet Gynecol 2016;214:116.e1-10.Crossref
6. Breathnach FM, McAuliffe FM, Geary M, et al. Prediction of safe and successful vaginal twin birth. Am J Obsetet Gynecol 2011;205:237.e1-7. Crossref
7. Liu AL, Yung WK, Yeung HN, et al. Factors influencing the mode of delivery and associated pregnancy outcomes for twins: a retrospective cohort study in a public hospital. Hong Kong Med J 2012:18:99-107.
8. Tang HT, Liu AL, Chan SY, et al. Twin pregnancy outcomes after increasing the rate of vagina twin delivery: retrospective cohort study in a Hong Kong regional obstetrics unit. J Maternal Fetal Neonatal Med 2016;29:1094-100. Crossref
9. Leung TY, Tam WH, Leung TN, Lok IH, Lau TK. Effect of twin-to-twin delivery interval on umbilical cord blood gas in the second twins. BJOG 2002;109:63-7. Crossref
10. Schneuber S, Magnet E, Haas J, et al. Twin-to-twin delivery time: neonatal outcome of second twin. Twin Res Hum Genet 2011;14:573-9. Crossref
11. Vogel JP, Holloway E, Cuesta C, Carroli G, Souza JP, Barrett J. Outcomes of non-vertex second twins, following vertex vaginal delivery of first twin: a secondary analysis of the WHO Global Survey on maternal and perinatal health. BMC Pregnancy Childbirth 2014;14:55. Crossref
12. Kong CW, To WW. The predicting factors and outcomes of caesarean section of the second twin. J Obstet Gynaecol 2017;37:709-13. Crossref
13. Yang Q, Wen SW, Chen Y, Krewski D, Fung KF, Walker M. Neonatal death and morbidity in vertex-nonvertex second twins according to mode of deliverya and birth weight. Am J Obstet Gynecol 2005;192:840-7. Crossref
14. Persad VL, Baskett TF, O’Connell CM, Scott HM. Combined vaginal-cesarean delivery of twin pregnancies. Obstet Gynecol 2001;98:1032-7. Crossref
15. Rossi AC, Mullin PM, Chmait RH. Neonatal outcomes of twins according to birth order, presentation and mode of delivery: a systematic review and meta-analysis. BJOG 2011;118:523-32. Crossref

Behavioural adaptations and responses to obstetric care among pregnant women during an early stage of the COVID-19 pandemic in Hong Kong: a cross-sectional survey

Hong Kong Med J 2022 Oct;28(5):367-75 | Epub 1 Aug 2022
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Behavioural adaptations and responses to obstetric care among pregnant women during an early stage of the COVID-19 pandemic in Hong Kong: a cross-sectional survey
PW Hui, MD, FRCOG; Mimi TY Seto, MB, BS, MRCOG; KW Cheung, MB, BS, MRCOG
Department of Obstetrics and Gynaecology, Queen Mary Hospital, Hong Kong
 
Corresponding author: Dr PW Hui (apwhui@hku.hk)
 
 Full paper in PDF
 
Abstract
Introduction: This study evaluated behavioural adaptations and responses to obstetric care among pregnant women during an early stage of the coronavirus disease 2019 (COVID-19) pandemic.
 
Methods: This cross-sectional survey included pregnant women who received obstetric care from 27 May 2020 to 16 June 2020 in a university-affiliated hospital in Hong Kong. Responses were collected with respect to obstetric appointment scheduling, workplace changes, mask-wearing practices, travel and quarantine experiences, obstetric service adjustments, and visiting arrangements. Regression analysis was used to compare the effects of patient characteristics on their responses.
 
Results: In total, 1000 surveys were distributed; 733 pregnant women provided complete survey responses. Among obstetric-related appointments in public hospitals, 16% were postponed or cancelled by pregnant women; such changes were most frequent among women beyond 24 weeks of gestation, women who had previous deliveries, and women who had a history of mental illness. The practice of working from home imposed psychological stress and negatively impacted the pregnancy experience in 4.5% of women. Childbirth companionship was regarded as an important service by 88.1% of women; only 4.2% agreed with its suspension. Obstetric service adjustments had the greatest impact on Chinese women and nulliparous women.
 
Conclusions: The findings provide an overview of how pregnant women adapted during an early stage of the COVID-19 pandemic. Women adjusted obstetric service attendance, began working from home, and wore masks. Women’s expectations did not match changes in childbirth companionship and peripartum services. Hospital administrators should consider psychological impacts on pregnant women when implementing service adjustments.
 
 
New knowledge added by this study
  • Pregnant women, especially women who had previous deliveries and a history of mental illness, were more likely to postpone or cancel obstetric appointments during an early stage of the coronavirus disease 2019 (COVID-19) pandemic.
  • While working from home improved the overall pregnancy experience for most women, it caused psychological stress and had a negative influence in 4.5% of respondents.
  • Childbirth companionship was considered important by 88.1% of the respondents; only 4.2% of respondents fully accepted its suspension.
Implications for clinical practice or policy
  • Obstetricians and policy makers should be aware of mismatches in the expectations of pregnant women concerning childbirth companionship and peripartum services; infection control should be balanced with peripartum needs.
  • Obstetric service adjustments had the greatest impact on Chinese women and nulliparous women.
 
 
Introduction
Coronavirus disease 2019 (COVID-19) has had substantial psychosocial impacts worldwide and caused major behavioural changes. In 2020, increased stress and anxiety levels were reported in countries with major disease spread.1 2 3 4 5 6 Pregnancy is considered a risk factor for COVID-19 because of relative maternal immunosuppression; there is also a risk of vertical transmission.7 8 9 10 11 Importantly, behavioural changes have been recognised among pregnant women.4 The pandemic situation could potentially disrupt obstetric care for pregnant women.8 Thus, it is important to study how the pandemic has affected obstetric care and pregnancy experiences.
 
Considering the severe adult respiratory syndrome (SARS) outbreaks in 2002 to 2003 in Hong Kong, a serious alert level was announced on 4 January 2020 in response to the emergence of novel coronavirus pneumonia in Wuhan, China.12 13 This was escalated to an emergency alert level on 25 January 2020. Corresponding policies were imposed in public hospitals with each alert announcement. In obstetric units, husbands and partners were no longer allowed to accompany pregnant women for labour and delivery. No visiting was allowed for mothers or babies staying with their mothers in postnatal wards. All antenatal exercise classes, antenatal seminars, hospital tours, and postnatal classes were suspended. Many workplaces for women and their partners shifted to working from home. These changes were coupled with suspensions of schools and non-urgent community services. The infection continued to spread worldwide; the COVID-19 pandemic was recognised by the World Health Organization on 11 March 2020. On 20 March 2020, the first case of COVID-19 in a pregnant woman was confirmed in Hong Kong. The local government subsequently restricted travel with additional quarantine measures and mandated social distancing in late March 2020.12 This study was conducted in the middle of 2020 to examine how pregnant women responded to changes in obstetric care and alterations in the workplace during an early stage of the COVID-19 pandemic; it also investigated their adaptations to the practices of mask wearing and social distancing.
 
Methods
This prospective questionnaire survey was conducted in the obstetric unit of a university-affiliated tertiary public hospital in Hong Kong from 27 May 2020 to 16 June 2020 in English (online supplementary Table 1) and Traditional Chinese (online supplementary Table 2). Pregnant women were invited to participate in an online questionnaire upon admission to obstetric wards or attendance to obstetric clinics; each invitation was provided by a midwife (in an obstetric ward) or a designated research assistant (in an obstetric clinic). Clinic sessions included an antenatal check-up, ultrasound scan, and screenings for gestational diabetes and Group B streptococcus. The survey was administered to all women who could read either Chinese or English. Each woman received an information leaflet containing an introduction of the project, a description of key events related to COVID-19 from January 2020 to March 2020, and a QR code linked to an online survey. The participants could begin the survey by scanning the QR code, selecting the language, and providing their consent.
 
The survey was developed by the authors and tailored to address issues related to the impacts of COVID-19 on obstetric services. Prior to this study, the survey content was validated by local consultant obstetricians and midwives; it consisted of demographic data collection and questions that involved five domains. These domains were related to obstetric appointment scheduling, workplace changes, mask-wearing practices, travel and quarantine experiences, and adjustments to birth companionship and visiting hours since the first novel coronavirus alerts were announced in January 2020. Concerning obstetric appointment scheduling, participants were asked whether their appointments had been postponed or rescheduled from a public hospital to a private hospital. Concerning workplace changes, participants were asked whether they and/or their partners had begun to work from home; they were then asked to describe the impact of the change on their pregnancy experience. Concerning mask-wearing practices, participants were asked about their pattern and type of mask use. With respect to travel and quarantine, participants were asked whether they had travelled because of COVID-19 risk; they were also asked about their experiences with COVID-19 testing and quarantine. Regarding the importance of birth companionship and visiting hours, as well as the acceptance of service adjustments and relief measures, participants were asked to rate their opinions of these factors using a visual analogue scale of 0 to 100, with 100 as very important or strongly accepted.
 
Women with gestational age ≤24 weeks were regarded as the early gestational group, while women with gestational age >24 weeks and women in the postnatal period were regarded as the late gestational group. The COVID-19 alert was announced by the Hong Kong government on 4 January 2020, slightly more than 20 weeks prior to the commencement of this study. Women in the early gestational group conceived after the date of COVID-19 alert announcement, while women in the late gestational group were already pregnant on the announcement date. Statistical analysis was performed using SPSS software (Windows version 26.0; IBM Corp, Armonk [NY], United States). The distributions of continuous variables were checked for normality. Analysis of variance and t test assessments were used for normally distributed variables, while the Mann-Whitney U test was used for non-normally distributed variables. Categorical variables were evaluated by the Chi squared test or Fisher’s exact test. Regression analysis was performed to examine the effects of marital status, ethnic background, parity, and mental illness on the behaviours of pregnant women regarding antenatal appointment rescheduling and their opinions of obstetric service adjustments. A value of P<0.05 was considered significant.
 
Results
In total, 1000 information leaflets were distributed to 200 women in obstetric wards and 800 women in out-patient clinics. In all, 890 responses were registered online, including 878 women who consented to participate and 12 women who did not consent. Among women who agreed to participate, 145 did not finish the survey; thus, 733 completed responses were available for analysis.
 
Table 1 shows the basic demographic characteristics of the participants. Women aged 31 to 35 years constituted nearly half (48.8%) of the respondents. The largest gestational age-group was 25 to 30 weeks (24.3%). With the exception of influenza and pertussis vaccination histories, other background characteristics were comparable between early and late gestational groups.
 

Table 1. Participant characteristics
 
Obstetric appointment scheduling
Among 2583 patient appointments, 417 (16.1%) were postponed or cancelled by pregnant women. Over half (56.1%) of these were rescheduled to a private hospital. The rate of postponement or cancellation was higher for regular antenatal visits (20.3%) and lower for foetal anomaly scans (13.6%) [Table 2]. Multivariate analysis showed that women in the late gestational group (odds ratio [OR]=2.66; 95% confidence interval [CI]=1.68-4.19; P<0.001) and women with mental illness (OR=2.20; 95% CI=1.09-4.43; P=0.03) were more likely to postpone or cancel regular antenatal appointments, while nulliparous women (OR=0.67; 95% CI=0.46-0.99; P=0.04) were less likely to make such changes (Table 3). No significant associations of demographic characteristics with ultrasound and investigation appointments (blood test, screening of Down’s syndrome, or Group B streptococcus colonisation) were identified.
 

Table 2. Arrangement of obstetric appointments after COVID-19 alert
 

Table 3. Multivariate analysis of factors affecting the antenatal appointment scheduling pattern
 
Working from home
As shown in Table 4, there were 542 (73.9%) working women in this study; more than half of them began working from home after the COVID-19 alert announcement. Compared with their husbands/partners, significantly more women were working from home (29.6% vs 52.8%; P<0.05). Among women working from home, this work pattern facilitated obstetric appointment attendance for 46.5% (133/286) of the women and 43.5% (87/200) of their husbands/partners. There was a tendency for decreased omission of antenatal appointments among women working at home, compared with women working in usual workplaces, although the differences were not statistically significant for any type of appointment (antenatal check-up 6.6% vs 6.7%, P=0.10; anomaly scan 3.7% vs 8.0%, P=0.09; obstetric scan 3.5% vs 6.6%, P=0.22; obstetric investigations 5.1% vs 8.3%, P=0.30).
 

Table 4. Work patterns, and effect of work patterns on antenatal appointment attendance
 
Overall pregnancy experience
Among the 131 women who reported that both they and their husbands/partners worked from home, 107 (81.7%) reported a better overall pregnancy experience. Among the 224 women who reported that they or their husbands/partners worked from home, 139 (62.1%) felt this work arrangement had made their overall pregnancy experience better while 13 (5.8%) felt this had made their experience worse. A significantly greater proportion of respondents reported a much better overall pregnancy experience when both they and their husbands/partners were working from home than when either they or their husbands/partners were working from home (50.4% vs 31.7%; P=0.001). In contrast, suspension of school and community services had more negative impacts on pregnancy experience (Table 5).
 

Table 5. Effect of work pattern, school suspension, and community service suspension on overall pregnancy experience
 
More time to spend at home was selected by 80.1% (197/246) of the respondents as a beneficial effect of working from home on their pregnancy experience (online supplementary Table 3). Among 18 women who had a worse pregnancy experience because of working from home, more psychological stress was chosen by 13 (72.2%) women as one of the underlying reasons. Five (27.8%) women reported greater conflict with their husbands/partners because of working from home (online supplementary Table 4).
 
Mask-wearing practices
The mean proportion of mask-wearing time was significantly greater in clinical areas (97.2% for hospitals and 97.0% for clinics) than in outdoor areas (89.3%) and at home (4.1%, P<0.05). Over 90% of respondents always wore masks in clinical areas; 63.8% always wore masks outdoors, and 0.8% always wore masks at home. Among all women in the study, surgical masks were most commonly used; N95 masks were used by 55 (7.5%) women in hospitals and 32 (4.4%) women in clinics (online supplementary Table 5).
 
Travel and quarantine experiences
Since the announcement of the COVID-19 alert, 6.8% (50/733) of respondents had travelled abroad because of COVID-19 risk in Hong Kong; 13.4% (98/733) of respondents had returned to Hong Kong because of COVID-19 risk abroad. Additionally, 31 (4.2%) women had been quarantined and 26 (3.5%) women had lived with household members during home quarantine. Coronavirus disease 2019 testing had been performed in 3.7% of all respondents. Moderate to marked emotional disturbance related to personal quarantine experience was reported by 64.5% (20/31) of the women (online supplementary Table 6).
 
Adjustments of birth companionship and visiting hours
Husband/partner companionship during childbirth was regarded as the most important obstetric service, followed by visiting hours for pregnant women and neonates. Childbirth companionship was considered important by 88.1% of the respondents; only 4.2% of the respondents fully accepted its suspension. In contrast, suspension of hospital tours was fully accepted by 27.0% of the respondents (online supplementary Fig). Univariate analysis showed that marital status, ethnicity, parity, and history of mental illness were factors that influenced opinions of obstetric service importance and acceptance of service suspension. Regression analysis showed that being married was strongly associated with greater perceived importance of childbirth companionship (B=10.51; 95% CI=5.77-15.24) and visiting hours for mothers (B=5.14; 95% CI=0.35-9.94). Chinese women had the greatest perceived importance of visiting arrangements for both mothers and babies; they had the least acceptance of suspension of those services. Nulliparity was only factor significantly associated with the perceived importance of antenatal exercise (B=23.41; 95% CI=19.00-27.82), antenatal seminars (B=28.72; 95% CI=24.41-33.03), hospital tours (B=20.03; 95% CI=14.97-25.09), and postnatal breastfeeding classes (B=25.96; 95% CI=21.59-30.33) [Table 6].
 

Table 6. Multivariate analysis of obstetric service importance and acceptance of service suspension
 
Discussion
Summary
To our knowledge, this is the first study of the behavioural adaptations and responses to obstetric care among pregnant women during an early stage of the COVID-19 pandemic in Hong Kong, a city which previously experienced SARS outbreaks in 2002-2003. Approximately 16% of obstetric-related appointments in public hospitals were postponed or cancelled by pregnant women because of COVID-19, but only 56% of these appointments were rescheduled in private hospitals. Women who had previous deliveries and a history of mental illness were more likely adjust their appointments. Working from home during the COVID-19 pandemic improved the overall pregnancy experience in most respondents. However, approximately 5% of women reported negative impacts on their pregnancy experiences, primarily because of psychological stress. Concerning obstetric services, nearly 90% of the women considered childbirth companionship to be important; <5% of the women fully accepted its suspension. More than 80% of the respondents regarded visiting for mothers and newborns as very important aspect of the overall pregnancy experience. Obstetric service adjustments had the greatest impact on Chinese women and nulliparous women.
 
Antenatal care
Delays in seeking medical attention for acute medical conditions such as cardiac and cerebrovascular events were reported in 2020.14 15 16 Importantly, failure to attend scheduled antenatal care can lead to adverse outcomes.17 18 Women in the late gestational group were already pregnant on the date of the COVID-19 alert announcement; they might have reported more adjustments to obstetric appointments. Additionally, their shifts in obstetric care and avoidance of in-hospital stays in public hospitals might be reflected by the reduced delivery rate.19 Because of their previous pregnancy experience, multiparous women might have been more likely to modify antenatal appointments. In contrast, women with mental illness require greater antenatal care and psychosocial support.20 The establishment of virtual clinics for online assessment without exposing pregnant women to COVID-19 risk in clinical areas offers an important alternative.21 To establish such clinics, antenatal protocols must be revised to incorporate virtual visits when ultrasounds, physical examinations, and obstetric investigations are unnecessary. Pregnant women would also require stable internet access, as well as foetal doppler and blood pressure monitoring equipment.
 
Working from home
Prior to and during the survey period, no complete lockdowns were instituted in Hong Kong, although working from home was encouraged. In this study, slightly more than half of working women were working from home after the COVID-19 alert. There is a need to consider safety for women who reported greater conflict with their husband/partner while working from home. Increased domestic violence was observed during the early stages of the COVID-19 pandemic; greater relationship friction and household conflict could be contributing factors.22 Public policy should be revised to facilitate the identification of women in need of conflict assistance when physical and psychosocial support may be limited because of physical isolation and the suspension of community services.9 23
 
Behavioural adaptations
In this study, >90% of pregnant women reported wearing a mask in clinical areas, although <10% reported wearing an N95 mask in hospitals. Our finding of 90% mask usage in clinical areas was much greater than the 31.8% observed among the general public in Taiwan in 2020.3 While the high rate of mask use could represent compliance with hospital policies regarding mandatory mask use and heightened awareness of self-protection in pregnant women, the use of N95 masks might also indicate a fear of contacting COVID-19 in public hospitals where confirmed cases were managed. Additionally, >20% of the women either travelled abroad or returned to Hong Kong because of COVID-19 risk. The history of SARS outbreaks in Hong Kong might have led to increased caution from the initial announcement of the COVID-19 alert. Travel during pregnancy and changes in delivery plans are important decisions. In 2020, a study in China showed that women were generally more anxious than men with respect to COVID-19; greater perceived susceptibility and severity of COVID-19 were also associated with greater anxiety.5 Obstetric decision-making and the implementation of preventive measures have been associated with antenatal anxiety secondary to the COVID-19 pandemic.3 8 Quarantine can lead to widespread and long-lasting adverse psychological sequelae.24 In 2020, anxiety levels were significantly higher among people who personally knew at least one person with COVID-19.1 In our cohort, moderate to marked emotional disturbance was reported by two-thirds of women who had undergone quarantine and one-third of women who had been living with household members during home quarantine. There is a need for supportive counselling to be provided to this susceptible group of women.
 
Expectations of childbirth companionship and peripartum services
Women’s expectations did not match changes in peripartum services and childbirth companionship; these mismatches were greatest in married women. Childbirth companionship provides multiple types of physical and psychological support.25 Women of Chinese ethnicity exhibited the greatest disagreement with suspension of visiting hours. The principle of “doing the month” in Chinese culture promotes maternal rest with nutritious supplements; thus, visits during the postpartum period are regarded as essential convalescence for mothers and babies.26 In Hong Kong, a greater proportion of women had a higher Edinburg Postpartum Depression Scale score upon suspension of childbirth companionship and visiting hours after announcement of the COVID-19 alert.19 In 2020, a similar effect on the Edinburg Postpartum Depression Scale score was observed in a Turkish population.27 Importantly, the Comprehensive Child Development Service in public obstetric units provides a programme for the identification, follow-up, and counselling of women at risk of postpartum depression; this programme constitutes critical support during stressful periods, such as the COVID-19 pandemic.
 
Strengths and limitations
Likely because many women of reproductive age living in Hong Kong remember the SARS outbreaks in 2002-2003, a notable strength was that the present study provided a useful assessment of adaptations and responses to a similar disease (COVID-19). Such valuable information can improve the understanding of behaviour among pregnant women in places that encounter further waves of COVID-19 transmission.
 
The merit of this survey was that the online questionnaire format allowed respondents to complete the questionnaire remotely and at their preferred speed. The responses were automatically captured in a database, which minimised entry errors and potential transmission of COVID-19. However, this questionnaire format is limited to patients with electronic access and does not permit the involvement of an interviewer to explain the questions. The use of convenience sampling in a single centre might also have introduced bias and limited the generalisability of the findings to the general population.
 
An additional limitation was that only women who continued antenatal follow-up or delivered in our public hospital were included in the present study. The delivery rate for January to April decreased by 13% in 2020, compared with the same period in 2019, despite a similar number of delivery bookings.17 This phenomenon was observed across all public hospitals in Hong Kong, indicating that pregnant women might have chosen to deliver in private hospitals instead. There is no standalone maternity hospital in Hong Kong; all maternity units are housed within general hospitals that admit patients with COVID-19. We suspect that this situation might have led some pregnant women to deliver in private hospitals where they perceived the risk of COVID-19 to be lower.
 
The final limitation was that the survey was conducted during a non-peak period of COVID-19 transmission in 2020. Childbirth companionship was resumed 2 days prior to the survey period; companions were required to complete an assessment of fever, travel, occupational exposure, contact history, and clustering phenomenon. Thus, the practices might have differed and the overall fear of disease might have been less intense, compared with a peak period of COVID-19 transmission. Furthermore, the retrospective nature of this study might have introduced recall bias, which we attempted to minimise by providing a timeline of key events concerning COVID-19 in the information leaflet. However, the initial response of the general public to COVID-19 might have been exaggerated because accurate disease information was limited during the early stages of the pandemic; the performance of a questionnaire study during a non-peak period might have helped to gather less exaggerated data concerning the behaviour of pregnant women. Further prospective longitudinal studies can address how women respond in different phases of the COVID-19 pandemic.
 
Conclusion
This study demonstrated the adaptations and responses of pregnant women to the COVID-19 pandemic in Hong Kong. The women in this study adjusted their obstetric appointments, began to work from home, and practised protective measures to reduce their risk of disease. While the overall pregnancy experience was mostly improved by working from home, women reported emotional disturbance because of the pandemic. Expectations of obstetric services remained high, particularly for Chinese women and nulliparous women. Obstetricians and policymakers should attempt to balance infection control and the peripartum needs of pregnant women when modifying childbirth companionship policies. Particular attention to nulliparous women is needed because they demonstrated higher levels of disagreement with the suspension of antenatal and postnatal educational programmes.
 
Author contributions
Concept or design: All authors.
Acquisition of data: All authors.
Analysis or interpretation of data: PW Hui.
Drafting of the manuscript: PW Hui.
Critical revision of the manuscript for important intellectual content: All authors.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
All authors have disclosed no conflicts of interest.
 
Acknowledgement
The authors thank Ms WK Choi for her assistance with creation of the online survey and Mr G Chu for his assistance with participant recruitment.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
The research has been approved by Institutional Review Board of The University of Hong Kong/Hospital Authority West Cluster (Ref UW 20-387).
 
References
1. Moghanibashi-Mansourieh A. Assessing the anxiety level of Iranian general population during COVID-19 outbreak. Asian J Psychiatr 2020;51:102076. Crossref
2. Wang C, Pan R, Wan X, et al. Immediate psychological responses and associated factors during the initial stage of the 2019 coronavirus disease (COVID-19) epidemic among the general population in China. Int J Environ Res Public Health 2020;17:1729. Crossref
3. Wong LP, Hung CC, Alias H, Lee TS. Anxiety symptoms and preventive measures during the COVID-19 outbreak in Taiwan. BMC Psychiatry 2020;20:376. Crossref
4. Corbett GA, Milne SJ, Hehir MP, Lindow SW, O’Connell MP. Health anxiety and behavioural changes of pregnant women during the COVID-19 pandemic. Eur J Obstet Gynecol Reprod Biol 2020;249:96-7. Crossref
5. Lin Y, Hu Z, Alias H, Wong LP. Knowledge, attitudes, impact, and anxiety regarding COVID-19 infection among the public in China. Front Public Health 2020;8:236. Crossref
6. Ozamiz-Etxebarria N, Dosil-Santamaria M, Picaza-Gorrochategui M, Idoiaga-Mondragon N. Stress, anxiety, and depression levels in the initial stage of the COVID-19 outbreak in a population sample in the northern Spain [in Spanish]. Cad Saude Publica 2020;36:e00054020.
7. Kotlyar AM, Grechukhina O, Chen A, et al. Vertical transmission of COVID-19: a systematic review and meta-analysis. Am J Obstet Gynecol 2020;224:35-53.e3.
8. Liu X, Chen M, Wang Y, et al. Prenatal anxiety and obstetric decisions among pregnant women in Wuhan and Chongqing during the COVID-19 outbreak: a cross-sectional study. BJOG 2020;127:1229-40. Crossref
9. Thapa SB, Mainali A, Schwank SE, Acharya G. Maternal mental health in the time of the COVID-19 pandemic. Acta Obstet Gynecol Scand 2020;99:817-8. Crossref
10. Wu Y, Zhang C, Liu H, et al. Perinatal depressive and anxiety symptoms of pregnant women along with COVID-19 outbreak in China. Am J Obstet Gynecol 2020;223:240.e1-9.
11. Westgren M, Pettersson K, Hagberg H, Acharya G. Severe maternal morbidity and mortality associated with COVID-19: the risk should not be downplayed. Acta Obstet Gynecol Scand 2020;99:815-6. Crossref
12. Leung GM, Cowling BJ, Wu JT. From a sprint to a marathon in Hong Kong. N Engl J Med 2020;382:e45.
13. To KK, Yuen KY. Responding to COVID-19 in Hong Kong. Hong Kong Med J 2020;26:164-6.
14. Metzler B, Siostrzonek P, Binder RK, Bauer A, Reinstadler SJ. Decline of acute coronary syndrome admissions in Austria since the outbreak of COVID-19: the pandemic response causes cardiac collateral damage. Eur Heart J 2020;41:1852-3. Crossref
15. Teo KC, Leung WC, Wong YK, et al. Delays in stroke onset to hospital arrival time during COVID-19. Stroke 2020;51:2228-31. Crossref
16. Marijon E, Karam N, Jost D, et al. Out-of-hospital cardiac arrest during the COVID-19 pandemic in Paris, France: a population-based, observational study. Lancet Public Health 2020;5:e437-43.
17. Vogel JP, Habib NA, Souza JP, et al. Antenatal care packages with reduced visits and perinatal mortality: a secondary analysis of the WHO Antenatal Care Trial. Reprod Health 2013;10:19. Crossref
18. Mohamed Shaker El-Sayed Azzaz A, Martínez-Maestre MA, Torrejón-Cardoso R. Antenatal care visits during pregnancy and their effect on maternal and fetal outcomes in pre-eclamptic patients. J Obstet Gynaecol Res 2016;42:1102-10. Crossref
19. Hui PW, Ma G, Seto MT, Cheung KW. Effect of COVID-19 on delivery plan and postnatal depression score of pregnant women. Hong Kong Med J 2021;27:113-7.
20. Berthelot N, Lemieux R, Garon-Bissonnette J, Drouin-Maziade C, Martel É, Maziade M. Uptrend in distress and psychiatric symptomatology in pregnant women during the coronavirus disease 2019 pandemic. Acta Obstet Gynecol Scand 2020;99:848-55. Crossref
21. Aziz A, Zork N, Aubey JJ, et al. Telehealth for high-risk pregnancies in the setting of the COVID-19 pandemic. Am J Perinatol 2020;37:800-8. Crossref
22. Roesch E, Amin A, Gupta J, García-Moreno C. Violence against women during COVID-19 pandemic restrictions. BMJ 2020;369:m1712.
23. Usher K, Bhullar N, Durkin J, Gyamfi N, Jackson D. Family violence and COVID-19: increased vulnerability and reduced options for support. Int J Ment Health Nurs 2020;29:549-52. Crossref
24. Brooks SK, Webster RK, Smith LE, et al. The psychological impact of quarantine and how to reduce it: rapid review of the evidence. Lancet 2020;395:912-20. Crossref
25. Bohren MA, Berger BO, Munthe-Kaas H, Tunçalp Ö. Perceptions and experiences of labour companionship: a qualitative evidence synthesis. Cochrane Database Syst Rev 2019;(3):CD012449.
26. Liu YQ, Maloni JA, Petrini MA. Effect of postpartum practices of doing the month on Chinese women’s physical and psychological health. Biol Res Nurs 2014;16:55-63. Crossref
27. Durankuş F, Aksu E. Effects of the COVID-19 pandemic on anxiety and depressive symptoms in pregnant women: a preliminary study. J Matern Fetal Neonatal Med 2020;35:205-11. Crossref

Models to predict prognosis in older patients with heart failure complicated by pre-frailty and frailty: a pilot prospective cohort study

Hong Kong Med J 2022 Oct;28(5):356-66 | Epub 22 Sep 2022
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Models to predict prognosis in older patients with heart failure complicated by pre-frailty and frailty: a pilot prospective cohort study
Takuya Umehara, PT, PhD1,2; Nobuhisa Katayama, PT3; Akinori Kaneguchi, PT, PhD1Yoshitaka Iwamoto, PT, PhD4; Miwako Tsunematsu, RN, PhD5; Masayuki Kakehashi, PhD, DSc5
1 Department of Rehabilitation, Faculty of Rehabilitation, Hiroshima International University, Higashi-Hiroshima, Japan
2 Department of Rehabilitation, Saiseikai Kure Hospital, Kure, Japan
3 Department of Rehabilitation, Kure Kyosai Hospital, Kure, Japan
4 Department of Neuromechanics, Graduate School of Biomedical and Health Sciences, Hiroshima University, Hiroshima, Japan
5 Department of Health Informatics, Graduate School of Biomedical and Health Sciences, Hiroshima University, Hiroshima, Japan
 
Corresponding author: Dr T Umehara (start.ume0421@gmail.com)
 
 Full paper in PDF
 
Abstract
Introduction: There are no clinical prediction models to predict the prognosis of pre-frailty or frailty in patients with heart failure. We aimed to develop prediction models for the prognosis of pre-frailty and frailty in older patients with heart failure using the classification and regression tree (CART) method; we then tested the predictive accuracies of the developed models.
 
Methods: Patients with pre-frailty or frailty at admission were divided into improved and non-improved groups. The CART method was used to establish two models: A, which predicted the presence or absence of pre-frailty improvement during hospitalisation; and B, which predicted the presence or absence of frailty improvement during hospitalisation.
 
Results: Patients with heart failure complicated by pre-frailty (n=28) or frailty (n=156) were included. In model A, the accuracy of predicting pre-frailty improvement was high; the best predictor was single-leg standing time at admission, followed by left ventricular ejection fraction at admission. In model B, the accuracy of predicting frailty improvement was moderate; the best predictor was hand grip strength at admission, followed by estimated glomerular filtration rate at admission, haemoglobin level at admission, and change in single-leg standing time during hospitalisation. The areas under the receiver operating characteristic curves of the CART models were 0.96 and 0.84 in models A and B, respectively.
 
Conclusion: Although conditions at admission may predict the improvement of pre-frailty and frailty during hospitalisation, cardiac rehabilitation that improves single-leg standing time may help to improve frailty, particularly when conditions at admission are poor.
 
 
New knowledge added by this study
  • We developed prediction models for the prognosis of pre-frailty and frailty in older patients with heart failure using classification and regression tree methods.
  • Single-leg standing time at admission was the best predictor of pre-frailty improvement, whereas hand grip strength at admission was the best predictor of frailty improvement.
  • Change in single-leg standing time during hospitalisation was also a predictor of frailty improvement.
Implications for clinical practice or policy
  • Improvements in physical function can help to manage frailty in older patients with heart failure, particularly when conditions at admission are poor.
  • Cardiac rehabilitation to prolong single-leg standing time is necessary to improve frailty, particularly when conditions at admission are poor.
 
 
Introduction
Heart failure is a major public health problem that has been shown to increase with age, such that incidence rates rapidly increase after 80 years of age.1 Among older patients with heart failure, 18% to 54% show signs of frailty, a state of reduced physical and cognitive function that results in weakness.2 There are some overlapping symptoms between heart failure and frailty; they interact to accelerate the vicious cycle of frailty.3 One study showed that frail patients with cardiovascular disease had a 7-year survival rate of 12%, which was much lower than the survival rate in non-frail patients with cardiovascular disease (43%).4 Moreover, frailty among patients with heart failure has been associated with poor prognosis5 and reduced cardiac output capacity.6 Pre-frailty is the first step towards frailty; approximately 34.6% to 46.1% of individuals with pre-frailty progress to frailty in Japan.7 Therefore, improvements in frailty and pre-frailty are important considerations in the care of older patients with heart failure.
 
Age,8 nutrition,9 walking speed,9 heart function,8 and grip strength10 have been shown to influence frailty improvement among older patients with heart failure. However, the predictive accuracies of such factors remain unknown. Moreover, a combination of the predictors has been suggested to increase their predictive accuracy,11 although this hypothesis has not yet been tested. Additionally, nutrition,12 physical function,12 and quality of life12 have been reported to influence pre-frailty improvement among community-dwelling older individuals. To our knowledge, there are no reports of factors that influence pre-frailty improvement among patients with heart failure.
 
In this study, we used the classification and regression tree (CART) method,13 which facilitates the establishment of clinical prediction models that can identify the best combinations of medical signs, symptoms, and other findings to predict prognoses or treatment outcomes.13 Several clinical prediction models have been developed using the CART method to predict mortality in patients with heart failure.14 15 However, no clinical prediction models have been established to predict the prognosis of patients with heart failure complicated by pre-frailty and frailty.
 
Here, we aimed to use the CART method to develop models that could predict the prognosis of older patients with heart failure complicated by pre-frailty and frailty, then confirm the predictive accuracies of those models.
 
Methods
Study design
This pilot prospective cohort study followed the STROBE reporting guidelines. All included patients provided written informed consent to participate in the study. Identifying information was not collected to protect each patient’s privacy. This study was performed in accordance with the Declaration of Helsinki.
 
Setting
Recruitment, follow-up, and data collection were performed at two acute hospitals (Saiseikai Kure Hospital, Kure, Japan; Kure Kyosai Hospital, Kure, Japan) between July 2018 and December 2019. Potential participants were recruited by therapists at the rehabilitation department.
 
Patients
This study included patients who met the following inclusion criteria: age ≥65 years; hospitalisation for the treatment of heart failure; and presence of pre-frailty or frailty. The exclusion criteria were complications during hospitalisation and/or severe dementia (defined as a revised Hasegawa’s Dementia Scale score ≤9).
 
Intervention
Cardiac rehabilitation was performed by physical or occupational therapists to improve physical condition, restore walking ability during hospitalisation, and expand the activities of daily living. Rehabilitation programmes were established by physical or occupational therapists in accordance with physicians’ orders. Initially, aerobic exercises and resistance training programmes were provided according to each patient’s physical condition. Exercise intensity was determined using multiple indices, including target heart rate (convenient method: resting heart rate + 20 bpm), talk test, and Borg scale (11-13) for the chest and lower limbs. The type of exercise was modified (ie, duration extended and load increased) with consideration of each patient’s symptoms and haemodynamics. If necessary, functional exercises (eg, neuromuscular facilitation, joint range of motion, and muscle strengthening exercises), exercises for activities of daily living, and psychological support were implemented for patients and their families. Exercises for activities of daily living were customised according to the functions that each patient needed for discharge. Overall, the duration and frequency of intervention were 30 minutes to 1 hour per day and 5 days per week, respectively.
 
Variables
Variables included demographic and clinical characteristics, frailty assessment results, and physical function. Demographic characteristics included age, sex, body mass index, living arrangement (with family members or alone), New York Heart Association class, medical history (eg, heart failure, coronary artery disease, valvular disease, hypertension, diabetes mellitus, dyslipidaemia, atrial fibrillation, chronic renal dysfunction, and/or stroke), cognitive function assessed using the revised Hasegawa’s Dementia Scale, Life-Space Assessment score, interval from admission to initiation of cardiac rehabilitation, interval from admission to rehabilitation room entry, and length of hospital stay. Hasegawa’s Dementia Scale scores of 21-30, 15-20, 10-14, and ≤9 were regarded as normal, suspected dementia, mild to moderate dementia, and severe dementia, respectively.16 The Life-Space Assessment developed by Baker et al17 was used to evaluate life-space mobility. Up to 120 points were assigned based on the degree of independence in each life-space level during the month prior to the assessment; higher scores were considered indicative of broader life-space and/or greater independence.
 
Clinical characteristics included blood data, cardiac function, and pharmacotherapy. Blood data included the Geriatric Nutritional Risk Index, brain natriuretic peptide level, estimated glomerular filtration rate (eGFR), and haemoglobin (Hb) level. Cardiac function was evaluated using left ventricular ejection fraction (LVEF), as determined by echocardiography. Pharmacotherapy data included whether patients were receiving dopamine, dobutamine, noradrenaline, phosphodiesterase III inhibitor, or diuretics.
 
Frailty was assessed using the following five conditions based on the Cardiovascular Health Study (CHS) Index: slow gait speed, weakness, exhaustion, low activity, and weight loss.18 Slow gait speed was defined as <1.0 m/s. Weakness was assessed using maximum grip strength according to sex-specific cut-offs (<26 and <18 kg for men and women, respectively). Exhaustion was assessed using the question “During the past 2 weeks, have you felt tired without a specific reason?” A positive response to this question (ie, “yes”) was considered indicative of exhaustion. Physical activity was evaluated using the question “Do you engage in low levels of physical activity to improve your health?” A negative response to this question (ie, “no”) was considered indicative of a low activity level. Weight loss was assessed using the question “Have you lost 2 kg or more in the past 6 months?” A positive answer to this question was considered indicative of weight loss. There are various criteria for assessing frailty; Fried’s frailty phenotype model18 and the accumulated deficit model established by Mitnitski et al19 are well known. The CHS criteria and the Frailty Index were developed based on the above frailty phenotype model and accumulated deficit model, respectively. Furthermore, a Japanese version of the CHS criteria (J-CHS) has been established,20 and its validity has been confirmed.21 Thus, we selected the J-CHS to assess frailty. Patients with none of the above conditions were considered non-frail (robust), patients with one to two conditions were considered pre-frail, and patients with three conditions or more were considered frail.22
 
Physical function was assessed using the Short Physical Performance Battery score, 10-metre walk time, single-leg standing time, and hand grip strength. Patients performed the Short Physical Performance Battery test in the following sequence, in accordance with the National Institute on Aging protocol: standing balance tests, gait test (4 m), and chair stand test (five repetitions). The sum of the three test components comprised the final Short Physical Performance Battery score, which ranged from 0 to 12; a score of 12 indicated optimal lower extremity function.23 24 Moreover, 10-m walk time was measured at a comfortable walking speed to assess walking ability. The 10-m walk test has demonstrated high validity and reliability in multiple populations, including healthy older individuals and patients with stroke, neurological disorders, or orthopaedic dysfunction.25 26 Measurements were performed twice at an interval of 30 s; the smaller value was used to indicate walking ability. Standing balance was evaluated by measuring single-leg standing time, which reflects the ability to maintain the body’s centre of gravity within its base of support. A stopwatch was used to measure the duration that a patient could stand on one leg with their eyes open and hands on their waist, without any assistance or falling. A second trial was performed if the result of the first trial was <60 s.27 Hand grip strength (kg) was measured using a digital hand grip strength dynamometer (TKK-5101; Takei Scientific Instruments, Tokyo, Japan or 12B3X00030; Tsutsumi Works, Chiba, Japan). Accordingly, patients were asked to squeeze the dynamometer with maximum effort during two trials for each hand. The maximum value (rounded to the nearest 0.1 kg) for either the left or right hand was used for subsequent analyses.28
 
Demographic and clinical characteristics were assessed upon admission or each time, whereas cognitive function and Life-Space Assessment were assessed during the first physical therapy session. Frailty and physical function were assessed upon initial entry into the rehabilitation room and at discharge. The amount of change in physical function (discharge−initial) was calculated.
 
Bias
To reduce selection bias, outcomes were selected based on the methods of previous studies.16 17 18 19 20 21 22 23 24 25 26 27 28 To reduce measurement bias, the first author (T Umehara) was not involved in participant enrolment or data collection. Patients received explanations about the purpose of the study but did not receive information concerning the hypothesis tested.
 
Statistical analysis
Classification and regression tree analysis29 was used to predict the primary outcomes. Patients with pre-frailty at admission were categorised into an improved group and a non-improved group: patients who were non-frail at discharge were assigned to the improved group, whereas patients who were pre-frail or frail were assigned to the non-improved group. Similarly, patients with frailty at admission were categorised into an improved group and a non-improved group: patients who were non-frail or pre-frail at discharge were assigned to the improved group, whereas patients who were frail were assigned to the non-improved group. Binary trees were used to recursively split predictor variables based on answers to yes/no questions for each variable. All statistical distributions were considered without limitation to linear relationships between outcome variables and predictor variables. These algorithms have been used to develop prediction models in various fields.30 31 32 The CART method with the Gini index was used for the following models: A, which predicted the presence or absence of pre-frailty improvement during hospitalisation; and B, which predicted the presence or absence of frailty improvement during hospitalisation. Pre-frailty or frailty improvement was the dependent variable, whereas demographic and clinical characteristics, pharmacotherapy, and amount of change in physical function were the independent variables. The accuracies of the CART models were evaluated using the area under the receiver operating characteristic curve (AUROC). The method proposed by Delong et al33 was used to identify optimal cut-off points. Subsequently, the sensitivity, specificity, positive likelihood ratio (PLR), and negative likelihood ratio (NLR) were calculated. To assess model validity, cross-validation was performed as follows: the sample was divided into 10 subgroups and the model developed from nine subgroups was used to test the 10th subgroup; this was repeated for all 10 combinations and the rates of misclassification were averaged. Model validity was considered high when the misclassification rates were similar before and after cross-validation. The accuracies of the CART models were evaluated using the AUROC developed from each method. The maximum Youden index (sensitivity + specificity − 1) was defined as the optimal cut-off point. All statistical analyses were performed using SPSS (Windows version 23.0; IBM Corp, Armonk [NY], United States) and the significance level was set at 5%.
 
Sample size
Sample size calculations were conducted using MedCalc statistical software, version 19.2 (MedCalc Software bvba, Ostend, Belgium). Before plotting an AUROC, the following values were established: statistical significance (P<0.05), alpha (0.05), statistical power (0.80), and the AUROC value to be included in the null hypothesis (0.5). The AUROC could distinguish between non-predictive (AUROC<0.5), less predictive (0.5<AUROC^lt;0.7), moderately predictive (0.7<AUROC<0.9), highly predictive (0.9<AUROC<1), and perfectly predictive (AUROC=1).34 In this study, an AUROC value of 0.7 was considered indicative of superior statistical discrimination. The frailty improvement ratio (ie, positive/negative ratio) considerably varied among previous studies.35 36 37 Therefore, the positive/negative ratio used here was set at 1:1-5. Moreover, a large sample size was needed to allow for the possibility of stratified analysis. Thus, 62 to 120 patients with frailty were required: 20-31 and 31-100 in the improved and non-improved groups, respectively.
 
Results
Figure 1 shows the flowchart of patient recruitment. Among the 30 patients with pre-frailty, two with severe dementia were excluded; 28 patients were included in the analysis. Among the 161 patients with frailty, five with severe dementia were excluded; 156 patients were included in the analysis. The patient characteristics are summarised in Tables 1 and 2 (patients with pre-frailty and patients with frailty, respectively).
 

Figure 1. Flowchart depicting the recruitment of older patients with heart failure complicated by pre-frailty and frailty
 

Table 1. Characteristics of patients with pre-frailty
 

Table 2. Characteristics of patients with frailty
 
Figure 2 shows CART model A, which predicted the presence or absence of pre-frailty improvement during hospitalisation. Among the 28 patients with pre-frailty, seven experienced improvements. Single-leg standing time at admission was identified as the best single discriminator for pre-frailty improvement (≤4.4 or >4.4 s). Among patients with single-leg standing time of >4.4 s at admission, the next predictor was LVEF (≤36.8% or >36.8%). Our CART analysis resulted in the establishment of three terminal nodes. The terminal node with the highest probability of a favourable outcome (pre-frailty improvement) was defined as rank 1, whereas the terminal node with the lowest probability of a favourable outcome was defined as rank 3. Based on the AUROC (95% confidence interval [CI]), this CART model had an accuracy of 0.96 (95% CI=0.89-1.00), with an optimal cut-off point of rank 1 (sensitivity: 85.7%, specificity: 95.2%, PLR: 18.0, and NLR: 0.15). The misclassification rates before and after cross-validation were 7.1% and 28.6%, respectively.
 

Figure 2. Model to predict the presence or absence of pre-frailty improvement during hospitalisation
 
Figure 3 shows CART model B, which predicted the presence or absence of frailty improvement during hospitalisation. Among the 156 patients with frailty, 57 experienced improvements. Hand grip strength at admission was identified as the best single discriminator for frailty improvement (≤16.8 or >16.8 kg). Among patients with hand grip strength of >16.8 kg at admission, the next predictor was eGFR (≤27.0 or >27.0 mL/min/1.73 m2). Among patients with hand grip strength of ≤16.8 kg at admission, the next predictor was eGFR (≤83.5 or >83.5 mL/min/1.73 m2). Among patients with eGFR ≤83.5 mL/min/1.73 m2, the next predictor was Hb level (≤14.3 or >14.3 g/dL). Among patients with Hb level ≤14.3 g/dL, the next predictor was change in single-leg standing time (≤4.9 or >4.9 s). Our CART analysis resulted in the establishment of six terminal nodes. The terminal node with the highest probability of a favourable outcome (frailty improvement) was defined as rank 1, whereas the terminal node with the lowest probability of a favourable outcome was defined as rank 6. Based on the AUROC (95% CI), this CART model had an accuracy of 0.84 (95% CI=0.78-0.91), with an optimal cut-off point of rank 4 (sensitivity: 70.1%, specificity: 86.9%, PLR: 5.3, and NLR: 0.3). The misclassification rates before and after cross-validation were 19.2% and 35.3%, respectively.
 

Figure 3. Model to predict the presence or absence of frailty improvement during hospitalisation
 
Discussion
Model A predicted pre-frailty improvement with high accuracy; it identified single-leg standing time at admission as the best predictor, followed by LVEF. Notably, changes in physical function during hospitalisation were not identified as predictors. These results suggest that conditions at admission strongly influence pre-frailty improvement during hospitalisation; moreover, improvement can be expected among patients with good physical function (single-leg standing time >4.4 s) and cardiac function (LVEF >36.8%) at admission. To our knowledge, no study has examined the factors that influence pre-frailty improvement among patients with heart failure. However, one study found that physical function, nutrition, and quality of life were factors that influenced pre-frailty improvement among community-dwelling older individuals.12 The above findings suggest that although physical function is a common factor that influences pre-frailty improvement among older patients with heart failure and community-dwelling older individuals, cardiac function specifically influences pre-frailty improvement among patients with heart failure.
 
Model B predicted frailty improvement with moderate accuracy; it identified hand grip strength at admission as the best predictor, followed by eGFR, Hb level, and change in single-leg standing time during hospitalisation. Thus, frailty improvement can be expected among patients with good hand grip strength and/or renal function at admission. However, the cut-off values for eGFR, an index of renal function, considerably differed between patients with good (>16.8 kg) and poor (≤16.8 kg) hand grip strength (27 and 83.5 mL/min/1.73 m2, respectively). A previous study also showed that older patients with heart failure who had higher hand grip strength were more likely to experience frailty improvement.10 Although no association has been identified between renal function and frailty, renal function is known to influence improvements in exercise capacity among patients with heart failure.38 Moreover, our results showed that, despite the presence of poor physical and renal function at admission, patients with a high Hb level (>14.3 g/dL) were likely to improve from frailty. Although there is no published literature concerning an association between Hb level and frailty, a low Hb level has been shown to cause fatigability,39 40 which is one of the criteria for assessing frailty using the CHS. Furthermore, the Hb level reportedly influences improvements in exercise capacity among patients with heart failure.38
 
Despite poor hand grip strength, weak renal function, and a low Hb level at admission, frailty improvement was observed in most patients whose single-leg standing time during hospitalisation was >4.9 s. In previous studies that sought to predict frailty improvement among older patients with heart failure, investigators mostly focused on conditions at admission without considering changes during hospitalisation.41 The present results indicate that, despite the presence of poor conditions at admission, patients can recover from frailty by improving physical function during hospitalisation; therefore, rehabilitation is essential during hospitalisation. Resistance training is known to improve single-leg standing time among older individuals.42 Thus, we recommend cardiac rehabilitation, including resistance training, to improve frailty among older patients with heart failure.
 
The clinical implications of our findings are as follows. Thus far, no algorithms have been established concerning pre-frailty and frailty improvements in older patients with heart failure. Using the CART method, we developed models that could predict the prognosis of older patients with heart failure complicated by pre-frailty and frailty. These models will provide useful information for patients and caregivers. Many of the factors extracted in this study were only assessed at the time of initial rehabilitation. Thus, the prognoses of patients with pre-frailty and patients with frailty can be inferred (to some extent) at the time of initial rehabilitation. Additionally, an increase in single-leg standing time during hospitalisation was associated with frailty improvement. For older patients with heart failure who show signs of frailty, interventions to increase single-leg standing time may help to improve frailty. The PLR and NLR were used to assess the diagnostic performances of the CART models; these parameters revealed that both model A (pre-frailty) and model B (frailty) had good performance. However, our data should be interpreted cautiously because of the small number of patients with pre-frailty.
 
There were several notable weaknesses and limitations in this study. First, the sample size was limited; the numbers of patients with pre-frailty and frailty were 28 and 156, respectively. Therefore, this study should be regarded as a pilot prospective cohort study. Despite the moderate to high accuracies of models A and B, more large-scale studies are needed to enhance the generalisability of our results. Second, three aspects of frailty exist, namely physical, social, and mental frailty; mental frailty was not considered in this study. A previous study reported that patients with multifaceted frailty, including physical, mental, and social frailty, had worse prognoses compared with patients who had physical frailty alone.5 Therefore, additional studies are needed to develop models that predict improvements in multifaceted frailty, including mental frailty. Third, interventions were customised for each patient; they were not uniform. However, physical therapists in both study hospitals received 2 weeks of training in cardiac rehabilitation, and the methods of cardiac rehabilitation were standardised as much as possible. Fourth, we used the J-CHS to measure frailty; the use of this tool to assess patients with heart failure is potentially controversial. The J-CHS was developed for older adults and has been used in multiple studies.7 21 Additionally, the reliability and validity of this tool have been confirmed.21 We thus consider this use of the J-CHS to be appropriate. Fifth, this study was performed in an unblinded manner, and we could not completely rule out the potential for bias. However, to reduce the measurement bias, the first author (T Umehara) was not involved in participant enrolment or data collection. Sixth, a selection bias might have been present because patients were only recruited at two hospitals in Japan. Caution is needed when generalising our results, particularly to patients in other countries.
 
Conclusion
By using the CART method, we developed moderately to highly accurate prediction models for pre-frailty and frailty improvement among older patients with heart failure. Model A, which predicted pre-frailty improvement, showed that patients with good single-leg standing time and cardiac function at admission are likely to experience pre-frailty improvement. Furthermore, Model B, which predicted frailty improvement, showed that patients with good hand grip strength, excellent renal function, and/or a high Hb level at admission are likely to experience frailty improvement. Notably, despite the presence of poor conditions at admission, frailty improvement may occur in patients who show improvement in single-leg standing time during hospitalisation. Overall, our results suggest that cardiac rehabilitation to prolong single-leg standing time is necessary to improve frailty, particularly when conditions at admission are poor.
 
Author contributions
Concept or design: All authors.
Acquisition of data: T Umehara and N Katayama.
Analysis or interpretation of data: T Umehara, M Tsunematsu, M Kakehashi.
Drafting of the manuscript: T Umehara, A Kaneguchi and Y Iwamoto.
Critical revision of the manuscript for important intellectual content: T Umehara, A Kaneguchi, Y Iwamoto.
 
All authors had full access to the data, contributed to the study, approved the final version for publication, and take responsibility for its accuracy and integrity.
 
Conflicts of interest
All authors have disclosed no conflicts of interest.
 
Acknowledgement
We thank Y Yamamoto, M Ota, Y Nakashima, W Kawakami, and M Iwanaga (Kure Kyosai Hospital, Kure, Japan), as well as S Nagao, Y Kawabata, and D Tomiyama (Saiseikai Kure Hospital, Kure, Japan) for their support in data collection.
 
Funding/support
This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
 
Ethics approval
The study was approved by the Ethics Committees of Saiseikai Kure Hospital (Ref 127) and Kure Kyosai Hospital (Ref 31-17). Patients provided written informed consent to participate.
 
References
1. Bleumink GS, Knetsch AM, Sturkenboom MC, et al. Quantifying the heart failure epidemic: prevalence, incidence rate, lifetime risk and prognosis of heart failure The Rotterdam Study. Eur Heart J 2004;25:1614-9. Crossref
2. Smart N, Marwick TH. Exercise training for patients with heart failure: a systematic review of factors that improve mortality and morbidity. Am J Med 2004;116:693-706. Crossref
3. Joyce E. Frailty in advanced heart failure. Heart Fail Clin 2016;12:363-74. Crossref
4. Newman AB, Gottdiener JS, Mcburnie MA, et al. Associations of subclinical cardiovascular disease with frailty. J Gerontol A Biol Sci Med Sci 2001;56:158-66. Crossref
5. Rodríguez-Pascual C, Paredes-Galán E, Ferrero-Martínez AI, et al. The frailty syndrome is associated with adverse health outcomes in very old patients with stable heart failure: A prospective study in six Spanish hospitals. Int J Cardiol 2017;236:296-303. Crossref
6. Denfeld QE, Winters-Stone K, Mudd JO, Hiatt SO, Chien CV, Lee CS. Frequency of and significance of physical frailty in patients with heart failure. Am J Cardiol 2017;8:1243-9. Crossref
7. Kojima G, Taniguchi Y, Iliffe S, Jivraj S, Walters K. Transitions between frailty states among community-dwelling older people: A systematic review and meta-analysis. Ageing Res Rev 2019;50:81-8. Crossref
8. Takabayashi K, Ikuta A, Okazaki Y, et al. Clinical characteristics and social frailty of super-elderly patients with heart failure—the Kitakawachi clinical background and outcome of heart failure registry. Circ J 2016;81:69-76. Crossref
9. Sze S, Zhang J, Pellicori P, Morgan D, Hoye A, Clark AL. Prognostic value of simple frailty and malnutrition screening tools in patients with acute heart failure due to left ventricular systolic dysfunction. Clin Res Cardiol 2017;106:533-41. Crossref
10. Bekfani T, Pellicori P, Morris DA, et al. Sarcopenia in patients with heart failure with preserved ejection fraction: impact on muscle strength, exercise capacity and quality of life. Int J Cardiol 2016;222:41-6. Crossref
11. Umehara T, Tanaka R, Tsunematsu M, et al. Can the amount of interventions during the convalescent phase predict the achievement of independence in activities of daily living in patients with stroke? A retrospective cohort study. J Stroke Cerebrovasc Dis 2018; 27:2436-44. Crossref
12. Gené Huguet L, Navarro González M, Kostov B, et al. Pre Frail 80: multifactorial intervention to prevent progression of pre-frailty to frailty in the elderly. J Nutr Health Aging 2018;22:1266-74. Crossref
13. McGinn TG, Guyatt GH, Wyer PC, Naylor CD, Stiell IG, Richardson WS. Users’ guides to the medical literature: XXII: how to use articles about clinical decision rules. Evidence-Based Medicine Working Group. JAMA 2000;284:79-84. Crossref
14. Fonarow GC, Adams KF Jr, Abraham WT, Yancy CW, Boscardin WJ, ADHERE Scientific Advisory Committee, Study Group, and Investigators. Risk stratification for in-hospital mortality in acutely decompensated heart failure: classification and regression tree analysis. JAMA 2005;293:572-80. Crossref
15. Arenja N, Breidthardt T, Socrates T, et al. Risk stratification for 1-year mortality in acute heart failure: classification and regression tree analysis. Swiss Med Wkly 2011;141:w13259. Crossref
16. Imai Y, Hasegawa K. The revised Hasegawa’s Dementia Scale (HDS-R)—Evaluation of its usefulness as a screening test for dementia. J Hong Kong Coll Psychiatr 1994;4:20-4.
17. Baker PS, Bodner EV, Allman RM. Measuring life-space mobility in community-dwelling older adults. J Am Geriatr Soc 2003;51:1610-4. Crossref
18. Fried LP, Tangen CM, Walston J, et al. Frailty in older adults: evidence for a phenotype. J Gerontol A Biol Sci Med Sci 2001;56:146-56. Crossref
19. Mitnitski AB, Mogilner AJ, Rockwood K. Accumulation of deficits as a proxy measure of aging. ScientificWorldJournal 2001;1:323-36. Crossref
20. Satake S, Arai H. The revised Japanese version of the Cardiovascular Health Study criteria (revised J-CHS criteria). Geriatr Gerontol Int 2020;20:992-3. Crossref
21. Makizako H, Shimada H, Doi T, Tsutsumimoto K, Suzuki T. Impact of physical frailty on disability in community-dwelling older adults: a prospective cohort study. BMJ Open 2015;5:e008462. Crossref
22. Satake S, Shimada H, Yamada M, et al. Prevalence of frailty among community-dwellers and outpatients in Japan as defined by the Japanese version of the Cardiovascular Health Study criteria. Geriatr Gerontol Int 2017;17:2629-34. Crossref
23. Bernabeu-Mora R, Medina-Mirapeix F, Llamazares-Herrán E, García-Guillamón G, Giménez-Giménez LM, Sánchez-Nieto JM. The Short Physical Performance Battery is a discriminative tool for identifying patients with COPD at risk of disability. Int J Chron Obstruct Pulmon Dis 2015;10:2619-26. Crossref
24. Guralnik JM, Simonsick EM, Ferrucci L, et al. A short physical performance battery assessing lower extremity function: association with self-reported disability and prediction of mortality and nursing home admission. J Gerontol 1994;49:85-94. Crossref
25. Bohannon RW. Comfortable and maximum walking speed of adults aged 20-79 years: reference values and determinants. Age Ageing 1997;26:15-9. Crossref
26. van Hedel HJ, Wirz M, Dietz V. Assessing walking ability in subjects with spinal cord injury: validity and reliability of 3 walking tests. Arch Phys Med Rehabil 2005;86:190-6. Crossref
27. Michikawa T, Nishiwaki Y, Takebayashi T, Toyama Y. One-leg standing test for elderly populations. J Orthop Sci 2009;14:675-85. Crossref
28. Huang C, Niu K, Kobayashi Y et al. An inverted J-shaped association of serum uric acid with muscle strength among Japanese adult men: a cross-sectional study. BMC Musculoskelet Disord 2013;14:258. Crossref
29. Breiman L, Friedman J, Olshen R, Stone C. Classification and Regression Tree Analysis. Boston University; 2014: 1-16.
30. Aguiar FS, Almeida LL, Ruffino-Netto A, Kritski AL, Mello FC, Werneck GL. Classification and regression tree (CART) model to predict pulmonary tuberculosis in hospitalized patients. BMC Pulm Med 2012;12:40.Crossref
31. Takahashi O, Cook EF, Nakamura T, Saito J, Ikawa F, Fukui T. Risk stratification for in-hospital mortality in spontaneous intracerebral haemorrhage: a Classification and Regression Tree analysis. QJM 2006;99:743-50. Crossref
32. Westreich D, Lessler J, Funk MJ. Propensity score estimation: neural networks, support vector machines, decision trees (CART), and meta-classifiers as alternatives to logistic regression. J Clin Epidemiol 2010;63:826-33. Crossref
33. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 1988;44:837-45. Crossref
34. Swets JA. Measuring the accuracy of diagnostic systems. Science 1988;240:1285-93. Crossref
35. Espinoza SE, Jung I, Hazuda H. Lower frailty incidence in older Mexican Americans than in older European Americans: the San Antonio Longitudinal Study of Aging. J Am Geriatr Soc 2010;58:2142-8. Crossref
36. Schuurmans H, Steverink N, Lindenberg S, Frieswijk N, Slaets JP. Old or frail: what tells us more? J Gerontol A Biol Sci Med Sci 2004;59:962-5. Crossref
37. Collard RM, Boter H, Schoevers RA, Oude Voshaar RC. Prevalence of frailty in community-dwelling older persons: a systematic review. J Am Geriatr Soc 2012;60:1487-92. Crossref
38. Silverberg DS, Wexler D, Blum M, Iaina A. The cardio renal anemia syndrome: correcting anemia in patients with resistant congestive heart failure can improve both cardiac and renal function and reduce hospitalizations. Clin Nephrol 2003;60 Suppl 1:S93-102.
39. Roy CN. Anemia in frailty. Clin Geriatr Med 2011;27:67-78. Crossref
40. Chaves PH, Ashar B, Guralnik JM, Fried LP. Looking at the relationship between hemoglobin concentration and prevalent mobility difficulty in older women. Should the criteria currently used to define anemia in older people be reevaluated? J Am Geriatr Soc 2002;50:1257-64. Crossref
41. Reeves GR, Whellan DJ, O’Connor CM, et al. A novel rehabilitation intervention for older patients with acute decompensated heart failure: The REHAB-HF pilot study. JACC Heart Fail 2017;5:359-66. Crossref
42. Gonzalez AM, Mangine GT, Fragala MS, et al. Resistance training improves single leg stance performance in older adults. Aging Clin Exp Res 2014;26:89-92. Crossref

Pages