Surgical outcome of daytime and out-of-hours surgery for elderly patients with hip fracture

Hong Kong Med J 2018 Feb;24(1):32–7 | Epub 4 Aug 2017
DOI: 10.12809/hkmj165044
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Surgical outcome of daytime and out-of-hours surgery for elderly patients with hip fracture
YM Chan, BSc, MSc1; N Tang, MB, ChB, FRCSEd2; Simon KH Chow, PhD2
1 Physiotherapy Department, Pok Oi Hospital, Yuen Long, Hong Kong
2 Department of Orthopaedics and Traumatology, Prince of Wales Hospital, The Chinese University of Hong Kong, Shatin, Hong Kong
 
Corresponding authors: Dr N Tang (ntang@ort.cuhk.edu.hk), Dr Simon KH Chow (skhchow@ort.cuhk.edu.hk)
 
 Full paper in PDF
 
Abstract
Introduction: Surgery for hip fracture may be performed out-of-hours to avoid surgical delay. There is, however, a perception that this may constitute less-than-ideal conditions and result in a poorer outcome. The aim of this study was to evaluate the surgical outcome of elderly patients with hip fracture who underwent daytime versus out-of-hours surgery in Hong Kong. This will help make decisions about whether to operate out-of-hours or to delay surgery until the following day.
 
Methods: This retrospective study included all elderly patients with hip fracture who were operated on and discharged from the Prince of Wales Hospital in 2014. Patients were divided into groups according to the time of surgical incision. Records were examined for 30-day mortality and postoperative surgical complications, and their potential associations with surgeon characteristics.
 
Results: Overall, 367 patients were selected in this study with 242 patients in the daytime group and 125 in the out-of-hours group. Demographic characteristics were comparable between the two groups. The overall 30-day mortality rate was 2.0% and the surgical complication rate was 24.2%. Compared with the daytime group, there was no increase in 30-day mortality or surgical complications for out-of-hours group. Fewer surgeons were involved in out-of-hours surgery but the number of surgeons and their qualifications did not affect the outcomes.
 
Conclusions: The two groups were homogeneous in terms of demographic characteristics. Outcomes for 30-day mortality and postoperative surgical complications were comparable between the two groups. Surgeons’ qualifications and number of surgeons involved were also not associated with the outcomes. Out-of-hours surgery remains a viable option in order to facilitate early surgery.
 
 
New knowledge added by this study
  • Time of surgery for hip fracture did not affect the outcome.
  • Surgeon’s qualification was not associated with postoperative outcomes.
Implications for clinical practice or policy
  • Out-of-hours repair of hip fracture is safe.
  • Hip operations by junior surgeons are practical.
 
 
Introduction
With the ageing population in Hong Kong, the number of elderly people aged 65 years or above is projected to rise most rapidly in the next 20 years, with a projected increase from 15% in 2014 to 30% in 2034.1 With this surge in the elderly population, and as one of the most common injuries in the elderly, hip fracture is also projected to double its numbers in 20 years.2 This places a huge financial burden on health care resources. The sum of HK$310 million allocated to elderly patients with hip fracture in 2011 will rise in the next few years.2
 
Early surgical repair is a key element both for pain management and restoration of bone integrity after hip fracture.3 4 5 Systematic reviews show that surgery beyond 48 hours significantly increases 30-day and 1-year mortality and complication rates.6 7 8 9 Early surgical stabilisation and mobilisation has become the standard of care. As a result, and due to congested operating theatre schedules, non–life-threatening orthopaedic surgery may be performed at night. However, there is a perception that out-of-hours surgery may result in poorer outcomes due to insufficient technical support and surgeon fatigue or inexperience.
 
Studies that investigated the effect of out-of-hours surgery in different specialties have shown increased morbidity and mortality risk.10 11 12 Scant literature on the effect of time of the day of operation on hip surgery outcome shows controversial results. A German study in 200313 and a study by Chacko et al14 in 2011 showed no significant differences in mortality or complication rate 6 months after surgery when it was performed at night. Other studies, however, have shown that night-time surgeries for hip fracture may be associated with increased operating time and surgical complication rate.15 16
 
Owing to the controversial outcomes of these limited studies, this retrospective study aimed to evaluate the surgical outcome of elderly patients with hip fracture who underwent surgery in Hong Kong during the day or out-of-hours. It was hypothesised that surgical outcomes of out-of-hours surgery would not differ significantly to those of daytime surgery. It was hoped that findings of this study would help surgeons in making a decision about whether to operate out-of-hours or to delay surgery until the following day.
 
Methods
The Hospital Authority (HA) in Hong Kong manages all public hospitals serving more than 90% of the population. The Clinical Data Analysis and Reporting System (CDARS) includes in-patient data from all hospitals and forms a huge database. The Clinical Management System (CMS) is another computerised system that records all aspects of clinical management in the HA.
 
Using these two systems, a retrospective case series study was conducted to review individual records of patients in the Prince of Wales Hospital (PWH) in Hong Kong. This study was approved by the New Territories East Cluster Ethics Committee (reference number: 2015.665). Preliminary screening was performed using CDARS. All patients discharged in 2014 with a diagnosis of hip fracture (ICD-9 code: 820.00-820.03, 820.09, 820.20-820.23 820.8, 821.00 and 905.3) and who underwent surgical intervention (ICD-9 code: 79.15(0)-79.15(5), 79.15(7)-79.15(10)) were selected from CDARS. Records were also reviewed through the CMS for verification. Patients aged 65 years or older with an isolated hip fracture who underwent surgical intervention were included in the study. Those with high-energy trauma, periprosthetic fracture, bilateral hip fracture, or multiple lower limb fractures were excluded as well as those with a fracture as a result of primary or metastatic bone tumours.
 
Records of patients who fulfilled the criteria were divided into two groups based on the time of surgical incision. The daytime group included those with an operation between 08:00 and 16:59 (group 1). The out-of-hours group comprised patients of whom the procedure was commenced between 17:00 and 07:59. This group was further split into those having surgery before (group 2) or after midnight (group 3) to enable more detailed analysis.
 
Operation procedure was defined as either fixation or arthroplasty. Preoperative surgical risk was estimated by the American Society of Anesthesiologists (ASA) classification. Surgeon’s qualification was defined according to the list of specialist registration in Orthopaedics and Traumatology in the Medical Council of Hong Kong. Surgeons who qualified as a specialist in or before 2014 were considered a specialist in this study. Surgery performed by a non-specialist but in the presence of a specialist was classified as ‘non-specialist with supervision’.
 
Outcome measures were 30-day mortality and complications during hospital stay; 30-day mortality was chosen because a shorter period could include deaths directly related to the hip surgery. Surgical outcome was defined as complications related to surgical procedures only. General complications such as cardiovascular, respiratory, or cognitive complications were excluded.
 
Statistical analyses
Records were divided into groups based on the time of incision. The daytime group included patients operated on between 08:00 and 16:59 (group 1). The remaining patients were assigned to the out-of-hours group. More detailed comparison was performed with the out-of-hours group further split into those having surgery before (group 2: 17:00 to 23:59) or after midnight (group 3: 00:00 to 07:59).
 
For group comparisons, continuous variables were presented as means and standard deviations. Comparison between groups was performed by one-way analysis of variance with post-hoc Bonferroni test. Categorical data such as demographic data as well as mortality and complication rates were expressed as proportion and were compared by Pearson’s Chi squared test. Statistical analysis was performed using the SPSS (Windows version 20.0; IBM Corp, Armonk [NY], United States). The level of significance was set at P<0.05.
 
Results
Using International Classification of Disease, 9th revision and identified from CDARS, there were 379 hip fracture patients operated on and discharged from PWH in 2014. Review of the related medical records in CMS led to elimination of 12 patients according to the inclusion and exclusion criteria. Of the remaining 367 patients, 242 patients were operated on between 08:00 and 16:59 (daytime group; group 1), and 125 patients were operated on during out-of-hours after 16:59 and before 08:00. Among these 125 patients, 104 were operated on before midnight (group 2: 17:00 to 23:59), and 21 were operated on after midnight (group 3: 00:00 to 07:59). Patient selection and grouping are shown in the Figure.
 

Figure. Patient selection and grouping
 
Demographic characteristics
Demographic equivalency was assessed by comparing the daytime and out-of-hours group and revealed no difference in terms of age, sex, or type of fracture. Detailed comparison was performed with the out-of-hours group further divided into before and after midnight as shown in the Table. There remained no differences in terms of age, sex, or fracture type among the groups. The mean age of the three groups ranged from 83.2 to 84.3 years and there were more females than males in all groups, more intertrochanteric fractures in group 1 and group 2, and more femoral neck fractures in group 3.
 

Table. Comparison of demographic characteristics and intra-operative variables between groups
 
Intra-operative variables
Intra-operative variables were compared between the daytime and out-of-hours groups and revealed no significant differences in ASA class, type of surgery performed, or surgeon’s qualification. Again, a more complete comparison was made with the three groups.
 
The ASA class was comparable among the groups, with almost two thirds of the patients categorised as ASA class 3. Fixation was more common in all the groups but the number of fixation and arthroplasty cases was not statistically significant. There was no difference in surgeon’s qualification among the groups, with most surgeries (>95%) performed in the presence of a specialist. Chi squared test revealed that significantly fewer surgeons were involved in the out-of-hours group, especially after midnight (P=0.02).
 
Regarding surgical outcome, the 30-day mortality rate and postoperative complication rate during hospital stay were obtained. There were eight deaths among 367 patients, accounting for 2.2% of the study population. The cause of death included chest infection and cardiac arrest. The mortality rates were 2.1% and 2.4% in the daytime and out-of-hours groups, respectively (P=0.84).
 
Surgical outcome was defined as complications related to surgical procedure only. The overall complication rate was 24.3% in the study population with a similar rate between daytime and out-of-hours groups. Comparable results were obtained when the out-of-hours group was further divided into two subgroups (P=0.53). A total of 89 patients among all groups had postoperative complications. Fall in haemoglobin level in 89 patients required blood transfusion in 96.7% of cases. Wound infection or implant infection occurred in only four patients. Because all patients with implant infection had revision surgery, rate of revision surgery was the same as implant infection. No patient had fixation failure, prosthetic dislocation, or peri-prosthetic fracture.
 
Comparison of surgical time revealed no significant difference in surgical outcome, or in surgeon’s qualification (P=0.21). For type of surgery performed, the fixation group showed a significantly higher surgical complication rate than the arthroplasty group (P=0.03), although mortality rate was similar.
 
Discussion
Bone density insufficiency is the leading cause of major musculoskeletal trauma following a fall in the aged population.17 In 2000, the number of hip fractures worldwide was about 1.6 million. By 2050, the projected number will reach 4.5 million, and more than 50% of osteoporotic hip fractures will occur in Asia.18
 
Encouragement of early surgery after hip fracture will result in unavoidable out-of-hours surgery because of busy daytime operating room schedules. Safety of surgery performed outside routine daytime working hours, however, has long been a controversial issue. Surgery performed after-hours may be under less-ideal conditions with consequent poorer outcomes. This study was designed to assess if surgical outcomes for out-ofhours surgery significantly differ to those of daytime surgery.
 
In this study, patients were grouped according to the time of surgical incision. The normal shift in the operating theatre is 08:00 to 17:00. Surgeries performed after 17:00 and before 08:00 were considered out-of-hours. The time period correlates with the typical working hours and allows analysis based on a surgeon’s routine practice. Demographic characteristics were comparable among the groups.
 
Outcomes of daytime and out-of-hours surgery
Mortality and complication rates were comparable between the daytime and night-time groups. Even after midnight, when a surgeon is thought to be most affected by fatigue, there was no significant increase in complication rate or mortality. This was supported by a study in 2013 that showed no significant difference in postoperative complication rate or mortality rate after reviewing 220 dynamic hip screw surgeries in terms of their operating time.19 It concluded that out-of-hours surgery offers the benefit of early fixation and mobilisation, and hence may shorten the length of stay and reduce cost of treatment.19 Chacko et al14 also reported similar findings in 171 hip fracture patients with surgical intervention where mortality rate within 1 month and complication rate were comparable between the daytime and night-time groups. Switzer et al20 studied the relationship between surgical time of day and outcome after hip fracture fixation. They identified more than 1400 hip fracture patients with surgical intervention. Time of surgery was treated as a continuous variable and showed no association with complication rate at any time period. The authors concluded that there was no difference in 30-day mortality or complications based on the time of surgery and suggested that early operation after normal operating room hours was safe and reasonable.20
 
In addition, complex cases are generally scheduled for surgery during the daytime when more support can be obtained when needed. This may help explain the similar surgical outcomes among the groups. The comparable results for daytime and out-of-hours surgery shown in this study are supported by the literature suggesting that out-of-hours surgery is safe.
 
Mortality rate
The overall 30-day mortality rate was 2.2% in this study, lower than the 3.5% to 10% reported in the UK,17 as well as the 4.96% in a 1997 local study.21 The lower mortality rate in this study may be attributed to advancements in surgical technique and design of prostheses. The introduction of an ortho-geriatrician in managing hip fracture patients has also been proven to decrease mortality and complication rates.22
 
Postoperative complication rate
Postoperative complications included chest infection and acute coronary syndrome. The effect of surgeon aspects on outcomes, however, was the main factor under investigation in this study. Thus, surgical outcome was defined as complications related to surgical procedure only. General complications were excluded. For surgical outcome, fall in haemoglobin level with the need for blood transfusion, wound infection, and implant infection were analysed.
 
The overall surgical complication rate was 24.2% in this study compared with previous reports of 5% to 32% in hip fracture fixation.15 19 20 23 24 Nonetheless, different analyses and definitions of complication rate were used in these studies. Some studies defined complications as medical complications or unplanned return to the operating room,15 24 whereas others reported only wound infection, urinary tract infection, and deep vein thrombosis.19 Thus direct comparison with these studies was not possible. Further comparison of blood transfusion rate with previous studies was performed, as it represented the most common complication. The blood transfusion rate was 23.4% in this study, similar to the results in previous studies where transfusion after hip fixation ranged from 19% to 69%.25 This may be due to incomplete reporting in the CMS as blood transfusion was not always noted in the discharge summary. Despite the difficulties in direct comparison of the complication rate with previous study, we suggest that the rate in this study was reasonable.
 
Number and qualification of surgeons
Significantly fewer surgeons were involved in out-of-hours surgery. This may be because training of junior staff commonly occurs during the daytime. Although fewer surgeons were involved in out-of-hours surgeries, this may be compensated by the experience of the surgeon since a larger proportion of out-of-hours surgeries was performed by a specialist. Nonetheless, the difference was not significant.
 
Furthermore, the qualification of the surgeon had no association with surgical outcomes in this study. This may be because cases were screened prior to allocation. Difficult and more complex cases would likely be operated on by a more experienced surgeon. Holt et al26 showed comparable results in their study of the Scottish Hip Fracture Audit Database published in 2008. They studied more than 18 000 patients and concluded that grade of surgeon did not significantly affect surgical outcome.26
 
Strengths and limitations
This is the first local study based in a major hospital in Hong Kong to analyse the effect of operating time on surgical outcome. The out-of-hours group was split into before and after midnight so as to focus on surgeon fatigue. Analysis of surgeon expertise revealed that surgical outcome was not compromised by surgeon’s qualification.
 
There are several limitations in this study. First, this was a retrospective study with no functional outcomes. Information on complications was retrieved from the CMS only which might not have recorded all complications. A fracture registry or prospective study with more representative complications including prosthetic dislocation, peri-prosthetic fracture, implant loosening, fixation failure, malreduction, malfixation, and implant malposition is suggested in future. Data collection was performed by the authors who were not blinded so this might have introduced bias. Blood transfusion, the most common complication reported, was believed to be related to the operative procedure. Fall in haemoglobin level due to other causes, however, could not be excluded simply from details in the CMS. Second, the overall population size and the relatively smaller number of cases in the after-midnight group might not have the statistical power to show any difference. Further study with a larger sample size is suggested. Finally, several potential confounders were not investigated, for example, fractures were not classified according to stability and time to surgery. These factors may be associated with poorer outcome.
 
Conclusion
This study demonstrates similar outcomes of elderly patients with hip fracture in terms of mortality and postoperative complications for daytime and out-of-hours surgery. Qualification and number of surgeons involved were not associated with outcome. To facilitate better outcome with early operation, out-of-hours surgery remains a safe option and the only means to overcome limited resources.
 
Declaration
All authors have disclosed no conflicts of interest.
 
References
1. Hong Kong population projections 2015-2064. Census and Statistics Department, the Government of the Hong Kong Administrative Region. Available from: http://www.statistics.gov.hk/pub/B1120015062015XXXXB0100.pdf. Accessed Dec 2015.
2. Ngai WK. Fragility Fracture Registry in Hong Kong. Proceedings of the Hospital Authority Convention 2014; 2014 May 7-8; Hong Kong.
3. Mak JC, Cameron ID, March LM; National Health and Medical Research Council. Evidence-based guidelines for the management of hip fractures in older persons: an update. Med J Aust 2010;192:37-41.
4. Australian & New Zealand Hip Fracture Registry. Australian and New Zealand guideline for hip fracture care. September 2014.
5. Evidence update—Hip fracture. London: National Institute for Health and Clinical Excellence. March 2013.
6. Shiga T, Wajima Z, Ohe Y. Is operative delay associated with increased mortality of hip fracture patients? Systematic review, meta-analysis, and meta-regression. Can J Anaesth 2008;55:146-54. Crossref
7. Sircar P, Godkar D, Mahgerefteh S, Chambers K, Niranjan S, Cucco R. Morbidity and mortality among patients with hip fractures surgically repaired within and after 48 hours. Am J Ther 2007;14:508-13. Crossref
8. Siegmeth AW, Gurusamy K, Parker MJ. Delay to surgery prolongs hospital stay in patients with fractures of the proximal femur. J Bone Joint Surg Br 2005;87:1123-6.Crossref
9. Simunovic N, Devereaux PJ, Sprague S, et al. Effect of early surgery after hip fracture on mortality and complications: systematic review and meta-analysis. CMAJ 2010;182:1609-16.Crossref
10. Desai V, Gonda D, Ryan SL, et al. The effect of weekend and after-hours surgery on morbidity and mortality rates in pediatric neurosurgery patients. J Neurosurg Pediatr 2015;16:726-31. Crossref
11. Kelz RR, Freeman KM, Hosokawa PW, et al. Time of day is associated with postoperative morbidity: an analysis of the national surgical quality improvement program data. Ann Surg 2008;247:544-52. Crossref
12. Scott SW, Bowrey S, Clarke D, Choke E, Bown MJ, Thompson JP. Factors influencing short- and long-term mortality after lower limb amputation. Anaesthesia 2014;69:249-58. Crossref
13. Dorotka R, Schoechtner H, Buchinger W. Influence of nocturnal surgery on mortality and complications in patients with hip fractures [in German]. Unfallchirurg 2003;106:287-93. Crossref
14. Chacko AT, Ramirez MA, Ramappa AJ, Richardson LC, Appleton PT, Rodriguez EK. Does late night hip surgery affect outcome? J Trauma 2011;71:447-53; discussion 453. Crossref
15. Bhattacharyya T, Vrahas MS, Morrison SM, et al. The value of the dedicated orthopaedic trauma operating room. J Trauma 2006;60:1336-40. Crossref
16. Wixted JJ, Reed M, Eskander MS, et al. The effect of an orthopedic trauma room on after-hours surgery at a level one trauma center. J Orthop Trauma 2008;22:234-6. Crossref
17. Giannoulis D, Calori GM, Giannoudis PV. Thirty-day mortality after hip fractures: has anything changed? Eur J Orthop Surg Traumatol 2016;26:365-70. Crossref
18. Poh KS, Lingaraj K. Complications and their risk factors following hip fracture surgery. J Orthop Surg (Hong Kong) 2013;21:154-7. Crossref
19. Rashid RH, Zubairi AJ, Slote MU, Noordin S. Hip fracture surgery: does time of the day matter? A case-controlled study. Int J Surg 2013;11:923-5. Crossref
20. Switzer JA, Bennett RE, Wright DM, et al. Surgical time of day does not affect outcome following hip fracture fixation. Geriatr Orthop Surg Rehabil 2013;4:109-16. Crossref
21. Ho ST, Chau YS, Wong WC. Short-term outcome of operated geriatric hip fracture. Hong Kong J Orthop Surg 1997;1:7-12.
22. Vidán M, Serra JA, Moreno C, Riquelme G, Ortiz J. Efficacy of a comprehensive geriatric intervention in older patients hospitalized for hip fracture: a randomized, controlled trial. J Am Geriatr Soc 2005;53:1476-82. Crossref
23. Zuckerman JD, Skovron ML, Koval KJ, Aharonoff G, Frankel VH. Postoperative complications and mortality associated with operative delay in older patients who have a fracture of the hip. J Bone Joint Surg Am 1995;77:1551-6. Crossref
24. Lawrence VA, Hilsenbeck SG, Noveck H, Poses RM, Carson JL. Medical complications and outcomes after hip fracture repair. Arch Intern Med 2002;162:2053-7. Crossref
25. Liodakis E, Antoniou J, Zukor DJ, Huk OL, Epure LM, Bergeron SG. Major complications and transfusion rates after hemiarthroplasty and total hip arthroplasty for femoral neck fractures. J Arthroplasty 2016;31:2008-12. Crossref
26. Holt G, Smith R, Duncan K, Finlayson DF, Gregori A. Early mortality after surgical fixation of hip fractures in the elderly: an analysis of data from the scottish hip fracture audit. J Bone Joint Surg Br 2008;90:1357-63. Crossref

Treatment of cutaneous angiosarcoma of the scalp and face in Chinese patients: local experience at a regional hospital in Hong Kong

Hong Kong Med J 2018 Feb;24(1):25–31 | Epub 12 Jan 2018
DOI: 10.12809/hkmj176813
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Treatment of cutaneous angiosarcoma of the scalp and face in Chinese patients: local experience at a regional hospital in Hong Kong
TL Chow, FRCS (Edin), FHKAM (Surgery)1; Wilson WY Kwan, FRCS (Edin), FHKAM (Surgery)1; CK Kwan, FRCR, FHKAM (Radiology)2
1 Head and Neck Division, Department of Surgery, United Christian Hospital, Kwun Tong, Hong Kong
2 Department of Oncology, United Christian Hospital, Kwun Tong, Hong Kong
 
Corresponding author: Dr TL Chow (chowtl@ha.org.hk)
 
 Full paper in PDF
 
Abstract
Introduction: Angiosarcoma is a rare aggressive sarcoma that occurs mostly in the skin of the head and neck in the elderly population. The optimal management is dubious and most studies are from Caucasian populations. We aimed to examine the treatment and outcome of this disease in Chinese patients.
 
Methods: Data of patients with histopathologically verified cutaneous angiosarcoma of the head and neck during December 1997 to September 2016 were retrieved from our hospital cancer registry. The demographic data, clinicopathological information, modality of treatment, and outcomes were reviewed.
 
Results: During the study period, 17 Chinese patients were treated. Their median age was 81 years. The tumours were present in the scalp only (n=11), face only (n=4), or both scalp and face (n=2). Only two patients had distant metastases. The modalities of treatment were surgery (n=6), surgery and adjuvant radiotherapy (n=1), palliative radiotherapy (n=5), or palliative chemotherapy (n=3). The remaining two patients refused any treatment initially. Of the seven patients treated surgically, there were four local and two regional recurrences. The median time to relapse was 7.5 months. Overall, 16 patients had died; causes of death were disease-related in 12 whereas four other patients died of inter-current illnesses. One patient was still living with the disease. The median overall survival was 11.1 months and the longest overall survival was 42 months.
 
Conclusion: The outcome of angiosarcoma in our series is poor. A high index of suspicion is mandatory for prompt diagnosis. Adjuvant radiotherapy is recommended following surgery. The benefit and role of systemic treatment in various combinations with surgery or radiotherapy require further study.
 
 
New knowledge added by this study
  • Reports of angiosarcoma of the scalp and face in the Chinese population are limited. Patient survival in this local study was worse than that of other studies.
  • Literature review in this study supports the use of adjuvant radiotherapy to improve angiosarcoma control.
Implications for clinical practice or policy
  • An aggressive but not too radical surgery for head and neck cutaneous lesions is advocated.
  • Combination therapy (surgery, radiotherapy, and systemic treatment) in various combinations should be considered.
 
 
Introduction
Angiosarcoma is a rare form of sarcoma of vascular origin. It is notorious for its aggressive and relentless progression with frequent local recurrence and distant metastasis.1 Owing to its scarcity and innocuous appearance at an early stage mimicking an ordinary bruise or benign haemangioma, correct diagnosis is often delayed for several months. This problem is compounded further because most patients with angiosarcoma are elderly with frailty and co-morbidity, and prognosis after surgery as the definitive therapy is gloomy. The 5-year overall survival (OS) has been variously reported as 24% to 35%.1 2 When radiotherapy (RT) is given as the main treatment, median survival is only 8 months.3
 
Almost half (43%) of angiosarcomas originate from the skin of the head and neck.1 2 Compared with truncal and extremity angiosarcoma, the prognosis of cutaneous angiosarcoma (CAS) of the head and neck is even worse. Perez et al2 indicated a greater need for flap or graft reconstruction after tumour extirpation for head and neck CAS (HNCAS), a positive resection margin in 50%, and 5-year OS of 21.5%. Surgery was conventionally regarded as the mainstay of therapy for HNCAS. Because of the poor results and frequent margin involvement (78%),4 a multidisciplinary approach with adjuvant RT has been advocated.
 
To the best of our knowledge, most articles about HNCAS derived from a Caucasian population. We are not aware of any reported series from ethnic Chinese populations. We therefore conducted this retrospective study of HNCAS in the Chinese patients managed in our hospital over the past two decades. Demographic data, clinicopathological information, modality of treatment, and outcomes were reviewed. The latest approaches to the treatment of this devastating disease are also discussed.
 
Methods
Records of patients with histopathologically verified HNCAS from December 1997 to September 2016 were captured from the head and neck cancer registry of our department. Only Chinese patients were recruited. Patient data were prospectively collected and regularly updated in the registry.
 
Tumours were classified as a unifocal/localised versus multifocal/diffuse form. Unifocal/localised tumour was characterised by a discrete lesion without macroscopic satellitosis; it was considered operable and potentially curable. If gross satellite lesions were present or the main tumour was too extensive to be removed surgically, it was regarded as a multifocal/diffuse tumour that was probably incurable. Superficial tumours were those confined to the skin and subcutaneous tissue. Conversely, deep tumours were defined as transgression beyond the subcutaneous layer, for example encroaching on the underlying muscle or bone.
 
If the tumour was resectable, a wide excision with at least a 3-cm margin was performed. The defects were reconstructed by local scalp flap/ skin graft (scalp primary) or submental flap (face primary).5 Neck dissection was performed only in the presence of clinical or radiological evidence of nodal spread. Adjuvant RT was not routinely administered. When disease was deemed inoperable or the patient was unfit for surgery, palliative RT or chemotherapy would be considered.
 
The OS was calculated from the date of diagnosis to patient death or last follow-up, and is expressed in Kaplan-Meier curve. The data were computed using the SPSS (Windows version 20.0; IBM Corp, Armonk [NY], United States). The principles outlined in the 2013 version of the Declaration of Helsinki have been followed.
 
Results
A total of 17 patients with HNCAS were identified and managed in our institution from December 1997 to September 2016. Their demographic and clinicopathological information are shown in Table 1. In brief, males predominated and there were only two female patients. Their median age was 81 years (range, 67-92 years). Only one patient had a history of whole-scalp RT for a lateral scalp angiosarcoma more than 10 years ago in another institution. He had a new angiosarcoma on the opposite side of his scalp that was treated in our centre as a second primary angiosarcoma induced by past RT. The median duration from onset of presentation to diagnosis was 4 months (range, 2-33 months).
 

Table 1. Demographics, clinicopathological characteristics, and outcome
 
Patients presented with protean symptoms: bleeding (n=7), pain (n=4), nodule (n=3), ulceration (n=2), pigmentation (n=2), pruritus (n=1), and localised oedema (n=1). One patient presented with an asymptomatic purplish macule but no other symptoms.
 
The tumours were present in the scalp only (n=11), face only (n=4), or both scalp and face (n=2). No patient had neck CAS. Ten patients had a localised/unifocal tumour. Seven patients were inflicted by multiple/diffuse lesions that were considered inoperable and treated with palliative intent in four. Of the three remaining patients with multiple/diffuse lesions, the tumours were still amenable to potentially curative surgery although two (cases 7 and 15) declined any treatment initially. Deep invasion occurred in two patients. Of the 11 patients whose tumour dimension had been documented, the median diameter was 4 cm (range, 3-13 cm). In the other six patients in whom dimensions were not recorded, there was extensive involvement by the HNCAS.
 
At the time of diagnosis, the numbers of T1 (≤5 cm) and T2 (>5 cm) diseases were seven and 10, respectively. Regional nodal spread was present in six patients. Only two patients (cases 2 and 3) were found to have distant metastases: lung in one patient, and lung and spine in the other (Table 1). The modalities of therapy were surgery (n=6), surgery + adjuvant RT (n=1), palliative RT (n=5), and palliative chemotherapy (n=3; two of them also received palliative RT following chemotherapy). The remaining two patients (cases 7 and 15) refused any form of treatment initially (Table 2). Incorporating subsequent therapy, surgery, RT and chemotherapy were eventually offered to seven, 10, and five patients, respectively.
 

Table 2. Treatment and outcome
 
Of the seven patients treated surgically, the resection margin was positive in two. Tumour recurred in six of them: four local and two regional recurrences. The median time to relapse was 7.5 months (range, 2-32 months). Overall, 16 patients had died; the causes of death were HNCAS in 12 and inter-current diseases in four (Table 2). One patient (case 15) was still living with the disease 21 months after diagnosis. The median OS was 11.1 months and the longest OS in our series was 42 months (case 6) [Fig 1].
 

Figure 1. Kaplan-Meier curve for overall survival of patients with angiosarcoma
 
Discussion
The oncological outcome of this study is obviously far from satisfactory when compared with a reported median survival of 13.4 to 64 months as shown in Table 3,4 6 7 8 9 10 11 12 which summarises the postulated prognosticators. Only those studies performed over the past two decades and that focused on HNCAS with more than 10 cases were included. Patient age, tumour size, tumour differentiation, deep invasion, and margin status showed conflicting results. Of note, RT was the most promising and consistent prognosticator; only one study showed an adverse effect on survival.10 The disparity might well be due to selection bias for RT in a retrospective study—more advanced disease tends to receive adjuvant RT. The evidence lends credence to adjuvant RT for HNCAS. The suboptimal outcome in our study is multifactorial: detainment of diagnosis, inclusion of palliative cases for survival evaluation, advanced age precluding curative treatment, and unpopular adoption of RT or chemotherapy as multimodal therapy. In this series, adjuvant RT was administered to only one patient after surgery, either because others declined RT or wound complications precluded its application.
 

Table 3. Survival prognosticators from the literature
 
The diagnosis of HNCAS is often late as early lesions can simulate innocent violaceous macules masquerading as benign dermatological entities (Fig 2). From our experience, the median duration was 4 months prior to diagnostic confirmation with histopathological examination. Three of our patients (cases 7, 15, and 17) had their diagnosis made more than 12 months after onset of disease. The longest delay was 33 months (case 17) and is absolutely not acceptable. Interestingly, HNCAS manifested as pigmented lesions in two patients and thereby misled clinicians in decision making. Increased awareness of this rare disease by primary care clinicians is essential to expedite patient referral. For specialists, a low threshold to biopsy of newly developed purplish skin lesions in the elderly patients is pivotal to an early diagnosis.
 

Figure 2. Angiosarcoma of the scalp can mimic benign vascular lesions
 
In our series, patients who underwent palliative therapy were included in the OS calculation and this might have partially contributed to our poor results. This was echoed by Buschmann et al13 who also included patients with palliative resection in outcome evaluation; their longest survival reported was 36 months. This is similar to our experience where longest survival was 42 months. Conversely, Dettenborn et al10 reported 80 patients with HNCAS treated surgically (44 patients also received postoperative RT) with curative intent and 5-year OS of 54% and median OS of 64 months. Similarly, Suzuki et al12 described their experience of definitive RT as the principal curative treatment for HNCAS; a median OS survival of 31 months was attained. Nonetheless, the results of RT were compromised when palliative cases were incorporated: only 12% of patients survived more than 5 years in one study.14
 
Our patients were older (median age, 81 years) than those reported in the literature (median age, 63-77 years) [Table 3]. The prognostic significance of age on the outcome of HNCAS is controversial. Patel et al11 reported that patients younger than 70 years fared better with improved locoregional control and relapse-free survival than older patients. Although age is not a confirmed prognostic factor, advanced age often precludes patients from curative therapy due to concomitant chronic diseases or general debility. The elderly patients are also prone to dying of other disease as encountered in this series; four patients died of inter-current illnesses (Table 2). Nonetheless, effective systemic treatments can be well tolerated by some elderly patients with HNCAS. With taxane-based regimens, a response rate of 83% (10 out of 12 patients) was achieved and the progression-free survival was approximately 7 months.15
 
Surgery is historically the main treatment for HNCAS. The latest approach to optimal management of HNCAS is combined treatment encompassing surgery, RT, or chemotherapy. Adjuvant RT should be liberally offered to maximise the oncological outcomes following surgery. As shown in Table 3, five of six studies support the beneficial role of RT.4 6 7 8 10 11 Guadagnolo et al7 advocated simple resection of the gross tumour to facilitate non-complicated reconstruction and thus expedite RT. Two-staged surgery was discouraged. The resection margin status was not critical to survival if timely adjuvant RT was administered. From their experience, the 5-year OS was 43% and 5-year disease-specific survival was 46%. In a review article, Hwang et al16 also concluded that positive margin was common (64%) but did not impair the ultimate outcome. They recommended that surgeons should not be too obsessive about removing each and every cancer cell if RT was to be pursued.
 
Definitive treatment with RT and/or chemotherapy has also been reported to be effective for HNCAS.9 Of 17 patients treated in that study, complete and partial responses were accomplished in none and five patients, respectively. The median OS was 26 months. Multimodality treatment in various combinations with surgery, RT, and chemotherapy has been asserted by Patel et al11 to be effective in improving locoregional control, relapse-free survival, and OS. In another study, survival (37 months) following combined therapy (RT and chemotherapy) was better than either modality alone: 23 months for RT and 15 months for chemotherapy.16 Docetaxel is the preferred agent due to its antiangiogenic and radio-sensitising effects.9 Other agents have also been successfully used and include doxorubicin, ifosfamide, bevacizumab, and interleukin-2. Systemic treatment may be used in a neoadjuvant setting, adjuvant setting, and as concurrent treatment with RT.8 17 18 Conversely, RT plus chemotherapy was not shown to have any prognostic value in a meta-analysis by Shin et al.19 Further studies should be carried out to elucidate the benefit of combined modality treatment.
 
Ip and Lee20 reported a smaller local series of CAS that was not confined to the head and neck. A total of seven patients were enrolled from three clinics of the Social Hygiene Service in Hong Kong. Only six patients had HNCAS. Similarly, poor prognosis was demonstrated in these patients. Our bigger series focusing on HNCAS provides a more updated and detailed strategy for the management of this rare disease in ethnic Chinese.
 
Our study has some limitations. First, the sample size was small as only 17 patients were included for evaluation. Nevertheless it is the largest series reported in our locality. The study spanned over 19 years (December 1997 to September 2016) and treatment has evolved over this period. Moreover, like many retrospective series, recall bias or selection bias are inherent limitations. Patient symptoms, signs, presentation duration, and criteria for treatment might not be completely accurate.
 
Conclusion
We present the first report of HNCAS in ethnic Chinese. The oncological outcome is far from satisfactory. A high index of suspicion is mandatory for prompt diagnosis of early disease. Adjuvant RT, as supported by evidence from the literature, is recommended following surgery that should aim at gross tumour extirpation to ensure uneventful reconstruction as well as timely implementation of RT. The benefit and role of systemic treatment in various combinations with surgery or RT require further study.
 
Declaration
All authors have disclosed no conflicts of interest.
 
References
1. Mark RJ, Poen JC, Tran LM, Fu YS, Juillard GF. Angiosarcoma. A report of 67 patients and a review of the literature. Cancer 1996;77:2400-6. Crossref
2. Perez MC, Padhya TA, Messina JL, et al. Cutaneous angiosarcoma: a single-institution experience. Ann Surg Oncol 2013;20:3391-7. Crossref
3. Sasaki R, Soejima T, Kishi K, et al. Angiosarcoma treated with radiotherapy: impact of tumor type and size on outcome. Int J Radiat Oncol Biol Phys 2002;52:1032-40. Crossref
4. Pawlik TM, Paulino AF, Mcgini CJ, et al. Cutaneous angiosarcoma of the scalp: a multidisciplinary approach. Cancer 2003;98:1716-26. Crossref
5. Chow TL, Chan TT, Chow TK, Fung SC, Lam SH. Reconstruction with submental flap for aggressive orofacial cancer. Plast Reconstr Surg 2007;120:431-6. Crossref
6. Aust MR, Olsen KD, Lewis JE, et al. Angiosarcoma of the head and neck: clinical and pathologic characteristics. Ann Otol Rhinol Laryngol 1997;106:943-51. Crossref
7. Guadagnolo BA, Zagars GK, Araujo D, Ravi V, Shellenberger TD, Sturgis EM. Outcomes after definitive treatment for cutaneous angiosarcoma of the face and scalp. Head Neck 2011;33:661-7. Crossref
8. Ogawa K, Takahashi K, Asato Y, et al. Treatment and prognosis of angiosarcoma of the scalp and face: a retrospective analysis of 48 patients. Br J Radiol 2012;85:e1127-33. Crossref
9. Miki Y, Tada T, Kamo R, et al. Single institutional experience of the treatment of angiosarcoma of the face and scalp. Br J Radiol 2013;86:20130439. Crossref
10. Dettenborn T, Wermker K, Schulze HJ, Klein M, Schwipper V, Hallermann C. Prognostic features in angiosarcoma of the head and neck: a retrospective monocenter study. J Craniomaxillofac Surg 2014;42:1623-8. Crossref
11. Patel SH, Hayden RE, Hinni ML, et al. Angiosarcoma of the scalp and face: the Mayo Clinic experience. JAMA Otolaryngol Head Neck Surg 2015;141:335-40. Crossref
12. Suzuki G, Yamazaki H, Takenaka H, et al. Definitive radiation therapy for angiosarcoma of the face and scalp. In Vivo 2016;30:921-6.Crossref
13. Buschmann A, Lehnhardt M, Toman N, Preiler P, Salakdeh MS, Muehlberger T. Surgical treatment of angiosarcoma of the scalp: less is more. Ann Plast Surg 2008;61:399-403. Crossref
14. Holden CA, Spittle MF, Jones EW. Angiosarcoma of the face and scalp, prognosis and treatment. Cancer 1987;59:1046-57. Crossref
15. Letsa I, Benson C, Al-Muderis O, Judson I. Angiosarcoma of the face and scalp: effective systemic treatment in the older population. J Geriatr Oncol 2014;5:276-80. Crossref
16. Hwang K, Kim MY, Lee SH. Recommendations for therapeutic decisions of angiosarcoma of the scalp and face. J Craniofac Surg 2015;26:e253-6. Crossref
17. Wollina U, Fuller J, Graefe T, Kaatz M, Lopatta E. Angiosarcoma of the scalp: treatment with liposomal doxorubicin and radiotherapy. J Cancer Res Clin Oncol 2001;127:396-9. Crossref
18. Yang P, Zhu Q, Jiang F. Combination therapy for scalp angiosarcoma using bevacizumab and chemotherapy: a case report and review literature. Chin J Cancer Res 2013;25:358-61.
19. Shin JY, Roh SG, Lee NH, Yang KM. Predisposing factors for poor prognosis of angiosarcoma of the scalp and face: systemic review and meta-analysis. Head neck 2017;39:380-6. Crossref
20. Ip FC, Lee CK. Cutaneous angiosarcoma: a case series in Hong Kong. Hong Kong J Dermatol Venereol 2010;18:6-14.

Implications of nipple discharge in Hong Kong Chinese women

Hong Kong Med J 2018 Feb;24(1):18–24 | Epub 5 Jan 2018
DOI: 10.12809/hkmj154764
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE  CME
Implications of nipple discharge in Hong Kong Chinese women
WM Kan, FCSHK, FHKAM (Surgery)1; Clement Chen, FRCS, FHKAM (Surgery)2; Ava Kwong, FRCS, FHKAM (Surgery)2
1 Department of Surgery, Queen Elizabeth Hospital, Jordan, Hong Kong
2 Department of Surgery, Queen Mary Hospital, Pokfulam, Hong Kong
 
Corresponding author: Dr Ava Kwong (avakwong@hku.hk)
 
 Full paper in PDF
 
Abstract
Introduction: There are no recent data on nipple discharge and its association with malignancy in Hong Kong Chinese women. This study reported our 5-year experience in the management of patients with nipple discharge, and our experience of mammography, ultrasonography, ductography, and nipple discharge cytology in an attempt to determine their role in the management of nipple discharge.
 
Methods: Women who attended our Breast Clinic in a university-affiliated hospital in Hong Kong were identified by retrospective review of clinical data from January 2007 to December 2011. They were divided into benign and malignant subgroups. Background clinical variables and investigative results were compared between the two subgroups. We also reported the sensitivity, specificity, and positive and negative predictive values of the investigations that included mammography, ultrasonography, ductography, and cytology.
 
Results: We identified 71 and 31 patients in the benign and malignant subgroups, respectively. The median age at presentation for the benign subgroup was younger than that of the malignant subgroup (48 vs 59 years; P=0.003). A higher proportion of patients in the malignant subgroup than the benign subgroup presented with blood-stained nipple discharge (87.1% vs 47.9%; P=0.002). Mammography had a specificity of 98.4% and positive predictive value of 66.7%; ultrasonography had a specificity of 87.0% and negative predictive value of 75.0%. Cytology and ductography were sensitive but lacked specificity. Ductography had a negative predictive value of 100% but a low positive predictive value (14.0%). Clinical variables including age at presentation, duration of discharge, colour of discharge, presence of an associated breast mass, and abnormal sonographic findings were important in suggesting the underlying pathology of nipple discharge. Multiple logistic regression showed that blood-stained discharge and an associated breast mass were statistically significantly more common in the malignant subgroup.
 
Conclusions: In patients with non–blood-stained nipple discharge, a negative clinical breast examination combined with negative imaging could reasonably infer a benign underlying pathology.
 
 
New knowledge added by this study
  • Blood-stained nipple discharge and an associated breast mass at presentation could suggest a higher chance of malignancy.
Implications for clinical practice or policy
  • A period of watchful waiting is a reasonable alternative to surgical intervention in patients with inferred benign pathology.
 
 
Introduction
Nipple discharge is a relatively uncommon complaint in Hong Kong Chinese women. According to a study in 1997, nipple discharge constituted 1.5% of all presenting complaints for women who attended a breast clinic in Hong Kong.1 On the contrary, nipple discharge accounted for up to 4% to 7% of all presenting symptoms in other studies.2 3 This may be better explained by the unique Chinese culture and help-seeking pattern rather than a true disease pattern. With this understanding, any clinical survey will probably underestimate the prevalence of nipple discharge in Chinese women. When patients approach health care professionals because of nipple discharge, not only is it important to differentiate malignant from benign causes of nipple discharge, it is also a valuable opportunity to promote breast health awareness.
 
Numerous studies have demonstrated the relationship between breast cancer and nipple discharge, with malignancy reported in up to 9.3% to 21% of all patients who present with nipple discharge.4 5 The most challenging role of breast surgeons is to accurately identify these patients. Notwithstanding, controversy persists about the value and accuracy of individual investigative tools for nipple discharge.6
 
There are no recent data on nipple discharge and its association with malignancy in Chinese women in Hong Kong. The primary aim of this study was to report our recent experience in the management of patients with nipple discharge in a single surgical centre. The secondary aim was to report our experience of individual investigative tools in an attempt to determine their role in the management of nipple discharge.
 
Methods
We retrospectively reviewed the clinical data of patients who attended our Breast Clinic at the Queen Mary Hospital, a university-affiliated hospital in Hong Kong, for nipple discharge from January 2007 to December 2011. Potential subjects were identified when diagnosis coding 611.79 (other signs and symptoms in breast) was entered into our Clinical Management System, which is a territory-wide computer-based medical record system designed for use in public hospitals, and also from the prospective database of the Division of Breast Surgery, The University of Hong Kong.
 
Data extraction and coding were performed by the first author (WM Kan) and included duration of follow-up until December 2011, age at presentation, history of breast condition, and laterality and duration of nipple discharge before first consultation. Clinical variables included colour of nipple discharge, single- or multiple-duct discharge, associated symptoms, mammographic and ultrasonographic imaging results, as well as ductogram and cytology results. Pathology results were recorded for patients who underwent surgery or biopsy.
 
In order to make a meaningful comparison, we divided patients into malignant and benign subgroups. The malignant subgroup was defined by malignant pathology on a surgically resected specimen. The benign subgroup was defined by benign pathology of a surgically resected or biopsy specimen, or clinical non-progression after more than 2 years of follow-up. Patients who did not undergo surgery or biopsy and who were followed up for less than 2 years were excluded (Fig).
 

Figure. Algorithm for patient selection
 
In the first part of our study, we compared the background clinical variables and investigative results between the two subgroups. In the second part of our analysis, we reported the sensitivity, specificity, positive predictive value, and negative predictive value of individual investigative tools.
 
For the purpose of this analysis, we also classified the results of clinical examination, mammography, ultrasonography, and cytology as ‘test positive’ or ‘test negative’ for underlying malignancy. Presence of a palpable breast mass (regardless of mobility) was considered a positive result and no palpable breast mass a negative result. For mammographic findings, microcalcifications were considered a positive result. For ultrasonography, a detectable mass was ‘test positive’ for underlying malignancy; non-solitary dilated ducts, cysts, and normal ultrasonogram were regarded as ‘test negative’. For ductogram results, dilated ducts, irregularity, and the presence of ductal filling defects were considered positive. For cytology, atypical, suspicious, and malignant were considered ‘test positive’, and benign as ‘test negative’. This study was done in accordance with the principles outlined in the Declaration of Helsinki.
 
Statistical analysis
R version 3.0.2 (the R Foundation) and the SPSS (Windows version 14.0; SPSS Inc, Chicago [IL], United States) were used for data analysis. To determine the differences between subgroups, Wilcoxon rank sum test and Fisher’s exact test were used for numerical data and categorical data, respectively. Multiple logistic regression was performed to examine the odds ratios of the factors. Backward selection through likelihood ratio test with removal of P value of 0.1 was conducted for model selection. Variables in univariate analysis with a P value of <0.1 were included in the full model. A P value of <0.05 was considered statistically significant.
 
Results
Table 1 summarises the first part of our analysis. We identified 102 patients who presented to our Breast Clinic during the study period. They had either a tissue diagnosis or had been followed up for longer than 2 years without tissue diagnosis. There were 31 and 71 patients in the malignant and benign subgroups, respectively.
 

Table 1. Bivariate analysis of tumour type (benign or malignant) and other clinical variables
 
The median age at presentation of the benign subgroup was significantly younger than that of the malignant subgroup (48 vs 59 years; P=0.003). The median interval between onset of nipple discharge and first presentation was significantly longer in the benign subgroup than in the malignant subgroup (13 vs 4 weeks; P=0.002).
 
Comparing the two subgroups, a larger proportion of patients in the malignant subgroup presented with blood-stained discharge (87.1% vs 47.9%; P=0.002) and had a breast mass at presentation (46.7% vs 7.0%; P<0.001). For the individual investigative modalities, with the exception of ultrasonography, neither mammography, ductography nor cytology showed any statistically significant difference between the malignant and benign subgroups.
 
Table 2 summarises the second part of the study. We calculated the sensitivity, specificity, and positive and negative predictive values of mammographic, ultrasonographic, cytological, and ductographic findings. There were 83, 95, 27, and 46 patients who underwent mammography, ultrasonography, cytology, and ductography, respectively. The positive and negative predictive values of cytology were 41.2% and 80.0%, respectively. Ductography had a sensitivity of 100%, specificity of 7.5%, positive predictive value of 14.0%, and negative predictive value of 100%.
 

Table 2. Sensitivity, specificity, and positive/negative predictive values of different modalities
 
Multiple logistic regression analysis with backward selection was performed. Covariates with a P value of <0.1 were included in the full model (Table 1). By likelihood ratio test and removal of variables with a P value of >0.1, duration of nipple discharge, colour of nipple discharge, mastalgia, and associated mass remained in the final model (Table 3).
 

Table 3. Multiple logistic regression of factors associated with malignancy
 
Compared with serous, milky and brownish discharge, patients with blood-stained discharge had a significantly higher risk for malignancy (odds ratio=13.368; 95% confidence interval, 1.926-92.809). In addition, compared with patients having no symptoms, those with a breast mass had a significantly higher risk for malignancy (odds ratio=14.648; 95% confidence interval, 3.155-68.000) [Table 3].
 
Discussion
A methodologically ideal study of nipple discharge would require every patient to undergo the same investigations and also surgery for final pathology. This, however, would be unethical. For patients who opted for non-operative management of nipple discharge, our retrospective study considered 2-year clinical non-progression a reasonable surrogate for benign breast pathology.
 
Clinical variables
Women in the malignant subgroup were significantly older at presentation than their benign counterparts. This was in agreement with the fact that physiological nipple discharge is more common in younger premenopausal women. Caution should be exercised in postmenopausal women who present with nipple discharge and the possibility of malignancy investigated before concluding a benign pathology.
 
With respect to the colour of nipple discharge, underlying benign and malignant causes had a different pattern. Benign pathology was more likely to be associated with non–blood-stained discharge (n=37, 52.1%), whereas malignant pathology was more likely to be associated with blood-stained discharge (n=27, 87.1%). This is not pathognomonic but did reach statistical significance.
 
The differentiation between multiple-duct and single-duct discharge showed no association with underlying pathology.
 
Mammography and ultrasonography
As shown in Table 2, mammography had a higher specificity of 98.4% and positive predictive value of 66.7% but a disappointingly low sensitivity of 9.5%. Therefore, a normal mammogram did not confidently exclude malignancy. On the other hand, breast ultrasonography had a specificity and negative predictive value of 87.0% and 75.0%, respectively. Mammography was routinely offered to patients who presented with nipple discharge. Complementary breast ultrasonography was also arranged, especially for younger Asian women with denser breasts on mammography.7 In our experience, complementary ultrasonography increases the overall sensitivity and negative predictive value compared with mammography alone.
 
Nipple discharge cytology
Opinion is divided on the value of cytological examination. While some studies report a complementary diagnostic value and recommend its routine use,8 9 others report it has little such value and advise against its routine use.10
 
Of the 102 patients, 36 had demonstrable nipple discharge at consultation with a sample collected for examination. Of these 36 specimens, only 27 showed a sufficient number of cells to make a cytological diagnosis. Nonetheless, we attempted to analyse its accuracy. The sensitivity and specificity of cytological examination were 77.8% and 44.4%, respectively. Its positive predictive value was disappointingly low at 41.2% and its negative predictive value was 80.0%. The diagnostic value of this investigation was limited as not every patient had demonstrable nipple discharge and not every specimen contained adequate cells for testing. Nonetheless, this investigation is minimally invasive so was always performed if there was demonstrable nipple discharge, although it rarely affected the clinical decision or plan of management.
 
Ductography
The value of ductography is debatable. While some studies have validated the diagnostic value of preoperative ductography in differentiating benign and malignant pathology,11 12 others doubt its value.13 Rather than differentiating benign and malignant pathology, we used preoperative ductogram to aid in the localisation of non-palpable lesions.14 15 The sensitivity was 100% whereas the specificity was low at 7.5%, with a positive predictive value of 14.0% and a negative predictive value of 100%.
 
Magnetic resonance imaging
Magnetic resonance imaging was not included in our routine evaluation of patients with nipple discharge although we acknowledge its value in the detection of carcinoma in these patients. It has an exceptionally high sensitivity for both invasive and in-situ carcinoma.16 Its routine use in patients with a breast lesion is nonetheless limited by its relatively low specificity of 72% (95% confidence interval, 67%-77%).17 The role of magnetic resonance imaging in patients with nipple discharge has been extensively validated,18 19 20 21 suggesting that it may detect or exclude the presence of carcinoma with a high degree of certainty. Magnetic resonance imaging may be considered when all other available strategies are inconclusive.
 
Microdochectomy
Emerging evidence suggests that neither clinical variables nor preoperative investigations reliably distinguish benign and malignant pathology so duct excision should be offered to every patient with nipple discharge.22 23 24 25 26 We offered microdochectomy to patients with no palpable breast lesion based on two indications: clinical or radiological suspicion, or a patient’s wish to stop nipple discharge by surgery. It is likely that offering microdochectomy to all patients with nipple discharge would result in overtreatment as the final pathology was benign in most cases. In patients with negative clinical examination and negative imaging findings, a period of watchful waiting with regular follow-up is a reasonable alternative to surgical intervention.
 
The association of blood-stained discharge with malignancy is controversial. Morrogh et al24 reported that haemorrhagic discharge did not indicate malignancy or high risk, and non-haemorrhagic discharge did not exclude malignancy. In our study, we showed that blood-stained discharge was associated with malignancy but was not pathognomonic.
 
On the other hand, presence of an associated breast mass was a significant finding. This may be because it is the most common presenting symptom of breast cancer, and its incidence rises with age.
 
Limitations
Our study had several limitations. First, as data collection was retrospective, there might have been inconsistent or incomplete recording of clinical findings. Study subjects might not be representative and some data for importable variables might have been missing. No blinding during information extraction or coding could be achieved as it was performed by the first author. Second, the small sample size limited the power of our study although this could in part be due to the relatively conservative culture and help-seeking pattern of Hong Kong Chinese women. The unequal arm size also limited the interpretation of statistical significance of comparisons. Third, our assumption of 2-year clinical non-progression as benign pathology might have underestimated the true incidence of malignancy in our group of patients. Lastly, the small number of adequate cytology specimens limited meaningful analysis of this investigation. As the sample taken for cytology is usually small, it will affect the sensitivity.
 
Conclusions
Clinical variables including age at presentation, duration and colour of discharge, presence of an associated breast mass, and abnormal sonographic findings were important in suggesting the underlying pathology of nipple discharge. Only blood-stained nipple discharge and an associated breast mass remained in the multiple logistic regression model and were statistically significant. In patients with non–blood-stained nipple discharge, as well as a negative clinical breast examination and imaging, we may infer an underlying benign pathology. Further prospective studies with a larger sample size are advocated.
 
Declaration
All authors have disclosed no conflicts of interest.
 
Acknowledgements
The authors would like to thank Mr Wing-pan Luk and Mr Ling-hiu Fung, Medical Physics & Research Department, Hong Kong Sanatorium & Hospital, Hong Kong for their statistical contribution to this paper.
 
References
1. Cheung KL, Alagaratnam TT. A review of nipple discharge in Chinese women. J R Coll Surg Edinb 1997;42:179-81.
2. Murphy IG, Dillon MF, Doherty AO, et al. Analysis of patients with false negative mammography and symptomatic breast carcinoma. J Surg Oncol 2007;96:457-63. Crossref
3. Vargas HI, Vargas MP, Eldrageely K, Gonzalez KD, Khalkhali I. Outcomes of clinical and surgical assessment of women with pathological nipple discharge. Am Surg 2006;72:124-8.
4. Murad TM, Contesso G, Mouriesse H. Nipple discharge from the breast. Ann Surg 1982;195:259-64. Crossref
5. King TA, Carter KM, Bolton JS, Fuhrman GM. A simple approach to nipple discharge. Am Surg 2000;66:960-6.
6. Jain A, Crawford S, Larkin A, Quinlan R, Rahman RL. Management of nipple discharge: technology chasing application. Breast J 2010;16:451-2. Crossref
7. Kwong A, Cheung PS, Wong AY, et al. The acceptance and feasibility of breast cancer screening in the East. Breast 2008;17:42-50. Crossref
8. Pritt B, Pang Y, Kellogg M, St. John T, Elhosseiny A. Diagnostic value of nipple cytology: study of 466 cases. Cancer 2004;102:233-8. Crossref
9. Kalu ON, Chow C, Wheeler A, Kong C, Wapnir I. The diagnostic value of nipple discharge cytology: breast imaging complements predictive value of nipple discharge cytology. J Surg Oncol 2012;106:381-5. Crossref
10. Kooistra BW, Wauters C, van de Ven S, Strobbe L. The diagnostic value of nipple discharge cytology in 618 consecutive patients. Eur J Surg Oncol 2009;35:573-7. Crossref
11. Hou MF, Huang TJ, Liu GC. The diagnostic value of galactography in patients with nipple discharge. Clin Imaging 2001;25:75-81. Crossref
12. Hou MF, Huang CJ, Huang YS, et al. Evaluation of galactography for nipple discharge. Clin Imaging 1998;22:89-94. Crossref
13. Dawes LG, Bowen C, Venta LA, Morrow M. Ductography for nipple discharge: no replacement for ductal excision. Surgery 1998;124:685-91. Crossref
14. Peters J, Thalhammer A, Jacobi V, Vogl TJ. Galactography: an important and highly effective procedure. Eur Radiol 2003;13:1744-7. Crossref
15. Lamont JP, Dultz RP, Kuhn JA, Grant MD, Jones RC. Galactography in patients with nipple discharge. Proc (Bayl Univ Med Cent) 2000;13:214-6. Crossref
16. Heywang-Koebrunner SH. Diagnosis of breast cancer with MR—review after 1250 patients. Electromedica 1993;61:43-52.
17. Peters NH, Borel Rinkes IH, Zuithoff NP, Mali WP, Moons KG, Peeters PH. Meta-analysis of MR imaging in the diagnosis of breast lesions. Radiology 2008;246:116-24. Crossref
18. Orel SG, Dougherty CS, Reynolds C, Czerniecki BJ, Siegelman ES, Schnall MD. MR imaging in patients with nipple discharge: initial experience. Radiology 2000;216:248-54. Crossref
19. Nakahara H, Namba K, Watanabe R, et al. A comparison of MR imaging, galactography and ultrasonography in patients with nipple discharge. Breast Cancer 2003;10:320-9. Crossref
20. Hirose M, Otsuki N, Hayano D, et al. Multi-volume fusion imaging of MR ductography and MR mammography for patients with nipple discharge. Magn Reson Med Sci 2006;5:105-12. Crossref
21. Ballesio L, Maggi C, Savelli S, et al. Role of breast magnetic resonance imaging (MRI) in patients with unilateral nipple discharge: preliminary study [in English, Italian]. Radiol Med 2008;113:249-64. Crossref
22. Adepoju LJ, Chun J, El-Tamer M, Ditkoff BA, Schnabel F, Joseph KA. The value of clinical characteristics and breast-imaging studies in predicting a histopathologic diagnosis of cancer or high-risk lesion in patients with spontaneous nipple discharge. Am J Surg 2005;190:644-6. Crossref
23. Lanitis S, Filippakis G, Thomas J, Christofides T, Al Mufti R, Hadjiminas DJ. Microdochectomy for single-duct pathologic nipple discharge and normal or benign imaging and cytology. Breast 2008;17:309-13. Crossref
24. Morrogh M, Park A, Elkin EB, King TA. Lessons learned from 416 cases of nipple discharge of the breast. Am J Surg 2010;200:73-80. Crossref
25. Alcock C, Layer GT. Predicting occult malignancy in nipple discharge. ANZ J Surg 2010;80:646-9. Crossref
26. Foulkes RE, Heard G, Boyce T, Skyrme R, Holland PA, Gateley CA. Duct excision is still necessary to rule out breast cancer in patients presenting with spontaneous bloodstained nipple discharge. Int J Breast Cancer 2011;2011:495315. Crossref

Characteristics and clinical outcomes of living renal donors in Hong Kong

Hong Kong Med J 2018 Feb;24(1):11–7 | Epub 29 Dec 2017
DOI: 10.12809/hkmj176820
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Characteristics and clinical outcomes of living renal donors in Hong Kong
YL Hong, MSc1; CH Yee, FHKAM (Surgery)1; CB Leung, FHKAM (Surgery)2; Jeremy YC Teoh, FHKAM (Surgery)1; Bonnie CH Kwan, FHKAM (Medicine)2; Philip KT Li, FHKAM (Medicine)2; Simon SM Hou, FHKAM (Surgery)1; CF Ng, FHKAM (Surgery)1
1 SH Ho Urology Centre, Department of Surgery, Prince of Wales Hospital, The Chinese University of Hong Kong, Shatin, Hong Kong
2 Division of Nephrology, Department of Medicine and Therapeutics, Prince of Wales Hospital, The Chinese University of Hong Kong, Shatin, Hong Kong
 
Corresponding author: Prof CF Ng (ngcf@surgery.cuhk.edu.hk)
 
 Full paper in PDF
 
Abstract
Introduction: In Asia, few reports are available on the outcomes for living renal donors. We report the short- and long-term clinical outcomes of individuals following living donor nephrectomy in Hong Kong.
 
Methods: We retrospectively reviewed the characteristics and clinical outcomes of all living renal donors who underwent surgery from January 1990 to December 2015 at a teaching hospital in Hong Kong. Information was obtained from hospital records and territory-wide electronic patient records.
 
Results: During the study period, 83 individuals underwent donor nephrectomy. The mean (± standard deviation) follow-up time was 12.0 ± 8.3 years, and the mean age at nephrectomy was 37.3 ± 10.0 years. A total of 44 (53.0%), four (4.8%), and 35 (42.2%) donors underwent living donor nephrectomy via an open, hand-port assisted laparoscopic, and laparoscopic approach, respectively. The overall incidence of complications was 36.6%, with most being grade 1 or 2. There were three (9.4%) grade 3a complications; all were related to open donor nephrectomy. The mean glomerular filtration rate was 96.0 ± 17.5 mL/min/1.73 m2 at baseline and significantly lower at 66.8 ± 13.5 mL/min/1.73 m2 at first annual follow-up (P<0.01). The latest mean glomerular filtration rate was 75.6% ± 15.1% of baseline. No donor died or developed renal failure. Of the donors, 14 (18.2%) developed hypertension, two (2.6%) had diabetes mellitus, and three (4.0%) had experienced proteinuria.
 
Conclusion: The overall perioperative outcomes are good, with very few serious complications. The introduction of a laparoscopic approach has decreased perioperative blood loss and also shortened hospital stay. Long-term kidney function is satisfactory and no patients developed end-stage renal disease. The incidences of new-onset medical diseases and pregnancy-related complications were also low.
 
 
New knowledge added by this study
  • The overall perioperative outcomes are good, with very few serious complications, among living renal donors. The introduction of a laparoscopic approach has decreased perioperative blood loss and also shortened hospital stay.
  • Long-term kidney function was satisfactory and no patients developed end-stage renal disease (ESRD).
  • The incidences of new-onset medical disease and pregnancy-related complications were also low.
Implications for clinical practice or policy
  • Medical practitioners should encourage relatives of patients with ESRD to consider the possibility of kidney donation.
 
 
Introduction
Chronic kidney disease (CKD) is the progressive loss of kidney function over a period of time. End-stage renal disease (ESRD) is the final stage of CKD. Patients with ESRD require renal replacement therapy that includes haemodialysis, peritoneal dialysis, and renal transplantation.
 
Currently, there are approximately 7000 patients on various forms of renal replacement therapy being cared for in the public sector in Hong Kong. As of 31 December 2016, 2047 patients were on the renal transplant waiting list. Nonetheless, between 2007 and 2016, only 58 to 87 cadaveric renal transplants were performed in Hong Kong each year.1 With the long waiting list and low number of cadaveric kidneys available, living donor renal transplant is the only possible alternative. It offers advantages over other renal replacement therapies, as it provides better long-term results, shortens the waiting time for an organ, lowers the risk of complications or rejection, and provides better quality of life after recovery. Despite these advantages, only seven to 15 living donor transplants were performed each year between 2007 and 2015 at the hospitals of the Hong Kong Hospital Authority.1
 
One of the major fears of an individual who is considering living organ donation concerns possible clinical outcomes. Although studies show that living donors have a similar to or better life expectancy than the general population, they are nevertheless at increased risk of developing ESRD, hypertension, gestational hypertension, and pre-eclampsia.2 3 4
 
In Hong Kong, few reports on the perioperative, short-term, and long-term clinical outcomes are available, especially those related to the minimally invasive surgical approach now employed for donor nephrectomy. This study reports our observation of characteristics of donors, and the short- and long-term clinical outcomes following living donor nephrectomy in Hong Kong.
 
Methods
Study design
We retrospectively reviewed the characteristics and short- and long-term clinical outcomes of all patients who underwent living donor nephrectomy at the Prince of Wales Hospital in Hong Kong between January 1990 and December 2015. Information was obtained from the Clinical Management System that includes the majority of electronic patient records—including consultation histories, operation records, radiology results, laboratory results, and medication records—collected and filed under the Hospital Authority since 2000. Medical records before 2000 and pregnancy-related information were reviewed manually by formally trained medical students and cross-checked by a urologist, and retrieved from the medical records of the involved patients.
 
The study was conducted in accordance with the principles outlined in the Declaration of Helsinki, and approved by the Joint Chinese University of Hong Kong–New Territories East Cluster Clinical Research Ethics Committee, with the requirement of patient informed consent waived because of its retrospective nature.
 
Study measures
Baseline demographics including sex, age at donation, ethnicity, relationship with recipient, diabetes mellitus status, hypertension status, body mass index, and serum creatinine level were obtained. Glomerular filtration rate (GFR) was derived from the serum creatinine level using a modified equation from the Modification of Diet in Renal Disease (MDRD) study.5 Operation details, including surgical approach, laterality of donated kidney, operating time, warm ischaemia time, blood loss, and need for transfusion were retrieved.
 
Short-term complications within 30 days of surgery were classified according to the Clavien-Dindo classification of surgical complications.6 Long-term outcomes were also assessed, with particular reference to development of hypertension, diabetes mellitus, renal stones, proteinuria, and renal failure. Serial changes in GFR were also assessed.
 
For female donors, pregnancy-related variables were recorded and included any pregnancy after surgery, records of pregnancy-related hydronephrosis, pregnancy-related urinary tract infection, pre-eclampsia, gestational diabetes mellitus, gestational hypertension, and any fetal loss.
 
Statistical analyses
All statistical analyses were performed using the SPSS (Windows version 23.0; IBM Corp, Armonk [NY], United States). Categorical variables were presented in counts and percentages while continuous variables were presented as mean ± standard deviation. Outcomes following open and laparoscopic techniques were compared by Chi squared test or Fisher’s exact test for categorical variables, and independent t test or Mann-Whitney U test for continuous variables. Paired t test or Wilcoxon rank sum test, whichever was appropriate, was used to evaluate the pre- and post-difference in GFR. A P value of <0.05 was considered statistically significant. Missing data were excluded from analysis.
 
Results
Donor characteristics
Between 1 January 1990 and 31 December 2015, a total of 83 donors underwent unilateral nephrectomy at the Prince of Wales Hospital. In one donor, records could not be traced, with only information about the sex, age at nephrectomy, and type of surgical technique.
 
Of the 83 donors, 56 (67.5%) were female. The mean age at nephrectomy was 37.3 ± 10.0 years. The majority were Chinese (97.6%) and a first-degree relative of the recipient (79.3%). None had hypertension or diabetes mellitus. The mean preoperative GFR was 96.0 ± 17.5 mL/min/1.73 m2. Nine (11.0%) donors had thalassaemia trait, four (4.9%) had hepatitis B, and two (2.4%) had asthma (Table 1).
 

Table 1. Baseline characteristics of kidney donors
 
Operation details and short-term outcomes
Around half (n=44, 53.0%) of the donors underwent open living donor nephrectomy, as this was the only technique used at our centre until 2002. After 2002, a hand-port assisted laparoscopic approach (n=4, 4.8%) and later a laparoscopic approach (n=35, 42.2%) were adopted. In most instances, the left kidney was donated (n=77, 93.9%) [Table 2].
 

Table 2. Comparison of operation details and short-term outcomes by operation techniques
 
Comparing laparoscopic or hand-port assisted laparoscopic living donor nephrectomy (LDN) with open donor nephrectomy (ODN), LDN was associated significantly with longer warm ischaemia time (309.0 ± 113.0 s vs 62.0 ± 17.9 s; P<0.01), less blood loss (55.3 ± 33.7 mL vs 532.2 ± 270.0 mL; P<0.01), and shorter hospital stay (5.7 ± 2.0 days vs 8.1 ± 1.9 days; P<0.01). In addition, LDN was associated significantly with more short-term complications (53.8% vs 20.9%; P<0.01). The most commonly experienced complication was epigastric pain/nausea and vomiting (n=18, 56.3%), followed by fever requiring medication (n=4, 12.5%). Most complications were grade 1 on the Clavien-Dindo classification scale (n=16, 50.0%), only three (9.4%) were grade 3a and all were related to ODN. The grade 3a complications were wound dehiscence that required a second operation for re-suturing, persistent pancreatic fluid discharge that required insertion of a pancreatic stent, and pneumothorax with chest drain inserted.
 
Long-term outcomes
The mean follow-up time was 12.0 ± 8.3 years. The mean GFR was 96.0 ± 17.5 mL/min/1.73 m2 at baseline and it dropped significantly to 66.8 ± 13.5 mL/min/1.73 m2 at 1-year follow-up (P<0.01). The GFR then gradually improved until 8 years after surgery and became stable (Fig). Of 73 living donors with at least one follow-up (mean follow-up time, 12.0 ± 8.2 years) and baseline serum creatinine level available, the latest GFR was 75.6% ± 15.1% of baseline GFR with the mean latest GFR being 71.3 ± 14.2 mL/min/1.73 m2. The mean GFR was 70.4% ± 12.3% of baseline level 1 year after surgery. Comparison of latest GFR with that 1 year after surgery revealed that it was stable (± 10% change) in 23 (39.0%) of 59 patients and higher (>10% increment) in 29 (49.2%) patients. None of the donors had died or developed ESRD. Fourteen (18.2%) donors developed hypertension, two (2.6%) had diabetes mellitus, and three (4.0%) had experienced proteinuria (Table 3).
 

Figure. The annual mean glomerular filtration rate (GFR) after surgery
 

Table 3. Long-term outcomes for kidney donors
 
Pregnancy-related complications
Of 56 female donors, 11 (19.6%) became pregnant after kidney donation: 17 pregnancies were reported. None of the pregnant donors experienced gestational hydronephrosis or gestational hypertension. Three donors each had gestational diabetes mellitus, pre-eclampsia, and post-delivery urinary tract infection. Two donors had experienced fetal loss, one in the first trimester and another one at an unknown gestational age (Table 4).
 

Table 4. Pregnancy-related complications
 
Discussion
Postoperative morbidity and mortality are the prime concerns when making a decision about kidney donation. Our results confirm that living donor nephrectomy is a relatively safe procedure, with a low incidence of major complications and mortality. In addition, the incidence of developing any other major disease was not particularly high in our series. This form of renal replacement therapy should be further promoted in Hong Kong to benefit more people with ESRD.
 
Results from previous studies have shown that living renal donors have a similar to or better life expectancy than the general population.7 8 9 10 Mjøen et al,11 however, reported that compared with healthy matched individuals, living renal donors had an increased risk of death. In Hong Kong, Chu et al12 reported one death related to multiple myeloma among 95 living renal donors with active follow-up and a mean follow-up period of 13.4 years. There were no deaths recorded in our study with a mean follow-up of 12 years.
 
Long-term renal function is another major concern of renal donors. Our results revealed that 1 year after living donor nephrectomy, the mean GFR of the kidney donors dropped significantly from 96.0 ± 17.5 mL/min/1.73 m2 at baseline to 66.8 ± 13.5 mL/min/1.73 m2. Nonetheless, it then gradually improved. This is probably partly related to the adaptation of the remaining kidney with hyper-filtration. From our series, the mean GFR was 70.4% ± 12.3% of baseline level 1 year after surgery but improved to 75.6% ± 15.1% of baseline level at the last follow-up. In the majority (88.2%) of donors, the last available GFR was static or higher than that 1 year after donation. This is comparable with the report of Rook et al13 in which GFR usually reached 64% ± 7% of the pre-donation level 1 year after donation.
 
Despite these changes in GFR, ESRD in renal donors is very rare, with an incidence of less than 0.5% in 15 years after donation.11 14 15 Ibrahim et al8 reported that survival and risk of ESRD in kidney donors appeared to be similar to those in the general population. Our study and that of Chu et al12 observed no ESRD in local kidney donors.
 
The effect of kidney donation on the development of hypertension is controversial. Although reports suggest that the incidence of hypertension among kidney donors increases,16 17 18 19 others have not confirmed this observation.20 21 22 23 24 In Hong Kong, the prevalence of hypertension in the general population was 12.6% in 2014,25 which is lower than our reported figure of 18.2%. With the progression of time after surgery, however, the prevalence of hypertension among living donors is expected to increase as age is a known influence in hypertension. Without a comparable control group, we cannot conclude if there is any actual discrepancy in the prevalence of hypertension among living donors compared with the general population.
 
Young female potential donors may have concerns about the impact of kidney donation on any future pregnancy. Garg et al4 reported that gestational hypertension or pre-eclampsia was more common among living donors than non-donors. Although our study showed an alarmingly high percentage (11%) of pre-eclampsia and absence of gestational hypertension, the small sample size (11 donors reported one or more pregnancies) undermines the ability to infer the actual percentage.
 
Perioperative complications may also deter potential living donors. Based on the US data, Lentine et al26 reported that 16.8% of donors experience a perioperative complication; most commonly gastrointestinal (4.4%). Our study showed a higher complication rate of 36.6%, with epigastric pain or nausea and vomiting being the major complication (56.3%). We further examined the techniques used and established that the complication rates of 20.9% or 53.8% respectively in donors who underwent ODN or LDN were significantly different (P<0.01). Despite the above mean complication rates, all complications of LDN were mild and of grade 1 or 2 according to the Clavien-Dindo classification, while three patients who underwent ODN had grade 3a complications. This is contrary to the majority of previous findings that suggest a lower perioperative complication rate for LDN and increased risk of more serious complications than during an ODN,27 although other indicators such as longer warm ischaemia time, less blood loss, and shorter hospital stays were still in line with previous findings. Further analysis of the differences between our local data and those of previous studies is warranted.
 
This study has several limitations. First, this was a retrospective study and the total number of living donors was restricted. Second, data quality could not be controlled and some data were incomplete, in particular for the obstetric records at other hospitals. Some data were also lost either because records were too old and pre-dated the electronic system or donors were no longer followed up at our centre. The oldest record included in the study was from 1990. At that time, record keeping was not always accurate, resulting in some baseline records from the early 1990s being missing. For example, the baseline GFR level of seven (8.4%) patients was not found, and might have affected the overall data quality as well as the analysis and conclusion. Third, although the urologist endeavoured to ensure accurate data entry, initial interpretation of the raw records was by medical students so certain inaccuracies might have occurred. Lastly, it is known that GFR might be underestimated when derived from the MDRD equation.
 
Conclusion
Living donor kidney transplantation is an important approach to improve the quality of life of patients with ESRD. Good short- and long-term outcome for kidney donors is important for promoting kidney donation. Our results suggest that the overall perioperative outcomes are good, with only very few serious (grade III) complications after surgery, occurring following an open approach. Long-term kidney function of donors was satisfactory and no patients developed ESRD. Although we had no control arm in our study, the overall incidences of new-onset medical diseases and pregnancy-related complications were low. The introduction of a laparoscopic approach for kidney harvesting has helped to decrease blood loss during surgery and also shorten hospital stay. Based on this encouraging result, relatives of patients with ESRD should be encouraged to consider the possibility of kidney donation.
 
Acknowledgements
Sincere thanks are given to Ms Karen Man-ting Chuk, Ms Tracy Lok-sze Chiu, Mr Wing-tung Leung, and Mr On-wa Ng for assisting with the data collection.
 
Declaration
All authors have disclosed no conflicts of interest.
 
References
1. Statistics (Milestones of Hong Kong organ transplantation): organ donation. Available from: http://www.organdonation. gov.hk/eng/statistics.html. Accessed 27 Dec 2017.
2. Reese PP, Boudville N, Garg AX. Living kidney donation: outcomes, ethics, and uncertainty. Lancet 2015;385:2003-13. Crossref
3. Delanaye P, Weekers L, Dubois BE, et al. Outcome of the living kidney donor. Nephrol Dial Transplant 2012;27:41- 50. Crossref
4. Garg AX, Nevis IF, Mcarthur E, et al. Gestational hypertension and preeclampsia in living kidney donors. N Engl J Med 2015;372:124-33. Crossref
5. Levey AS, Greene T, Kusek JW, Beck GJ. A simplified equation to predict glomerular filtration rate from serum creatinine. J Am Soc Nephrol 2000;11:155A0828.
6. Dindo D, Demartines N, Clavien PA. Classification of surgical complications: a new proposal with evaluation in a cohort of 6336 patients and results of a survey. Ann Surg 2004;240:205-13. Crossref
7. Segev DL, Muzaale AD, Caffo BS, et al. Perioperative mortality and long-term survival following live kidney donation. JAMA 2010;303:959-66. Crossref
8. Ibrahim HN, Foley R, Tan L, et al. Long-term consequences of kidney donation. N Engl J Med 2009;360:459-69. Crossref
9. Okamoto M, Akioka K, Nobori S, et al. Short- and long-term donor outcomes after kidney donation: analysis of 601 cases over a 35-year period at Japanese single center. Transplantation 2009;87:419-23. Crossref
10. Garg AX, Meirambayeva A, Huang A, et al. Cardiovascular disease in kidney donors: matched cohort study. BMJ 2012;344:e1203. Crossref
11. Mjøen G, Hallan S, Hartmann A, et al. Long-term risks for kidney donors. Kidney Int 2014;86:162-7. Crossref
12. Chu KH, Poon CK, Lam CM, et al. Long-term outcomes of living kidney donors: a single centre experience of 29 years. Nephrology (Carlton) 2012;17:85-8. Crossref
13. Rook M, Hofker HS, van Son WJ, Homan van der Heide JJ, Ploeg RJ, Navis GJ. Predictive capacity of pre-donation GFR and renal reserve capacity for donor renal function after living kidney donation. Am J Transplant 2006;6:1653-9. Crossref
14. Muzaale AD, Massie AB, Wainwright J, McBride MA, Wang M, Segev DL. Long-term risk of ESRD attributable to live kidney donation: matching with healthy non-donors. Am J Transplant 2013;13:204-5.
15. Fehrman-Ekholm I, Nordén G, Lennerling A, et al. Incidence of end-stage renal disease among live kidney donors. Transplantation 2006;82:1646-8. Crossref
16. Kasiske BL, Ma JZ, Louis TA, Swan SK. Long-term effects of reduced renal mass in humans. Kidney Int 1995;48:814-9. Crossref
17. Gossmann J, Wilhelm A, Kachel HG, et al. Long-term consequences of live kidney donation follow-up in 93% of living kidney donors in a single transplant center. Am J Transplant 2005;5:2417-24. Crossref
18. Garg AX, Prasad GV, Thiessen-Philbrook HR, et al. Cardiovascular disease and hypertension risk in living kidney donors: an analysis of health administrative data in Ontario, Canada. Transplantation 2008;86:399-406. Crossref
19. Doshi MD, Goggins MO, Li L, Garg AX. Medical outcomes in African American live kidney donors: a matched cohort study. Am J Transplant 2012;13:111-8. Crossref
20. Fehrman-Ekholm I, Dunér F, Brink B, Tydén G, Elinder CG. No evidence of accelerated loss of kidney function in living kidney donors: results from a cross-sectional follow-up. Transplantation 2001;72:444-9. Crossref
21. Macdonald D, Kukla AK, Ake S, et al. Medical outcomes of adolescent live kidney donors. Pediatric Transplant 2014;18:336-41. Crossref
22. Janki S, Klop KW, Dooper IM, Weimar W, Ijzermans JN, Kok NF. More than a decade after live donor nephrectomy: a prospective cohort study. Transpl Int 2015;28:1268-75. Crossref
23. Tavakol MM, Vincenti FG, Assadi H, et al. Long-term renal function and cardiovascular disease risk in obese kidney donors. Clin J Am Soc Nephrol 2009;4:1230-8. Crossref
24. El-Agroudy AE, Wafa EW, Sabry AA, et al. The health of elderly living kidney donors after donation. Ann Transplant 2009;14:13-9.
25. Census and Statistics Department, Hong Kong SAR Government. Thematic Household Survey Report No. 58. Available from: http://www.statistics.gov.hk/pub/ B11302582015XXXXB0100.pdf. Accessed 28 Oct 2016.
26. Lentine KL, Lam NN, Axelrod D, et al. Perioperative complications after living kidney donation: a national study. Am J Transplant 2016;16:1848-57. Crossref
27. Fonouni H, Mehrabi A, Golriz M, et al. Comparison of the laparoscopic versus open live donor nephrectomy: an overview of surgical complications and outcome. Langenbecks Arch Surg 2014;399:543-51. Crossref

Pathological outcome for Chinese patients with low-risk prostate cancer eligible for active surveillance and undergoing radical prostatectomy: comparison of six different active surveillance protocols

Hong Kong Med J 2017 Dec;23(6):609–15 | Epub 13 Oct 2017
DOI: 10.12809/hkmj166194
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Pathological outcome for Chinese patients with low-risk prostate cancer eligible for active surveillance and undergoing radical prostatectomy: comparison of six different active surveillance protocols
CF Tsang, MB, BS, FRCS (Edin); James HL Tsu, MB, BS, FRCS (Edin); Terence CT Lai, MB, BS, FRCS (Edin); KW Wong, MB, ChB, FRCS (Edin); Brian SH Ho, MB, BS, FRCS (Edin); Ada TL Ng, MB, BS, FRCS (Edin); WK Ma, MB, ChB, FRCS (Edin); MK Yiu, MB, BS, FRCS (Edin)
Division of Urology, Department of Surgery, Queen Mary Hospital, The University of Hong Kong, Pokfulam, Hong Kong
 
Corresponding author: Dr MK Yiu (pmkyiu@gmail.com)
 
 Full paper in PDF
 
Abstract
Introduction: Active surveillance is one of the therapeutic options for the management of patients with low-risk prostate cancer. This study compared the performance of six different active surveillance protocols for prostate cancer in the Chinese population.
 
Methods: Patients who underwent radical prostatectomy for prostate cancer from January 1998 to December 2012 at a university teaching hospital in Hong Kong were reviewed. Six active surveillance protocols were applied to the cohort. Statistical analyses were performed to compare the probabilities of missing unfavourable pathological outcome. The sensitivity and specificity of each protocol in identifying low-risk disease were compared.
 
Results: During the study period, 287 patients were included in the cohort. Depending on different active surveillance protocols used, extracapsular extension, seminal vesicle invasion, pathological T3 disease, and upgrading of Gleason score were present on final pathology in 3.3%-17.1%, 0%-3.3%, 3.3%-19.1%, and 20.6%-34.5% of the patients, respectively. The University of Toronto protocol had a higher rate of extracapsular extension at 17.1% and pathological T3 disease at 19.1% on final pathology than the more stringent protocols from John Hopkins (3.3% extracapsular extension, P=0.05 and 3.3% pathological T3 disease, P=0.03) and Prostate Cancer Research International: Active Surveillance (PRIAS; 8.0% pathological T3 disease, P=0.04). The Royal Marsden protocol had a higher rate of upgrading of Gleason score at 34.5% compared with the more stringent protocol of PRIAS at 20.6% (P=0.04). The specificities in identifying localised disease and low-risk histology among different active surveillance protocols were 59%-98% and 58%-94%, respectively. The John Hopkins active surveillance protocol had the highest specificity in both selecting localised disease (98%) and low-risk histology (94%).
 
Conclusions: Active surveillance protocols based on prostate-specific antigen and Gleason score alone or including Gleason score of 3+4 may miss high-risk disease and should be used cautiously. The John Hopkins and PRIAS protocols are highly specific in identifying localised disease and low-risk histology.
 
 
New knowledge added by this study
  • Active surveillance protocols based on prostate-specific antigen (PSA) and Gleason score only may miss high-risk prostate cancer.
  • Active surveillance protocols using PSA density as an inclusion criteria were highly specific in identifying localised disease and low-risk pathology.
Implications for clinical practice or policy
  • When adopting active surveillance in patients with prostate cancer, protocols with PSA density as an inclusion criteria are preferred.
 
 
Introduction
Prostate-specific antigen (PSA) plays a significant role in the early detection of prostate cancer in current practice.1 2 It is, however, a double-edged sword that leads to overdiagnosis, especially for clinically insignificant prostate cancer.3 4 Curative treatments for low-risk prostate cancer include radical prostatectomy and radiotherapy, both of which are associated with significant morbidities.5 6 7 In recent years, the concept of active surveillance (AS) has been adopted with the aim of monitoring clinically insignificant prostate cancer until disease progression, at which point radical prostatectomy or radiotherapy is considered. The ultimate objective is to delay or avoid the morbidities associated with radical treatments without compromising survival.8 9 10
 
Although AS is an established management option for low-risk prostate cancer, different AS protocols have been adopted.11 12 13 14 15 16 17 The most commonly used include those from the University of Toronto,11 Royal Marsden,12 John Hopkins,13 14 University of California San Francisco (UCSF),15 Memorial Sloan Kettering Cancer Center (MSKCC),16 and Prostate Cancer Research International: Active Surveillance (PRIAS).17 Most AS protocols select prostate cancer with a Gleason score of ≤6, PSA level of ≤10 ng/mL, and clinical stage of ≤T2. Other parameters that are considered by some protocols include PSA density, number of positive biopsy cores, and percentage of core involvement (Table 111 12 13 14 15 16 17).
 

Table 1. Inclusion criteria of six active surveillance protocols 11 12 13 14 15 16 17
 
Currently, there is no consensus regarding which AS protocol we should adopt for our patients. In addition, direct comparisons between different AS protocols are few. Before deciding to follow any particular AS protocol, urologists and oncologists should be aware of their individual strengths and limitations. Our study aimed to provide some insight into this issue by performing a head-to-head comparison of six AS protocols.
 
Methods
Patients who underwent radical prostatectomy for prostate cancer from January 1998 to December 2012 at a university teaching hospital in Hong Kong were reviewed. Indication for radical prostatectomy was localised prostate cancer in patients with a life expectancy exceeding 10 years. All patients underwent clinical assessment including clinical T staging by digital rectal examination, serum PSA level, and transrectal ultrasound-guided prostate biopsy. Sextant biopsies were performed from 1998 to 2002, but changed to 10-core biopsies from 2002 to 2011 and subsequently 12-core biopsies thereafter. Preoperative magnetic resonance imaging of the prostate was routinely performed from 2007. From 1998 to 2007, open or laparoscopic radical prostatectomies were performed. After November 2007, all prostatectomies at our institution were performed with the da Vinci robotic surgery system. Pathological assessment of transrectal ultrasound-guided biopsy and radical prostatectomy specimens was performed by a specialist pathologist in our institution. All patients attended a follow-up visit with physical examination 2 weeks after operation, and physical examination with serum PSA level checked every 3 months for the first year, every 6 months for the second year, and then annually thereafter. Data on patient demographics, clinical T stage, serum PSA level, transrectal ultrasound-guided biopsy results, and final pathology of radical prostatectomy specimen were retrospectively retrieved by an independent third party. Pathological assessment of the radical prostatectomy specimen was performed by independent specialist pathologists.
 
In our current study, we compared six different AS protocols, specifically from the University of Toronto,11 Royal Marsden,12 John Hopkins,13 14 UCSF,15 MSKCC,16 and PRIAS17 (Table 1). The six protocols were retrospectively applied to our cohort and patients were stratified accordingly based on clinical T stage, serum PSA level, PSA density, Gleason score on biopsy, number of positive biopsy cores, and percentage of positive core involvement. Data from the pathological assessment of radical prostatectomy specimens including extracapsular extension, seminal vesicle invasion, upgrading to pathological T3 disease, and upgrading of Gleason score were analysed. The clinical data used in the AS protocols were those available on diagnosis of prostate cancer and operations were performed within 12 weeks of diagnosis.
 
Statistical analyses to compare the rate of not diagnosing clinically significant prostate cancer—defined as extracapsular extension, seminal vesicle invasion, upgrading to T3 disease, and upgrading of Gleason score in the final prostatectomy specimens—were performed. The sensitivity and specificity of each protocol in selecting localised prostate cancer (defined as pathological stage <T3) and histological low-risk disease (defined as no upgrading of Gleason score on final pathology) were compared.
 
Statistical analysis was performed using the SPSS (Windows version 20.0; IBM Corp, Armonk [NY], US). Independent sample t test and Pearson Chi-squared test were used for continuous and categorical variables, respectively. A P value of <0.05 was considered statistically significant. This study was done in accordance with the principles outlined in the Declaration of Helsinki.
 
Results
A total of 287 patients were included in the cohort. The mean age was 66 years, mean serum PSA level was 10 ng/mL, mean number of positive cores during biopsy was 3, and mean Gleason sum at biopsy was 6. In the current cohort, 266 (93%) patients had clinical T1c or T2a prostate cancer—198 (69%) had clinical T1c disease and 68 (24%) had clinical T2a disease. Table 2 summarises the basic demographics of all patients.
 

Table 2. Basic demographics of patients (n=287)
 
When the six AS protocols were applied to the cohort, 30 to 152 patients were identified as low-risk; their mean serum PSA level ranged from 5.3 ng/mL to 7.7 ng/mL, and mean PSA density ranged from 0.12 ng/mL/mL to 0.25 ng/mL/mL. All six protocols had a mean biopsy Gleason sum of 6. Table 3 summarises the clinical characteristics of patients stratified according to different AS protocols.
 

Table 3. Clinical characteristics of patients stratified according to six active surveillance protocols
 
In the analyses of final pathological outcomes in patients stratified into different AS protocols, extracapsular extension rate varied from 3.3% to 17.1%. The incidence of seminal vesicle invasion was low in all six protocols, ranging from 0% to 3.3%. The rate of pathological T3 disease was lowest according to the John Hopkins criteria (3.3%), while the University of Toronto criteria had the highest incidence (19.1%). Regarding the upgrading of Gleason score in the radical prostatectomy specimens, all six protocols had a relatively high rate ranging from 20.6% to 34.5%. Table 4 summarises the pathological outcomes among the six AS protocols.
 

Table 4. Pathological outcomes of six active surveillance protocols
 
Comparative analyses of individual AS protocols against each other were also performed (Table 5). The University of Toronto protocol had a significantly higher rate of extracapsular extension at 17.1% and pathological T3 disease at 19.1% when compared with the more stringent protocol from John Hopkins (3.3% extracapsular extension, P=0.05 and 3.3% pathological T3 disease, P=0.03) and PRIAS (8.0% pathological T3 disease, P=0.04). In addition, the Royal Marsden protocol had a significantly higher rate of upgrading of Gleason score at 34.5% when compared with the more stringent protocol of PRIAS at 20.6% (P=0.04). There was no significant difference in the incidence of seminal vesicle invasion between the six protocols.
 

Table 5. Comparative analyses of pathological outcomes of six active surveillance protocols
 
In terms of the ability of each protocol to identify pathological localised disease (defined as pathological stage <T3) and histologically low-risk cancer (defined as no upgrading of Gleason score), sensitivity varied from 13%-61% and 14-71%, respectively. The John Hopkins criteria demonstrated highest specificity in identifying pathological localised disease (98%) and histological low-risk cancer (94%). Table 6 illustrates the sensitivity and specificity of identifying localised and histological low-risk disease for the six AS protocols.
 

Table 6. Sensitivity and specificity of six active surveillance protocols in predicting low-risk prostate cancer
 
Discussion
Prostate cancer screening has always been a controversial issue and evidence of improved survival is awaited.1 2 Nonetheless, PSA screening has undoubtedly led to overdiagnosis of insignificant prostate cancer.3 4 Active surveillance, with the purpose to delay or even avoid radical treatments and their associated morbidities, plays an important role in managing these patients. Unfortunately there are different AS protocols with various inclusion criteria, and urologists and oncologists may have difficulty deciding which protocol to adopt. The gold standard to answer this question will be a prospective randomised trial to compare overall survival following the application of different AS protocols. This, however, will require decades to observe low-risk prostate cancer patients before survival endpoints are reached. Our study provides data on pathological outcomes when different AS protocols were compared.
 
In our cohort, the proportion of patients eligible for active surveillance varied widely from approximately 11% to 58% according to different selection criteria (Table 3). Two recent series showed similar findings of a large discrepancy in the proportion of patients eligible for different AS protocols, varying from 16% to 63% and 28% to 69%.18 19 We demonstrated that although all AS protocols aim to select low-risk prostate cancer, the heterogeneity between them can be quite large. Clinicians need to be vigilant before adopting any of the AS protocols for their patients when further data from comparative analyses among different protocols are unavailable. The proportion of patients who were eligible for AS protocols in our study was lower than that in previous series.18 19 This may be because some patients with localised prostate cancer were treated with radiotherapy. The proportion of patients who can be selected in different AS protocols will be affected by the proportion of patients who undergo radiotherapy instead of surgery. In our centre, it is also possible that low-risk patients were selected to undergo a non-operative approach.
 
When the six protocols were compared after stratifying patients according to different AS criteria, the University of Toronto protocol had a significantly higher rate of extracapsular extension at 17.1% and pathological T3 disease at 19.1% than the John Hopkins protocol (3.3% extracapsular extension, P=0.05 and 3.3% pathological T3 disease, P=0.03) and PRIAS criteria (8.0% pathological T3 disease, P=0.04) [Table 5]. This observation can be explained by the difference in stringency of the two protocols. The University of Toronto criteria selected patients by two factors only: PSA of <10 ng/mL and Gleason score of ≤6; PSA density, number of positive biopsy cores, and percentage of core involvement were not considered. On the contrary, the John Hopkins criteria applied very strict criteria: a PSA density of 0.15 ng/mL/mL. In addition, only patients with T1 disease with at most two positive cores during biopsy and no more than 50% involvement of each core were selected (Table 1). Contrary to our findings, El Hajj et al19 found no significant difference in the rate of extracapsular extension, upgrading of Gleason score, or unfavourable disease when they compared the University of Toronto protocol with the John Hopkins protocol. The difference can be explained by the high rate of extracapsular extension (15%) and unfavourable disease (46%) within the John Hopkins criteria in their series, compared with 3% extracapsular extension and 3% pathological T3 disease in our cohort. This also implies that disease heterogeneity among different populations may influence the choice and results of different AS protocols.
 
In our study, analyses of final pathology revealed that the Royal Marsden protocol had a significantly higher rate of upgrading of Gleason score at 34.5% compared with the PRIAS criteria at 20.6% (P=0.04; Table 5). This result can be explained by the less-stringent selection criteria of the Royal Marsden protocol. First, it is the only protocol that allowed a Gleason score of 3+4 to be selected. Second, PSA level up to 15 ng/mL was permitted. These factors will invariably result in the inclusion of a proportion of patients with higher-risk disease. In the study by El Hajj et al,19 the Royal Marsden protocol were compared with the John Hopkins protocol and significantly more unfavourable disease was observed in the Royal Marsden group. Klotz et al11 also demonstrated that inclusion of Gleason score of 4 on biopsy into AS was a risk factor in predicting definitive treatment during active surveillance. These findings illustrate that active surveillance in patients with Gleason score of 3+4 is likely to miss higher-risk disease. It should be used cautiously and preferably not in young patients who are otherwise fit for radical treatments.
 
We have shown that less pathological T3 disease and Gleason score upgrading were present in the more-stringent John Hopkins and PRIAS protocols compared with the less stringent University of Toronto and Royal Marsden criteria. Nonetheless their sensitivity in identifying low-risk disease may be compromised by the more stringent selection criteria. More low-risk disease may therefore be excluded from surveillance by these stringent criteria. We addressed this issue in the last part of our analyses. The sensitivity and specificity in identifying localised disease (pathological stage <T3) and low-risk histology (no upgrading of Gleason score) among different AS protocols were compared (Table 6). The most stringent protocols of the John Hopkins and PRIAS had the highest specificity when selecting localised disease (94%-98%) and low-risk histology (91%-94%). However, inclusion of less pathological T3 disease and Gleason score upgrading by the more stringent protocols of John Hopkins and PRIAS should be cautious because it will, inevitably, be at the expense of low-risk patients who is excluded from AS and may receive unnecessary aggressive treatments. A recent study by Iremashvili et al18 showed that the PRIAS criteria had a better balance of sensitivity and specificity compared with the UCSF and MSKCC criteria. From our point of view, we tend to place more emphasis on high specificity since low specificity will include patients with high-risk tumours into active surveillance and thus patient survival may be jeopardised.
 
The present study had several limitations. First, the number of biopsy cores was not consistent throughout the study period. A proportion of patients had six-core biopsies in the early period of the cohort versus the current more recent standard of 10-12–core biopsies. Second, the sample size was relatively small due to the low incidence of prostate cancer in our population. Third, the tumour volume in prostatectomy specimens that might predict low-risk prostate cancer was not assessed. Lastly, the final prostatectomy pathology in this study was from patients who were operated on soon after diagnosis and not after a period of post-diagnosis surveillance. As a note of caution, it would be expected that the final pathology would show even worse pathological features if the patients were put on AS and operated on later. This should be noted when interpreting the results of the current study and counselling patients.
 
In conclusion, there is a wide range of variation in the selection criteria of different AS protocols. Active surveillance protocols based on PSA and Gleason score alone or including Gleason score of 3+4 may miss higher-risk disease and should be applied cautiously. The more stringent criteria of John Hopkins protocol and the PRIAS protocol were highly specific in identifying localised disease and low-risk histology.
 
Declaration
All authors have disclosed no conflicts of interest
 
References
1. Schröder FH, Hugosson J, Roobol MJ, et al. Screening and prostate-cancer mortality in a randomized European study. N Engl J Med 2009;360:1320-8. Crossref
2. Andriole GL, Crawford ED, Grubb RL 3rd, et al. Mortality results from a randomized prostate-cancer screening trial. N Engl J Med 2009;360:1310-9. Crossref
3. Ploussard G, Epstein JI, Montironi R, et al. The contemporary concept of significant versus insignificant prostate cancer. Eur Urol 2011;60:291-303. Crossref
4. Etzioni R, Penson DF, Legler JM, et al. Overdiagnosis due to prostate-specific antigen screening: lessons from U.S. prostate cancer incidence trends. J Natl Cancer Inst 2002;94:981-90. Crossref
5. Novara G, Ficarra V, Rosen RC, et al. Systematic review and meta-analysis of perioperative outcomes and complications after robot-assisted radical prostatectomy. Eur Urol 2012;62:431-52. Crossref
6. Boorjian SA, Eastham JA, Graefen M, et al. A critical analysis of the long-term impact of radical prostatectomy on cancer control and function outcomes. Eur Urol 2012;61:664-75. Crossref
7. Zaorsky NG, Harrison AS, Trabulsi EJ, et al. Evolution of advanced technologies in prostate cancer radiotherapy. Nat Rev Urol 2013;10:565-79. Crossref
8. Bastian PJ, Carter BH, Bjartell A, et al. Insignificant prostate cancer and active surveillance: from definition to clinical implications. Eur Urol 2009;55:1321-30. Crossref
9. Cooperberg MR, Carroll PR, Klotz L. Active surveillance for prostate cancer: progress and promise. J Clin Oncol 2011;29:3669-76. Crossref
10. Dall’Era MA, Albertsen PC, Bangma C, et al. Active surveillance for prostate cancer: a systematic review of the literature. Eur Urol 2012;62:976-83. Crossref
11. Klotz L, Zhang L, Lam A, Nam R, Mamedov A, Loblaw A. Clinical results of long-term follow-up of a large, active surveillance cohort with localized prostate cancer. J Clin Oncol 2010;28:126-31. Crossref
12. van As NJ, Norman AR, Thomas K, et al. Predicting the probability of deferred radical treatment for localised prostate cancer managed by active surveillance. Eur Urol 2008;54:1297-305. Crossref
13. Carter HB, Kettermann A, Warlick C, et al. Expectant management of prostate cancer with curative intent: an update of the Johns Hopkins experience. J Urol 2007;178:2359-64. Crossref
14. Tosoian JJ, Trock BJ, Landis P, et al. Active surveillance program for prostate cancer: an update of the Johns Hopkins experience. J Clin Oncol 2011;29:2185-90. Crossref
15. Dall’Era MA, Konety BR, Cowan JE, et al. Active surveillance for the management of prostate cancer in a contemporary cohort. Cancer 2008;112:2664-70. Crossref
16. Berglund RK, Masterson TA, Vora KC, Eggener SE, Eastham JA, Guillonneau BD. Pathological upgrading and up staging with immediate repeat biopsy in patients eligible for active surveillance. J Urol 2008;180:1964-7. Crossref
17. van den Bergh RC, Roemeling S, Roobol MJ, Roobol W, Schröder FH, Bangma CH. Prospective validation of active surveillance in prostate cancer: the PRIAS study. Eur Urol 2007;52:1560-3. Crossref
18. Iremashvili V, Pelaez L, Manoharan M, Jorda M, Rosenberg DL, Soloway MS. Pathologic prostate cancer characteristics in patients eligible for active surveillance: a head-to-head comparison of contemporary protocols. Eur Urol 2012;62:462-8. Crossref
19. El Hajj A, Ploussard G, de la Taille A, et al. Patient selection and pathological outcomes using currently available active surveillance criteria. BJU Int 2013;112:471-7. Crossref

Frameless stereotactic radiosurgery for brain metastases: a review of outcomes and prognostic scores evaluation

Hong Kong Med J 2017 Dec;23(6):599–608 | Epub 10 Nov 2017
DOI: 10.12809/hkmj166138
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Frameless stereotactic radiosurgery for brain metastases: a review of outcomes and prognostic scores evaluation
ST Mok, FHKCR, FHKAM (Radiology)1; Michael KM Kam, FHKCR, FHKAM (Radiology)1; WK Tsang, FHKCR, FHKAM (Radiology)1; Darren MC Poon, FHKCR, FHKAM (Radiology)1; Herbert H Loong, FHKCP, FHKAM (Medicine)2; WM Yeung, FHKCR, FHKAM (Radiology)1; TY Yeung, FHKCR, FHKAM (Radiology)1; Jimmy Yu, BSc (Hons), MPhil1; Carlos KH Wong, PhD3
1 Department of Clinical Oncology, Prince of Wales Hospital, Shatin, Hong Kong
2 Department of Clinical Oncology, The Chinese University of Hong Kong, Shatin, Hong Kong
3 Department of Family Medicine and Primary Care, The University of Hong Kong, Pokfulam, Hong Kong
 
This paper was presented at the MASCC/ISOO meeting 2016, 23-25 June 2016, Adelaide, Australia.
 
Corresponding author: Dr ST Mok (mst216@ha.org.hk)
 
  A video clip showing frameless stereotactic radiosurgery for brain metastases is available at www.hkmj.org
 
 
 Full paper in PDF
 
Abstract
Introduction: Stereotactic brain radiosurgery provides good local control in patients with limited brain metastases. A newly developed frameless system allows pain-free treatment. We reviewed the effectiveness of this frameless stereotactic brain radiosurgery and identified prognostic factors that may aid better patient selection.
 
Methods: Medical records of patients with brain metastases treated with linear accelerator–based frameless stereotactic brain radiosurgery between January 2010 and July 2015 in a university affiliated hospital in Hong Kong were reviewed. Outcomes including local and distant brain control rate, progression-free survival, and overall survival were analysed. Prognostic factors were identified by univariable and multivariable analyses. Association of outcomes with four common prognostic scores was performed.
 
Results: In this study, 64 patients with 94 lesions were treated with a median dose of 18 Gy (range, 12-22 Gy) in a single fraction. The median follow-up was 11.5 months. One-year actuarial local and distant brain control rates were 72% and 71%, respectively. The median overall survival was 13.0 months. On multivariable analysis, Karnofsky performance status score (>50 vs ≤50) and number of lesions (1-2 vs ≥3) were found to associate significantly with distinct brain progression-free survival (P=0.022, hazard ratio=0.20, 95% confidence interval 0.05-0.80 and P=0.003, hazard ratio=0.31, 95% confidence interval 0.14-0.68, respectively). Overall survival was associated significantly with Basic Score for Brain Metastases (P=0.031), Score Index for Radiosurgery in Brain Metastases (P=0.007), and Graded Prognostic Assessment (P=0.003). Improvement in overall survival was observed in all groups of different prognostic scores.
 
Conclusion: Frameless stereotactic brain radiosurgery is effective in patients with oligo-metastases of brain and should be increasingly considered in patients with favourable prognostic scoring.
 
 
New knowledge added by this study
  • Survival of patients with brain metastases has significantly improved over the past decade.
  • Frameless stereotactic brain radiosurgery is effective and has acceptable toxicities.
Implications for clinical practice or policy
  • Calculation of a prognostic score can aid clinicians in the identification of patients who will benefit most from stereotactic brain radiosurgery.
 
 
Introduction
Patients with brain metastases have previously had poor survival of only 3 to 4 months with non-surgical treatment.1 2 Substantial improvement has been achieved in recent years with the advance of systemic treatment and radiation techniques. Stereotactic radiosurgery (SRS) was first delivered with the Cobalt-60 Gamma Knife system by Leksell in 1951.3 Today, SRS can also be delivered via the linear accelerator (LINAC) system and proton beam system. It is usually indicated in patients with oligo-brain metastases (≤4) with a diameter of less than 4 cm.4 It is particularly advantageous for lesions in the deep brain parenchyma that are not easily accessible by surgery. A frame-based system was initially used to immobilise the patient. A frameless system was later developed to minimise patient suffering and was reported to have comparable outcomes to the framed-based system.5 Since the introduction of a frameless system, SRS or even fractionated stereotactic radiotherapy has been increasingly used to treat patients with oligo-brain metastases. Patients do not have to undergo painful frame placement. Rather, they undergo simple planning procedures over 2 consecutive days. The patient is required to return only for mould fitting and planning of computed tomography. Together with diagnostic fine-cut magnetic resonance imaging co-registration, oncologists can easily contour the target on the radiotherapy planning system. With the use of the ExacTrac system (Brainlab AG, Germany) to verify treatment position, the magnitude of error is reported to be only 0.7 mm, and the mean deviation between frame-based and image-guided initial positioning is just 1.0 mm (standard deviation, 0.5 mm).6 A frameless system became one of the choices of treatment in SRS and was included in the ASTRO policy.7 The recommended dosages according to the RTOG 9005 trial are 24 Gy, 18 Gy, and 15 Gy for tumours of ≤20 mm, 21-30 mm, and 31-40 mm in maximum diameter, respectively.8 For framed SRS, 1-year local progression-free survival (PFS) was reported to be up to 70% to 90%, and median overall survival (OS) of 6 to 12 months.9 10 11 12 13 14 15 The outcomes of frameless SRS have been reported only in limited series, with 1-year local control of 79% to 95%.5 16 17 18
 
Patient selection and tailor-made management are indeed challenging. Several scoring systems have been modelled to predict survival of patients with brain metastases, including the Radiation Therapy Oncology Group Recursive Partitioning Analysis (RTOG RPA),19 Basic Score for Brain Metastases (BSBM),20 the Score Index for Radiosurgery in Brain Metastases (SIR),21 Graded Prognostic Assessment (GPA),22 and Disease-Specific Graded Prognostic Assessment (DS-GPA).23 These scoring systems were developed at a time when treatment strategies were also rapidly evolving with the availability of more accurate diagnostic imaging, better radiotherapy techniques, and more effective systemic and targeted agents. A paradigm shift to more aggressive treatment of oligo-metastasis as a result of longer cancer survivorship now requires further validation of these scoring systems.
 
In this study, we reviewed the outcomes of patients who underwent LINAC-based frameless SRS and identified prognostic factors that affect survival. By doing so, we hope to gain a better understanding of which patients will benefit from SRS without jeopardising their quality of life.
 
Methods
Records of patients who underwent frameless SRS for limited brain metastases in a university-affiliated hospital between January 2010 and July 2015 were retrospectively reviewed. Patient data were extracted from paper records and the Clinical Management System of the Hospital Authority, Hong Kong by investigators in charge of the study. Data extracted included gender, age, type of primary malignancy, date of diagnosis of malignancy and brain metastases, extracranial disease status and control at treatment time, diagnostic and monitoring modalities, presence of convulsions, and steroid and anticonvulsant use before and after treatment period. Treatment details including immobilisation technique, number of lesions, dose and fractionation, and volume of lesions were reviewed from department records and the Brainlab iplan system (Brainlab AG, Germany). Prognostic scoring including RTOG RPA, BSBM, SIR, and GPA were calculated (Appendix 1 19 20 21 22 23).
 
Outcome parameters including local and distant brain control, PFS, and OS were generated using SPSS (Windows version 22.0; IBM Corp, Armonk [NY], US). Univariable analysis with Cox proportional hazards model was performed to generate prognostic factors for local and distinct brain PFS (defined as the time from treatment to documented local progression/distinct brain progression or death) and OS. For each outcome, statistically significant non-modifiable patient and disease factors in univariable analysis together with important treatment factors were included in respective multivariable analysis using Cox proportional hazards model. The enter method was used for variable selection process. Kaplan-Meier survival curve for OS was generated for different prognostic scoring groups and log rank significance was calculated. The study was approved by clinical research ethics committee of the NTEC-CUHK Cluster, Hospital Authority, Hong Kong, with patient informed consent waived.
 
Results
Demographics
A total of 68 patients were screened during the study period. Four patients who were treated with fractionated stereotactic radiotherapy and single-fraction SRS in the same treatment were excluded, and thus 64 patients were included. All patients were treated with frameless LINAC-based SRS with ExacTrac system verification, while contouring and dosimetry with the Brainlab iplan system. Dose administered was based on tumour diameter: 22 Gy to lesions of ≤2 cm, 18 Gy to lesions of 2.1-3.0 cm, and 15 Gy to lesions of 3.1-4.0 cm. A 1.5-mm margin was allowed from gross tumour volume to planning target volume. Deviation of dose prescription from departmental protocol was permitted at the individual physician’s discretion.
 
Among the 64 patients, there were 40 men and 24 women. The median age at the time of treatment was 58 years (range, 22-95 years). The median Eastern Cooperative Oncology Group performance status was 1 (range, 0-3), and Karnofsky Performance Status (KPS) score was 80 (range, 40-100). Primary disease included carcinoma of breast (n=7), lung (n=45), gastrointestinal (n=2), renal cell (n=6), thyroid (n=1), osteosarcoma (n=1), germ cell (n=1), and epithelioid haemangioendothelioma (n=1). Further details of demographics are summarised in Table 1.
 

Table 1. Demographics and treatment details
 
Treatment
A total of 94 lesions were treated with a dose of 12 Gy to 22 Gy according to size (12 Gy, n=6; 15 Gy, n=12; 16 Gy, n=2; 18 Gy, n=48; 20 Gy, n=14; 22 Gy, n=12). The median dose was 18 Gy. The median size of lesion treated was 19 mm (range, 3-43 mm).
 
Outcomes
The median follow-up time was 11.5 (range, 0.4-56.4) months. One-year actuarial local control rate was 72% (95% confidence interval [CI], 57%-83%) and distant control rate was 71% (95% CI, 56%-82%). The median local PFS was 11.2 (95% CI, 8.4-11.2) months. The median distinct brain PFS was 10.8 (95% CI, 8.4-13.1) months. The median OS was 13.0 (95% CI, 10.6-11.3) months.
 
Toxicities
Four (6.3%) patients had acute toxicities, mainly brain oedema, and one patient had a seizure for 3 days after treatment. Eight (12.5%) patients had delayed seizure after a median time of 10.5 months. One patient had radionecrosis confirmed pathologically after surgical resection. There were 43 (67.2%) patients who were prescribed steroid before treatment, and eight (12.5%) patients became steroid dependent until their demise. Steroid prescription was not found to affect OS significantly. Nonetheless among the steroid group, becoming steroid dependent was associated with poorer prognosis, with a median OS in the steroid-dependent group of 0.92 months versus 13.6 months in the non–steroid-dependent group (P<0.005, log rank; Appendix 2, Fig a). The worse survival of steroid-dependent patients was independent of volume of brain metastases.
 

Figure. Overall survival (OS) according to prognostic score grouping: (a) RTOG RPA, (b) BSBM, (c) SIR, and (d) GPA
 
Prognostic patient and disease factors
Potential prognostic factors of survival including patient factors such as gender, age, and performance status; and disease factors such as primary cancer, presence of extracranial disease, pre-existing convulsion, number of brain lesions, and size and volume of the largest lesion were examined with reference to decision for SRS treatment by univariable analysis using Cox proportional hazards model. It was found that OS was associated significantly with age (≤70 vs >70 years; P=0.011) and KPS score (>50 vs ≤50; P=0.008). Local PFS was associated significantly with age (≤70 vs >70 years; P=0.043) and KPS score (>50 vs ≤50; P=0.021). Distinct brain PFS was associated significantly with age (≤70 vs >70 years; P=0.02), presence of extracranial disease (presence vs absence; P=0.038), KPS score (>50 vs ≤50; P=0.009), and number of brain lesions (<1-2 vs ≥3; P=0.016). Results of univariable analysis are summarised in Table 2.
 

Table 2. Univariable analysis by categorical variables
 
Treatment factors
Dose relationship
Dose relationship for each lesion was analysed separately. Lesions prescribed >18 Gy had statistically significant superior time to progression (radiologically documented local progression) than those given ≤18 Gy, with a 1-year local control rate of 88% vs 60% (Appendix 2, Fig b). Some patients had more than one lesion treated with different doses. Nonetheless after taking into account the highest dose given in the same patient, dose did not affect local PFS or OS significantly (Table 2); dose was not analysed in distinct brain PFS as it should not affect distant brain progression.
 
Effect of whole-brain radiotherapy
With particular reference to the effect of whole-brain radiotherapy (WBRT), it was found that concomitant WBRT (within 3 months of treatment with SRS) did not have a statistically significant impact on OS, local PFS, or distant brain PFS (Table 2).
 
Multivariable analysis
Multivariable analysis using Cox proportional hazards model and taking patient, disease, and treatment factors into account identified that statistically significant factors associated with distinct brain PFS were KPS score (>50 vs ≤50; P=0.022, hazard ratio [HR]=0.20, 95% CI=0.05-0.80) and number of brain lesions (1-2 vs ≥3; P=0.003, HR=0.31, 95% CI=0.14-0.68) [Table 3].
 

Table 3. Multivariable analysis by categorical variables with Cox regression model
 
Primary lung cancer
Of note, a large number of patients in the group had primary lung cancer (n=45), most of which were non–small-cell lung cancer (NSCLC) [n=42]. Among NSCLC patients, a sensitive activating EGFR mutation (exon 19 deletion or exon 21 L858R mutation) was present in 14. Three other patients carried a less common mutation: exon 21 861Q (n=1), exon 18 missense (n=1), and exon 18 179S (n=1). Patients with an exon 19 deletion or exon 21 L858R mutation had superior OS compared with the non-mutational group (P=0.019, HR=0.281, 95% CI=0.097-0.814) but there was no statistically significant difference in local or distant brain control. Among the 14 patients with sensitive activating EGFR mutation, three patients who were diagnosed with brain metastases received WBRT before SRS treatment, and six patients were given SRS together with WBRT. Again, concomitant WBRT was not shown to affect local/distinct brain PFS or OS. For epidermal growth factor receptor (EGFR)–tyrosine kinase inhibitor (TKI) treatment, 12 of 14 patients had lifelong EGFR-TKI treatment, with a median survival of 19.5 months; one with exon 19 deletion and one with exon 18 missense deletion did not have EGFR-TKI treatment. There were seven patients who were prescribed EGFR-TKI before SRS treatment (range of duration, 5.7-21.4 months), and eight patients who were started on or continued on more lines of EGFR-TKI after SRS treatment.
 
Association with available prognostic scoring
Overall survival was significantly associated with BSBM (P=0.031, log-rank), SIR (P=0.007, log-rank), and GPA (P=0.003, log-rank) [Fig]. A comparison of median survival of the current study with the other original studies is shown in Table 4.19 20 21 22 Of note, DS-GPA was not analysed due to the small number of patients with breast, gastrointestinal, and renal cell primaries. The calculation of GPA and DS-GPA of lung primary was the same.
 

Table 4. Comparison of median overall survival in the present study according to prognostic scoring group with that of original studies19 20 21 22
 
Discussion
Brain metastasis has previously been considered an end-of-life event. With the development of new systemic therapies that are effective in both extracranial and intracranial diseases, together with a better understanding from clinical trials of the advantages of SRS, oncologists are more willing to offer SRS to patients with limited brain metastases.
 
At the other extreme, studies have compared the efficacy of WBRT with supportive care in patients with advanced brain metastases. The latest news from the QUARTZ trial, conducted by the UK Medical Research Council Group, presented at the American Society of Clinical Oncology Meeting in 201524 (full paper awaited) was striking for oncologists. They randomly allocated 538 NSCLC patients with brain metastases that were not amenable to surgery or SRS to either optimal supportive care (OSC) plus WBRT (20 Gy/5 fractions) or OSC alone. There was no significant difference in survival between the OSC+WBRT group and OSC-alone group, with the median survival being 65 and 57 days, respectively. Quality of life was also assessed in this study. The difference between the mean quality-adjusted life-years was -1.9 days only (OSC+WBRT 43.3 vs OSC-alone 41.4 days) and did not meet the initial defined criteria of significance. These data revealed that we are encountering a group of patients with very heterogeneous tumour behaviour and thus personalised treatment is required.
 
This retrospective study included patients who underwent frameless SRS during January 2010 to July 2015, after commencement of frameless SRS treatment in our centre. Limitations of this study including small number of patients and information bias are inevitable. Nonetheless, the outcomes of patients with brain metastases who underwent frameless SRS in our centre are compatible with those from other large clinical trials that used frame-based systems in terms of control rate, median OS and PFS, and toxicities. Approximately 13% of patients had a complication of steroid dependence that may have been due to treatment or natural disease progression. Steroid dependence was associated with poor survival, independent of volume of tumour. Prolonged use of steroid has been associated with decreased immunity that may underlie superimposed infection. Therefore, tailing down of steroid dose as early as possible in accordance with patient symptoms is strongly recommended.
 
This study revealed that OS was significantly associated with previously identified prognostic scoring group such as BSBM, SIR, and GPA. Among the three, BSBM and GPA are more convenient to use as only three or four factors are considered respectively, and the information should be easily available in a clinic (including age, KPS, control of primary cancer, presence of extracranial metastases, and number of brain metastases). Data relating to volume of the largest brain lesion included in SIR may not always be available as the reporting radiologist may only report lesion diameter. In terms of patient selection, for patients with GPA of 0-1.0, the median OS was 4.1 months in our study compared with 2.6 months in the original study, and similar to that of patients given WBRT alone. It may be more appropriate to prescribe WBRT alone or best supportive care for this group of patients in lieu of SRS. An important observation from the result of our study is that survival of patients was significantly improved compared with a previous cohort (Table 4). This reflects a significant improvement in systemic treatment over the last decade. Thus, the use of high technology radiation techniques such as SRS is increasingly considered by radiation oncologists to achieve the best outcomes.
 
Another important aim of this study was to identify prognostic factors of survival in order to avoid futile treatment in those patients who will have a poor outcome despite SRS. Due to the small number of patients in this study, we were not able to identify patients with superior survival among different primaries, similar to DS-GPA. It is of note that a large number of patients in our study had primary lung cancer. In the NSCLC subgroup, patients with an activating EGFR mutation had significantly better survival than those without mutation, and the majority of this group had EGFR-TKI lifelong. Of note, EGFR-TKI has been shown in various studies to have PFS and survival benefit in patients with EGFR-activating mutation.25 26 27 28 29 30 31 32 In a recent retrospective multi-institutional study with more than 300 patients, outcomes of patients with EGFR-activating mutation were analysed following treatment with upfront SRS followed by EGFR-TKI, upfront WBRT followed by EGFR-TKI, and upfront EGFR-TKI.33 Patients in the upfront SRS and upfront WBRT group had significantly superior OS and intracranial PFS compared with those with upfront EGFR-TKI.33 Therefore, in patients with oligo-brain metastases harbouring an EGFR-activating mutation, SRS followed by EGFR-TKI should be considered a standard treatment, and WBRT reserved until there is frank brain disease progression to conserve cognitive function. In addition, SRS combined with efficacious systemic treatment with good brain penetration while omitting WBRT should also be considered in other primaries, although further studies are awaited to validate the benefit.
 
The beneficial effect of WBRT in addition to SRS is controversial. Recent evidence shows it improves local control but not survival.34 35 Nonetheless, in view of toxicity of somnolence, malaise and cognitive impairment with WBRT, many clinicians may prefer delaying WBRT until there is frank disease progression after SRS. In a recent meta-analysis, the benefit of additional WBRT was not observed in patients who were 50 years old or younger in terms of survival or distant brain control.36 Initial omission of WBRT in this young age-group had no adverse effect on distant brain relapse rate. We were unable to replicate improvement in brain control with WBRT or demonstrate an interaction of age with benefit of concomitant WBRT, possibly due to the small size and retrospective nature of our current study. Number of brain metastases was identified as a significant prognostic factor of brain PFS. Patients with three or more brain metastases had worse PFS than those with one or two brain metastases (5.8 months vs 11.3 months). Again due to the small number of patients, we were unable to demonstrate whether concomitant WBRT could improve brain PFS in patients with three or more brain metastases. Further prospective studies are warranted to verify whether concomitant WBRT should be considered in patients with a higher disease load or age over 50 years.
 
Frameless SRS for oligo-brain metastases is painless and well tolerated, and should be increasingly considered in patients with good prognostic scores. Its combination with effective systemic treatment has significantly improved survival over the past decade. Nonetheless it is important to individualise treatment for patients with brain metastases according to their inherited prognostic risk factors. High precision treatment with SRS with or without WBRT should be offered to patients with oligo-brain metastases with good prognostic scores and favourable primary histology. For patients with EGFR-activating mutation, SRS followed by EGFR-TKI is a superior choice of treatment. Based on the latest evidence, it may be advisable to give SRS alone and reserve WBRT as salvage for patients with limited brain metastases who are 50 years or younger. Further, WBRT alone can be offered to patients with multiple symptomatic brain metastases and unfavourable prognostic scores. Best supportive care with dexamethasone alone may be considered for patients with very poor performance status.
 
Conclusions
Frameless SRS is effective and safe for patients with oligo-metastases of brain. Identification of patients with brain metastases who would benefit from SRS is important. Current available prognostic scoring systems provide a good estimation of survival. Frameless SRS should be increasingly considered in patients with favourable prognostic scores.
 
Acknowledgements
I would like to thank Dr SF Leung and Dr Kennis Ngar of Department of Clinical Oncology, Prince of Wales Hospital, Hong Kong for their professional opinion and support in this study.
 
Declaration
All authors have disclosed no conflicts of interest.
 
References
1. Tsao MN, Lloyd NS, Wong RK, et al. Radiotherapeutic management of brain metastases: a systematic review and meta-analysis. Cancer Treat Rev 2005;31:256-73. Crossref
2. Nieder C, Spanne O, Mehta MP, Grosu AL, Geinitz H. Presentation, patterns of care, and survival in patients with brain metastases: what has changed in the last 20 years? Cancer 2011;117:2505-12. Crossref
3. Leksell L. The stereotaxic method and radiosurgery of the brain. Acta Chir Scand 1951;102:316-9.
4. Lo SS. Stereotactic radiosurgery. Available from: http://emedicine.medscape.com/article/1423298-overview#a2. Accessed 19 Mar 2015.
5. Minniti G, Scaringi C, Clarke E, Valeriani M, Osti M, Enrici RM. Frameless linac-based stereotactic radiosurgery (SRS) for brain metastases: analysis of patient repositioning using a mask fixation system and clinical outcomes. Radiat Oncol 2011;6:158. Crossref
6. Ramakrishna N, Rosca F, Friesen S, Tezcanli E, Zygmanszki P, Hacker F. A clinical comparison of patient setup and intra-fraction motion using frame-based radiosurgery versus a frameless image-guided radiosurgery system for intracranial lesions. Radiother Oncol 2010;95:109-15. Crossref
7. ASTRO Policies for Stereotactic radiosurgery (SRS). Available from: https://www.astro.org/uploadedFiles/ Main_Site/Practice_Management/Reimbursement/ASTROSRSModelPolicy.pdf. Accessed 2014.
8. Shaw E, Scott C, Souhami L, et al. Single dose radiosurgical treatment of recurrent previously irradiated primary brain tumors and brain metastases: final report of RTOG protocol 90-05. Int J Radiat Oncol Biol Phys 2000;47:291-8. Crossref
9. Sneed PK, Lamborn KR, Forstner JM, et al. Radiosurgery for brain metastases: is whole brain radiotherapy necessary? Int J Radiat Oncol Biol Phys 1999;43:549-58. Crossref
10. Pirzkall A, Debus J, Lohr F, et al. Radiosurgery alone or in combination with whole-brain radiotherapy for brain metastases. J Clin Oncol 1998;16:3563-9. Crossref
11. Andrews DW, Scott CB, Sperduto PW, et al. Whole brain radiation therapy with or without stereotactic radiosurgery boost for patients with one to three brain metastases: Phase III results of the RTOG 9508 randomised trial. Lancet 2004;363:1665-72. Crossref
12. Cho KH, Hall WA, Gerbi BJ, Higgins PD, Bohen M, Clark HB. Patient selection criteria for the treatment of brain metastases with stereotactic radiosurgery. J Neurooncol 1998;40:73-86. Crossref
13. Varlotto JM, Flickinger JC, Niranjan A, Bhatnagar AK, Kondziolka D, Lunsford LD. Analysis of tumor control and toxicity in patients who have survived at least one year after radiosurgery for brain metastases. Int J Radiat Oncol Biol Phys 2003;57:452-64. Crossref
14. Bhatnagar AK, Flickinger JC, Kondziolka D, Lunsford LD. Stereotactic radiosurgery for four or more intracranial metastases. Int J Radiat Oncol Biol Phys 2006;64:898-903. Crossref
15. Nieder C, Grosu AL, Gaspar LE. Stereotactic radiosurgery (SRS) for brain metastases: a systematic review. Radiat Oncol 2014;9:155. Crossref
16. Pham NL, Reddy PV, Murphy JD, et al. Frameless, real-time, surface imaging-guided radiosurgery: update on clinical outcomes for brain metastases. Transl Cancer Res 2014;3:351-7.
17. Breneman JC, Steinmetz R, Smith A, Lamba M, Warnick RE. Frameless image-guided intracranial stereotactic radiosurgery: clinical outcomes for brain metastases. Int J Radiat Oncol Biol Phys 2009;74:702-6. Crossref
18. Muacevic A, Kufeld M, Wowra B, Kreth FW, Tonn JC. Feasibility, safety, and outcome of frameless image-guided robotic radiosurgery for brain metastases. J Neurooncol 2010;97:267-74. Crossref
19. Gaspar L, Scott C, Rotman M, et al. Recursive partitioning analysis (RPA) of prognostic factors in three Radiation Therapy Oncology Group (RTOG) brain metastases trials. Int J Radiat Oncol Biol Phys 1997;37:745-51. Crossref
20. Lorenzoni J, Devriendt D, Massager N, et al. Radiosurgery for treatment of brain metastases: estimation of patient eligibility using three stratification systems. Int J Radiat Oncol Biol Phys 2004;60:218-24. Crossref
21. Weltman E, Salvajoli JV, Brandt RA, et al. Radiosurgery for brain metastases: a score index for predicting prognosis. Int J Radiat Oncol Biol Phys 2000;46:1155-61. Crossref
22. Sperduto PW, Berkey B, Gaspar LE, Mehta M, Curran W. A new prognostic index and comparison to three other indices for patients with brain metastases: an analysis of 1,960 patients in the RTOG database. Int J Radiat Oncol Biol Phys 2008;70:510-4. Crossref
23. Sperduto PW, Kased N, Roberge D, et al. Summary report on the graded prognostic assessment: an accurate and facile diagnosis-specific tool to estimate survival for patients with brain metastases. J Clin Oncol 2012;30:419-25. Crossref
24. Mulvenna PM, Nankivell MG, Barton R, et al. Whole brain radiotherapy for brain metastases from non-small lung cancer: Quality of life (QoL) and overall survival (OS) results from the UK Medical Research Council QUARTZ randomised clinical trial (ISRCTN 3826061). J Clin Oncol 2015;33(Suppl);abstract8005.
25. Mok TS, Wu YL, Thongprasert S, et al. Gefitinib or carboplatin-paclitaxel in pulmonary adenocarcinoma. N Engl J Med 2009;361:947-57. Crossref
26. Han JY, Park K, Kim SW, et al. First-SIGNAL: first-line single-agent iressa versus gemcitabine and cisplatin trial in never-smokers with adenocarcinoma of the lung. J Clin Oncol 2012;30:1122-8. Crossref
27. Mitsudomi T, Morita S, Yatabe Y, et al. Gefitinib versus cisplatin plus docetaxel in patients with non-small-cell lung cancer harbouring mutations of the epidermal growth factor receptor (WJTOG3405): an open label, randomised phase 3 trial. Lancet Oncol 2010;11:121-8. Crossref
28. Maemondo M, Inoue A, Kobayashi K, et al. Gefitinib or chemotherapy for non-small-cell lung cancer with mutated EGFR. N Engl J Med 2010;362:2380-8. Crossref
29. Jänne PA, Wang X, Socinski MA, et al. Randomized phase II trial of erlotinib alone or with carboplatin and paclitaxel in patients who were never or light former smokers with advanced lung adenocarcinoma: CALGB 30406 trial. J Clin Oncol 2012;30:2063-9. Crossref
30. Rosell R, Carcereny E, Gervais R, et al. Erlotinib versus standard chemotherapy as first-line treatment for European patients with advanced EGFR mutation–positive non-small-cell lung cancer (EURTAC): a multicentre, open-label, randomised phase 3 trial. Lancet Oncol 2012;13:239-46. Crossref
31. Zhou C, Wu YL, Chen G, et al. Erlotinib versus chemotherapy as first-line treatment for patients with advanced EGFR mutation–positive non-small-cell lung cancer (OPTIMAL, CTONG-0802): a multicentre, open-label, randomised, phase 3 study. Lancet Oncol 2011;12:735-42. Crossref
32. Yang JC, Wu YL, Schuler M, et al. Afatinib versus cisplatin-based chemotherapy for EGFR mutation–positive lung adenocarcinoma (LUX-Lung 3 and LUX-Lung 6): analysis of overall survival data from two randomised, phase 3 trials. Lancet Oncol 2015;16:141-51. Crossref
33. Magnuson WJ, Lester-Coll NH, Wu AJ, et al. Management of brain metastases in tyrosine kinase inhibitor-naïve epidermal growth factor receptor-mutant non-small-cell lung cancer: a retrospective multi-institutional analysis. J Clin Oncol 2017;35:1070-7. Crossref
34. Mehta MP, Tsao MN, Whelan TJ, et al. The American Society for Therapeutic Radiology and Oncology (ASTRO) evidence-based review of the role of radiosurgery for brain metastases. Int J Radiat Oncol Biol Phys 2005;63:37-46. Crossref
35. Patil CG, Pricola K, Sarmiento JM, Garg SK, Bryant A, Black KL. Whole brain radiation therapy (WBRT) alone versus WBRT and radiosurgery for the treatment of brain metastases. Cochrane Database Syst Rev 2012;(9):CD006121. Crossref
36. Sahgal A, Aoyama H, Kocher M, et al. Phase 3 trials of stereotactic radiosurgery with or without whole-brain radiation therapy for 1 to 4 brain metastases: individual patient data meta-analysis. Int J Radiat Oncol Biol Phys 2015;91:710-7. Crossref

Feasibility and safety of extended adjuvant temozolomide beyond six cycles for patients with glioblastoma

Hong Kong Med J 2017 Dec;23(6):594–8 | Epub 11 Aug 2017
DOI: 10.12809/hkmj165002
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Feasibility and safety of extended adjuvant temozolomide beyond six cycles for patients with glioblastoma
Sonia YP Hsieh, MB, BS, MSc; Danny TM Chan, FRCS, FHKAM (Surgery); Michael KM Kam, FRCR, FHKAM (Radiology); Herbert HF Loong, MB, BS, MRCP (UK); WK Tsang, FRCR, FHKAM (Radiology); Darren MC Poon, FRCR, FHKAM (Radiology); Stephanie CP Ng, PhD; WS Poon, FRCS, FHKAM (Surgery)
CUHK Otto Wong Brain Tumour Centre, The Sir Yue-kong Pao Centre for Cancer, Prince of Wales Hospital, Shatin, Hong Kong
 
Corresponding author: Dr Danny TM Chan (tmdanny@surgery.cuhk.edu.hk)
 
 Full paper in PDF
 
Abstract
Introduction: Temozolomide is the first chemotherapeutic agent proven effective for patients with newly diagnosed glioblastoma. The drug is well tolerated for its low toxicity. The current standard practice is concomitant chemoradiotherapy for 6 weeks followed by 6 cycles of adjuvant temozolomide. Some Caucasian studies have suggested that patients might benefit from extended adjuvant cycles of temozolomide (>6 cycles) to lengthen both progression-free survival and overall survival. In the present study, we compared differences in survival and toxicity profile between patients who received conventional 6-cycle temozolomide and those who received more than 6 cycles of temozolomide.
 
Methods: Patients with newly diagnosed glioblastoma without progressive disease and completed concomitant chemoradiotherapy during a 4-year period were studied. Progression-free survival was compared using Kaplan-Meier survival curves. t Test, U test, and correlation were chosen accordingly to examine the impact of age, extent of resection, MGMT promoter methylation status and adjuvant cycles on progression-free survival. For factors with a P value of <0.05 in univariate analyses, Cox regression hazard model was adopted to determine the strongest factors related to progression-free survival.
 
Results: The median progression-free survival was 17.0 months for patients who received 6 cycles of temozolomide (n=7) and 43.4 months for those who received more than 6 cycles (n=7) [P=0.007, log-rank test]. Two patients in the former group and one in the latter group encountered grade 1 toxicity and recovered following dose adjustment. Cycles of adjuvant temozolomide were correlated with progression-free survival (P=0.016, hazard ratio=0.68).
 
Conclusion: Extended cycles of temozolomide are safe and feasible for Chinese patients with disease responsive to temozolomide.
 
 
New knowledge added by this study
  • Extended adjuvant temozolomide beyond 6 cycles is safe and feasible. The approach has improved progression-free survival.
Implications for clinical practice or policy
  • For glioblastoma patients with disease that is responsive to temozolomide, extended adjuvant cycles can be suggested.
 
 
Introduction
Glioblastoma multiforme (GBM) has been a conundrum for all clinicians. The standard approach includes maximal safe resection, irradiation with concurrent temozolomide (TMZ), and 6 cycles of adjuvant TMZ.1 Addition of chemotherapy to radiotherapy can prolong survival among GBM patients, with a median increase in survival of 2.5 months.1 Since then, no further breakthrough treatment has emerged.
 
Of note, there is still insufficient evidence to support 6 cycles as the optimal adjuvant amount of TMZ for GBM. Only few studies have suggested that extended use of TMZ is safe and beneficial for prolonged survival.2 The main concern of extended use of TMZ is haematological toxicity. It is attributed to the depletion of O6-methylguanine-DNA methyltransferase (MGMT) protein activity in both GBM cells and haematopoietic stem cells.3 Nonetheless, compared with other alkylating agents, the low toxicity profile of TMZ has motivated clinicians to try its extended use after balancing the benefits and side-effects for each patient.4
 
Our institution offers the option for GBM patients with at least stable disease to step up to adjuvant cycles of TMZ. In this study, we report the experience of extended use of TMZ and its impact on newly diagnosed GBM patients.
 
Methods
Study design
We retrospectively reviewed the brain tumour registry of the Chinese University of Hong Kong Otto Wong Brain Tumour Centre, and identified patients with primary GBM during January 2010 to December 2013. Those patients who received surgical intervention and standard concomitant chemoradiotherapy after surgery (60-Gy irradiation with concomitant TMZ for 6 weeks, then followed by at least 6 cycles of adjuvant TMZ) were chosen as candidates for the study.
 
Surgical intervention
An experienced neuroradiologist was responsible for determining the extent of resection (EOR) by reading the postoperative day-1 magnetic resonance imaging (MRI) scans. A total resection indicated that the entire preoperative contrast-enhanced lesion seen on T1-weighted images was excised. If there was residual enhancement on T1-weighted images as well as T1-subtraction sequence, the case would be labelled as subtotal resection.
 
Irradiation and temozolomide protocol
As a standard practice, a postoperative irradiation of 60 Gy was given to all patients within 4 weeks of surgery. Temozolomide was prescribed concurrently during radiotherapy at 75 mg/d/m2 for 6 weeks, followed by 6 or more cycles of adjuvant TMZ at a dosage of 150-200 mg/d/m2 for 5 consecutive days every 28 days. After completion of standard 6-cycle TMZ, all patients with at least stable disease were offered the chance of extended TMZ, regardless of individual prognostic factors. Whether or not they proceeded to extended TMZ was a decision made principally by patients and their relatives and also with neurosurgeons and clinical oncologists, on the basis of a detailed assessment of the patient’s clinical performance (neurological status and toxicity profile) and tumour response to TMZ. Anti-emetics were given during the 5 days. To achieve early detection of TMZ toxicity, haematological profile including complete blood picture with differential count, and liver and renal functions were assessed monthly on about day 21 to day 25. Toxicity was graded according to the National Cancer Institute Common Terminology Criteria for Adverse Events (CTCAE), version 4.0.5
 
Follow-up schedule
All patients were followed up regularly with both clinical and radiological assessments of disease status. They were seen by a neuro-oncologist every 2 weeks after surgery, daily during irradiation, and monthly when being given adjuvant TMZ. For radiological assessment, patients were subjected to a scanning protocol with contrast-enhanced MRI of the brain at postoperative day 1, 2 weeks after completion of radiation, and every 3 months thereafter. These standardised protocols ensured that disease progression of all patients could be monitored in a timely manner. Disease progression was determined according to Macdonald’s criteria. In short, deteriorating neurological status, increasing tumour size, and appearance of new enhancement were suggestive of disease progression.6
 
Statistical analysis
Progression-free survival (PFS) was calculated from the date of surgical intervention to the date of progression. As the aim of this study was to compare the therapeutic effect of standard 6-cycle TMZ with that of more than 6 cycles of TMZ, only patients with neither neurological deterioration nor radiological signs suggesting progression for more than 28 days upon completion of the sixth cycle of adjuvant TMZ were eligible. The MGMT promoter status and EOR were regarded as categorical factors while age and cycles of adjuvant TMZ were assigned as the continuous variables for which correlation was chosen as the tool for analysis. t Test, U test, and correlation were chosen accordingly to examine the impact on PFS of each factor. For those factors with a P value of <0.05 in univariate analyses, a Cox regression hazard model was adopted to determine those strongly related to PFS. All statistical analyses were done using the SPSS (Windows version 22.0; IBM Corp, Armonk [NY], US).
 
This audit review was conducted in accordance with the principles outlined in the Declaration of Helsinki.
 
Results
Basic demographics
From January 2010 to December 2013, there were 14 patients who fulfilled the inclusion criteria. Their mean age at presentation was 52 (range, 25-71) years with a male preponderance: 10 male versus 4 female patients. Total resection was achieved in seven patients. For the remaining seven, six underwent subtotal resection and one could only have tumour biopsy. The MGMT promoter status was available in all cases and was methylated in 12 cases and unmethylated in the remaining two. After completion of standard concomitant chemoradiotherapy in all cases, extended adjuvant TMZ was initiated in seven cases (Table 1).
 

Table 1. Demographics and basic characteristics of patients
 
In total, 134 cycles of adjuvant TMZ were given, with 92 cycles given to the seven patients who proceeded to extended maintenance TMZ treatment. The median number of cycles given was 13 (range, 8-26) in the latter group. With regard to TMZ-related toxicity, two patients in the 6-cycle group had grade 1 haematological toxicity (thrombocytopaenia and neutropaenia) and one patient in the >6-cycle group developed mildly deranged alanine aminotransferase (ALT; grade 1, defined as “more than upper limit of normal and less than three times the upper limit of normal by CTCAE”5) during the fifth cycle of adjuvant TMZ that subsequently subsided.
 
Survival and associated prognostic factor
Progression-free survival differed significantly between the two groups: 17.0 (95% confidence interval [CI], 14.4-19.6) months for 6-cycle versus 43.4 (95% CI, 17.8-69.0) months for the >6-cycle group (P=0.007, log-rank test; Fig).
 

Figure. Kaplan-Meier progression-free survival curves of patients with different treatments
 
Progression-free survival at 12 months was 85.7% for both groups, and that at 18 months declined to only 28.6% for the 6-cycle group compared with 71.4% for the >6-cycle group (Table 2Fig).
 

Table 2. Progression-free survival and overall survival in the two groups of patients
 
Three out of seven patients in the >6-cycle group were still alive at their last follow-up. Their median overall survival was 48.4 (95% CI, 24.3-72.4) months. In the 6-cycle group, the median overall survival was 30.9 (95% CI, 22.4-39.4) months. No statistical significance was observed by the end of the study (P=0.07, log-rank).
 
All factors including age, gender, and MGMT status were well balanced, except for the EOR. Despite the higher proportion of patients with subtotal resection who elected to receive extended TMZ, EOR was not predictive of longer PFS (P=0.482, Mann-Whitney U test). When subgrouping the cohort with MGMT promoter status, there was no evidence to suggest that methylated MGMT promoter status favoured patients with better PFS (P=0.882, Mann-Whitney U test). Age was also not correlated with PFS (P=0.09, Pearson correlation). Cox regression hazard model suggested that increased cycles of TMZ were associated with prolonged PFS (P=0.016; hazard ratio=0.68 per cycle; 95% CI, 0.48-0.94) [Table 3].
 

Table 3. Univariate analyses for progression-free survival
 
Discussion
Despite recent advances in its therapy, GBM is still an incurable disease, characterised by rapid and inevitable recurrence even with intensive treatment. Ample clinical research has been carried out with the intention of defeating the disease, but the prognosis of GBM remains dismal. Temozolomide is the first chemotherapeutic agent proven to be effective. The standard treatment after maximum safe resection has two components, irradiation with concomitant TMZ and adjuvant TMZ at a higher dosage for 6 cycles. Under this regimen, survival is favourably improved.1 Since then, no other encouraging milestones have been achieved.
 
Comparison of progression-free survival and toxicity with other studies
Our audit review showed a significant correlation between the number of cycles of TMZ and PFS. One patient in the >6-cycle group showed a continuous yet prominent shrinkage of the non-operable GBM (bilateral corpus callosum) after initiation of the seventh cycle of TMZ and finally achieved complete response after 12 cycles. The patient had only mildly deranged ALT during the fifth cycle of adjuvant TMZ and this subsided on its own.
 
Temozolomide was well-tolerated by most patients. One of our previous studies also demonstrated its satisfying anti-tumoural activity as well as its safety profile among ethnic Chinese population.7 Extended usage of TMZ upon completion of standard 6-cycle adjuvant courses has become common practice in many institutions.4 8 9
 
A literature search revealed only a few reports with similar settings and conclusions. Three non-randomised retrospective studies with decent sample sizes demonstrated an indispensable impact of extended adjuvant TMZ. The reported PFS ranged from 13 to 24.6 months; the overall survival was also improved.2 8 10 One very recent pooled analysis of four randomised clinical trials, however, showed a slightly different result—PFS was the only outcome that increased with the cumulative prescription of TMZ.11 Blumenthal et al11 reported that treatment with extended maintenance TMZ was significantly associated with better PFS with a hazard ratio of 0.77 (6 cycles vs >6 cycles). To conclude, the positive impact of long-term use of TMZ on PFS is supported by much evidence. Its influence on overall survival, however, needs further clarification.
 
Toxicity after long-term usage of temozolomide
By sacrificing its only alkyl component to the TMZ-induced lethal depletion of alkyl products on tumoural DNA, MGMT serves as a suicidal DNA repair enzyme. Theoretically, this irreversible depletion of the MGMT protein could be exploited by increasing tumoural exposure to TMZ. The effect might be even more prominent when MGMT promoter is hypermethylated, although the impact of MGMT promoter methylation could not be demonstrated in the present study. Nonetheless this mechanism also accounts for myelosuppression, the main concern of long-term use of TMZ, since MGMT protein in normal cells can also be depleted by TMZ. It is more common in haematopoietic stem cells contributing to toxicity for patients using this alkylating agent.3 In a cohort that comprised 114 patients, 39 (34%) were observed to have CTCAE grade 3 haematological toxicity during administration of TMZ. The study included all patients who received 1 to 57 cycles of TMZ.8 The French SV3 Study also evaluated the effect of prolonged TMZ and suggested that 39.6% of cases developed haematological toxicity beyond the second cycle.10 Toxicity to a certain degree discourages both clinicians and patients from increasing the dosage of TMZ during adjuvant therapy, and for extending use of TMZ beyond 6 cycles. In the current study, only 21.4% (3/14) of our patients encountered mild side-effects. Neuro-oncologists, however, remain reluctant to persuade patients to receive long-term TMZ. It is generally accepted by clinicians that long-term use of alkylating agents is unwise since they are likely to be the eventual cause of myelosuppression and secondary cancers.
 
Clinical and financial situation in Hong Kong
In our institute, all patients with at least stable disease for more than 28 days after completion of the sixth cycle of adjuvant TMZ will be offered the option of continuing TMZ beyond 6 cycles, after being given detailed information about possible future side-effects. Of note, TMZ is funded in Hong Kong only for the first six adjuvant cycles; patients need to pay thereafter, making the inherent socio-economic bias unavoidable.
 
This study had several limitations. The sample size was relatively small. The analyses presented may provide only limited and preliminary evidence. Moreover, due to the nature of this study, only patients with disease responsive to TMZ yet with no or mild TMZ-related toxicity were qualified for the study.
 
Conclusion
Extended treatment with TMZ is safe and effective in Chinese patients with disease that is responsive to it. Careful assessment and consideration of continuing adjuvant TMZ is feasible for this group of patients.
 
Declaration
All authors have disclosed no conflicts of interest.
 
References
1. Stupp R, Mason WP, Van Den Bent MJ, et al. Radiotherapy plus concomitant and adjuvant temozolomide for glioblastoma. N Engl J Med 2005;352:987-96. Crossref
2. Barbagallo GM, Paratore S, Caltabiano R, et al. Long-term therapy with temozolomide is a feasible option for newly diagnosed glioblastoma: a single-institution experience with as many as 101 temozolomide cycles. Neurosurg Focus 2014;37:E4. Crossref
3. Wick W, Platten M, Weller M. New (alternative) temozolomide regimens for the treatment of glioma. Neuro Oncol 2009;11:69-79. Crossref
4. Mason WP, Maestro RD, Eisenstat D, et al. Canadian recommendations for the treatment of glioblastoma multiforme. Curr Oncol 2007;14:110-7. Crossref
5. Common Terminology Criteria for Adverse Events (CTCAE), Version 4.0. Available from: http://ctep.cancer.gov/protocolDevelopment/electronic_applications/docs/ctcaev3.pdf. Accessed 31 Mar 2016.
6. Macdonald DR, Cascino TL, Schold SC Jr, Cairncross JG. Response criteria for phase II studies of supratentorial malignant glioma. J Clin Oncol 1990;8:1277-80. Crossref
7. Chan DT, Poon WS, Chan YL, Ng HK. Temozolomide in the treatment of recurrent malignant glioma in Chinese patients. Hong Kong Med J 2005;11:452-6.
8. Seiz M, Krafft U, Freyschlag CF, et al. Long-term adjuvant administration of temozolomide in patients with glioblastoma multiforme: experience of a single institution. J Cancer Res Clin Oncol 2010;136:1691-5. Crossref
9. Hau P, Koch D, Hundsberger T, et al. Safety and feasibility of long-term temozolomide treatment in patients with high-grade glioma. Neurology 2007;68:688-90. Crossref
10. Rivoirard R, Falk AT, Chargari C, et al. Long-term results of a survey of prolonged adjuvant treatment with temozolomide in patients with glioblastoma (SV3 Study). Clin Oncol (R Coll Radiol) 2015;27:486-7. Crossref
11. Blumenthal DT, Stupp R, Zhang P, et al. ATCT-08. The impact of extended adjuvant temozolomide in newly-diagnosed glioblastoma: A secondary analysis of EORTC and NRG Oncology/RTOG. Neuro Oncol 2015;17(Suppl 5):v2. Crossref

Moderate iodine deficiency among pregnant women in Hong Kong: revisit the problem after two decades

Hong Kong Med J 2017;23(6):586–93 | Epub 10 Nov 2017
DOI: 10.12809/hkmj176841
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Moderate iodine deficiency among pregnant women in Hong Kong: revisit the problem after two decades
WH Tam, MD, FRCOG1; Ruth SM Chan, PhD2; Michael HM Chan, FRCPA, FHKCPath3; LY Yuen, BSc1; Liz Li, MSc2; Mandy MM Sea, PhD2; Jean Woo, MD, FRCP2
1 Department of Obstetrics and Gynaecology, The Chinese University of Hong Kong, Shatin, Hong Kong
2 Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Shatin, Hong Kong
3 Department of Chemical Pathology, The Chinese University of Hong Kong, Shatin, Hong Kong
 
Corresponding author: Dr WH Tam (tamwh@cuhk.edu.hk)
 
 Full paper in PDF
 
Abstract
Introduction: A survey conducted during 2005 to 2007 by the Centre for Food Safety in Hong Kong suggested that only 5% of the local population had a sufficient dietary intake of iodine. The study, however, was limited as biochemical data (ie urinary iodine concentration) were lacking. Pregnant women are vulnerable to iodine deficiency because of their increased requirement. Recent studies have shown that iodine deficiency in early pregnancy is associated with poorer cognitive development in early childhood. This study reports the iodine status of women during early gestation at an obstetric unit in Hong Kong.
 
Methods: Healthy pregnant women with no history of hyperemesis gravidarum were enrolled into a study when they first made a booking in an antenatal clinic of a public hospital to investigate their iodine status during early pregnancy. All subjects were asked to collect their morning urine for measurement of iodine and creatinine levels. Daily dietary intake of iodine was assessed in a subgroup of participants by structured interview using a standard food frequency questionnaire.
 
Results: A total of 600 pregnant women were enrolled at a median of 7.0 weeks of gestation. The median urinary iodine concentration and urinary iodine-to-creatinine ratio were 100 µg/L and 98 µg/g, respectively; 429 (71.5%) participants had iodine insufficiency according to the World Health Organization classification. Daily dietary intake of iodine was assessed in 146 participants. The median daily intake of iodine was 69.5 µg and 122 (83.6%) participants had an intake below the 250 µg recommended during pregnancy by the World Health Organization.
 
Conclusions: Local pregnant women continue to have an inadequate dietary intake of iodine and remain iodine-deficient.
 
 
New knowledge added by this study
  • This study confirmed the previous observation that dietary intake of iodine is inadequate in our local population, in particular in women during pregnancy.
  • The majority of the pregnant women were taking multivitamin supplements without iodine, suggesting inadequate knowledge about the importance of iodine.
Implications for clinical practice or policy
  • There is an urgent need to educate the general public, in particular women of childbearing age, about the importance of an adequate dietary intake of iodine and the option of iodine-containing multivitamin supplements before contemplating pregnancy.
  • We can consider a policy to monitor the iodine status of the population, especially in vulnerable groups including pregnant and lactating mothers, and their children.
  • World Health Organization recommends salt iodisation in countries and areas that are iodine-deficient. As less than 15% of table salt available in Hong Kong is iodised, it may be appropriate to consider this policy.
 
 
Introduction
The World Health Organization (WHO) considers iodine deficiency the single most important preventable cause of brain damage worldwide. Since 1993, WHO has recommended universal salt iodisation to eliminate iodine deficiency disorder.1 There is a common belief that the Hong Kong population residents in coastal regions should have sufficient iodine intake. Twenty years ago, however, 50% of children and adults were found iodine-deficient according to the WHO standards.2 At the same time, studies also reported that one third of local pregnant women were iodine-deficient based on their urinary iodine concentration (UIC), from the first through third trimester.3 4 Pregnant and lactating women are among the most vulnerable groups in the population as iodine plays an important role in early neuronal migration and maturation in the developing fetus and infants.
 
A local expert panel group was established in 2003 to encourage the monitoring of iodine status and rectify the problem.5 The panel concluded that dietary iodine intake of our population was borderline sufficient, and was also inadequate to meet any extra requirement at the time of thyroid stress, ie pregnancy, neonatal period, and in the first few years of life.5 In a local population survey conducted by the Centre for Food Safety during 2005 to 2007, the median iodine dietary intake was as low as 44 µg/day among 5000 adults, while 60% had an intake of <50 µg/day (the threshold for normal thyroid functioning).6 Only 5% had an iodine intake within the safe range. This report, however, also admitted a lack of clinical and biochemical data (ie UIC) that precluded a conclusion about iodine deficiency. Since this report in 2011, all mothers are enquired about their dietary intake of iodine and provided with iodine-containing multivitamin supplements when they book in at a Maternity and Child Health Centre in Hong Kong.
 
Nevertheless, the problem remains unresolved as two recent cohort studies in the UK and Australia consistently reported that low maternal first-trimester UIC was associated with poorer childhood cognitive development at the age of 8 to 9 years.7 8 The investigators of the Avon Longitudinal Study of Parents and Children in the UK also reported a dose-effect association; child neurocognitive scores were lower when maternal urinary iodine-to-creatinine ratio (UICr) was <50 µg/g.7
 
As no salt iodisation programme has been implemented since the first study was carried out, and with the changes in social circumstances and dietary trends over the last two decades, it was deemed necessary and important to reassess the iodine status of pregnant and lactating women in Hong Kong to guide future policy. The objective of this paper was to report the iodine status of women during early gestation at an obstetric unit in Hong Kong.
 
Methods
Between July 2014 and November 2015, healthy pregnant women were enrolled at the antenatal clinic of Prince of Wales Hospital into a study of gestational age-specific thyroid function test reference intervals. Women without a history of thyroid dysfunction, hyperemesis gravidarum, autoimmune disease, or any other major medical conditions were eligible. Consecutive cases were recruited at their first attendance for antenatal booking. We estimated that 250 blood samples were required for each block of gestational age range (<6, 6-10, >10-14, >14-18, >18-24, >24-32, >32-38, and >38 weeks) to generate a nomogram; 600 subjects were required to provide 2000 samples, each had an average of four blood samples across the gestation, with an estimated dropout rate of 15%. All subjects underwent early ultrasound scan for dating with gestational age adjusted according to the ultrasound date if it differed to that calculated from the last menstrual period. The study was approved by the Chinese University of Hong Kong Clinical Research Ethics Committee (CRE-2013.500), and written informed consent was obtained from all women.
 
As part of the study design, UIC was measured at the time of recruitment to determine the participant’s iodine status. Women were asked to collect a morning urine sample into an acid-washed trace element urine bottle within 1 week of recruitment for the assay. In this study, UIC was measured by an inductively coupled plasma mass spectrometer (ICP-MS 7700; Agilent Technologies, US). The intra- and inter-assay coefficients of variation for UIC were 3.3% and 7.4% at 11.9 µg/L, and 2.7% and 4.9 % at 49.6 µg/L, respectively. Urinary creatinine concentration was also measured by IDMS-traceable Jaffe kinetic reaction (cobas 8000; Roche Diagnostics Inc, US) to calculate the UICr.
 
All participants were interviewed by a research nurse who asked about the duration, frequency, and brand of any multivitamin supplements taken to quantify daily iodine supplementation. Due to the limited funding, dietary intake of iodine-containing foods was only assessed in a conveniently sampled subgroup of the population by another research assistant using a food frequency questionnaire (FFQ; Appendix). Education level and occupation were recorded for this subgroup. The FFQ was based on the literature and previously validated FFQs that have been used in the local population.6 9 10 Eleven main food groups that contribute to iodine intake in the local population were incorporated into the FFQ (Table 1). Portions were reported based on standard reference sizes, eg cups, grams, centilitres. To facilitate understanding and recall, food photo albums and eating utensils of standard portions were presented when completing the FFQ. Use of iodised salt was also documented. Daily dietary intake of iodine was calculated using Food Processor Nutrition Analysis and Fitness software, version 8.0 (ESHA Research, Salem, US). The iodine content of local foods was also programmed into the software, extracted from food composition tables from Hong Kong SAR and Mainland China.11 We used the WHO definition to categorise insufficient, adequate, above requirements, and excessive by UIC of <150, 150-249, 250-499, and ≥500 µg/L, respectively12; UICr of <150 µg/g was also used to define insufficiency according to Bath et al.7 An average daily iodine intake of 250 µg/day was considered adequate for pregnancy.1 Data are expressed as mean ± standard deviation (SD), median and interquartile range (IQR), or counts with proportion. Between-group differences were compared using the Student’s t test and χ2/Fisher’s exact tests, as appropriate. Multivariate logistic regression analyses were used to obtain adjusted odds ratios with 95% confidence intervals, with the forced entry of covariates, namely maternal age, parity, education level, occupation, body mass index, and gestational age at recruitment, to assess factors associated with low UIC (<150 µg/L), low UICr (<150 µg/g) and iodine-containing supplementation. Multivariate linear regression analyses were used to assess the association with UIC and UICr. Statistical analysis was performed using SPSS (Windows version 20.0; IBM Corp, Armonk [NY], US). A P value of <0.05 was used to indicate significance for two-tailed statistical test results. The gestational age-specific thyroid function test reference intervals are not included in this article but will be published elsewhere.
 

Table 1. Contribution to daily dietary intake of iodine from eleven main food groups
 
Results
A total of 600 healthy pregnant women were enrolled during the study period at a median gestational age of 7.0 (IQR, 5.9-8.6) weeks. The mean (± SD) age and body mass index at recruitment were 31.3 ± 3.9 years and 21.9 ± 3.1 kg/m2, respectively. Of the subjects, 382 (63.7%) women were nulliparous and two (0.3%) were twin pregnancies. The median (IQR) UIC and UICr were 100 (58-165) µg/L and 98 (67-150) µg/g, respectively (Table 2). According to the WHO definition, 429 (71.5%) participants were regarded as iodine deficient (Fig a). Moreover, 450 (75.0%) participants had UICr of <150 µg/g (Fig b).7
 

Table 2. Characteristics of all 600 participants and the subgroup of participants who completed the FFQ
 

Figure. Subjects classified according to the WHO definition on the basis of (a) UIC, and (b) UICr in the study population
 
Daily dietary intake of iodine was assessed in 146 participants. They were slightly younger in age and had a higher rate of nulliparity; their UIC and UICr were no different to all participants (Table 2). The median (IQR) daily dietary intake of iodine was 69.5 (47.3-152.4) µg. The greatest source of iodine among the studied sample was from seaweeds that were consumed by 76% of women (Table 1). The next most common source of iodine was non-alcoholic beverages such as soy milk, soup, soft drinks, and water (21.5% of the daily iodine intake). Of all the 146 women surveyed, only four (2.7%) regularly used iodised salt in their diet. Several food sources were commonly eaten by all women including fish, cereal and grain products, vegetables and legumes, and condiments. Nonetheless these combined food sources only contributed to 15.5% of daily dietary iodine intake. There were 44 (30.1%), 92 (63.0%) and 122 (83.6%) participants who had a daily iodine intake below 50, 100, and 250 µg respectively.
 
Nutritional supplementation was reported by 414 (69.0%) of all participants at the time of enrolment but only 163 (39.4%) of these supplements contained iodine; the daily iodine supplement was between 140 and 290 µg if taken according to the prescription. The subgroup of participants who had taken iodine-containing supplements for 2 weeks or more had an adequate iodine status both by their median UIC and UICr (Table 3). In 122 subjects who completed the FFQ and had urinary iodine level measured within 1 week of each other, total daily iodine intake (iodine supplementation included) had a weak correlation with UICr (r=0.20, P=0.03). In multivariate regression analyses, no covariates except gestational age at recruitment were associated with low UIC or UICr, or iodine supplementation; women recruited at a later gestational age had significantly higher UIC and were more likely to have started taking an iodine-containing supplement (Table 4).
 

Table 3. Median UIC and UICr among participants who did not take iodine supplements and those who took it for more than 1 or 2 weeks
 

Table 4. Association of UIC, UICr, and iodine supplementation with gestational age at recruitment
 
Discussion
The findings from this study suggest that local pregnant women might still be iodine-deficient when the WHO definition is applied. In the subgroup of participants who completed the FFQ, more than 80% did not have an adequate daily iodine intake as recommended by the WHO and other authorities.13 14 This is in keeping with the survey conducted by the Centre for Food Safety 10 years ago. A recent local study of 95 lactating mothers showed that the mean daily dietary intake of iodine was 62.6 µg with only 2% having a sufficient iodine intake.15 The result also showed that only 48% of breast milk samples from 39 women with an infant younger than 6 months had an adequate level of iodine (85 µg per day) as recommended by the Chinese Dietary Reference Intake.15 The revised guideline from the Department of Health in 2016 suggested that mothers consider taking iodine-containing prenatal supplements during pregnancy and breastfeeding in view of the difficulties in obtaining sufficient iodine from food alone.16 The policy appears to be in agreement with our findings that mothers taking iodine-containing supplements were iodine sufficient according to their median UIC. Nonetheless universal iodine supplementation remains controversial. The American Thyroid Association, Endocrine Society, and New Zealand Ministry of Health recommend that women who are planning a pregnancy or who are currently pregnant or lactating should receive 150 µg per day of iodine supplementation in the form of potassium iodide-containing supplements.17 18 On the contrary, Cochrane systematic review did not support routine iodine supplementation in women before, during or after pregnancy, while WHO does not recommend iodine supplementation in regions where the median UIC indicates iodine sufficiency, or where a salt iodisation programme is in place.1 19 All eight iodine-containing supplements that were reportedly taken by participants in this study also contained iron, and frequently resulted in varying degrees of constipation. Most women did not start taking supplements prior to their first antenatal visit. Even in those who did, 60% of the supplements contained no iodine, similar to that reported in the US.20 21 Our results also show that mothers were more likely to have commenced iodine supplementation and have an iodine adequate status at a later gestation; this suggests mothers might acquire the knowledge as pregnancy progresses.
 
Although the UIC is designated for assessment of a population, not individual iodine status, our results suggest that three quarters of the participants may have been iodine insufficient during early pregnancy according to their UIC and UICr. Similarly, iodine deficiency in pregnancy has re-emerged in several developed countries.22 Subjects enrolled into the Avon study in the UK were mildly-to-moderately iodine deficient with a median UIC of 91 µg/L.7 A more recent study among schoolgirls in the UK also reported a similarly low UIC suggesting that mild-to-moderate iodine deficiency remains a problem.23 Such deficiency was probably caused by a poor availability of iodised salt, few UK recommendations for increased iodine intake in pregnancy, and insufficient use of iodine-containing prenatal supplements. The International Council for Control of Iodine Deficiency Disorders global network now places the UK on the list of mildly deficient nations.24 The Department of Health in the UK has also added iodine to its National Diet and Nutrition Survey that checks the nutrient intake of adults and children in the UK.25 In Australia, mandatory iodine fortification of salt used in bread was introduced in 2009 in order to tackle mild iodine deficiency in the population; iodine status of women has improved since then but only those taking iodine-containing supplements have UIC indicative of sufficiency.26 According to data from the National Health and Nutrition Examination Survey 2005-2010, more than 55% of pregnant women had UIC that suggested inadequate iodine intake.27 Mainland China has led the way in sustaining the elimination of iodine deficiency through iodised salt since the early 1990s.28 Nonetheless dietary iodine intake remains insufficient among pregnant women in Shanghai, Zhejiang, and other coastal cities where iodine is considered sufficient for the general population.29 30 A national survey from the Mainland reported that the median UIC levels of pregnant and lactating women were 123-224 µg/L and 109-224 µg/L, respectively; median UIC was higher among those in inland cities, because of a higher iodine level in the salt and greater household coverage.31 In contrast, the progress in tackling iodine insufficiency in Hong Kong has been far from satisfactory.
 
Due to the limitations in study design, this study can only provide a snapshot overview of the iodine status of local pregnant women during early pregnancy from a single obstetric unit. Given that the main objective of the study was to ensure our study group had sufficient iodine intake in order to establish a thyroid function test reference range for the local population, our results cannot draw any conclusions about iodine status during the second and third trimesters. Moreover, dietary iodine intake could only be assessed in a subgroup of the population due to limited funding. The WHO has considered iodine deficiency the single most important preventable cause of brain damage worldwide. Recent study also highlighted the impact of iodine deficiency in the first 1000 days of life especially among breast-fed infants.32 As the Hong Kong SAR Government is actively promoting breastfeeding, it is also important to ensure adequate iodine in lactating mothers and breast-fed infants. It is time to systematically revisit the iodine status of our local women at pre-conception, and during pregnancy and lactation.
 
Conclusions
Results of our study are in line with those from the survey by the Centre for Food Safety, suggesting that our local pregnant women are borderline iodine-deficient and have an inadequate dietary iodine intake during early pregnancy. There is a need to educate the public and to advise women of childbearing age to maintain sufficient dietary iodine before contemplating pregnancy. A policy of salt iodisation and regular monitoring of iodine status with UIC in our population should be considered.
 
Acknowledgements
This study was supported by the Hospital Authority of Hong Kong on a project to derive a gestational age-specific thyroid function reference interval for the local pregnant population. Ms Sharon Lai-kwai Chan, research nurse, recruited all subjects, and collected and organised the clinical data. Ms Macy Kwan conducted face-to-face interviews for the dietary questionnaire for the subgroup of subjects.
 
Declaration
All authors have disclosed no conflicts of interest.
 
References
1. World Health Organization; United Nations Children's Fund; International Council for the Control of Iodine Deficiency Disorders. Assessment of iodine deficiency disorders and monitoring their elimination: A guide for programme managers. 3rd ed. Available from: http://apps.who.int/iris/bitstream/10665/43781/1/9789241595827_eng.pdf. Accessed 22 Mar 2017.
2. Kung AW, Chan LW, Low LC, Robinson JD. Existence of iodine deficiency in Hong Kong—a coastal city in southern China. Eur J Clin Nutr 1996;50:569-72.
3. Kung AW, Lao TT, Low LC, Pang RW, Robinson JD. Iodine insufficiency and neonatal hyperthyrotropinaemia in Hong Kong. Clin Endocrinol (Oxf) 1997;46:315-9. Crossref
4. Kung AW, Lao TT, Chau MT, Tam SC, Low LC. Goitrogenesis during pregnancy and neonatal hypothyroxinaemia in a borderline iodine sufficient area. Clin Endocrinol (Oxf) 2000;53:725-31. Crossref
5. But B, Chan CW, Chan F, et al. Consensus statement on iodine deficiency disorders in Hong Kong. Hong Kong Med J 2003;9:446-53.
6. Centre for Food Safety, Hong Kong SAR Government. Risk Assessment Studies Report No. 45: Dietary iodine intake in Hong Kong adults. Jul 2011. Available from: http://www.cfs.gov.hk/english/programme/programme_rafs/programme_rafs_n_01_12_Dietary_Iodine_Intake_HK.html. Accessed 20 Apr 2015.
7. Bath SC, Steer CD, Golding J, Emmett P, Rayman MP. Effect of inadequate iodine status in UK pregnant women on cognitive outcomes in their children: results from the Avon Longitudinal Study of Parents and Children (ALSPAC). Lancet 2013;382:331-7. Crossref
8. Hynes KL, Otahal P, Hay I, Burgess JR. Mild iodine deficiency during pregnancy is associated with reduced educational outcomes in the offspring: 9-year follow-up of the gestational iodine cohort. J Clin Endocrinol Metab 2013;98:1954-62. Crossref
9. Combet E, Lean ME. Validation of a short food frequency questionnaire specific for iodine in U.K. females of childbearing age. J Hum Nutr Diet 2014;27:599-605. Crossref
10. Condo D, Makrides M, Skeaff S, Zhou SJ. Development and validation of an iodine-specific FFQ to estimate iodine intake in Australian pregnant women. Br J Nutr 2015;113:944-52. Crossref
11. 楊月欣. 食物營養成分速查 [in Chinese]. 人民日報出版社; 2006.
12. World Health Organization. Urinary iodine concentrations for determining iodine status in populations. 2013. Available from: http://www.who.int/vmnis/indicators/urinaryiodine/en/. Accessed 22 Apr 2015.
13. Institute of Medicine (US) Panel on Micronutrients. Dietary reference intakes for vitamin A, vitamin K, arsenic, boron, chromium, copper, iodine, iron, manganese, molybdenum, nickel, silicon, vanadium, and zinc. Washington, DC: National Academies Press; 2001. Available from: https://www.ncbi.nlm.nih.gov/books/NBK222310/. Accessed 22 Mar 2017.
14. American Thyroid Association. Iodine deficiency. 4 Jun 2012. Available from: http://www.thyroid.org/iodine-deficiency/. Accessed 25 Mar 2015.
15. PolyU discovers inadequate calcium, iron and iodine intakes of Hong Kong lactating women. 26 Jul 2016. Available from: https://www.polyu.edu.hk/web/en/media/media_releases/index_id_6237.html. Accessed 25 Jun 2017.
16. Family Health Service, Department of Health, Hong Kong SAR Government. Healthy eating during pregnancy and breastfeeding. Revised Nov 2016. Available from: http://www.fhs.gov.hk/english/health_info/woman/20036.html. Accessed 21 Jun 2017.
17. Alexander EK, Pearce EN, Brent GA, et al. 2017 Guidelines of the American Thyroid Association for the Diagnosis and Management of Thyroid Disease During Pregnancy and the Postpartum. Thyroid 2017;27:315-89. Crossref
18. Ministry for Primary Industries; Ministry of Health, New Zealand. Mandatory iodine fortification in New Zealand: Supplement to the Australian Institute of Health and Welfare 2016 report—Monitoring the health impacts of mandatory folic acid and iodine fortification. MPI Technical—Paper No: 2016/32. Available from: https://www.mpi.govt.nz/dmsdocument/12786-mandatory-iodine-fortification-in-new-zealand-supplement-to-the-australian-institute-of-health-and-welfare-2016-report-monitoring-the-health-impacts-of-mandatory-folic-acid-and-iodine-fortification. Accessed 22 Mar 2017.
19. Harding KB, Peña-Rosas JP, Webster AC, et al. Iodine supplementation for women during the preconception, pregnancy and postpartum period. Cochrane Database Syst Rev 2017;(3):CD011761. Crossref
20. Gregory CO, Serdula MK, Sullivan KM. Use of supplements with and without iodine in women of childbearing age in the United States. Thyroid 2009;19:1019-20. Crossref
21. Leung AM, Pearce EN, Braverman LE. Iodine content of prenatal multivitamins in the United States. N Engl J Med 2009;360:939-40. Crossref
22. Li M, Eastman CJ. The changing epidemiology of iodine deficiency. Nat Rev Endocrinol 2012;8:434-40. Crossref
23. Vanderpump MP, Lazarus JH, Smyth PP, et al. Iodine status of UK schoolgirls: a cross-sectional survey. Lancet 2011;377:2007-12. Crossref
24. Iodine global network. Global Scorecard 2014: Number of iodine deficient countries more than halved in past decade. IDD Newsletter 2015;Feb:5-7. Available from: http://www.ign.org/cm_data/IDD_feb15_mail.pdf. Accessed 25 Mar 2015.
25. Whitton C, Nicholson SK, Roberts C, et al. National Diet and Nutrition Survey: UK food consumption and nutrient intakes from the first year of the rolling programme and comparisons with previous surveys. Br J Nutr 2011;106:1899-914. Crossref
26. Charlton KE, Yeatman H, Brock E, et al. Improvement in iodine status of pregnant Australian women 3 years after introduction of a mandatory iodine fortification programme. Prev Med 2013;57:26-30. Crossref
27. Caldwell KL, Pan Y, Mortensen ME, Makhmudov A, Merrill L, Moye J. Iodine status in pregnant women in the National Children's Study and in U.S. women (15-44 years), National Health and Nutrition Examination Survey 2005-2010. Thyroid 2013;23:927-37. Crossref
28. Sun D, Codling K, Chang S, et al. Eliminating iodine deficiency in China: achievements, challenges and global implications. Nutrients 2017;9.pii:E361. Crossref
29. Zou S, Wu F, Guo C, et al. Iodine nutrition and the prevalence of thyroid disease after salt iodization: a cross-sectional survey in Shanghai, a coastal area in China. PLoS One 2012;7:e40718. Crossref
30. Wang Z, Zhu W, Mo Z, et al. An increase in consuming adequately iodized salt may not be enough to rectify iodine deficiency in pregnancy in an iodine-sufficient area of China. Int J Environ Res Public Health 2017;14:piiE206.
31. National Food Safety and Risk Assessment Expert Committee. Salt iodization and risk assessment of iodine status in Chinese population [in Chinese]. Technical report No. 2010-002. May 2010. Available from: http://www.nhfpc.gov.cn/cmsresources/mohwsjdj/cmsrsdocument/doc9250.pdf. Accessed 21 Jun 2017.
32. Stinca S, Andersson M, Herter-Aeberli I, et al. Moderate-to-severe iodine deficiency in the “first 1000 days” causes more thyroid hypofunction in infants than in pregnant or lactating women. J Nutr 2017;147:589-95. Crossref

The first case series of Chinese patients in Hong Kong with familial Alzheimer’s disease compared with those with biomarker-confirmed sporadic late-onset Alzheimer’s disease

Hong Kong Med J 2017 Dec;23(6):579–85 | Epub 10 Nov 2017
DOI: 10.12809/hkmj176845
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
The first case series of Chinese patients in Hong Kong with familial Alzheimer’s disease compared with those with biomarker-confirmed sporadic late-onset Alzheimer’s disease
YF Shea, MRCP (UK), FHKAM (Medicine)1; LW Chu, MD, FRCP (Lond)1; SC Lee, BHS(Nursing)1; Angel OK Chan, MD, FRCPA2
1 Division of Geriatric Medicine, Department of Medicine, LKS Faculty of Medicine, The University of Hong Kong, Queen Mary Hospital, Pokfulam, Hong Kong
2 Division of Clinical Biochemistry, Department of Pathology & Clinical Biochemistry, Queen Mary Hospital, Pokfulam, Hong Kong
 
Corresponding author: Dr YF Shea (elphashea@gmail.com)
 
 Full paper in PDF
 
Abstract
Introduction: Patients with familial Alzheimer’s disease are being increasingly reported in Hong Kong. The objectives of this study were to report the clinical features of these patients, and to compare them with those with biomarker-confirmed sporadic late-onset Alzheimer’s disease.
 
Methods: All symptomatic Chinese patients with familial Alzheimer’s disease who attended Queen Mary Hospital, Memory Clinic between January 1998 and December 2016 were included. Information about clinical features, baseline Mini-Mental State Examination score, and presenting cognitive symptoms or atypical clinical features were collected. Their clinical features were compared with those of 12 patients with sporadic late-onset Alzheimer’s disease with cerebrospinal fluid biomarker evidence of Alzheimer’s disease and 14 patients with late-onset Alzheimer’s disease and positive amyloid loading on Pittsburgh compound B imaging.
 
Results: There were three families with familial Alzheimer’s disease among whom eight family members were affected. The mean (± standard deviation) age of onset and the Mini-Mental State Examination score were 48.4 ± 7.7 years and 7.9 ± 9.2, respectively. Compared with the sporadic late-onset Alzheimer’s disease patients, those with familial Alzheimer’s disease had an earlier age of onset and presentation (both P<0.001) and received the correct diagnosis later (median [interquartile range], 7.5 [5.3-14.5] vs 2 [1.0-3.3] years; P<0.001). Patients with familial disease had a lower Mini-Mental State Examination score at presentation than those having late-onset Alzheimer’s disease (mean, 7.9 ± 9.2 vs 17.6 ± 7.2; P=0.01). They also had fewer delusions, and less dysphoria and irritability (0% vs 41.7%, 0% vs 50% and 0% vs 54.2%; P=0.04, 0.01 and 0.01, respectively). There was a trend of less frequent amnesia among patients with familial Alzheimer’s disease compared with those having late-onset Alzheimer’s disease (75% vs 100%; P=0.05).
 
Conclusion: Clinical features differ for patients with familial Alzheimer’s disease compared with those with late-onset Alzheimer’s disease. There is a delay in diagnosis. Promotion of public awareness of familial Alzheimer’s disease is much needed.
 
 
New knowledge added by this study
  • There is a significant delay in the diagnosis of familial Alzheimer’s disease (FAD) in Hong Kong.
  • Patients with FAD had fewer delusions and less dysphoria and irritability compared with patients with sporadic late-onset Alzheimer’s disease.
Implications for clinical practice or policy
  • Promotion of public awareness of FAD is much needed.
 
 
Introduction
Alzheimer’s disease (AD) is the most common cause of dementia. It is frequently classified as early-onset AD (EOAD) if onset is before the age of 65 years and thereafter as late-onset AD (LOAD). Familial AD (FAD) is a special form of EOAD with an autosomal dominant inheritance, and can be caused by mutations in presenilin (PSEN) 1 or 2 and amyloid precursor protein (APP) genes. Not all patients with EOAD have autosomal dominant FAD, which accounts for less than 1% of all AD.1 The first patient diagnosed with AD by Alois Alzheimer was called Auguste Deter; she was admitted to a psychiatric unit because of amnesia and hallucinations at the age of 51 years.2 Deoxyribonucleic acid was extracted from a histological section of Auguste Deter’s brain and 100 years later a heterozygous mutation p.Phe176Leu was discovered in the PSEN1 gene.2 Unlike reports of FAD in the western population, little has been written about this condition in the Chinese population.1
 
Apart from the difference in age of onset (AOO), EOAD shows a number of differences in clinical features when compared with LOAD. Patients with EOAD often have a non-amnestic presentation with visuospatial dysfunction and apraxias; neuropsychologically they exhibit dysexecutive function, and poor visuospatial and motor skills.3 Structural imaging also reveals that patients with EOAD exhibit more frontal or temporoparietal atrophy rather than the hippocampal atrophy seen in patients with LOAD.3 Patients with EOAD exhibit more hypometabolism in the temporoparietal cortex while those with LOAD exhibit more hypometabolism over the medial temporal lobe.3 Our previous systematic review revealed that FAD patients can present with atypical clinical features including myoclonus, seizures, cerebellar dysfunction, spastic paraparesis, and neuropsychiatric manifestations.1 These factors may contribute to under-recognition of EOAD or FAD among local Chinese population.
 
Diagnosis of FAD is clinically important for the affected family. Genetic counselling may be offered to potential asymptomatic carriers if desired, as they may benefit from prenatal diagnosis and planning of personal affairs.4 Identification of asymptomatic carriers can also identify potential candidates for future drug trials of disease-modifying agents. With respect to preventive therapies, two clinical trials—the DIAN-TU (Dominantly Inherited Alzheimer Network Trial Unit) and API (Alzheimer’s Prevention Initiative)—are ongoing to test the efficacy of passive immunotherapy among normal or mildly symptomatic FAD mutation carriers.5 6 Thus, it is important to enhance local doctors’ knowledge of FAD.
 
The objectives of this study were to report the clinical features of the first case series of Chinese FAD patients in Hong Kong, and to compare their clinical features with those of biomarker-confirmed sporadic LOAD patients. We hypothesised that patients with FAD had more atypical clinical features, and that this could contribute to a delay in correct diagnosis.
 
Methods
Patients with familial Alzheimer’s disease
This was a retrospective case series of FAD patients diagnosed between January 1998 and December 2016 in the memory clinic of Queen Mary Hospital, Hong Kong. The FAD patients were identified by reviewing the case records of all patients diagnosed with EOAD during the study period. The study was performed in accordance with the principles outlined in the Declaration of Helsinki. All symptomatic FAD patients with confirmed mutations in PSEN1 or APP genes were included. To date, no FAD family with PSEN2 has been identified in Hong Kong. All these patients are pure Chinese. Detailed histories were obtained from primary caregivers. All patients underwent a physical examination, laboratory blood tests (including vitamin B12, folic acid, and thyroid function), computed tomography (CT) or magnetic resonance imaging (MRI) of the brain, and completed the Cantonese version of Mini-Mental State Examination (MMSE).7 These patients fulfilled the National Institute of Neurological and Communicative Disorders and Stroke and the Alzheimer’s Disease and Related Disorders Association (NINCDS-ADRDA) diagnostic criteria of AD.8 In this study, AOO was defined as the age at first appearance of symptoms that interfered with social or occupational functioning. Age of correct diagnosis (AOCD) was defined as the age at which diagnosis of FAD was confirmed with genetic mutation. Duration of symptoms was defined as the difference between AOO and AOCD in years. Initial presenting cognitive symptoms and behavioural and psychological symptoms of dementia (BPSD) according to the Neuropsychiatric Inventory (NPI) were specifically collected from primary caregivers and were immediately recorded in the medical records.9 Of note, BPSD—including delusions, hallucinations, agitation, dysphoria, anxiety, euphoria, apathy, disinhibition, irritability, and aberrant motor behaviour—were recorded as binary variables (ie present or absent) as not all NPI scores could be retrieved.9 Authors (SYF and LSC), who were blinded to the hypothesis, retrieved the information related to initial presenting cognitive symptoms and BPSD.
 
Selection of patients
In summary, three families among whom eight patients were affected were included in this case series. Two families have been reported previously.10 11 For reference purposes, there were 18 patients with EOAD and no positive family history during the study period.
 
Two patients with familial Alzheimer’s disease
This family has not been reported in detail previously. The family was referred to our memory clinic more than 10 years ago (Fig). The first case (II3; patient No. 5) was a 52-year-old woman who complained of progressive short-term memory impairment with impaired daily function, occupational performance, and management of personal finances. Her father (I1) and eldest brother (II1) had been diagnosed with dementia at around 50 years of age by doctors in China. As a result of these symptoms, her husband had divorced her, and she received care from her friend. Single-photon emission CT of the brain showed bilateral hypoperfusion over the frontal and temporoparietal lobes. She consented to genetic testing and gene sequencing for known FAD mutations, which was subsequently performed by The Tanz Centre for Research in Neurodegenerative Diseases, University of Toronto. A heterozygous missense mutation p.His163Arg in the PSEN1 gene was detected. She received rivastigmine treatment. Five years later, because of her poor drug compliance and impaired ability to carry out cooking and housework, arrangements were made for her to live in an elderly care home. Another patient (II4; patient No. 4) was her 46-year-old brother who was diagnosed with dementia by another hospital. He also consented to have genetic testing. The same heterozygous missense mutation was found.
 

Figure. Pedigree of the FAD family of patients No.4 and 5
 
Late-onset Alzheimer’s disease with biomarker confirmation
Late-onset AD is defined as AD with AOO that occurs at or after the age of 65 years. During the study period, 12 patients with LOAD underwent cerebrospinal fluid (CSF) examination that revealed an AD pattern of CSF biomarkers (ie low amyloid-beta [Aβ42], and elevated total tau and phosphorylated tau [pTau]) within 1 month of clinical assessment.12 In addition, 14 patients with LOAD underwent 11C-Pittsburgh compound B (PIB) and 18F-2-fluoro-2-deoxy-d-glucose (FDG) positron emission tomography (PET) within 3 months of clinical assessment. Bilateral temporoparietal hypometabolism was evident on 18FDG PET and positive amyloid loading on 11C-PIB (ie binding occurred in more than one cortical brain region: frontal, parietal, temporal, or occipital).13 Clinically, these patients also fulfilled the NINCDS-ADRDA criteria for AD. These patients had no history of stroke and their CT or MRI brain showed no evidence of infarcts or extensive white matter changes. For these 26 patients with LOAD, similar clinical information including basic demographics, AOO, AOCD (based on the availability of biomarkers’ results), disease duration, Cantonese version of MMSE, initial presenting cortical symptoms, and BPSD was collected. For reference purposes, there were 2480 patients with LOAD without CSF biomarkers or FDG and PIB-PET examination during the same period of time.
 
Statistical analysis
Parametric variables are expressed as mean ± standard deviation (SD). Non-parametric variables are expressed as median with interquartile range (IQR). Chi squared test or Fisher’s exact test were used to compare categorical variables. Independent sample t test or Mann-Whitney U test was used to compare continuous variables when appropriate. Statistical significance was inferred by a two-tailed P value of <0.05. All statistical analyses were performed using the SPSS (Windows version 18.0; SPSS Inc, Chicago [IL], US).
 
Results
Case series of familial Alzheimer’s disease
There were three affected families with eight affected patients. Their clinical features are summarised in Table 1. The mean (± SD) AOO and MMSE score were 48.4 ± 7.7 years and 7.9 ± 9.2, respectively. The mean duration of symptoms before genetic diagnosis was 10.1 ± 7.1 years. Patients 1 and 3 were initially misdiagnosed with depression and Parkinson’s disease with dementia, respectively. The three most common presenting cognitive symptoms were amnesia (75%), disorientation (63%), and anomia (38%).
 

Table 1. A summary of all FAD patients attending the memory clinic of Queen Mary Hospital from January 1998 and December 2016
 
Comparison with late-onset Alzheimer’s disease
The comparison of demographics between FAD and LOAD patients is summarised in Table 2. The AOO and AOCD were much earlier for FAD than LOAD patients (48.4 ± 7.7 vs 77.9 ± 6.7 years and 57.9 ± 8.2 vs 80.7 ± 6.2 years; both P<0.001). The duration of symptoms was much longer for FAD patients than LOAD patients (median [IQR]: 7.5 [5.3-14.5] vs 2.0 [1.0-3.3] years; P<0.001). Patients with FAD had a lower presenting MMSE score than those with LOAD (7.9 ± 9.2 vs 17.6 ± 7.2; P=0.01). More patients with FAD had been educated to secondary level or above than LOAD patients (P=0.001).
 

Table 2. Comparison of clinical features between FAD and sporadic LOAD
 
The comparison of cognitive symptoms and BPSD between FAD and LOAD patients is summarised in Tables 3 and 4, respectively. There was a trend wherein patients with FAD were less likely to present with amnesia (75% vs 100%; P=0.05) than those with LOAD although it was still their main presenting cognitive symptom. Patients with LOAD more commonly presented with delusion, dysphoria, and irritability than FAD patients (0% vs 41.7%, 0% vs 50%, and 0% vs 54.2% respectively; P=0.04, 0.01, and 0.01, respectively).
 

Table 3. Comparison of cognitive symptoms between FAD and sporadic LOAD
 

Table 4. Comparison of BPSD between FAD and sporadic LOAD
 
Discussion
In this case series, there was significant delay in making a correct diagnosis of FAD among patients who presented at a late stage of dementia compared with patients with LOAD. Patients with LOAD more often presented with BPSD such as delusion, dysphoria, and irritability.
 
There are several factors that contribute to the delay in diagnosis and thus the late presentation of FAD patients to the memory clinic. First, the availability of genetic tests is not well known to local doctors. Currently doctors in public hospitals can consult with a clinical biochemist if they encounter a family with at least two generations having EOAD. Genetic tests can be arranged for PSEN1, APP, and PSEN2 sequentially. Second, patients with FAD may have atypical clinical features. In our case series, two patients were initially misdiagnosed as depression and Parkinson’s disease with dementia. Our previous systematic review indicated that patients with FAD and PSEN1 mutations can present with parkinsonism, seizures, spastic paraparesis, myoclonus, and cerebellar dysfunction.1 Chinese FAD patients with an APP mutation can present with atypical phenotypes with a prominent psychiatric manifestation, behavioural and language variants.1 Patients with FAD with a PSEN2 mutation can present with a later AOO even within the same family.1 It is important for local doctors to be aware of the possibility of these atypical clinical features in their EOAD patients, especially if there is a positive family history of EOAD. Third, since FAD is not treatable, genetic testing may not be considered. Nonetheless, genetic counselling is important for patients with FAD. Asymptomatic carriers are also potentially valuable for future clinical trials.4 5 6
 
In terms of cognitive symptoms, patients with FAD tended to present slightly less frequently with amnesia than those with LOAD, although amnesia remained their main presenting cognitive symptom. This is in agreement with previous studies that reported EOAD patients to have more prominent frontoparietal dysfunction than medial temporal dysfunction.3 14 15 Our study also identified that LOAD patients have more positive symptoms of BPSD including delusions and irritability. Table 5 summarises the differences in BPSD between FAD and LOAD patients in our study and in other reported studies between EOAD and LOAD patients.16 17 18
 

Table 5. A comparison between our study findings on BPSD with other studies comparing EOAD and LOAD patients 16 17 18
 
There are several potential reasons for the different clinical features between FAD and LOAD patients. First, patients with FAD have a genetic mutation that increases the production of Aβ42 from early on in life. This explains the much earlier AOO.19 Second, pathological studies of the brain of FAD patients seldom noted non-AD pathological changes. On the contrary, 42% of LOAD patients exhibited at least one other concurrent clinicopathological diagnosis such as vascular dementia, dementia with Lewy bodies, hippocampal sclerosis, or Pick’s disease.20 Third, amyloid plaques in LOAD patients are mostly compact or diffuse while those in FAD patients exhibit various morphologies associated with the specific PSEN mutation.20 Fourth, PSEN 1 and 2 form the catalytic subunit of γ-secretase and apart from amyloid beta precursor protein, there are over 90 other substrates upon which γ-secretase can act; this may explain the wide range of phenotypes for FAD patients with PSEN mutations.20
 
The strength of the study is that the diagnoses of FAD and LOAD were supported by genetic analyses and imaging/CSF biomarkers, respectively. There are a number of limitations in our study. First, FAD accounted for only a minority of EOAD cases and thus our results may not be generalised to sporadic EOAD patients. Second, the severity of BPSD could not be compared. In future, NPI scores should be compared. In addition, detailed neuropsychological tests were not performed because of the busy clinical setting in our memory clinic. Third, the sample size is small and our results must be treated as preliminary. Fourth, the presence or absence of symptoms depends on the recall of primary caregivers and is subject to recall bias. Fifth, apolipoprotein E status is an important genetic contributor to LOAD but it was not checked in all LOAD patients in this study.4 Despite these limitations, this is the first local study in Hong Kong to compare Chinese FAD and LOAD patients.
 
In summary, there are differences in clinical features between patients with FAD, who receive a correct diagnosis much later, and patients with LOAD. Promotion of public awareness of FAD in Hong Kong is much needed to help those families that are affected but not yet identified.
 
Declaration
All authors have disclosed no conflicts of interest.
 
References
1. Shea YF, Chu LW, Chan AO, Ha J, Li Y, Song YQ. A systematic review of familial Alzheimer’s disease: differences in presentation of clinical features among three mutated genes and potential ethnic differences. J Formos Med Assoc 2016;115:67-75. Crossref
2. Müller U, Winter P, Graeber MB. A presenilin 1 mutation in the first case of Alzheimer’s disease. Lancet Neurol 2013;12:129-30. Crossref
3. Tellechea P, Pujol N, Esteve-Belloch P, et al. Early- and late-onset Alzheimer disease: are they the same entity [in English, Spanish]? Neurologia 2015;pii:S0213-4853(15)00210-8.
4. Bird TD. Genetic aspects of Alzheimer disease. Genet Med 2008;10:231-9. Crossref
5. Bateman RJ, Benzinger TL, Berry S, et al. The DIAN-TU Next Generation Alzheimer’s prevention trial: adaptive design and disease progression model. Alzheimers Dement 2017;13:8-19. Crossref
6. Reiman EM, Langbaum JB, Fleisher AS, et al. Alzheimer’s Prevention Initiative: a plan to accelerate the evaluation of presymptomatic treatments. J Alzheimers Dis 2011;26 Suppl 3:321-9.
7. Chiu FK, Lee HC, Chung WS, Kwong PK. Reliability and validity of the Cantonese version of the Mini-Mental State Examination—A preliminary study. J Hong Kong Coll Psychiatr 1994;4:25S-28S.
8. McKhann GM, Knopman DS, Chertkow H, et al. The diagnosis of dementia due to Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimers Dement 2011;7:263-9. Crossref
9. Cummings JL, Mega M, Gray K, Rosenberg-Thompson S, Carusi DA, Gornbein J. The Neuropsychiatric Inventory: comprehensive assessment of psychopathology in dementia. Neurology 1994;44:2308-14. Crossref
10. Shea YF, Chan AO, Chu LW, et al. Novel presenilin 1 mutation (p.F386I) in a Chinese family with early-onset Alzheimer’s disease. Neurobiol Aging 2017;50:168.e9-11.
11. Shea YF, Chu LW, Chan AO, Kwan JS. Delayed diagnosis of an old Chinese woman with familial Alzheimer’s disease. J Formos Med Assoc 2015;114:1020-1. Crossref
12. Shea YF, Chu LW, Zhou L, et al. Cerebrospinal fluid biomarkers of Alzheimer’s disease in Chinese patients: a pilot study. Am J Alzheimers Dis Other Demen 2013;28:769-75. Crossref
13. Shea YF, Ha J, Lee SC, Chu LW. Impact of (18)FDG PET and (11)C-PIB PET brain imaging on the diagnosis of Alzheimer’s disease and other dementias in a regional memory clinic in Hong Kong. Hong Kong Med J 2016;22:327-33. Crossref
14. Cavedo E, Pievani M, Boccardi M, et al. Medial temporal atrophy in early and late-onset Alzheimer’s disease. Neurobiol Aging 2014;35:2004-12. Crossref
15. Kaiser NC, Melrose RJ, Liu C, et al. Neuropsychological and neuroimaging markers in early versus late-onset Alzheimer’s disease. Am J Alzheimers Dis Other Demen 2012;27:520-9. Crossref
16. Mushtaq R, Pinto C, Tarfarosh SF, et al. A comparison of the Behavioral and Psychological Symptoms of Dementia (BPSD) in early-onset and late-onset Alzheimer’s disease—a study from South East Asia (Kashmir, India). Cureus 2016;8:e625.
17. Park HK, Choi SH, Park SA, et al. Cognitive profiles and neuropsychiatric symptoms in Korean early-onset Alzheimer’s disease patients: a CREDOS study. J Alzheimers Dis 2015;44:661-73.
18. Toyota Y, Ikeda M, Shinagawa S, et al. Comparison of behavioral and psychological symptoms in early-onset and late-onset Alzheimer’s disease. Int J Geriatr Psychiatry 2007;22:896-901. Crossref
19. Cacace R, Sleegers K, Van Broeckhoven C. Molecular genetics of early-onset Alzheimer’s disease revisited. Alzheimers Dement 2016;12:733-48. Crossref
20. Roher AE, Maarouf CL, Kokjohn TA. Familial presenilin mutations and sporadic Alzheimer’s disease pathology: is the assumption of biochemical equivalence justified? J Alzheimers Dis 2016;50:645-58.

Clinical utility of late-night and post-overnight dexamethasone suppression salivary cortisone for the investigation of Cushing’s syndrome

Hong Kong Med J 2017 Dec;23(6):570–8 | Epub 13 Oct 2017
DOI: 10.12809/hkmj176240
© Hong Kong Academy of Medicine. CC BY-NC-ND 4.0
 
ORIGINAL ARTICLE
Clinical utility of late-night and post-overnight dexamethasone suppression salivary cortisone for the investigation of Cushing’s syndrome
CM Ng, FRCP (Edin), FHKAM (Medicine)1; TK Lam, MB, BS, MRCP1; YC Au Yeung, MRCP, FHKAM (Medicine)1; CH Choi, FRCP (Edin), FHKAM (Medicine)1; YP Iu, BSc, MSc2; CC Shek, MB, BS, FHKAM (Pathology)2; SC Tiu, MD, FHKAM (Medicine)1
1 Department of Medicine, Queen Elizabeth Hospital, Jordan, Hong Kong
2 Department of Pathology, Queen Elizabeth Hospital, Jordan, Hong Kong
 
Corresponding author: Dr CM Ng (ngcm2@ha.org.hk)
 
 Full paper in PDF
 
Abstract
Introduction: There is a pressing need to identify diagnostic testing for Cushing’s syndrome that can be achieved with ease and at low cost. This study aimed to explore the usefulness of late-night and post-overnight 1-mg dexamethasone suppression salivary cortisone, as measured by liquid chromatography–tandem mass spectrometry, for investigation of hypercortisolism.
 
Methods: Salivary cortisone data of subjects were investigated according to a pre-specified protocol. Subjects were classified as having ‘hypercortisolism’ or ‘eucortisolism’ on the basis of histological or biochemical criteria. Receiver operating characteristic curves were drawn to identify the cut-off values and study their performance characteristics. We measured 24-hour urinary free cortisol; late-night salivary cortisol and cortisone; and post-overnight 1-mg dexamethasone suppression serum cortisol, and salivary cortisol and cortisone. Saliva and urine samples were assayed by liquid chromatography–tandem mass spectrometry.
 
Results: In this study, 21 subjects were classified as having hypercortisolism and 78 as having eucortisolism. A late-night salivary cortisone cut-off of 13.50 nmol/L had a sensitivity of 94.7% and a specificity of 87.2%. After taking 1-mg dexamethasone the night before, a salivary cortisol cut-off of 0.85 nmol/L had a sensitivity of 76.2% and a specificity of 96.2%; a salivary cortisone cut-off of 7.45 nmol/L had a sensitivity of 85.7% and a specificity of 94.9%, while a salivary cortisone cut-off of 3.25 nmol/L had a sensitivity of 95.2% and a specificity of 79.5%. Many salivary cortisol samples were below the detection limit of liquid chromatography–tandem mass spectrometry. In comparison with salivary cortisol, salivary cortisone had a better correlation with total serum cortisol and better diagnostic performance following dexamethasone suppression.
 
Conclusions: Both late-night and post-overnight dexamethasone suppression salivary cortisone levels are of diagnostic value in the investigation of hypercortisolism.
 
 
New knowledge added by this study
  • Compared with salivary cortisol, salivary cortisone has better diagnostic performance after dexamethasone suppression.
  • Both late-night and post-overnight dexamethasone suppression salivary cortisone levels are of diagnostic value in the investigation of hypercortisolism.
Implications for clinical practice or policy
  • Using the cut-off value generated from this study, late-night and post-overnight dexamethasone suppression salivary cortisone, measured by liquid chromatography–tandem mass spectrometry, can be added to the panel of diagnostic tests for hypercortisolism.
 
 
Introduction
The diagnosis of Cushing’s syndrome, especially when hypersecretion is mild, is plagued by uncertainties. Most clinical features—such as diabetes mellitus, hypertension, obesity, hyperlipidaemia, osteoporosis, or depression—are non-specific and highly prevalent in the general population. More specific features such as myopathy or easy bruising may be absent even in subjects with florid biochemical hypercortisolism. In addition, no single test can diagnose Cushing’s syndrome with 100% sensitivity and specificity.1 It is a common phenomenon that tests for hypercortisolism—for examples, 24-hour urinary free cortisol (UFC), late-night salivary cortisol (SalFLN) or serum cortisol level after 1-mg overnight dexamethasone suppression test (SerFDex)—often produce discordant results. Each test has its own caveats, affected by the level of binding proteins, completeness of urine collection, and absorption and metabolism of dexamethasone. In recent years, increased awareness of the metabolic and cardiovascular consequences of Cushing’s syndrome2 has led to a pressing need to identify tests that are easy to perform and can provide useful information at a low cost.
 
Collection of saliva samples is non-invasive and stress-free.3 4 Salivary cortisol is not affected by the levels of binding proteins so it provides a reliable indication of the biologically active free serum cortisol level.5 Significant advances have been made with the use of salivary cortisol in the investigation of hypercortisolism.3 6 7 8 9 The availability of liquid chromatography–tandem mass spectrometry (LC-MS/MS) also enables the measurement of other glucocorticoid analytes. Among these and of particular interest is cortisone that is present in the saliva at a higher concentration—the salivary cortisone–to–cortisol ratio is up to 6-8:110 11 12—due to conversion of cortisol to cortisone by the 11 beta-hydroxysteroid dehydrogenase 2 (11β-HSD2) enzyme in the salivary glands.13 This higher concentration makes it more detectable than salivary cortisol.10 Salivary cortisone has been shown by some investigators to have a better and more linear relationship with serum total cortisol14 and serum free cortisol10 than salivary cortisol.
 
In this study, we reviewed the late-night salivary cortisone (SalELN) and post-overnight dexamethasone suppression salivary cortisone (SalEDex) values of subjects being investigated for hypercortisolism in our centre, in an attempt to define cut-off values with reasonable sensitivity and specificity.
 
Methods
Subjects
All subjects referred to the Endocrine Unit of Queen Elizabeth Hospital in Hong Kong for suspected endogenous hypercortisolism were evaluated according to a pre-specified protocol. Written informed consent was obtained from all subjects. The results of subjects investigated during May 2013 (when salivary measurement by LC-MS/MS became available) to September 2016 were reviewed. This study was approved by the Hospital Ethics Committee. No patient was receiving medical treatment for Cushing’s syndrome at the time of assessment. Subjects who were taking medication (such as rifampicin, phenytoin, phenobarbital, and alcohol) that might interfere with dexamethasone metabolism, or were night or shift workers, were excluded.
 
Investigations performed
Detailed oral and written instructions were given to all subjects by an endocrine specialist nurse. For the collection of saliva sample, subjects were instructed to refrain from smoking, brushing teeth, and eating or drinking anything but water for at least 30 minutes before collection. Saliva samples were collected using a cotton swab in Salivette tubes (Sarstedt, Nümbrecht, Germany). Salivettes were kept at 4°C in a home refrigerator and sent to the laboratory within 24 hours.
 
According to the pre-specified protocol, on day 1, a 24-hour urine sample was collected for UFC measurement. On day 2, a 0900h saliva sample was collected at the Endocrine Centre, under nurse supervision. The patient was then instructed to collect a late-night (between 2300h and 2400h) saliva sample that evening, after which he/she was to take dexamethasone 1 mg orally. On day 3, subjects returned to the Endocrine Centre for a simultaneous blood and saliva sample at 0900h. The blood sample was sent for serum cortisol assay. Saliva samples were assayed for both cortisol and cortisone.
 
Laboratory assays
Serum cortisol was measured by competitive chemiluminescent microparticle immunoassay using the Abbott ARCHITECT i2000SR system (Abbott Diagnostics, Illinois, US). The coefficient of variation of the assay for serum cortisol was 4.0%-6.2% at low levels and 3.3%-4.3% at high levels. Salivary cortisol and salivary cortisone were measured by LC-MS/MS using the Waters Xevo TQ MS system (Waters Corporation, Milford [MA], US). Cortisol and cortisone were extracted from saliva using the organic solvent methyl tert-butyl ether after addition of a deuterium-labelled internal standard mixture of cortisol-d4 and cortisone-d8 (CDN isotopes). The organic supernatant was dried under nitrogen at a temperature below 45°C and dissolved in the initial mobile phase for LC-MS/MS analysis. The steroid analytes were separated from the matrix background in a reversed-phase chromatography that employed a sub-2 μm analytical column (ACQUITY UPLC HSS T3 Column, 2.1 x 100 mm, 1.8 mm; Waters Corporation, Milford [MA], US) and a 6-minute elution method consisting of a gradient mixture of 0.1% glacial acetic acid, 0.2 mM ammonium acetate in water and methanol. Negative electrospray mode was used for analyte ionisation. The charged acetate adducts were monitored by multiple reaction monitoring mode with two stable mass transitions for cortisol (421>331; 421>297) and cortisone (419>329; 419>301) and one multiple reaction monitoring for each of the corresponding deuterated internal standards (cortisol-d4: 425>335; cortisone-d8: 427>337). Quantitative measurement was derived using the linear least squares regression method with origin excluded and 1/x weighting for better accuracy at a low concentration level. The coefficient of variation of the assay for salivary cortisol and cortisone was 5%-7% and 7%-10%, respectively across the analyte reporting ranges up to 250 nmol/L. The lower limit of detection was 0.5 nmol/L for both salivary cortisol and cortisone. Urinary cortisol was also measured by LC-MS/MS. Adrenocorticotropic hormone (ACTH) was measured by Immulite 2000 XPi (Siemens Healthcare GmbH, Erlangen, Germany) chemiluminescent immunometric assay. The upper reference limit of ACTH was 10.2 pmol/L.
 
Definition of hypercortisolism
Subjects were classified as having hypercortisolism if either the biochemical or the histological criterion was fulfilled. The biochemical criterion was defined as having an abnormal value in at least two of the following three tests: (1) SerFDex >138 nmol/L, or >50 nmol/L in the context of adrenal incidentaloma15; (2) UFC >157 nmol/day; and (3) SalFLN ≥3 nmol/L. The categorisation of hypercortisolism was made without knowledge (ie blinded) of the three outcome parameters being evaluated for diagnostic accuracy, namely SalELN, post-overnight dexamethasone suppression salivary cortisol (SalFDex), and SalEDex. The reference range for UFC in our laboratory, established locally from the 2.5th to 97.5th percentile of 112 healthy adults, was 22-157 nmol/day. The reference level for SalFLN in our laboratory, derived from the 97.5th percentile of 61 normal individuals, was <3 nmol/L. The histological criterion was defined as histological proof and postoperative improvement in biochemical and clinical parameters. Subjects not fulfilling either of these criteria were classified as having eucortisolism.
 
Statistical analyses
For calculation purposes, results below the detection limit of the assay were set to the lowest detection value. Continuous data were expressed as mean ± standard deviation if parametric, and median (range) if non-parametric. Chi squared test was used to detect any difference between categorical data. Unpaired t test was used to compare continuous variables, and Mann-Whitney test was used for non-parametric data. A P value of <0.05 was considered statistically significant. Correlation between serum and salivary values were performed using Pearson correlation coefficients.
 
For estimation of the optimal diagnostic cut-off value, receiver operating characteristic (ROC) curves were drawn using data from the subjects classified as hypercortisolism or eucortisolism. The optimal cut-off was chosen where the Youden’s index (sensitivity + specificity–1) was maximal. The test performance characteristics were calculated to assess their utility. The quality of diagnostic tests was expressed as the area under ROC curve (AUC). For sample size requirement estimation, for an estimated AUC of 0.8, a minimum of nine positive cases was required.16 Statistical analysis was performed using the Statistical Package for Social Science (Windows version 20.0; IBM Corp, Armonk [NY], US).
 
Results
A total of 115 subjects were referred to our Endocrine Clinic for assessment of hypercortisolism during the study period. Of them, 14 subjects with a history of transsphenoidal surgery or adrenalectomy who had been referred for postoperative assessment of endocrine function were excluded. One patient with ongoing investigations and pending re-evaluation and another with end-stage renal failure were also excluded. No patient was taking exogenous steroids. All subjects had normal renal and liver function tests.
 
A total of 115 sets of biochemical investigations were performed in 99 subjects (40 males, 59 females; mean age, 55.3 ± 14.3 years; range, 19-81 years). The primary indications for testing were adrenal incidentaloma in 52 subjects, suspected secondary hypertension or diabetes mellitus in 25, Cushingoid features in 21, and pituitary mass in one. Eleven subjects had two or more sets of investigations performed (two patients had 4 sets, one patient had 3 sets, and eight patients had 2 sets). For these subjects, only the data set with the highest SerFDex was chosen for analysis. In two subjects, the volume of the late-night salivary sample was inadequate for measurement.
 
In this study, 21 subjects were found to have hypercortisolism according to the above criteria—20 subjects met the biochemical criterion; eight subjects met the histological criterion, including one whose set of tests did not fulfil the biochemical criterion (SerFDex 135 nmol/L; normal UFC and SalFLN, ACTH 1.1 pmol/L) and who underwent surgery because of an adrenal adenoma that enlarged from 1.6 cm to 2.5 cm over 2 years, postoperative spot cortisol was <28 nmol/L and hydrocortisone replacement was required for 6 months before axis recovery. Among the 21 subjects who had hypercortisolism, 7 had adrenal Cushing’s, 4 pituitary Cushing’s, 3 ectopic ACTH syndrome, 1 adrenocortical carcinoma, and 6 subclinical Cushing’s. Of these subjects, 14 (67%) had elevated UFC, 18 (86%) had non-suppressed SerFDex, and 17 (81%) had elevated SalFLN. Among the eight subjects with histological proof (6 adrenalectomies, 1 transsphenoidal surgery, 1 enucleation of pancreatic neuroendocrine tumour), all had clinical and biochemical improvement after operation. Eucortisolism was diagnosed in 78 subjects according to the aforementioned criteria.
 
The baseline characteristics of the subjects are shown in Table 1. There were no statistically significant differences between the hypercortisolism and the eucortisolism groups with respect to age, gender, and prevalence of obesity, diabetes mellitus, or hypertension. There was a statistically significantly higher prevalence of Cushingoid features and of proximal myopathy in the hypercortisolism group.
 

Table 1. Baseline characteristics of study subjects
 
 
The biochemical test results are shown in Table 2. The hypercortisolism group had statistically significantly higher levels of SerFDex, UFC, SalFLN, SalELN, SalFDex, and SalEDex. No SalFLN sample in the hypercortisolism group and 17 (21.8%) SalFLN samples in the eucortisolism group had levels below the detection limit of 0.5 nmol/L. No SalELN sample in the hypercortisolism group had levels below the detection limit of 0.5 nmol/L. Four (19.0%) SalFDex samples in the hypercortisolism group and 68 (87.2%) SalFDex samples in the eucortisolism group had a level below the detection limit of 0.5 nmol/L. One (1.3%) SalEDex sample in the eucortisolism group had a level below the detection limit of 0.5 nmol/L.
 

Table 2. Biochemical test results of study subjects
 
The ROC analysis (Fig 1) and Table 3 reveal that the optimal cut-off for SalELN was 13.50 nmol/L. Setting the specificity at a level of 95%, the cut-off for SalELN would be 20.50 nmol/L.
 

Figure 1. Receiver operating characteristic curves for late-night salivary cortisone, and post-overnight 1-mg dexamethasone suppression salivary cortisol and cortisone
 

Table 3. Performance characteristics of late-night salivary cortisone, and post-overnight 1-mg dexamethasone suppression salivary cortisol and cortisone
 
After overnight 1-mg dexamethasone suppression, the optimal cut-off for SalFDex was 0.85 nmol/L (Table 3). Nonetheless, these values should be interpreted with caution, since many SalFDex values in both the eucortisolism (87.2%) and hypercortisolism (19.0%) groups were below the detection limit of the LC-MS/MS assay and were consequently presumed to be equivalent to the lowest detection limit of 0.5 nmol/L.
 
After 1-mg overnight dexamethasone suppression, the optimal cut-off for SalEDex was 7.45 nmol/L. Setting the sensitivity at a level of 95%, the cut-off for SalEDex would be 3.25 nmol/L (Table 3).
 
The correlation between 0900h serum cortisol and salivary cortisol was 0.81 (P<0.01); and that between 0900h serum cortisol and salivary cortisone was 0.88 (P<0.01). The correlation between SerFDex and SalFDex was 0.90 (P<0.01); and that between SerFDex and SalEDex was 0.94 (P<0.01) [Fig 2].
 

Figure 2. Correlation plots between 0900h serum cortisol versus (a) salivary cortisol and (b) salivary cortisone; and between post-dexamethasone serum cortisol versus (c) salivary cortisol and (d) salivary cortisone
 
Discussion
All investigators who study Cushing’s syndrome are confronted with the conundrum of accurately diagnosing or excluding the condition with no gold standard test.1 In our study, in addition to the histological criterion, we considered it appropriate to include a set of biochemical criteria in which subjects with two positive results among the three commonly used tests—namely the SerFDex, the UFC, and the SalFLN—were considered to have hypercortisolism. This is in agreement with the Endocrine Society Clinical Practice Guideline17 that recommends performing one or two other tests if one of these is abnormal; and if results from two different tests are concordant, to proceed with investigations to establish the cause of Cushing’s syndrome. One well-recognised contentious point in the interpretation of the SerFDex is the optimal cut-off: <140 nmol/L is a widely cited normal response, but can lead to false-negative results in up to 15% of subjects with Cushing’s syndrome.18 19 The more stringent cut-off of <50 nmol/L sacrifices specificity for sensitivity.20 21 In this study, we adopted a double cut-off as proposed by the European Society of Endocrinology Clinical Practice Guideline in collaboration with the European Network for the Study of Adrenal Tumors15; the rationale being that a more sensitive cut-off should be employed in those with a higher pretest probability of Cushing’s syndrome, such as the presence of an adrenal adenoma on imaging studies.22 A more specific cut-off can be employed in general to avoid overdiagnosis.
 
The loss of circadian rhythm with absence of a late-night cortisol nadir is a well-established feature of Cushing’s syndrome. Midnight serum cortisol is, however, difficult to obtain. When SalFLN was shown to correlate well with serum cortisol levels, with sensitivity of 92%-100% and specificity of 93%-100% for the diagnosis of Cushing’s syndrome,17 it rapidly became one of the most popular tests in investigating endogenous hypercortisolism. In view of the theoretical advantages of salivary cortisone, we also attempted to explore the performance characteristics of SalELN. Our data showed that it had a good sensitivity of 94.7% and a specificity of 87.2% at the cut-off of 13.50 nmol/L, as measured by LC-MS/MS.
 
We could not compare the utility of SalFLN with cortisone in this study, since SalFLN was one of the criteria applied to define Cushing’s syndrome. Simultaneous measurement of salivary cortisol and salivary cortisone can nonetheless alert clinicians to certain caveats encountered when measuring salivary cortisol alone. When the salivary cortisol-to-cortisone ratio is exceptionally high, direct contamination of the oral sample by topical or oral hydrocortisone must be excluded. Ingestion of glycyrrhetinic acid (eg in licorice, carbenoxolone), which competitively inhibits 11β-HSD2, or rare cases of genetic 11β-HSD2 defect, can also lead to the same anomaly.
 
A number of other investigators have explored the utility of SalFDex in the diagnosis of Cushing’s syndrome. Apart from its convenience, salivary values are not affected by conditions that affect corticosteroid-binding globulin (CBG) or albumin levels, such as acute and chronic illness, pregnancy or oestrogen treatment, or genetic variants of CBG. A sensitivity varying between 97% and 100% and a specificity between 77% and 100% have been variously reported, with cut-off level varying between 1.7 nmol/L and 2 nmol/L.6 7 8 9 In the current study, we found that the sensitivity of SalFDex was only 76.2% at the optimal cut-off of 0.85 nmol/L. Although this discrepancy with other studies might be due to a number of factors, such as the means of defining normal ranges and the criteria for diagnosing Cushing’s syndrome, one important factor that is evident from our data is the method used for assaying salivary cortisol: some used electrochemiluminescence assay,9 others used radioimmunoassay6 7 8; but we measured SalFDex with LC-MS/MS.12 Unlike immunoassays, LC-MS/MS measurement of analytes is more specific, with less cross-reactivity among different cortisol precursors and metabolites.23 The concentration of SalFDex was very low: 19% of our patients with hypercortisolism and 87% of those with eucortisolism had SalFDex below the detection limit of 0.5 nmol/L, leading to uncertainty in establishing the cut-off, since all those with results of <0.5 nmol/L could only be considered to have salivary cortisol level equal to 0.5 nmol/L in the analysis. Immunoassays, by measuring other cortisol precursors or metabolites in varying degrees in addition, could have bypassed this problem. Other studies have also reported that SalFLN has poorer diagnostic performance characteristics if measured by LC-MS/MS, in comparison with the less-specific immunoassays such as chemiluminescent assays or radioimmunoassays.24 25
 
Nevertheless, LC-MS/MS is analytically more superior and is expected to become the steroid assay of choice in the future.26 Values generated by studies using LC-MS/MS have greater inter-centre and long-term generalisability in view of the lack of assay-specific steroid cross-reactivity. Adoption of cut-offs generated by studies in which salivary cortisol was assayed using antibody-based methods into clinical practice is known to be problematic.27 Individual centres are often advised to generate their own references and cut-offs although this is often not feasible. In a meta-analysis on the use of SalFLN for investigation of Cushing’s syndrome,25 the recommended cut-offs varied widely, from 3.59 to 15.17 nmol/L. To overcome the problem of lower performance characteristics due to low levels of salivary cortisol, instead of going for immunoassays, a better solution may be to measure salivary cortisone that is present in a much higher concentration than salivary cortisol. At a serum cortisol below 74 nmol/L, Debono et al28 showed that salivary cortisol could become undetectable by LC-MS/MS, while salivary cortisone was always detected. Similarly, our data showed that after dexamethasone suppression, when the salivary cortisol became too low to be measured with LC-MS/MS in many subjects, salivary cortisone could still be measured in all but one subject in the eucortisolism group.
 
Between salivary cortisol and salivary cortisone, this study showed that salivary cortisone would be the preferred test because it is present at a higher concentration in the saliva; and at comparable specificity levels, SalEDex appears to have better accuracy (as reflected by the higher AUC of the ROC curves), sensitivity, and negative LR than SalFDex.
 
Apart from the optimal cut-off, clinically it is often useful to have two cut-offs, one for ruling in a diagnosis (high specificity) and another one for excluding a diagnosis (high sensitivity), depending on clinicians’ preference. If we arbitrarily define an acceptable and useful cut-off as having a 95% level of either sensitivity or specificity, the two useful cut-offs for SalELN as derived from our study were 13.50 and 20.50 nmol/L, respectively; those for SalEDex were 3.25 nmol/L and 7.45 nmol/L, respectively.
 
Traditionally, in the algorithm for the workup for Cushing’s syndrome, late-night levels (serum or salivary cortisol) have been used for screening (excluding Cushing’s syndrome), whereas the post-dexamethasone level (serum cortisol) has been used for diagnosis (ruling in Cushing’s syndrome). When used as such, the cut-off of 13.50 nmol/L can be used for SalELN; whereas 7.45 nmol/L can be used for SalEDex. For the time being, SalEDex data can supplement serum cortisol measurement as a confirmatory test when concordant, or alert the clinician to the potential pitfalls with serum cortisol (eg variations in CBG levels) when discordant. With more experience, SalEDex may even ultimately replace the need to measure serum cortisol.
 
The strength of this study lies in the rigour with which a pre-specified protocol was adhered to. A high success rate of sample collection was achieved, with little missing data. Insufficient salivary volume collected in the Salivette tubes was the most common reason for unsuccessful salivary collection, because LC-MS/MS requires a larger saliva volume (100-250 µL) than immunoassay (40-50 µL).29 Two thirds of our study subjects were referred either because of an adrenal incidentaloma or common clinical conditions such as diabetes mellitus and hypertension, and had no clinical features of Cushing’s syndrome. This population was quite representative of cases referred to an endocrine centre for workup of Cushing’s syndrome.
 
A notable limitation of this study is the small number of subjects with hypercortisolism. The cut-off for the SerFDex was adopted from the literature rather than from data derived from our own healthy volunteers. The Endocrine Society Clinical Practice Guideline17 recommended two separate measurements of SalFLN or UFC. Only one sample for each was collected in our study. Although we might have consequently missed some cases of episodic hypercortisolism, we assumed that if less than two out of the relatively sensitive tests were positive at the time of sampling, the subjects would likely be in a phase of normal cortisol secretion even if they had episodic Cushing’s syndrome.
 
Conclusions
Our study showed that salivary cortisone can become the analyte of choice for investigating Cushing’s syndrome in the era of LC-MS/MS. Our data suggest using 13.50 nmol/L for SalELN, and either 7.45 nmol/L (more specific) or 3.25 nmol/L (more sensitive) for SalEDex, as cut-offs.
 
Declaration
All authors have disclosed no conflicts of interest.
 
References
1. Stewart PM. Is subclinical Cushing’s syndrome an entity or a statistical fallout from diagnostic testing? Consensus surrounding the diagnosis is required before optimal treatment can be defined. J Clin Endocrinol Metab 2010;95:2618-20. Crossref
2. Ferraù F, Korbonits M. Metabolic comorbidities in Cushing’s syndrome. Eur J Endocrinol 2015;173:M133-57. Crossref
3. Viardot A, Huber P, Puder JJ, Zulewski H, Keller U, Müller B. Reproducibility of nighttime salivary cortisol and its use in the diagnosis of hypercortisolism compared with urinary free cortisol and overnight dexamethasone suppression test. J Clin Endocrinol Metab 2005;90:5730-6. Crossref
4. Graham UM, Hunter SJ, McDonnell M, Mullan KR, Atkinson AB. A comparison of the use of urinary cortisol to creatinine ratios and nocturnal salivary cortisol in the evaluation of cyclicity in patients with Cushing’s syndrome. J Clin Endocrinol Metab 2013;98:E72-6. Crossref
5. Vining RF, McGinley RA, Maksvytis JJ, Ho KY. Salivary cortisol: a better measure of adrenal cortical function than serum cortisol. Ann Clin Biochem 1983;20(Pt 6):329-35. Crossref
6. Barrou Z, Guiban D, Maroufi A, et al. Overnight dexamethasone suppression test: comparison of plasma and salivary cortisol measurement for the screening of Cushing’s syndrome. Eur J Endocrinol 1996;134:93-6. Crossref
7. Cardoso EM, Arregger AL, Tumilasci OR, Contreras LN. Diagnostic value of salivary cortisol in Cushing’s syndrome (CS). Clin Endocrinol (Oxf) 2009;70:516-21.
8. Castro M, Elias PC, Quidute AR, Halah FP, Moreira AC. Out-patient screening for Cushing’s syndrome: the sensitivity of the combination of circadian rhythm and overnight dexamethasone suppression salivary cortisol tests. J Clin Endocrinol Metab 1999;84:878-82. Crossref
9. Deutschbein T, Broecker-Preuss M, Flitsch J, et al. Salivary cortisol as a diagnostic tool for Cushing’s syndrome and adrenal insufficiency: improved screening by an automatic immunoassay. Eur J Endocrinol 2012;166:613-8. Crossref
10. Perogamvros I, Keevil BG, Ray DW, Trainer PJ. Salivary cortisone is a potential biomarker for serum free cortisol. J Clin Endocrinol Metab 2010;95:4951-8. Crossref
11. Wood P. Salivary steroid assays—research or routine? Ann Clin Biochem 2009;46(Pt 3):183-96. Crossref
12. Antonelli G, Ceccato F, Artusi C, Marinova M, Plebani M. Salivary cortisol and cortisone by LC-MS/MS: validation, reference intervals and diagnostic accuracy in Cushing’s syndrome. Clin Chim Acta 2015;451(Pt B):247-51.
13. Smith RE, Maguire JA, Stein-Oakley AN, et al. Localization of 11 beta-hydroxysteroid dehydrogenase type II in human epithelial tissues. J Clin Endocrinol Metab 1996;81:3244-8. Crossref
14. Perogamvros I, Owen LJ, Newell-Price J, Ray DW, Trainer PJ, Keevil BG. Simultaneous measurement of cortisol and cortisone in human saliva using liquid chromatography-tandem mass spectrometry: application in basal and stimulated conditions. J Chromatogr B Analyt Technol Biomed Life Sci 2009;877:3771-5. Crossref
15. Fassnacht M, Arlt W, Bancos I, et al. Management of adrenal incidentalomas: European Society of Endocrinology Clinical Practice Guideline in collaboration with the European Network for the Study of Adrenal Tumors. Eur J Endocrinol 2016;175:G1-G34. Crossref
16. Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 1982;143:29-36. Crossref
17. Nieman LK, Biller BM, Findling JW, et al. The diagnosis of Cushing’s syndrome: an Endocrine Society Clinical Practice Guideline. J Clin Endocrinol Metab 2008;93:1526-40. Crossref
18. Findling JW, Raff H, Aron DC. The low-dose dexamethasone suppression test: a reevaluation in patients with Cushing’s syndrome. J Clin Endocrinol Metab 2004;89:1222-6. Crossref
19. Görges R, Knappe G, Gerl H, Ventz M, Stahl F. Diagnosis of Cushing’s syndrome: re-evaluation of midnight plasma cortisol vs urinary free cortisol and low-dose dexamethasone suppression test in a large patient group. J Endocrinol Invest 1999;22:241-9. Crossref
20. Ceccato F, Barbot M, Zilio M, et al. Screening tests for Cushing’s syndrome: urinary free cortisol role measured by LC-MS/MS. J Clin Endocrinol Metab 2015;100:3856-61. Crossref
21. Nieman LK. Cushing’s syndrome: update on signs, symptoms and biochemical screening. Eur J Endocrinol 2015;173:M33-8. Crossref
22. Lopez D, Luque-Fernandez MA, Steele A, Adler GK, Turchin A, Vaidya A. “Nonfunctional” adrenal tumors and the risk for incident diabetes and cardiovascular outcomes: a cohort study. Ann Intern Med 2016;165:533-42. Crossref
23. Raff H, Auchus RJ, Findling JW, Nieman LK. Urine free cortisol in the diagnosis of Cushing’s syndrome: is it worth doing and, if so, how? J Clin Endocrinol Metab 2015;100:395-7. Crossref
24. Erickson D, Singh RJ, Sathananthan A, Vella A, Bryant SC. Late-night salivary cortisol for diagnosis of Cushing’s syndrome by liquid chromatography/tandem mass spectrometry assay. Clin Endocrinol (Oxf) 2012;76:467-72. Crossref
25. Carroll T, Raff H, Findling JW. Late-night salivary cortisol for the diagnosis of Cushing syndrome: a meta-analysis. Endocr Pract 2009;15:335-42. Crossref
26. Handelsman DJ, Wartofsky L. Requirement for mass spectrometry sex steroid assays in the Journal of Clinical Endocrinology and Metabolism. J Clin Endocrinol Metab 2013;98:3971-3. Crossref
27. Beko G, Varga I, Glaz E, et al. Cutoff values of midnight salivary cortisol for the diagnosis of overt hypercortisolism are highly influenced by methods. Clin Chim Acta 2010;411:364-7. Crossref
28. Debono M, Harrison RF, Whitaker MJ, et al. Salivary cortisone reflects cortisol exposure under physiological conditions and after hydrocortisone. J Clin Endocrinol Metab 2016;101:1469-77. Crossref
29. Raff H. Cushing’s syndrome: diagnosis and surveillance using salivary cortisol. Pituitary 2012;15:64-70. Crossref

Pages