Abstract
Modern multidisciplinary therapy for colorectal liver metastases (CRLM) is associated with significant morbidity and must be adapted to the patient’s relative risk. The tools currently available to risk-stratify patients are limited. This study assessed the prognostic utility of metabolic measurements derived from18F-FDG PET compared with previously proposed prognostic scoring systems. Methods: Preoperative 18F-FDG PET/CT studies from a series of 30 patients who underwent liver resection for CRLM after neoadjuvant chemotherapy were evaluated. Quantitative 18F-FDG PET analysis calculated the maximum and mean standardized uptake value, metabolic tumor volume (MTV), and tumor glycolytic volume (TGV) as measures of the metabolic activity of tumors. The predictive value of these parameters was compared with that of 4 prognostic scores developed by Fong, Iwatsuki, Nordlinger, and Rees. Results: High MTV and TGV in patients before metastasectomy were significantly associated with poorer overall survival (MTV: P = 0.001; TGV: P = 0.004) and recurrence-free survival (MTV: P = 0.001, TGV; P = 0.002). Maximum and mean standardized uptake value did not show any significant predictive ability. Of the prognostic scores, prediction of outcome was most accurate using the Basingstoke index (area under the curve, 0.898). Conclusion: Assessment of metabolic tumor burden with volumetric 18F-FDG PET parameters appears to be a valuable adjunct in determining the biology of CRLM before surgical resection and may enable better risk stratification of patients.
Colorectal cancer is among the most common malignancies affecting both men and women worldwide (1). Although screening initiatives have successfully enabled earlier detection of this cancer, metastatic disease to the liver remains the most frequent cause of mortality. Treatment of colorectal liver metastases (CRLM) with a combination of surgical resection and systemic chemotherapy results in 5-y survival rates of 30%–50% in patients undergoing the procedure (2,3). However, despite substantial survival benefit, surgical resection can be offered only to 20%–30% of patients (4) because of the presence of extrahepatic or bilobar disease or the lack of patient fitness for major surgery. Although technical advances have allowed major liver resections to be conducted safely, with perioperative mortality below 2%, liver resection remains a major undertaking carrying a postoperative morbidity as high as 35% (5). There is an urgent need for the treatment of CRLM to be risk-adapted such that patients with a predicted poor prognosis may be spared invasive surgery, and major interventions may be targeted at those who are likely to significantly benefit.
Previous investigators have developed clinicopathologic prognostic scoring systems (6–9) in an effort to delineate risk categories in patients before the onset of treatment (Table 1). The accuracy of these predictive models has, however, been questioned in follow-on studies when evaluated in independent patient cohorts (10,11). One potential reason for these inconsistencies may be that the clinicopathologic factors scored in these systems are an inadequate surrogate for the overall burden and biologic aggressiveness of the tumor (12), both of which impact heavily on patient outcomes.
PET has been widely used and has become an integral part of staging of CRLM (13). PET has progressively come to the forefront as a minimally invasive method of studying functional and metabolic processes in tumors. Increased glucose utilization is a feature of many malignancies and can be identified on PET using the radiotracer 18F-FDG. Recent studies have correlated high 18F-FDG uptake in tumors with poorer patient survival in a range of tumors including lung (14), breast (15), head and neck (16), and esophageal cancers (17). Despite growing interest in the biology of tumors as a predictor of outcome, there is a general lack of tools to assess tumor biology in CRLM. In this study, we investigated the predictive ability of quantitative measurements of tumor metabolic uptake using 18F-FDG PET as a prognostic tool before surgical resection for CRLM. Four measures of tumor glucose uptake were studied in this setting: the maximum and mean standardized uptake value (SUVmax and SUVmean), which are semiquantitative measures of 18F-FDG concentration in the tumor, and the metabolic tumor volume (MTV) and tumor glycolytic volume (TGV), which are 3-dimensional measurements that may enable more accurate representation of the metabolically active tumor burden.
MATERIALS AND METHODS
Patients
Patients admitted for surgical resection of CRLM between January 2004 and December 2007 were identified from the database maintained by the Hepato-Pancreato-Biliary and Transplant Unit at the Austin Hospital, Australia. Patient selection was standardized by including only those receiving neoadjuvant chemotherapy before surgery followed by adjuvant chemotherapy after surgery. The availability of 18F-FDG PET/CT scans after completion of chemotherapy and immediately before liver surgery was also a criterion for inclusion. Ethics approval from Austin Hospital was obtained for prospective analysis (approval no. H2011/04301).
Patient data were collected from the prospective database of the Hepato-Pancreato-Biliary and Transplant Unit, with additional information gathered from the hospital medical records and from discussions with treating clinicians. The data included patient demographics (age and sex), details of the liver and colorectal operations, histopathologic features of the resected liver metastasis and the primary colorectal tumor, perioperative blood transfusion, blood carcinoembryonic antigen levels, and chemotherapy received. From these factors, prognostic scores were calculated according to the systems developed by Fong et al. (Memorial Sloan-Kettering Cancer Center) (6), Iwatsuki et al. (Pittsburgh) (7), Nordlinger et al. (8), and Rees et al. (Basingstoke index) (9), which have been described in Table 1.
18F-FDG PET/CT Acquisition and Analysis
In preparation for the imaging study, patients fasted for 6 h before intravenous injection of approximately 5 MBq of 18F-FDG per kilogram of body weight according to the standard protocol. All patients were then scanned from base of skull to upper thighs on a PET/CT system (Philips) after a 60-min uptake period. The scan included a low-dose 30 mA/slice CT component for attenuation correction and anatomic localization. The duration of the 18F-FDG PET emission scan was 3 min/bed position, and the patients were scanned with their arms above their heads. The scans were processed with 3-dimensional iterative reconstruction.
The PET scans were reanalyzed by a nuclear medicine physician who was masked to the clinical and survival data of the patients. Software previously validated for assessing 3-dimensional glycolytic volume (18) was used to define a volume of interest for each metabolically active liver lesion. The software uses a semiautomated threshold-based 3-dimensional seed-growing algorithm that is determined by the current mean activity, the activity in neighboring pixels, and the maximum normal background level. A threshold of SUVmean plus 2 SDs was used, and separate seeds were applied for each noncontiguous liver metastasis. The MTV and TGV were then calculated from the volume of interest. SUVmax and SUVmean were also measured and recorded, using standard methods involving injected dose, blood glucose level, and normalization by the patient’s body weight. TGV, being a composite of metabolic activity and volume, was measured as SUV⋅mL. MTV was recorded as cubic centimeters because it is defined as the physical volume of metabolically active tumor.
Statistical Analysis
Statistical analysis was performed using SPSS Statistics, version 17 (IBM). Overall survival was calculated as the number of months from liver resection until death. Recurrence-free survival was defined as the interval from hepatectomy to the time of the first evidence of tumor recurrence found by either PET or CT. Patients were censored on the date of last follow-up if they were still alive or had no evidence of disease recurrence.
Receiver-operating-characteristic analysis was conducted to assess the discriminative performance of each PET parameter (SUVmax, SUVmean, MTV, and TGV) and the proposed prognostic score (Iwatsuki, Fong, Nordlinger, and Basingstoke) for death and tumor recurrence at 30 mo as endpoints (duration of minimum follow-up). Areas under the curve (AUCs) were derived from these plots and compared on the conventional scale ranging from 0.5 (discriminative ability no better than chance) to 1.0 (perfect discrimination). A separate AUC analysis was performed excluding patients with non–18F-FDG-avid tumors to investigate survival in only patients with tumors that had metabolic uptake observable on 18F-FDG PET. Survival curves were produced by the Kaplan–Meier method and compared using the log-rank test. The prognostic value of demographic, clinicopathologic, and tumor metabolic parameters were examined on univariate and multivariate analysis with the Cox proportional hazards regression model, which calculated the hazard ratio and a 95% confidence interval. Statistical significance was regarded as a P value of less than 0.05.
RESULTS
Patient Characteristics
A total of 30 patients with histologically proven CRLM were included in the study. The median follow-up time for surviving patients was 37 mo (range, 30–83 mo). Patient demographics and clinical characteristics are summarized in Table 2.
All patients had been assessed by a multidisciplinary team consisting of radiologists, hepatobiliary surgeons, oncologists, and nuclear physicians before the commencement of treatment. Standard indications for liver resection for CRLM were followed after exclusion of extrahepatic metastases by multidetector CT of the chest and triple-phase multidetector CT of the abdomen and pelvis in addition to whole-body 18F-FDG PET/CT. Patients received a combination chemotherapy regimen involving oxaliplatin (folinic acid/fluorouracil/oxaliplatin or capecitabine/oxaliplatin) or irinotecan (folinic acid/fluorouracil/irinotecan). It is well established that the efficacy of all these treatment regimens in advanced colorectal cancer is comparable (19).
Of the thirty 18F-FDG PET/CT scans evaluated, 4 involved tumors with no observable 18F-FDG uptake. Extrahepatic metastases involving the lung were found in 1 patient on preoperative 18F-FDG PET/CT staging, who underwent staged liver resection and subsequent lung resection. No perioperative deaths occurred in this series of patients, but surgical resection margins were proven histologically positive in 5 patients (16.7%). At the completion of follow-up, 20 (66.7%) of the 30 patients were found to have tumor recurrences, and the median overall survival was calculated to be 58 mo (95% confidence interval, 42.25–73.75) using the Kaplan–Meier method.
Receiver-Operating-Characteristic Analysis
SUVmean (AUC, 0.531) and SUVmax (AUC, 0.580) were poor predictors of patient mortality at 30 mo on receiver-operating-characteristic analysis. However, the volumetric parameters MTV (AUC, 0.760) and TGV (AUC, 0.753) exhibited high predictive power that showed further improvement when only patients with 18F-FDG-avid tumors were analyzed (MTV: AUC, 0.886; TGV: AUC, 0.876). Similarly, in the prediction of tumor recurrence as an outcome, MTV (AUC, 0.804) and TGV (AUC, 0.786) outperformed SUVmean (AUC, 0.577) and SUVmax (AUC, 0.673) (Fig. 1).
Prognostic scores were also good predictors of medium-term mortality. The most accurate prediction was seen with the Basingstoke index (AUC, 0.898), and the Iwatsuki (AUC, 0.824), Nordlinger (AUC, 0.884) and Fong (AUC, 0.787) scores achieved AUCs comparable to those of MTV and TGV. However, unlike volumetric PET parameters, prediction of tumor recurrence using the scoring systems was limited (Fig. 2), with low AUCs for the Iwatsuki (AUC, 0.563), Memorial Sloan-Kettering Cancer Center (AUC, 0.605), and Nordlinger (AUC, 0.619) scores. The Basingstoke index (AUC, 0.766) once again demonstrated the best predictive performance of all scoring systems, achieving an AUC only slightly lower than that of the volume-based PET parameters.
Cutoffs that maximized the sensitivity and specificity of each PET metabolic measurement were then established. There was again a noticeable increase in predictive power when patients with 18F-FDG–avid tumors were analyzed separately. MTV above a cutoff of 15.58 cm3 correctly identified all patients who would not survive up to 30 mo (sensitivity, 100%), and below this threshold approximately 7 of 10 surviving patients were classified appropriately (specificity, 73.3%). A similar level of performance was seen with TGV at a cutoff of 64.97 SUV⋅mL (sensitivity, 100%; specificity, 76.2%).
Survival Analysis
Using these cutoff thresholds, PET parameters were dichotomized to generate Kaplan–Meier survival plots (Fig. 3). The log-rank test demonstrated significant survival differences between patients with high and low MTV (P = 0.015) and TGV (P = 0.001), but not when SUVmean (P = 0.859) and SUVmax (P = 0.165) were used as measures of metabolic activity. For TGV, the 3-y survival rate was 40.0% in the group with high metabolic uptake, compared with 88.7% in the low-uptake group. Corresponding values for MTV were 45.5% in the high-uptake group and 88.0% in the low-uptake group.
Finally, the prognostic influence of demographic, clinicopathologic, and tumor metabolic factors on overall and recurrence-free survival was examined (Table 3). On univariate analysis, MTV, TGV, Basingstoke score, and the size of the largest metastases were all found to be predictive of both endpoints, and the Fong score predicted only recurrence-free survival at a statistically significant level. Multivariate analysis of these variables found that all prognostic factors remained significant even when adjusted for age and sex. Table 4 shows the hazard ratios calculated for each of these variables.
DISCUSSION
Management of CRLM has long suffered from a lack of viable prognostic tools. This drawback limits the communication of risk between clinicians and patients and hampers decision making regarding the most appropriate treatment strategy for patients in terms of their likely outlook. Accumulating evidence indicates the biology of the tumor to be a major determinant of clinical outcome (20,21). Traditionally, the biologic aggressiveness of the tumor has been assessed by radiologic and clinical staging and by histopathologic features. Several investigators have developed clinicopathologic scoring systems to predict the long-term outcomes of patients undergoing metastasectomy. However, there has been uncertainty about the prognostic value of these scoring systems when applied to the wider population. In a study at the Mayo clinic in which 3 proposed risk scores (Fong, Iwatsuki, and Nordlinger) were applied retrospectively to 662 patients, Zakaria et al. reported that none of these scores were able to successfully stratify patients into distinct risk categories (11). On the other hand, several studies have identified the Fong and Iwatsuki scores to be predictive of recurrence-free survival (10,22). In our own survival analysis, the Fong score accurately predicted the risk of recurrence after resection and was one of the simpler systems to score. However, its performance appeared to be outclassed by the more recently developed Basingstoke index, which was found to predict both overall and recurrence-free survival.
Although these prognostic scores have been fundamental in establishing clinical risk factors for CRLM, they are likely to face significant challenges in the future. Validation of many of these scoring systems in independent patient populations is still limited. Meanwhile, the growing trend toward molecular targeted therapies may require an entirely new repertoire of biomarkers directly related to novel treatments. Prognostic models will need to be flexible enough to accommodate these newer factors and concise enough to be applied in the clinical setting.
With these considerations in mind, the present paper describes an alternative approach to the prognostication of CRLM using 18F-FDG PET. In our analysis, we identified the volume-based metabolic parameters MTV and TGV as important predictors of overall and recurrence-free survival when measured before liver resection. This is the first study to our knowledge assessing such an association in CRLM. From our findings, an increase of MTV and TGV by 1 unit increased the hazard of death by 3.5% and 1%, respectively. Considering that MTV and TGV measurements ranged by up to 77.31 cm3 and 343.16 SUV⋅mL, respectively, the survival advantage of patients with a low metabolically active tumor burden appears to be quite substantial.
However, these survival differences were not observed when SUV variables were used as a measure of tumor metabolic activity. This finding is in keeping with earlier studies of other cancers in which authors found volumetric PET variables to be of prognostic value in lung and esophageal cancer but failed to demonstrate an association between SUV and survival (23,24). It is believed that use of the volumetric parameters MTV and TGV enables a more global representation of tumor metabolism, in contrast to SUVmax, which depicts only the most intense metabolic region of a tumor. However, SUV has been shown previously to be predictive of survival in a study conducted by de Geus-Oei et al., of both palliatively and curatively treated CRLM patients (25). Our failure to detect this association may be attributable to the narrower range of SUVs of this cohort, which consisted of only patients who underwent surgical resection and excluded those treated with a palliative intent. Nonetheless, the results of our receiver-operating-characteristic analysis suggest that volumetric parameters may be the more powerful prognostic tool. A single measurement of either MTV or TGV was able to predict outcome with accuracy similar to or better than the 4 prognostic scores analyzed. Furthermore, performance was consistently high for predicting both the risk of recurrence and mortality. Although a recent study (26) has shown that metabolic volumes should ideally be performed on 18F-FDG PET scans that were acquired in list mode using arterial input data, this method is not clinically feasible as it limits the scan to a single field of view. The method that we have used has been previously used and validated by prior studies (18,27).
Volumetric PET parameters have several features apart from their prognostic capacity that allow them to perform either as a simple stand-alone prognostic tool or as an addition to future prognostic models. In contrast to more complicated assessments of tumor biology such as tumor doubling time (28), the calculation of volumetric PET variables is less time-consuming and may be undertaken with routine preoperative staging studies. There are also many opportunities for this technique to be further enhanced and refined. For instance, assessing volumetric PET variables before the onset of chemotherapy may be an additional predictor of prognosis. As well, the differential change in volumetric PET variables in response to systemic chemotherapy may be an even better predictor of outcome than a single-time-point measurement. Finally, new radiotracers allow a whole array of biologic features to be studied. In particular, fluoromisonidazole is a marker for hypoxia that has garnered attention due to its possible relevance to tumor behavior and the prediction of response to chemoradiotherapy (29). Volumetric PET variables using such tracers may allow greater stratification of tumors in relation to long-term prognosis.
CONCLUSION
This pilot study demonstrated the predictive value of volumetric PET variables in a group of patients undergoing potentially curative liver resection. Although the retrospective nature of this study created limitations, many of our findings were consistent with prior studies investigating the role of PET parameters in other malignancies. A prospective study at our center to expand on these results is currently under way. Meanwhile, our current findings suggest that volumetric 18F-FDG PET parameters have the prognostic potential to guide the selection of therapy in order to optimize outcomes in patients with CRLM.
DISCLOSURE STATEMENT
The costs of publication of this article were defrayed in part by the payment of page charges. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 USC section 1734.
Acknowledgments
We thank the members of the Hepato-Pancreato-Biliary and Transplant Unit, Austin Hospital, including Roger Berry, Michael Fink, Mehrdad Nikfarjam, Robert Jones, and Mark Goodwin from the Department of Radiology and Niall Tebbutt from GI Oncology, for their clinical contribution to this study, as well as the staff of the Centre for PET, Austin Hospital. This study was supported in part by NHMRC Program Grant 487922 and OIS funding from the Victorian Government. No other potential conflict of interest relevant to this article was reported.
Footnotes
Published online Jul. 13, 2012.
↵* Contributed equally to this work.
- © 2012 by the Society of Nuclear Medicine and Molecular Imaging, Inc.
REFERENCES
- Received for publication January 23, 2012.
- Accepted for publication April 2, 2012.