TY - JOUR T1 - Treatment Monitoring by <sup>18</sup>F-FDG PET/CT in Patients with Sarcomas: Interobserver Variability of Quantitative Parameters in Treatment-Induced Changes in Histopathologically Responding and Nonresponding Tumors JF - Journal of Nuclear Medicine JO - J Nucl Med SP - 1038 LP - 1046 DO - 10.2967/jnumed.107.050187 VL - 49 IS - 7 AU - Matthias R. Benz AU - Vladimir Evilevitch AU - Martin S. Allen-Auerbach AU - Fritz C. Eilber AU - Michael E. Phelps AU - Johannes Czernin AU - Wolfgang A. Weber Y1 - 2008/07/01 UR - http://jnm.snmjournals.org/content/49/7/1038.abstract N2 - Measurements of tumor glucose use by 18F-FDG PET need to be standardized within and across institutions. Various parameters are used for measuring changes in tumor glucose metabolic activity with 18F-FDG PET in response to cancer treatments. However, it is unknown which of these provide the lowest variability between observers. Knowledge of the interobserver variability of quantitative parameters is important in sarcomas as these tumors are frequently large and demonstrate heterogeneous 18F-FDG uptake. Methods: A total of 33 patients (16 men, 17 women; mean age, 47 ± 18 y) with high-grade sarcomas underwent 18F-FDG PET/CT scans before and after neoadjuvant chemotherapy. Two independent investigators measured the following parameters on the pretreatment and posttreatment scans: maximum standardized uptake value (SUVmax), peak SUV (SUVpeak), mean SUV (SUVmean), SUVmean in an automatically defined volume (SUVauto), and tumor-to-background ratio (TBR). The variability of the different parameters was compared by concordance correlation coefficient (CCC), variability effect coefficient, and Bland–Altman plots. Results: Baseline SUVmax, SUVpeak, SUVmean, SUVauto, and TBR averaged 10.36, 7.78, 4.13, and 6.22 g/mL and 14.67, respectively. They decreased to 5.36, 3.80, 1.79, and 3.25 g/mL and 6.62, respectively, after treatment. SUVmax, SUVpeak, and SUVauto measurements and their changes were reproducible (CCC ≥ 0.98). However, SUVauto poorly differentiated between responding and nonresponding tumors. The high intratumoral heterogeneity of 18F-FDG resulted in frequent failure of the thresholding algorithm, which necessitated manual corrections that in turn resulted in a higher interobserver variability of SUVmean (CCCs for follow-up and change were 0.96 and 0.91, respectively; P &lt; 0.005). TBRs also showed a significantly higher variability than did SUVpeak (CCCs for follow-up and change were 0.94 and 0.86, respectively; P &lt; 0.005). Conclusion: SUVmax and SUVpeak provided the most robust measurements of tumor glucose metabolism in sarcomas. Delineation of the whole-tumor volume by semiautomatic thresholding did not decrease the variability of SUV measurements. TBRs were significantly more observer-dependent than were absolute SUVs. These findings should be considered for standardization of clinical 18F-FDG PET/CT trials. ER -