PT - JOURNAL ARTICLE AU - Linda M. Velasquez AU - Ronald Boellaard AU - Georgia Kollia AU - Wendy Hayes AU - Otto S. Hoekstra AU - Adriaan A. Lammertsma AU - Susan M. Galbraith TI - Repeatability of <sup>18</sup>F-FDG PET in a Multicenter Phase I Study of Patients with Advanced Gastrointestinal Malignancies AID - 10.2967/jnumed.109.063347 DP - 2009 Oct 01 TA - Journal of Nuclear Medicine PG - 1646--1654 VI - 50 IP - 10 4099 - http://jnm.snmjournals.org/content/50/10/1646.short 4100 - http://jnm.snmjournals.org/content/50/10/1646.full SO - J Nucl Med2009 Oct 01; 50 AB - 18F-FDG PET is often used to monitor tumor response in multicenter oncology clinical trials. This study assessed the repeatability of several semiquantitative standardized uptake values (mean SUV [SUVmean], maximum SUV [SUVmax], peak SUV [SUVpeak], and the 3-dimensional isocontour at 70% of the maximum pixel value [SUV70%]) as measured by repeated baseline 18F-FDG PET studies in a multicenter phase I oncology trial. Methods: Double-baseline 18F-FDG PET studies were acquired for 62 sequentially enrolled patients. Tumor metabolic activity was assessed by SUVmean, SUVmax, SUVpeak, and SUV70%. The effect on SUV repeatability of compliance with recommended image-acquisition guidelines and quality assurance (QA) standards was assessed. Summary statistics for absolute differences relative to the average of baseline values and repeatability analysis were performed for all patients and for a subgroup that passed QA, in both a multi- and a single-observer setting. Intrasubject precision of baseline measurements was assessed by repeatability coefficients, intrasubject coefficients of variation (CV), and confidence intervals on mean baseline differences for all SUV parameters. Results: The mean differences between the 2 SUV baseline measurements were small, varying from −2.1% to 1.9%, and the 95% confidence intervals for these mean differences had a maximum half-width of about 5.6% across the SUV parameters assessed. For SUVmax, the intrasubject CV varied from 10.7% to 12.8% for the QA multi- and single-observer datasets and was 16% for the full dataset. The 95% repeatability coefficients ranged from −28.4% to 39.6% for the QA datasets and up to −34.3% to 52.3% for the full dataset. Conclusion: Repeatability results of double-baseline 18F-FDG PET scans were similar for all SUV parameters assessed, for both the full and the QA datasets, in both the multi- and the single-observer settings. Centralized quality assurance and analysis of data improved intrasubject CV from 15.9% to 10.7% for averaged SUVmax. Thresholds for metabolic response in the multicenter multiobserver non-QA settings were −34% and 52% and in the range of −26% to 39% with centralized QA. These results support the use of 18F-FDG PET for tumor assessment in multicenter oncology clinical trials.