PT - JOURNAL ARTICLE AU - Kurland, Brenda F. AU - Peterson, Lanell M. AU - Shields, Andrew T. AU - Lee, Jean H. AU - Byrd, Darrin W. AU - Novakova-Jiresova, Alena AU - Muzi, Mark AU - Specht, Jennifer M. AU - Mankoff, David A. AU - Linden, Hannah M. AU - Kinahan, Paul E. TI - Test–Retest Reproducibility of <sup>18</sup>F-FDG PET/CT Uptake in Cancer Patients Within a Qualified and Calibrated Local Network AID - 10.2967/jnumed.118.209544 DP - 2019 May 01 TA - Journal of Nuclear Medicine PG - 608--614 VI - 60 IP - 5 4099 - http://jnm.snmjournals.org/content/60/5/608.short 4100 - http://jnm.snmjournals.org/content/60/5/608.full SO - J Nucl Med2019 May 01; 60 AB - Calibration and reproducibility of quantitative 18F-FDG PET measures are essential for adopting integral 18F-FDG PET/CT biomarkers and response measures in multicenter clinical trials. We implemented a multicenter qualification process using National Institute of Standards and Technology–traceable reference sources for scanners and dose calibrators, and similar patient and imaging protocols. We then assessed SUV in patient test–retest studies. Methods: Five 18F-FDG PET/CT scanners from 4 institutions (2 in a National Cancer Institute–designated Comprehensive Cancer Center, 3 in a community-based network) were qualified for study use. Patients were scanned twice within 15 d, on the same scanner (n = 10); different but same model scanners within an institution (n = 2); or different model scanners at different institutions (n = 11). SUVmax was recorded for lesions, and SUVmean for normal liver uptake. Linear mixed models with random intercept were fitted to evaluate test–retest differences in multiple lesions per patient and to estimate the concordance correlation coefficient. Bland–Altman plots and repeatability coefficients were also produced. Results: In total, 162 lesions (82 bone, 80 soft tissue) were assessed in patients with breast cancer (n = 17) or other cancers (n = 6). Repeat scans within the same institution, using the same scanner or 2 scanners of the same model, had an average difference in SUVmax of 8% (95% confidence interval, 6%–10%). For test–retest on different scanners at different sites, the average difference in lesion SUVmax was 18% (95% confidence interval, 13%–24%). Normal liver uptake (SUVmean) showed an average difference of 5% (95% confidence interval, 3%–10%) for the same scanner model or institution and 6% (95% confidence interval, 3%–11%) for different scanners from different institutions. Protocol adherence was good; the median difference in injection-to-acquisition time was 2 min (range, 0–11 min). Test–retest SUVmax variability was not explained by available information on protocol deviations or patient or lesion characteristics. Conclusion: 18F-FDG PET/CT scanner qualification and calibration can yield highly reproducible test–retest tumor SUV measurements. Our data support use of different qualified scanners of the same model for serial studies. Test–retest differences from different scanner models were greater; more resolution-dependent harmonization of scanner protocols and reconstruction algorithms may be capable of reducing these differences to values closer to same-scanner results.