RT Journal Article
SR Electronic
T1 Test–Retest Reproducibility of <sup>18</sup>F-FDG PET/CT Uptake in Cancer Patients Within a Qualified and Calibrated Local Network
JF Journal of Nuclear Medicine
JO J Nucl Med
FD Society of Nuclear Medicine
SP 608
OP 614
DO 10.2967/jnumed.118.209544
VO 60
IS 5
A1 Brenda F. Kurland
A1 Lanell M. Peterson
A1 Andrew T. Shields
A1 Jean H. Lee
A1 Darrin W. Byrd
A1 Alena Novakova-Jiresova
A1 Mark Muzi
A1 Jennifer M. Specht
A1 David A. Mankoff
A1 Hannah M. Linden
A1 Paul E. Kinahan
YR 2019
UL http://jnm.snmjournals.org/content/60/5/608.abstract
AB Calibration and reproducibility of quantitative 18F-FDG PET measures are essential for adopting integral 18F-FDG PET/CT biomarkers and response measures in multicenter clinical trials. We implemented a multicenter qualification process using National Institute of Standards and Technology–traceable reference sources for scanners and dose calibrators, and similar patient and imaging protocols. We then assessed SUV in patient test–retest studies. Methods: Five 18F-FDG PET/CT scanners from 4 institutions (2 in a National Cancer Institute–designated Comprehensive Cancer Center, 3 in a community-based network) were qualified for study use. Patients were scanned twice within 15 d, on the same scanner (n = 10); different but same model scanners within an institution (n = 2); or different model scanners at different institutions (n = 11). SUVmax was recorded for lesions, and SUVmean for normal liver uptake. Linear mixed models with random intercept were fitted to evaluate test–retest differences in multiple lesions per patient and to estimate the concordance correlation coefficient. Bland–Altman plots and repeatability coefficients were also produced. Results: In total, 162 lesions (82 bone, 80 soft tissue) were assessed in patients with breast cancer (n = 17) or other cancers (n = 6). Repeat scans within the same institution, using the same scanner or 2 scanners of the same model, had an average difference in SUVmax of 8% (95% confidence interval, 6%–10%). For test–retest on different scanners at different sites, the average difference in lesion SUVmax was 18% (95% confidence interval, 13%–24%). Normal liver uptake (SUVmean) showed an average difference of 5% (95% confidence interval, 3%–10%) for the same scanner model or institution and 6% (95% confidence interval, 3%–11%) for different scanners from different institutions. Protocol adherence was good; the median difference in injection-to-acquisition time was 2 min (range, 0–11 min). Test–retest SUVmax variability was not explained by available information on protocol deviations or patient or lesion characteristics. Conclusion: 18F-FDG PET/CT scanner qualification and calibration can yield highly reproducible test–retest tumor SUV measurements. Our data support use of different qualified scanners of the same model for serial studies. Test–retest differences from different scanner models were greater; more resolution-dependent harmonization of scanner protocols and reconstruction algorithms may be capable of reducing these differences to values closer to same-scanner results.