Abstract
This article presents system performance studies for the Discovery MI PET/CT system, a new time-of-flight system based on silicon photomultipliers. System performance and clinical imaging were compared between this next-generation system and other commercially available PET/CT and PET/MR systems, as well as between different reconstruction algorithms. Methods: Spatial resolution, sensitivity, noise-equivalent counting rate, scatter fraction, counting rate accuracy, and image quality were characterized with the National Electrical Manufacturers Association NU-2 2012 standards. Energy resolution and coincidence time resolution were measured. Tests were conducted independently on two Discovery MI scanners installed at Stanford University and Uppsala University, and the results were averaged. Back-to-back patient scans were also performed between the Discovery MI, Discovery 690 PET/CT, and SIGNA PET/MR systems. Clinical images were reconstructed using both ordered-subset expectation maximization and Q.Clear (block-sequential regularized expectation maximization with point-spread function modeling) and were examined qualitatively. Results: The averaged full widths at half maximum (FWHMs) of the radial/tangential/axial spatial resolution reconstructed with filtered backprojection at 1, 10, and 20 cm from the system center were, respectively, 4.10/4.19/4.48 mm, 5.47/4.49/6.01 mm, and 7.53/4.90/6.10 mm. The averaged sensitivity was 13.7 cps/kBq at the center of the field of view. The averaged peak noise-equivalent counting rate was 193.4 kcps at 21.9 kBq/mL, with a scatter fraction of 40.6%. The averaged contrast recovery coefficients for the image-quality phantom were 53.7, 64.0, 73.1, 82.7, 86.8, and 90.7 for the 10-, 13-, 17-, 22-, 28-, and 37-mm-diameter spheres, respectively. The average photopeak energy resolution was 9.40% FWHM, and the average coincidence time resolution was 375.4 ps FWHM. Clinical image comparisons between the PET/CT systems demonstrated the high quality of the Discovery MI. Comparisons between the Discovery MI and SIGNA showed a similar spatial resolution and overall imaging performance. Lastly, the results indicated significantly enhanced image quality and contrast-to-noise performance for Q.Clear, compared with ordered-subset expectation maximization. Conclusion: Excellent performance was achieved with the Discovery MI, including 375 ps FWHM coincidence time resolution and sensitivity of 14 cps/kBq. Comparisons between reconstruction algorithms and other multimodal silicon photomultiplier and non–silicon photomultiplier PET detector system designs indicated that performance can be substantially enhanced with this next-generation system.
PET is a standard of care for cancer management that has been integrated with CT and MRI. PET/CT is commonly used to differentiate benign from malignant tumors, stage cancer, and plan for radiation therapy (1). Improvements in system performance such as in photon sensitivity, time resolution, and spatial resolution are expected to enhance lesion detection, low-dose patient imaging, and individualized treatment planning and evaluation (2). Many recent commercial PET-based systems have been evaluated using the National Electrical Manufacturers Association (NEMA) NU-2 standards, as previously reported (3–15).
This paper reports on studies performed with the new Discovery MI PET/CT system (GE Healthcare) (16), with the 4-ring PET configuration. The Discovery MI combines a 64- or 128-slice CT component with a 3- or 4-ring PET component providing a 15- or 20-cm axial field of view (FOV) (16). Each PET ring uses 136 detector blocks, each of which comprises a 4 × 9 array of lutetium-yttrium-oxyorthosilicate crystals coupled to a 3 × 6 array of silicon photomultipliers (SiPMs) with Anger multiplexing for crystal identification. A closed-loop water cooling system, with real-time local temperature measurement and SiPM gain adjustment capabilities, is used to keep the SiPM arrays at a stable temperature of around 19°C in a manner similar to that reported by Levin et al. (17). The crystal elements are 3.95 mm (transaxial) × 5.3 mm (axial) × 25 mm (length), and each Hamamatsu SiPM array has 2 × 3 pixels with an active area of 4 × 6 mm. A similar detector system design was used in the SIGNA PET/MR system (GE Healthcare) and achieved a coincidence time resolution of less than 400 ps FWHM, making it capable of advanced time-of-flight (TOF) performance (17).
MATERIALS AND METHODS
NEMA NU-2 2012 testing was performed independently on two Discovery MI systems, one installed at Stanford University and the other at Uppsala University. Before NEMA testing at either site, a well counter calibration scan was performed with 18F-FDG in a uniform cylindric phantom. Unless otherwise specified, the testing protocol followed the NEMA NU-2 2012 standards exactly, with results from both institutions reported separately and not averaged. Detailed procedures for each test can be found in the NEMA NU-2 2012 standards publication (18).
Because of some interesting technology in this next-generation system, such as use of SiPMs instead of photomultiplier tubes, use of the same detector in both PET/CT and PET/MR systems, a coincidence time resolution of under 400 ps, and the new reconstruction algorithms available, we have added to this paper unique comparisons between reconstruction algorithms and other SiPM-based and non–SiPM-based PET/CT and PET/MR detector designs.
Spatial Resolution
Spatial resolution was measured by creating 18F-FDG point sources using capillary tubes and suspending them at radial offsets of 1, 10, and 20 cm and axial offsets of 0 and 7.5 cm from the center of the FOV. Data were collected for 1 min at each position. The full width at half maximum (FWHM) and full width at tenth maximum of the point sources were quantified at all locations, using the NEMA-specified filtered backprojection algorithm, as well as non-TOF ordered-subset expectation maximization without point-spread function (non-TOF OSEM − PSF) modeling.
Sensitivity
At both institutions, plastic tubing (70 cm in length, 1 mm in inner diameter) was filled with an averaged calibrated activity of approximately 20 MBq of 18F-FDG, allowed to decay for 250 min to reach an activity that generates less than 5% randoms, and fixed both at the center of the FOV and at a vertical radial offset of 10 cm using positioning scans and a positioning apparatus. The line source was placed in an aluminum sleeve ensuring complete annihilation of all positrons. Five 1-min scans were taken, adding an additional aluminum sleeve after each measurement.
Counting Rate Statistics
The patient table was moved to its lowest setting, and the NEMA scatter phantom was propped up to the center of the FOV on stacks of low-density material placed outside the FOV. A line source (70 cm in length, 3.2 mm in inner diameter) was filled with a calibrated activity of roughly 817 MBq of 18F-FDG and inserted into the NEMA scatter phantom. Twenty-four frames of data were taken, with the first 17 frames taken as 15-min acquisitions and the last 7 frames taken as 25-min acquisitions followed by 25-min delays (i.e., at 50-min intervals). NEMA specifications were used to derive the trues, randoms, scatter, and noise-equivalent counting rate (NECR) from the prompts dataset in each frame. Randoms were estimated using singles rates.
Correction Accuracy
The system’s counting rate accuracy, which compares the measured activity with the expected activity and is dependent on the system corrections used, was found from a linear fit of the activity concentrations measured below peak NECR. In addition to attenuation and scatter corrections, randoms and dead-time corrections were performed using singles-based randoms subtraction and pileup correction, respectively. The dimensions of the reconstructed image matrix were 128 × 128, with a pixel size of 1.41 × 1.41 mm.
Image Quality
The background region of the NEMA image-quality phantom and the 10-, 13-, 17-, and 22-mm-diameter spheres were filled with 18F-FDG activity concentrations of, respectively, 4.7 and 18.8 kBq/mL at Stanford and 5.1 and 20.4 kBq/mL at Uppsala, yielding a 4:1 sphere-to-background concentration ratio. The 28- and 37-mm-diameter spheres were filled with nonradioactive water. The scatter line source used to measure NEMA counting rate statistics was filled with roughly 118 MBq of 18F-FDG and threaded through the body phantom. For reproducibility, 3 separate acquisitions of the image-quality phantom were taken with decay-adjusted acquisition times of 271, 279, and 282 s, consistent with a 151-mm axial step for each bed position, and reconstructed with the standard GE Healthcare clinical algorithm (TOF OSEM − PSF) and the recently released Q.Clear algorithm (block-sequential regularized expectation maximization + PSF (19)) (β-value = 50, yielding noise levels similar to TOF OSEM − PSF). The average and SD of the contrast recovery and background variability were quantified over the 3 sets of data replicates. Corrections for randoms, scatter, CT-based attenuation, dead time, and normalization were applied. The dimensions of the reconstructed image matrix were 384 × 384 × 71, with a pixel size of 1.042 × 1.042 mm and a slice thickness of 2.790 mm.
Energy and Timing Resolutions
A line source 70 cm in length was filled with 12 MBq of 18F-FDG solution and suspended at the center of the FOV in the axial direction inside the smallest aluminum sleeve used in the NEMA sensitivity measurement. To acquire the timing and energy spectra, 400 million coincident counts were taken. Measurement of the timing resolution FWHM was based on a 3-point fit of the peak for each crystal pair’s timing spectra after randoms were removed. For the energy resolution, the energy spectra were smoothed with a boxcar filter, and the percentage FWHM per channel was measured. The system energy and timing resolutions were quantified by averaging the values from all the detector crystals and channels in the system.
Comparison with Discovery 690
To study the potential benefits of SiPM technology, we compared the performance of the Discovery MI with that of a photomultiplier-tube–based PET/CT system, the Discovery 690 (GE Healthcare), at Stanford University. A patient with a history of melanoma who required restaging and metastatic evaluation was injected with 298 MBq of 18F-FDG, and a 33-min total-body scan was taken 96 min afterward on the Discovery 690. At roughly 130 min after injection, another 33-min total-body scan was taken, this time on the Discovery MI. Singles-estimate randoms correction, CT-based scatter and attenuation correction, and dead-time correction were applied. Image reconstruction for both systems used TOF OSEM − PSF with 3 iterations, 16 subsets, and a 5-mm postprocessing filter, as well as Q.Clear with a β-value of 350, which was chosen to match the noise level in TOF OSEM − PSF reconstructions.
Comparison with SIGNA
Because the same basic SiPM-based detector design is incorporated in both the Discovery MI system and the SIGNA PET/MR system (GE Healthcare), we are in the unique position of being able to compare the PET performance in CT versus MR environments. A neuroendocrine tumor patient was scanned with 68Ga-DOTATOC on the Discovery MI and SIGNA systems over 2 consecutive days at Uppsala University. For the Discovery MI, the scan began 72 min after injection of 181 MBq of 68Ga-DOTATOC and proceeded at a rate of 2 min per bed position. For the SIGNA, the scan began 69 min after injection of 148 MBq of 68Ga-DOTATOC and proceeded at a rate of 3 min per bed position. Image reconstruction for both systems used TOF OSEM with PSF (TOF OSEM + PSF) with 3 iterations, 16 subsets, and a 5-mm postprocessing filter. In addition, the Discovery MI images were reconstructed using Q.Clear with a β-value of 350. Attenuation correction was performed using a standard 2-point Dixon MR sequence.
RESULTS
Spatial Resolution
Table 1 shows the spatial resolution results for both filtered backprojection and non-TOF OSEM − PSF.
Sensitivity
At the center of the FOV, sensitivity was 14.0 cps/kBq at Stanford and 13.4 cps/kBq at Uppsala. At a radial offset of 10 cm, sensitivity was 13.8 cps/kBq at Stanford and 13.1 cps/kBq at Uppsala. Figure 1A shows the slice sensitivity profile at the center of the FOV, and Figure 1B shows the extrapolation process used to remove attenuation from the aluminum tubes in order to obtain attenuation-free sensitivity numbers.
Counting Rate Statistics
Figure 2A shows the trues, randoms, scatters, NECR, and total-event rate as a function of activity, and Figure 2B shows the system scatter fraction as a function of activity. Table 2 summarizes important counting rate metrics measured at both Stanford University and Uppsala University.
Correction Accuracy
The average counting rate error at both institutions, defined by the percentage deviation from the expected activity concentration, is shown in Figure 3 as a function of the activity in the line source. The error bars indicate the maximum positive and negative slice deviations from the average error at each activity level. The maximum deviation from expected activity at Uppsala was 3.86% at an activity of 1.19 kBq/mL, whereas at Stanford it was 2.43% at an activity of 35.44 kBq/mL.
Image Quality
The image contrast recovery and background variability of the image-quality phantom are shown in Table 3 and Figure 4. The average lung error for TOF OSEM − PSF was 4.4%± 0.1% at Stanford and 5.3%± 0.0% at Uppsala, whereas for Q.Clear it was 2.5%± 0.1% at Stanford and 3.3%± 0.1% at Uppsala. The reconstructed transverse and coronal images from the image-quality phantom are shown in Figure 5 and reveal the better noise and contrast recovery performance of Q.Clear.
Energy and Timing Resolutions
The average system photopeak energy resolution was 9.44%± 0.07% FWHM at Stanford and 9.35%± 0.05% FWHM at Uppsala. The average system coincidence time resolution was 374.1± 2.6 ps FWHM at Stanford and 376.7± 2.7 ps FWHM at Uppsala.
Comparison with Discovery 690
Figure 6 shows maximum-intensity-projection coronal PET slices, axial PET slices, and axial PET/CT slices for the Discovery MI and the Discovery 690 using TOF OSEM − PSF and Q.Clear. Because the two systems have different axial slice thicknesses, the displayed images were carefully chosen to ensure the best alignment possible.
Comparison with SIGNA
Figure 7 shows coronal-slice and maximum-intensity-projection images for the Discovery MI and the SIGNA using TOF OSEM + PSF and Q.Clear. The displayed images were carefully chosen to ensure the closest possible alignment and intensity scales between the two systems.
DISCUSSION
Intersystem Variance
Differences in the manufacturing process may explain general trends in variance between the Discovery MI at Stanford and the Discovery MI at Uppsala, such as the better counting rate performance and worse contrast recovery performance of the former. However, differences in the phantom-filling process might also have caused some of the variance in results. Imperfect filling of the 17-mm sphere at Stanford could explain its lower contrast recovery numbers, and differences in the axial confinement of the capillary tube could explain differences in axial spatial resolution. The lower sensitivity measured at Uppsala might also have been caused by variations in activity calibration.
Performance Measurement Observations
Because of the high photon sensitivity of the Discovery MI and the low clinical dosage recommended, NECR testing did not reach the system’s peak trues counting rate and activity. The amount of activity required would be so high as to require sweeping a large nonclinically relevant portion of the NECR curve. Similarly, the measured maximum relative counting rate error at both institutions occurred at activity levels that are not clinically relevant, whereas the relative counting rate error was less than 3% at both institutions for clinically relevant activity levels. A low relative counting rate error is clinically important for accurate lesion quantization, posttreatment follow-up scans, and longitudinal studies. The background activity concentration used in the image-quality phantom at Stanford, 0.6 kBq/mL, was lower than the NEMA specifications. This factor leads to a more conservative contrast recovery measurement due to a lower number of acquired counts and could explain why the contrast recovery measured for the smaller spheres was 8% lower for Stanford’s system than for Uppsala’s. A reduction in the system’s axial FOV from 20 to 15 cm (3-ring configuration) would result in a reduced sensitivity of 7.5 kcps/MBq and peak NECR of 100 kcps at 20.6 kBq/mL (16). The measured energy and timing resolutions can be improved using lower activity levels.
Clinical System Comparisons
Figure 6 shows that regardless of reconstruction method, several lesions were visible in the Discovery MI images but not in the Discovery 690 images. The likely cause is the Discovery MI’s improved sensitivity, NECR, timing resolution, and contrast recovery for small lesions, when compared with the Discovery 690, as shown in Table 4. However, direct comparisons cannot be made because of the differences in tracer uptake time and patient alignment between the two acquisitions. Comparing Q.Clear with the more traditional TOF OSEM − PSF, the intensity of lesion uptake is clearly higher for the former, with better-defined lesion boundaries. This improvement can help in surgery and radiotherapy planning or in tumor segmentation for quantitative studies on treatment effectiveness.
Likewise, direct comparisons cannot be made between the Discovery MI and the SIGNA because the patient was scanned on 2 consecutive days with slightly different amounts of activity and scanning times per bed position. However, it is clear from Figure 7 that, unsurprisingly, lesion identification capability and spatial resolution are similar between the Discovery MI and the SIGNA since they use similar block detector architectures (17). In addition, lesion visualization near the center of the body does not significantly differ between the CT-based and MR-based attenuation correction of the two systems. Comparison of the TOF OSEM + PSF coronal slices shows better noise performance for the Discovery MI than for the SIGNA, making lesion identification easier for the former. The use of Q.Clear reconstruction instead of OSEM–based reconstruction dramatically suppresses background noise while having minimal effects on lesion intensity.
Comparisons with Other Multimodal Systems
Table 4 compares published NEMA NU-2 measurements for some of the newest commercial multimodal PET scanners. The Discovery MI metrics in this table are averages of the NEMA NU-2 results from Stanford and Uppsala. The Discovery MI, SIGNA, Biograph mCT Flow (Siemens Healthcare), and Vereos Digital (Philips Healthcare) were tested using the NEMA NU-2 2012 standards, and the remaining systems were tested with the NEMA NU-2 2007 standards.
Comparisons of NEMA NU-2 2007 and NEMA NU-2 2012 must take into account slight differences between the two sets of standards. For spatial resolution tests, the 2007 standards do not report radial offsets of 20 cm or separate radial and tangential resolutions at the center of the FOV, and the 2007 standards average the resolutions of 2 sources at 10-cm radial offsets in orthogonal directions. For NECR tests, the 2007 standards do not allow shimming of the phantom, leading to larger table scatter contribution for systems with a larger range of table height adjustment. The counting rate accuracy measured by the 2007 standards uses extrapolation over the lowest 3 measurement points instead of over all points below peak NECR, and the 2007 standards also do not require corrections to be performed on reconstructed images—a requirement that can be challenging for scatter models that vary with counting rate. Most significantly, image-quality measurements in the 2007 standards use acquisitions that are twice as long as in the 2012 standards, leading to artificially favorable contrast recovery and background variability for the 2007 standards.
The spatial resolution testing showed that, taken as a whole over all 3 resolution directions and the different distances from the center of the FOV, the Discovery MI performs comparably to the other systems in Table 4.
The sensitivity of the Discovery MI is the highest of all the PET/CT systems although still lower than that of the PET/MR systems, with longer PET axial FOVs and smaller transaxial FOVs. A higher sensitivity allows for a better signal-to-noise ratio for a given acquisition time or the same signal-to-noise ratio for shortened acquisitions or reduced patient doses. The counting rate performance of the Discovery MI reinforces this observation.
Compared with the Discovery 690, the Discovery MI has a peak NECR that is 39% higher and a peak noise-equivalent-count activity concentration that is 24% lower. The higher peak NECR allows for the same signal-to-noise ratio in a 39% shorter acquisition, and even more so when accounting for the improved timing resolution and associated TOF signal-to-noise gain of the Discovery MI. The lower peak noise-equivalent-count activity concentration points to the viability of the Discovery MI as a low-dose imaging system.
The contrast recovery for small spheres is better for the Discovery MI than for any of the other commercially available systems in Table 4. Compared with the Discovery 690, which has the next best contrast recovery, that of the Discovery MI is 22% better for the 10-mm sphere, 14% better for the 13-mm sphere, 12% better for the 17-mm sphere, and 11% better for the 22-mm sphere. This better contrast recovery should lead to an improvement in the system’s ability to detect, visualize, and quantify smaller lesions.
CONCLUSION
NEMA NU-2 2012 testing of the SiPM-based Discovery MI PET/CT systems at Stanford University and Uppsala University points to improved diagnostic sensitivity for small lesions and a wide range of promising applications, from low-dose oncology studies to high-dose studies with short-lived isotopes. In addition, comparisons with other PET/CT and PET/MR systems demonstrate the substantial performance improvements possible with the new generation of SiPM-based TOF PET/CT systems.
DISCLOSURE
Craig Levin is participating in a sponsored research agreement with GE Healthcare on an unrelated topic. No other potential conflict of interest relevant to this article was reported.
Acknowledgments
For valuable contributions in the data collection process for this publication, we thank Adam Brown, Jakub Siennicki, and Tarik Cengiz from GE Healthcare; Monica Ranger and Vinh Nguyen from Stanford Healthcare; Dr. Fred Chin and the Radiochemistry Group at Stanford; and Lars Lindsjö and Prof. Anders Sundin from Uppsala University Hospital. We also thank Dr. Andrei Iagaru, for coordinating the clinical studies and the world's first installation of the Discovery MI system at Stanford University.
Footnotes
Published online Apr. 27, 2017.
- © 2017 by the Society of Nuclear Medicine and Molecular Imaging.
REFERENCES
- Received for publication January 5, 2017.
- Accepted for publication April 14, 2017.