Abstract
The aim of this work was to develop a rigorous evaluation methodology to assess performance of different acquisition and processing methods for variable patient sizes in the context of lesion detection in whole-body 18F-FDG PET. Methods: Fifty-nine bed positions were acquired in 32 patients in 2-dimensional (2D) and 3-dimensional (3D) modes 1–4 h after 18F-FDG injection (740 MBq) using a BGO PET scanner. Three spheres (1.0-, 1.3-, and 1.6-cm diameter) containing 68Ge were also imaged separately in air, at locations corresponding to possible lesion sites in 2D and 3D (590 targets per condition). Each bed position was acquired for 7 min in 2D and 6 min in 3D and corrected for randoms using delayed window randoms subtraction (DWS) or randoms variance reduction (RVR). Sphere sinograms were attenuated using the 2D or 3D attenuation map derived from the transmission scan of the patient, after scaling 2D and 3D sinograms with identical factors to ensure marginal detectability. Resulting 2D sinograms were reconstructed with filtered backprojection (FBP) and ordered-subsets expectation maximization (OSEM) without any scatter or attenuation correction (FBP-NATS and OSEM-NATS) or corrected for scatter and attenuation and reconstructed using FBP (FBP-ATT) or attenuation-weighted OSEM (AWOSEM). 3D sinograms were processed identically after Fourier rebinning. Next, reconstructed volumes were compared on the basis of performance of a 3-channel Hotelling observer (CHO-SNR [SNR is signal-to-noise ratio]) in detecting the presence of a sphere of unknown size on an anatomic background while modeling observer noise. The noise equivalent count (NEC) rate was computed in 2D and 3D for 3 different phantoms sizes (40, 60, and 95 kg) and compared with lesion detection SNR. Results: 3D imaging yielded better lesion detectability than 2D (P < 0.025, 2-tailed paired t test) in patients of normal size (body mass index [BMI] ≤ 31). However, 2D imaging yielded better lesion detectability than 3D in large patients (BMI > 31), as 3D performance deteriorated in large patients (P < 0.05). 2D and 3D yielded similar results for different lesion sizes. CHO-SNR were 40% greater for AWOSEM, FBP-ATT, and FBPNAT than for OSEM (P < 0.05), and AWOSEM yielded significantly better lesion detectability than did FBP. In all patients, RVR yielded a systematic improvement in CHO-SNR over DWS in both 2D and 3D. √NEC was characterized by a behavior similar to that of SNRCHO for the 3 different phantom sizes considered in this study.
Three-dimensional (3D) “septumless” whole-body (WB) 18F-FDG PET yields increased sensitivity to true coincidences at the expense of increased sensitivity to scattered and random coincidences. Several groups have assessed the relative merits of 3D and 2-dimensional (2D) WB-PET for BGO-based PET scanners using numeric simulations, phantom studies, or patient studies, applying various metrics. Raylman et al. (1), in a single-observer subjective visibility phantom study, reported equal performance for 2D and 3D PET, whereas Moore et al. (2) reported higher nonprewhitening signal-to-noise ratio for 3D than for 2D in a study using an anthropomorphic torso phantom. On the other hand, Farquhar et al. (3) reported significantly better performance in detecting simulated lesions on normal patient background images with 2D than with 3D WB 18F-FDG PET, with a 3D performance close to the line of chance. Lartizien et al. (4,5) reported, on the basis of human and numeric observer detectability studies using numeric simulations of the MCAT torso phantom (Mathematical Cardiac Torso phantom), better performance with 2D PET and a 740-MBq injection of 18F-FDG than with 3D PET with a 444-MBq injection. They found, however, that detectability with 3D PET was greater than or similar to that of 2D PET if a 444-MBq injection was simulated for both modes. Visvikis et al. (6) reported that human observers, asked to subjectively assess WB scans, rated 2D and 3D images similarly when overall acquisition time was held constant. LSO scanners are generally expected to perform better than BGO scanners in 3D mode due to higher count-rate capability, better timing resolution, and, therefore, better randoms rejection, as well as better energy resolution, and higher energy threshold, and, therefore, better scatter rejection. For LSO scanners, Lodge et al. found reduced image variability in patient images for 3D mode, under conditions of matched target-to-background ratio (7).
The aim of this work was to develop a rigorous evaluation methodology that allows objective assessment of the performance of different PET scanner geometries and processing schemes for variable patient sizes and that is based on a clinically meaningful task—that is, lesion detection in WB-PET. Actual patient data, rather than phantom or experimental data, acquired in multiple anatomic positions were used so that a large range of patient sizes was represented, as our hypothesis was that the relative merits of 2D and 3D imaging depend on patient size and that patient data, in contrast with phantom experiments, was crucial to capture the physiologic variability of 18F-FDG uptake in different parts of the body (8). Objective, rather than subjective or visual, assessment of image quality was performed. In addition to assessing the relative merits of 2D and 3D imaging, we assessed the impact of the processing scheme (randoms and attenuation correction, tomographic reconstruction) on image quality for different lesion sizes and patient sizes.
MATERIALS AND METHODS
Patient Studies
Thirty-two patients were included in this study, which was approved by the institutional review board of the Dana-Farber Cancer Institute, after having given informed consent consistent with institutional review board guidelines. Forty-four bed positions were acquired in 22 patients with various types of cancer randomly selected from the clinical population. In addition, 15 bed positions in both 2D and 3D modes were acquired in 10 patients participating in a pilot study of an antitumor agent for prostate cancer, which included 18F-FDG PET. Therefore, the total number of bed positions included in this study was 59, obtained from 32 patients. The body mass index (BMI = weight (kg)/height2 (m2)), as defined by the National Institutes of Health publication nos. 04-5283 and 98-4083 (9), was used to quantify patient size. In our population, BMI ranged from 18 to 50, distributed as follows: 15 bed positions were associated with patients with normal BMI (18.5 < BMI < 24.9), 21 bed positions were associated with overweight patients (25 < BMI < 29.9), 19 bed positions were associated with obese patients (30 < BMI < 39.9), and 4 bed positions were associated with extremely obese patients (BMI > 40). A weighted estimate of total attenuation volume in each bed position was also computed for each patient by summing the linear attenuation coefficients (in cm−1) over every voxel in the reconstructed transmission image; the correlation between these values and BMI was measured.
Patients were injected with 740 MBq of 18F-FDG 45 min before data were acquired in 2D mode for 60–70 min on an ECAT HR+ BGO-based WB-PET scanner (Siemens Medical Solutions, Inc.) using the WB-PET protocol used routinely in the clinic at the Dana-Farber Cancer Institute. An experienced nuclear medicine physician reviewed the reconstructed volumes and identified 1 or 2 disease-free bed positions to be subsequently imaged in both 2D and 3D. The patients were then removed from the scanner and allowed to rest for 1 h. At the end of this time (∼3 h after injection), they were encouraged to empty their bladders and were repositioned on the scanner with the first previously identified bed-position located in the scanner's FOV. 2D and 3D emission scans were interleaved (3 min-3D/3 min-2D/3 min-3D/3 min transmission/4 min-2D). A lower-level discriminator of 350 keV was used in 2D and 3D modes. After compensating for randoms and scatter (Material and Methods; Reconstruction Schemes), the time frames were summed into a 6-min 3D scan and a 7-min 2D scan for each bed position. The difference in imaging times compensates for the greater degree of bed overlap required in 3D WB-PET compared with 2D WB-PET.
Image Generation
Lesion Generation.
To model in a realistic manner “lesion-present” clinical studies with lesions present—ensuring perfect knowledge of the presence and location of each lesion—spheric lesions were added to disease-free bed positions, yielding synthetic “lesion-present” studies. To avoid errors associated with numeric simulation of lesions to be added (e.g., normalization errors, incomplete modeling of the scanner components, etc.), and to ensure that spheric lesions were added in a realistic manner to the normal images in both 2D and 3D, 3 plastic spheres filled with 74 kBq/mL of 68Ge were acquired separately in 2D (for 300 s) and 3D (for 258 s) modes at the exact location in the patient volume where a simulated lesion was to be added. To accomplish this, a grid was manufactured with 1,300 threaded holes that allowed positioning of the spheres at any desired location in the PET scanner FOV with 5-mm accuracy in all 3 directions. The spheres of 1.0-, 1.3-, and 1.6-cm internal diameter were individually acquired in air, in 2D and 3D, in 20 locations corresponding to patient anatomic regions of interest (e.g., lungs, soft tissues, bones) within the central third of the FOV (i.e., slices 21–40 of 63 slices) to ensure that the axial sensitivity profiles in 2D and 3D modes were flat (an overlap of 22 slices was used in 3D). We assumed that scatter associated with the sphere acquired in air was negligible as compared with the scatter associated with the patient. For each bed position, the most likely 10 lesion locations within the volume, out of the 20 possible ones, were identified by an experienced nuclear medicine physician. The 3D- (2D) sphere sinograms were attenuated by the 3D (2D) attenuation map generated for each patient, corrected for randoms and scaled to a contrast value that would yield the desired marginal detectability, as assessed subjectively by iterative trial and error of the whole process by a human observer, using the same factor (derived from the 2D images) in 2D and 3D. Because we determined the contrast in the 2D image, and then scaled the sphere data equivalently for 3D, the signal strength was the same in 3D—but the image contrast might well have been different (due to, for example, differences in scatter correction). Poisson noise deviates were then generated from the sphere sinograms and added to the patient sinograms to ensure marginal detectability. Figure 1 illustrates the schema used to generate the lesion-present studies.
To validate the realism of the lesion generation strategy in 2D and 3D modes, a phantom study was undertaken. An elliptic phantom (long axis = 30 cm, short axis = 22 cm) was filled with a solution of 4.63 MBq/mL of 18F-FDG, and a 1.3-cm-diameter sphere was filled with a solution of 9.0 MBq/mL of 18F-FDG. First, the elliptic phantom was positioned in the scanner and emission sinograms were acquired in 2D and 3D modes; a transmission scan was acquired as well. Next, the sphere was positioned inside the phantom at an off-centered position on the long axis of the elliptic cylinder, and a second acquisition (with the sphere inside the phantom) was performed in 2D and 3D under conditions that were identical to those of the first acquisition. Finally, the sphere was mounted on the positioning grid at a location that corresponded to its location inside the cylindric phantom and acquired alone in air. This allowed us to compare profiles as well as mean pixel noise and sphere-to-background contrast ratios in 2D and 3D in the reconstructed volumes of the spheric lesion present in the cylindric phantom with the synthetic 2D and 3D volumes obtained by adding the lesion's sinogram to that of the cylindric phantom using the same approach as that used in the patient studies. The mean pixel noise in 2D and 3D was computed as the ratio of the SD of reconstructed counts in a rectangular region of interest to the mean counts in the same region in 3 locations within 3 slices. The sphere-to-background ratio was computed in 2 steps: first, the maximum within the sphere in the central slice through the sphere was determined for each acquisition (maxsphere) as well as the mean reconstructed counts in a background region of interest in the sphere's vicinity (bkg); next, the sphere-to-background ratio was computed as: (maxsphere – bkg)/(maxsphere + bkg).
Reconstruction Schemes.
The acquisition (2D, 3D) and processing protocols (randoms correction, attenuation correction, reconstruction algorithm) were evaluated in 2 steps. First, 2D and 3D acquisition protocols were assessed for lesion detection using clinical reconstruction protocols to determine relative performance for several patient size ranges. Lesion detection SNR was calculated from each bed position, based on 30 samples per bed position (3 sphere sizes, 10 locations selected from 20 possible locations). Next, several randoms and attenuation compensation and reconstruction approaches were assessed using the same task-based figure of merit with all bed positions from all patient sizes pooled together to reduce the number of conditions explored.
In the first evaluation step, the acquired sinograms were reconstructed using the clinical reconstruction protocol. Prompt and random coincidence data were acquired separately, and random coincidences in each bed position were compensated using the delayed window randoms subtraction (DWS) approach (10). The 2D sinograms were corrected for scatter using a deconvolution with a double-sided exponential, whereas the 3D sinograms were corrected for scatter using a single-scatter, Klein–Nishina modeling technique (11). 3D data were reduced to 2D sinograms using Fourier rebinning (FORE) (12), and then 3D and 2D data were reconstructed iteratively using the same reconstruction algorithm and parameters (attenuation-weighted OSEM, 8 subsets, 2 iterations) recommended by the manufacturer at installation time for WB 18F-FDG studies performed in clinical routine on the scanner. Regularization was achieved using a 3D gaussian-kernel sieve of 6-mm full width at half maximum. The reconstructed pixel size in the 128 × 128 × 63 image volume was 0.51 × 0.51 cm2 in the transverse plane; the plane separation was 0.24 cm. The 2D and 3D volumes for each bed position were also reconstructed in the same manner as that described without adding any lesions, to yield the background “lesion-absent” volumes used in the observer study. In summary, 590 lesion-present images and 590 lesion-absent images were generated for each of 3 sphere sizes for both 2D and 3D, by combining 59 bed positions (normal patient images) with images of spheres at each of 10 locations (selected from 20 possible locations based on the clinical likelihood of tumor presence in each location) in patients with BMIs ranging from 18 to 50. The lesion-absent image at a given location was the same for the 3 sphere sizes. These images were used to evaluate the performance of 2D and 3D acquisition modes as a function of patient size. In the second evaluation step, 16 reconstruction/processing schemes were evaluated using all datasets (1,770 lesion-present datasets (59 bed positions × 10 locations × 3 sphere sizes), and 590 lesion-absent datasets (59 bed positions × 10 locations)):
Two random coincidence correction methods were considered: the standard delayed window subtraction approach (DWS) (10) and the randoms variance reduction approach (RVR) (13,14). In the latter, random sinograms were unmashed by interpolation, and Casey smoothing was performed within each plane (the solution to the randoms variance reduction problem is approximate in this case but yields very similar results to the exact solution (14)). Next, the data were remashed and subtracted from the prompts to yield true coincidence estimates. After random coincidence correction, the time frames were summed into a 6-min 3D scan and a 7-min 2D scan for each bed position.
The scatter correction methods used were the deconvolution using a double-sided exponential in 2D and the single-scatter, Klein–Nishina in 3D (11). Attenuation correction was performed either at the sinogram level or during iterative reconstruction. Therefore, sinograms obtained after summing the data into 6-min 3D or 7-min 2D scans were either reconstructed without scatter or attenuation correction using filtered backprojection (FBP-NATS) or ordered-subsets expectation maximization (OSEM-NATS, 8 subsets, 2 iterations), or corrected for scatter and reconstructed using FBP, after correcting for attenuation (FBP-ATT), or while modeling attenuation in the projector/backprojector of an attenuation-weighted iterative algorithm (AWOSEM). Postreconstruction smoothing was performed using a 3D gaussian-kernel of 6-mm full width at half maximum. In summary, 1,770 lesion-present and 590 lesion-absent datasets were considered for each of the 16 conditions evaluated that corresponded to different acquisition and processing schemes (2D, 3D) × (DWS, RVR) × (FBP-NATS, OSEM-NATS, FBP-ATT, AWOSEM).
Channelized Hotelling Observer
The acquisition and processing schemes were assessed on the basis of performance of a model observer in detecting the presence of a spheric lesion of unknown size on an anatomic background. The model observer was a 3-channel Hotelling observer (CHO) (15), by which the 32 × 32 pixel subimage data were processed through the frequency channels that are believed to exist in the human visual system. The 32 × 32 image corresponded to the slice with the greatest signal from a lesion located in pixel (16,16) at the center of the image (only 1 lesion was present in any 32 × 32 image). The CHO-SNR is given by:Eq. 1where Δf is the mean interclass channel output difference vector and S2 is the intraclass scatter matrix, calculated from the (channelized) covariance matrices M1 and M2 of the 2 classes (lesion-present and lesion-absent) being discriminated by We used a 3-channel difference-of-gaussians Hotelling observer (15) with (radially symmetric) channel profiles C0, C1, and C2 given by:Eq. 2where ρ is the spatial frequency, j = 1,3 indices the channels, σj = σ0 2j-1, and σ0 = 0.052. The parameters of this observer model are appropriate for a viewing distance of 60 cm and a displayed pixel size of 0.51 mm; the effect of the observer noise on task performance was incorporated by doubling the diagonal elements of the channel covariance matrix (15). The lesion detectability performance estimated with the CHO, dA, can be related to the corresponding area under the receiving-operator-characteristic curve (Az) by:Eq. 3where erf−1 is the inverse error function. Figure 2 shows the difference-of-gaussian channel profiles used in this study. The 590 lesion-present and 590 background (lesion-absent) images were used to compute the CHO detection SNR for the 3 sphere sizes, in each 32 × 32 subimage, yielding 1,770 lesion-present images for each of the 16 conditions considered in this study. Because the CHO-SNR were averaged over sets of subimages that were derived from various body locations, we were concerned that differences in background means among the locations might lead to spurious low-frequency covariance components in the pooled intraclass covariance matrices. To ensure that our results were not affected by this phenomenon, we examined the individual channel output values for evidence of clustering.
Noise Equivalent Count (NEC) Rate
The performance of 2D and 3D acquisition modes in the task of lesion detection as determined by CHO-SNR was compared with a widely used global metric, the NEC. This metric has the advantage that is easy to compute; it does not, however, account for the scanner's spatial resolution, variable sensitivity in the FOV, or noise correlation in the reconstructed tomographic volume. Three phantoms were considered: The first phantom was the National Electrical Manufacturers Association (NEMA) NU-2 standard phantom that consisted of a 70-cm-long polyethylene cylinder of 20-cm diameter with an off-center line source (16). The second phantom was obtained by fitting an external collar over the 20 × 70 cm cylinder, yielding an effective phantom diameter of 27 cm. The third phantom was obtained by fitting a third external collar yielding an effective phantom diameter of 35 cm (17). The 3 phantoms corresponded to patients weighing 40, 60, and 95 kg, respectively, with assumed BMIs of 18.5, 23.5, and 31, respectively. The line source inside the phantom was filled with 740 MBq of 18F-FDG, and the phantoms were scanned in 2D and 3D modes 3 h after filling to mimic the clinical conditions under which the detection study was performed in the patient population described in this work. The ratio of the square-root values of NEC in 2D and 3D for the 3 different phantoms sizes was plotted along with the lesion detection SNRCHO results obtained in the patient population for reference.
RESULTS
Validation of Image Generation
Thirty-eight (64%) bed positions were obtained in 21 normal-weight and overweight subjects (BMI < 30), whereas 21 bed positions (36%) were obtained from 11 obese or extremely obese subjects (BMI ≥ 30). Because different bed positions in the same patient (same BMI) can correspond to different levels of attenuation, we correlated the BMI to the total attenuation present in each bed position. Figure 3 shows that BMI and total attenuation correlated very well in the population studied in this work (r2 = 0.924). Therefore, we used BMI as a surrogate of total attenuation.
Adding synthetic lesions to real background images yielded realistic studies. We found very good agreement between the synthetic lesion-present study (sphere added to uniform cylinder) and the true lesion-present study (sphere acquired inside the cylinder), both in 2D and 3D based on the profiles shown in Figure 4 and the mean pixel noise and lesion-to-background contrast values. The mean pixel noise in the synthetic lesion-present study was 9.72% ± 0.41% in 2D (4.64% ± 0.24% in 3D) as compared with 8.03% ± 0.34% in the case of the 2D true lesion- present study (3.19% ± 0.21% in 3D) (not statistically significant Student paired t test). The sphere-to-background contrast was 0.264 in 2D in the case of the synthetic lesion-present study (0.305 in 3D) as compared with 0.276 in the case of the 2D true lesion-present study (0.293 in 3D).
Lesion Detection as a Function of Acquisition Mode
Figure 5 shows lesion-present (arrow) coronal slices through studies of patient with normal BMI (BMI = 18, 2 lesions), patients who are obese (BMI = 33), and extremely obese (BMI = 50) in 2D and 3D modes reconstructed using the clinical protocol (AWOSEM). In the patient with normal BMI, the lesion is clearly more visible in 3D than in 2D, whereas the lesion is more easily discernable in 2D than in 3D in the extremely obese patient. The visual assessment is confirmed by the objective assessment of lesion detectability. The channelized Hotelling SNR, computed over all lesion sizes and locations (30 conditions), is shown as a function of BMI in 2D and 3D (Fig. 6A). The ratio of SNRCHO (2D) to that in 3D (Fig. 6B) is <1 for low values of BMI but becomes >1 for high values of BMI. The best straight-line fit between the ratio of SNRCHO (2D) to SNRCHO (3D) and BMI intercepts the unity line at BMI ≈ 31. The mean SNRCHO, computed by pooling the 10 lesion locations and 3 sphere sizes and patients with BMI ≤ 31 was significantly higher in 3D mode (2.77 ± 0.89) compared with 2D mode (2.18 ± 074), P < 0.025 (Student paired t test). However, the mean SNRCHO pooled over all lesion sizes and locations and patients with BMI > 31 was significantly higher in 2D mode (2.65 ± 0.61) than in 3D mode (2.25 ± 0.53), P < 0.025 (Student paired t test). When all patients were pooled, the mean SNRCHO was not significantly different between 2D and 3D modes. The ratio of square-root NEC measured in 2D and 3D at 3 different BMI values was characterized by a behavior similar to that of SNRCHO (2D)/SNRCHO (3D), with values of 0.70, 0.80, and 0.88 for √NEC (2D)/√NEC (3D) for the 20 × 70, 27 × 70, and 35 × 70 cm cylinders, respectively. These values were well within the patient variability observed for SNRCHO (2D)/SNRCHO (3D) in our patient population.
Table 1 shows the SNRCHO calculated for the pooled population of 32 patients (59 bed positions) and 10 locations (10 locations per bed position) in 2D and 3D modes for 1.0-, 1.3-, and 1.6-cm lesions. Our results show that no statistically significant difference was seen between 2D and 3D as a function of lesion size. As expected, SNRCHO significantly increased as the lesion size increased from 1-cm diameter to 1.3 cm (17%) and 1.6 cm (37%), P < 0.05.
Figure 7 is a 3D scatter plot of the individual CHO output values. Channels 1, 2, and 3 are the spatial frequency channels, defined in Equation 2, through which the image data are processed, yielding the 3 channel outputs. It is evident that if values are clustered around different means for various body locations, the differences between such means are small compared with the overall dispersion. This gives us confidence that our approach, which involves averaging over disparate body locations, is valid. If there had been clustering, then we would have observed distinct clouds of points. The fact that we did not implies that our pooled covariance matrix was not affected by any differences in background means among various body locations.
Lesion Detection as a Function of Processing Scheme
Figure 8 shows coronal slices of a patient with normal BMI (BMI = 18) and an obese patient (BMI = 38) in 2D and 3D reconstructed with FBP with and without attenuation and scatter corrections, OSEM without attenuation and scatter corrections, and AWOSEM. The arrows point to synthetic lesions added in the mediastinal area (2 lesions are shown for BMI = 18, 1 lesion is shown for BMI = 38). The differences between the different processing protocols are clearly visible in images of the obese patient, where the lesion becomes extremely hard to see with OSEM-NATS compared with AWOSEM. Table 2 shows the SNRCHO computed over all 59 bed positions and 10 sphere locations and 3 sizes (1,770 conditions) for the 8 processing schemes evaluated in 2D and 3D. AWOSEM, FBP-ATT, and FBP-NATS yielded significantly better lesion detection SNRCHO (40%) than OSEM (P < 0.05) in both 2D and 3D. AWOSEM yielded also a significant improvement in SNRCHO compared with FBP-ATT and FBP-NATS (14% with RVR and 7% with DWS) (P < 0.05). In all patients, RVR yielded a systematic improvement in SNRCHO over DWS in both 2D (6%, not statistically significant [NS]) and 3D (13%, P < 0.05) acquisition modes. Furthermore, in patients of typical size (BMI < 33), SNRCHO was 20% higher with DWS in 3D than with DWS in 2D (P < 0.05) and 15% higher with RVR in 3D than with RVR in 2D. On the other hand, 2D significantly outperformed 3D in large patients for lesion detectability, both with DWS (17% higher SNRCHO) and RVR (13% higher SNRCHO) (P < 0.05).
DISCUSSION
The aim of this study was to develop an evaluation methodology for the assessment of different acquisition and processing schemes in WB 18F-FDG PET using a task-based metric—that is, lesion detection as measured by a channelized Hotelling observer. Our approach provides clinically realistic images in which ground truth regarding the presence or absence of a lesion is known. Because the aim of this work was to assess lesion detection and not the determination of lesion morphology or density, only spheric lesions were considered. This is a reasonable model for small metastatic lesions as seen routinely, for example, in lung, liver, or breast cancers. This approach was used to assess the performance of 2D and 3D acquisition modes, as well as different processing schemes, including random event variance reduction and delayed window subtraction, FBP, and OSEM reconstruction algorithms, with and without scatter and attenuation corrections. In contrast with several published studies that reported comparison of 2D and 3D for a given phantom size (1,2,5,18), this study included a population of patients with a large range of BMI to make possible the assessment of the impact of patient size. This is crucial, as our results show that the relative performance of 2D and 3D modes correlated with BMI. Furthermore, the use of clinical data, in contrast with numerically simulated data, ensured that all physical factors—such as scatter, attenuation, randoms, and dead time—were correctly modeled for all body sizes and physiologic activity distributions. This gives us confidence that any conclusions drawn from the data can be extrapolated to the clinical setting.
One of the major difficulties of using clinical data is the absence of a gold standard. Unlike previous studies (6,7), this study had the advantage of a gold standard, because the lesion-present cases were generated from artificial lesions that were added to the raw sinograms. Our methodology for adding the lesions was validated in cylindric phantom experiment, and the realism was verified by ensuring that likely locations of disease were chosen by an experienced physician for each bed position. Performing 2D and 3D imaging at the same time ensured imaging under identical physiologic (redistribution of the tracer, specific uptake) as well as physical (counting rate, scatter and randoms fractions, radioactive decay) conditions. The limitation of this approach, however, is that 2D and 3D imaging was performed at one time point that might not be optimal for both 2D and for 3D. Previous reports on phantom data have shown that optimal 2D and 3D imaging using BGO scanners correspond to different injected doses (18), implying that a given imaging time cannot be optimal for both acquisition geometries. In fact, it is most likely that the imaging time was close to optimal for 3D (injection of 740 MBq and 3-h delay) but not for 2D. As a result, we would expect that under optimal 2D imaging conditions, lesion detection performance would improve for 2D, yielding better lesion detection SNR than does 3D at lower BMI values.
In PET, the linear attenuation coefficient (LAC) at every point in the patient is proportional to the density of the tissue at that point. Therefore, one would expect the integral value of the LACs in the transmission images to be perfectly correlated to “partial patient weight”—that is, the weight of tissue in the FOV. In this work, we have considered BMI instead of partial patient weight and reported an r2 of 0.9244 (r = 0.961) between BMI and total attenuation. The reason for using BMI instead of partial patient weight is a practical one: when deciding whether to perform 2D or 3D imaging, one needs to know before starting the scan what protocol to use, and one cannot compute the sum of LACs before scanning the patient but can easily compute the patient BMI.
The task considered in this work was lesion detection, measured by SNRCHO (15,19,20). Although lesion detection performance is best assessed using human observer studies (e.g., receiver operating characteristic [ROC] (3), Localization ROC [LROC] (3), or Alternative Free-response ROC [AFROC] (5)), the numeric observer approach allows more rapid assessment. The numeric observer we used, the CHO, has been shown to be a good surrogate of human observers under a wide range of conditions (15,21). Because the 30 samples were 3 sphere sizes and 10 locations, these numbers multiply as each sphere size was placed at each of the 10 locations. Although every effort was made to ensure that the 3 sphere center planes with the highest intensity corresponded to at least 2 different slices, 3D gaussian regularization of the reconstructed volume will have introduced correlations between the backgrounds used for the 3 sphere sizes that were neglected in this work.
The background images were portions of the patient anatomy that did not overlap and, therefore, each bed position's CHO-SNR was based on a sample of 30 independent noise backgrounds. A good strategy for evaluating system performance would be to use a numeric observer to identify the optimal schemes of acquisition or processing and then validate these findings in a smaller study using human observers. Because we used the entire set of images both to estimate the scatter matrices and to obtain the performance estimates, it is possible that our results are affected by resubstitution bias. However, even if this is true, both modalities should be affected equally, and our comparisons should be valid.
In this work we found that AWOSEM yielded a significant improvement compared with FBP and that AWOSEM, FBP-ATT, and FBP-NAT yielded significantly better SNRCHO than did OSEM. These results are consistent with previously published results by Lartizien et al. (22), who reported region-dependent performance of the reconstruction algorithms with lower performance in lesion detection with OSEM than with AWOSEM and FBP in non-Monte Carlo simulated projections of the MCAT torso phantom that were corrected for scatter and attenuation in the hepatic region, and similar performance of FBP and AWOSEM in that region. The poor performance of OSEM, especially for large patients, is probably caused by inconsistencies between the model and the physics of the data acquisition. We are presently performing detailed comparisons of the different processing schemes as a function of the lesion location. We also reported significantly better values of SNRCHO in 2D mode for patients with a BMI > 31, but higher SNRCHO in 3D mode for patients with a BMI ≤ 31, whereas no statistical difference was observed between 2D and 3D when all patients were pooled. The latter finding could explain the results reported recently by Visvikis et al. (6), who found similar image quality in a qualitative assessment of 2D and 3D scans.
We found that random event variance reduction yielded a systematic improvement in SNRCHO over the delayed window subtraction of random events in 2D (6%, NS) and 3D (13%, P < 0.05) acquisition modes. Because the delayed channel in DWS is also subject to Poisson noise, a direct subtraction of the delayed coincidence measurement results in an increase of the effective noise contribution of random coincidences to the data, whereas a variance reduction approach similar to that currently used in normalization of PET cameras does not.
This work pertains to a PET scanner equipped with BGO crystal detectors, and our findings may not translate to scanners equipped with faster cerium-doped LSO, LYSO, GSO, or LaBr3 detectors. These scanners may have reduced sensitivity to random coincidences and, depending on the detector and electronics design, less dead time and even time-of-flight capability. In addition, such scanners may have improved energy resolution and, thus, reduced scatter fraction. These factors would all act to improve 3D imaging over 2D imaging, perhaps especially for obese patients. Our results may also not be applicable if other reconstruction techniques are used, as it is not clear that FORE followed by OSEM reconstruction represents the optimal treatment of 3D PET data. Our methodology can be easily applied to studies performed on any scanner and is not limited to comparisons of 2D and 3D mode imaging—it could also be used to compare other acquisition parameter choices or different reconstruction techniques. We are presently pursuing this avenue of research.
CONCLUSION
We have compared the relative merits of different acquisition (2D vs. 3D) and processing (data correction and tomographic reconstruction) schemes in 59 PET bed positions acquired in 32 patients with BMI ranging from 18 to 50. Our results show that 3D imaging yielded better lesion detectability than 2D in patients of typical size (BMI ≤ 31) on our BGO scanner. However, 2D imaging yielded better lesion detectability than 3D in large patients (BMI > 31) as 3D performance deteriorated in large patients. 2D and 3D yielded similar results for different lesion sizes. Furthermore, attenuation-weighted OSEM yielded significantly better lesion detectability than did filtered backprojection and OSEM without scatter and attenuation corrections.
Acknowledgments
This work was supported by a grant from the Society of Nuclear Medicine Education and Research Foundation and in part by grants RO1-EB005876, RO1-EB000802, and RO1-EB001989 from the National Institutes of Health (NIH). The contents of this article are solely the responsibility of the authors and do not represent the official views of the NIH. The authors thank Dr. Suleman Surti for providing the phantom collars and Dr. Craig Abbey for useful discussions.
Footnotes
-
COPYRIGHT © 2007 by the Society of Nuclear Medicine, Inc.
References
- Received for publication July 29, 2006.
- Accepted for publication August 24, 2007.