|
|
|||||||||
Basic Science Investigations |
1 Department of Diagnostic Radiology, University of Maryland, Baltimore, Maryland
2 Department of Biomedical Engineering, Lerner Research Institute, The Cleveland Clinic Foundation, Cleveland, Ohio
3 Biomedical Engineering Center, The Ohio State University, Columbus, Ohio
4 Department of Radiology, Lahey Clinic, Burlington, Massachusetts
5 Department of Radiology, Medina General Hospital, Medina, Ohio
6 Department of Molecular and Functional Imaging, The Cleveland Clinic Foundation, Cleveland, Ohio
| ABSTRACT |
|---|
|
|
|---|
Key Words: image registration elastic registration whole-body PET/CT multimodality image fusion
| INTRODUCTION |
|---|
|
|
|---|
Registration (alignment) of whole-body PET and CT images is a prerequisite for their meaningful fusion. Before the advent of combined PET/CT scanners, this registration could only be performed manually or using software methods. The manual approach is slow and tedious and generally corrects for rigid misalignment only. For higher accuracy, software methods implementing elastic image registration have been reported (68). Combined PET/CT scanners, on the other hand, perform image registration "mechanically" by performing the 2 scans successively while minimizing patient movement through the use of a common couch and then appropriately "sliding" one scan with respect to another.
The combined PET/CT scanners have undoubtedly simplified multimodality image registration and fusion. They have even sped up the overall scan time up to 40% (3) using the CT scan for PET attenuation correction. Nonetheless, the underlying registration mode remains rigid, which cannot compensate for involuntary nonrigid motion of thoracic and abdominal organs. Differences arise mainly due to varying breathing patterns. Whereas CT is generally performed with breath-hold or shallow breathing, longer PET scans are performed with the subject engaged in tidal breathing. The resulting misalignment in whole-body images is especially pronounced in the area of the diaphragm. In fact, up to 96% of combined PET/CT examinations have been reported to contain respiratory motion artifacts (9), with a target registration error as high as 11 mm (10). Developing and testing different breathing protocols to minimize motion artifacts in combined PET/CT images remain an area of active research (911). Therefore, despite the huge success of combined PET/CT scanners, it can be argued that the need for software-based (algorithmic) image registration persists and that its ability to correct for both rigid and nonrigid misalignments makes it applicable to whole-body CT and PET images, whether collected separately or on a combined scanner.
A software-based whole-body CT and PET image registration method must be automatic and fast and accommodate deformable anatomy to be clinically successful. These requirements, as we have argued before (12), rule out fiducial marker-based methods and make segmentation-based methods less desirable. Segmentation often requires manual intervention and can limit the overall accuracy of image registration. Even if it is automatic, the accuracy of nonrigid matching degrades away from the segmented features. For example, if the contour of the thorax is used for nonrigid registration of whole-body images as proposed by Slomka et al. (8), alignment of the abdominal features remains suspect. Voxel-based methods, especially those that maximize mutual information (MI) (1315) or, its variant, normalized mutual information (NMI) (16), have emerged as the most accurate and robust measures (17,18) currently available for image registration. Because MI (or NMI) is a statistical measure that does not expect a direct correlation of intensities among homologous regions, it is a perfect measure for multimodality registration such as that between PET and CT. Successful application of multimodality MI/NMI-based image registration has been reported for many organs and imaging modalities (17,1921). Our preliminary report (22), as well as works of others (8,23), demonstrates the potential of MI (or NMI) for whole-body elastic PET/CT registration.
We report here the development of a new fully automated algorithm for 3-dimensional (3D) elastic image registration. Following the description of the algorithm, we report its application to whole-body PET and CT images from separate as well as combined PET/CT scanners. We have used the opinion of 3 experts as a gold standard for validating the algorithm and found the algorithms performance on par with an experts ability to perform elastic registration using anatomic landmarks.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Fifteen image pairs consisted of PET and CT acquired on separate scanners. Contrast-enhanced CT images were acquired (Somaton Plus; Siemens) with the subject involved in breath-holding at maximum inspiration and arms stretched over the head. The subject was involved in normal tidal breathing with arms beside the body during PET image acquisition (ECAT HR+; Siemens) lasting approximately 4050 min. 18F-FDG was injected as radiotracer about 4560 min before scanning. Depending on the scanner settings (for optimum magnification of anatomy of clinical interest) at the time of image acquisition, CT images measured 512 x 512 x 7090 voxels with voxel dimensions in the range 0.91.2 x 0.91.2 x 5 mm. The typical image size for PET was 128 x 128 x 135160 voxels, with cubic voxels of dimensions 5.15 x 5.15 x 5.15 mm. The attenuation-corrected PET emission scans, reflecting functional information for the entire body, generally had a larger field of view than the corresponding CT images. The time difference between acquisition of CT and PET images varied from a few minutes for some cases to a few days for others, dictated by clinical need, availability of equipment, and clinical scheduling issues at the time of acquisition.
We also performed registration of 5 cases consisting of PET and CT images acquired on a combined PET/CT scanner (Biograph Sensation 16; Siemens). During CT and PET image acquisition using the combined scanner, the subject was instructed to keep the arms beside the body and perform shallow breathing. 18F-FDG was injected as the radiotracer about 4560 min before scanning. At the time of scanning, the low-dose noncontrastenhanced CT image was acquired first, followed by the PET image acquisition. No transmission PET scan was acquired, and the attenuation correction for PET was performed using the CT image. The typical image size for CT was 512 x 512 x 307404 voxels with a voxel size of 0.78 x 0.78 x 2.5 mm, whereas the typical image size for PET was 128 x 128 x 154202 voxels with a voxel size of 5.31 x 5.31 x 5 mm.
To achieve a trade-off between maintaining CT resolution and obtaining nearly isotropic voxels, we resampled the CT images for all cases to get nearly isometric voxels with dimensions of 1.82.4 x 1.82.4 x 2.5 mm, depending on the original voxel dimensions. Resampling reduced the spatial resolution of CT images; however, the resulting images still had better spatial resolution than the PET images (the lower resolution image controls the accuracy of image registration in general) and nearly isotropic voxels. No preprocessing steps were used for the PET images.
Registration Algorithm
The algorithm is based on maximization of NMI and uses a hierarchical (pyramid) volume subdivision scheme to elastic image registration. Our algorithm initially recovers the global rigid-body mismatch between the reference (CT) and the floating (PET) image, followed by hierarchical refinement of the localized matching between the globally registered images. Initial development of the algorithm has been reported previously (22).
Global registration uses the 6-parameter rigid-body transformation model and is based on maximization of NMI between the 2 images. Next, we implement a hierarchical octree-based volume subdivision scheme. At each subdivision level, the floating image is registered with the individual subvolumes of the reference image, considered one at a time. Each subvolume registration continues until the iterative optimization process converges to a transformation solution (% NMI change < 0.1) or the maximum number of iterations (200) is reached. When all subvolume registrations at a given level are performed, the algorithm continues on to the next level of subdivision. Volume subdivision and subvolume registration continue until the voxel count for an individual subvolume remains above a predefined limit of 163. This lower threshold was selected on the basis of the observation from pilot experiments for the given pair of modalities, which showed that over half the subvolume registrations with less than 163 voxels per subvolume failed to converge to a solution and thus did not significantly and robustly improve the existing alignment. Subvolume registration also uses a 6-parameter rigid-body transformation model. Initial seeding of the undivided floating image with respect to each subvolume is given by the transformation obtained from registration involving the floating image and the corresponding parent subvolume at the previous level of subdivision.
Subvolume registration is also based on maximization of NMI. During calculation of NMI for any given subvolume registration, we use the prior registration information for all remaining parts of the image, available from the previous hierarchical level. The prior information is assimilated into subvolume registration during calculation of NMI, by compiling the mutual histogram as a sum of 2 separate mutual histograms: MHsubvolume, calculated for the subvolume voxels, and MHprior, calculated for all remaining voxels of the image, with transformations derived from the immediately preceding hierarchical level (22). The typical number of bins selected for the mutual histogram was 128 x 128, based on calculations of optimal bin width as described (24). We implemented the steps in the NMI approach as outlined previously (16).
If the result of a given subvolume registration causes the effective subvolume displacement at that level above a preset threshold, we discard the solution and retain the transformation from the previous subdivision level. At each hierarchical level, this threshold is set equal to 25% of the smallest subvolume dimension at that level. This provides a constraint on the subvolume registration and aids in maintaining image integrity by preventing individual subvolumes from drifting far off from their starting positions at each hierarchical level.
After registration at the last hierarchical level, the rigid-body transformations are assigned to the centers of the respective subvolumes. For every voxel in the reference image, a unique mapping transformation is determined by performing tricubic interpolation between the subvolume centers. The 3D translational component of the transformation is interpolated separately as 3 scalars, due to the independence of the translations along the 3 coordinate axes. We perform interpolation of 3D rotation in the quaternion domain, by converting the rotational matrices into quaternions (Appendix). The concept of quaternions allows a unique and complete representation of 3D rotation with elegant formulations of a range of interpolation methods. We use the equivalent of cubic interpolation for quaternions called spherical cubic interpolation or squad (spherical quadrangle) as introduced by Shoemake (25). Next, using the transformation field, we resample the original PET image (22) to generate a continuous elastically transformed PET image that aligns well with the CT image.
Validation
We evaluate the accuracy of the whole-body PET/CT registration by comparing the alignment of several anatomic landmarks (3D locations within the images) as predicted by the algorithm against a reference. Because of the lack of a gold standard for this registration application (23), we assume the ability of clinical experts to locate landmarks in both CT and PET images as a suitable benchmark performance. We contend that comparing the variability in landmark matching between algorithm- and expert-defined registrations with the variability among the 3 expert-defined registrations is a reasonable way to estimate the 3D registration accuracy of the algorithm and assess whether its performance is comparable to that of the experts.
Three clinical experts, experienced in interpreting whole-body PET and CT images, were involved in the validation procedure. All experts were unaware of the identities of the subjects. Each expert was asked to identify and mark anatomic landmarks identifiable in both imaging modalities from a list of 32 well-described anatomic landmarks. Examples of anatomic landmarks are right and left lung apex, upper and lower tips of kidneys, dome of the liver, any cancerous nodule, and so forth. The experts were asked to mark the maximum possible landmarks from this list, provided they were confident of identifying a given landmark in both CT and PET images. They, on average, spent 2030 min marking all such landmarks in a pair of PET/CT images. Three thin-plate splinebased elastic deformation fields were generated based on the homologous landmarks identified by the 3 experts, representing the respective expert-defined PET/CT elastic image registration.
Because the location of a specific landmark as marked by an expert varied slightly from expert to expert, a set of "test landmarks" was created for each case separately, by defining the location of each landmark as the centroid of the expert-defined locations for that landmark in CT. The expert-defined deformation fields were then used to determine distinct sets of homologous PET landmarks (PET1, PET2, and PET3, respectively), each representing the transformed locations of the test landmarks according to the manual registration performed by one of the experts independently. The average expert-defined transformed locations (PETEXPERT) were determined as the average of PET1, PET2, and PET3. The algorithm-determined transformation field was used to determine a set of landmarks in the PET image (PETALGO) representing the transformed locations of the test landmarks after the automatic elastic registration. Figure 1 graphically illustrates the concept of our validation approach.
|
The results of the validation study were analyzed separately for the 15 cases involving images acquired on separate scanners and for the 5 cases from the combined PET/CT scanner. For the 5 combined PET/CT cases, we tabulate the mean difference values for mechanical registration against the experts before presenting similar data for the algorithm. Decrease in the variability after elastic registration would indicate the inability of combined PET/CT scanners to achieve perfect alignment and the ability of our algorithm to improve on their result.
| RESULTS |
|---|
|
|
|---|
Figure 2 shows an example of improvement in image alignment after elastic registration as compared with only rigid-body registration for PET and CT acquired on separate scanners. Figure 3 highlights the residual misalignment in PET/CT images acquired on the combined scanner and the subsequent improvement in image alignment after registration. Figure 2 shows the ability of the registration algorithm to improve alignment of a cancerous mass as seen in CT and PET images.
|
|
Table 1 summarizes the analysis of the interobserver variability in landmark identification averaged over the 15 cases having PET and CT from separate scanners. The results have been analyzed for the overall anatomy and then separately for the thoracic and abdominal regions of the anatomy. It can be seen from Table 1 that for a given region (overall/thoracic/abdominal), the value of the mean difference in transformed locations for any given group lies within the 95% confidence interval (CI) of the mean for the remaining 3 groups, indicating that the algorithm can replace any expert without increasing the interobserver variability in registration. On the basis of these results it can be concluded that the performance of the algorithm-determined registration is comparable to all 3 expert-defined registrations.
|
= 0.05) in the mean difference values for groups 24 in Table 2 when mechanical registration was replaced with our algorithm. The difference between the algorithm and the mechanical registration is also statistically significant for registration in the thoracic region, but not so for the abdominal region, indicating that both registrations are comparable for features of the abdominal anatomy.
|
| DISCUSSION |
|---|
|
|
|---|
Accuracy is paramount to the success of any image registration method; therefore, we adopted a very rigorous validation approach, despite being labor intensive and time-consuming. A common problem with evaluating the performance of multimodality elastic registration is the lack of a gold standard. Expert opinion, on which often the clinical care rests, appears to be the closest substitute for a gold standard. Because experts generally differ among themselves, it is not possible to demonstrate that a new method is superior to a human expert even if that were the case. What is possible, however, is to show that the new method is at least equivalent in performance to an average human expert. Comparable interexpert and 2 experts + algorithm variabilities in Tables 1 and 2 led us to conclude that our algorithm performed as well as an average human expert and could possibly be viewed as the fourth expert. We also observed slightly higher mean difference values for images from a combined PET/CT scanner (Table 2) than those from separate scanners (Table 1). This could be attributed to low-dose noncontrastenhanced CT images. Though none of the experts explicitly complained about this fact, lower contrast could have impacted their selection of landmarks, leading to increased variability. Another possibility is the smaller sample size (5 vs. 15 cases) used for combined-scanner images.
Previously reported elastic registration methods for PET/CT have been validated either only qualitatively or with not as much rigor. Tai et al. (7) used the number of mismatched voxels as a measure for registration accuracy. Slomka et al. (8) used indirect means such as reproducibility tests, algorithm convergence, and qualitative visual assessment for validation of their algorithm. To the best of our knowledge, only Mattes et al. (23) have reported registration accuracy in terms of physical dimensions for whole-body PET/CT registration. However, they evaluated only the in-plane translational accuracy of the registration by using several reformatted sagittal, coronal, and axial images and reported 0- to 6-mm error range for the thoracic region versus 6- to 11-mm error range for the abdominal region. The mean registration error and other results presented in Tables 1 and 2 from our comprehensive validation approach compare favorably with the registration accuracy reported by Mattes et al. (23).
Our reported execution time of 4575 min compares favorably with approximately 100 min for the multiresolution free-form deformation (FFD)based approach reported (23) for similar but smaller images (in-plane voxel count about a quarter of our images). This is because an image subdivision-based approach (like ours) is computationally simpler (the transformation mode remains rigid locally) as compared with the FFDbased approach, making our algorithm more efficient to compute and, hence, faster without compromising accuracy. Independent of the current work, we have developed a new hardware platform called FAIR (fast automatic image registration) for accelerated execution of MI/NMIbased image registration (12,26,27). Further reduction in the reported execution time is possible when we implement our algorithm on the FAIR platform.
The question of whether to use emission or transmission PET often arises in PET/CT registration. The transmission scan, for being similar to CT, has been successfully used by many investigators (7,23). The emission scan is substituted for the transmission scan after registration in these methods. Although inadequate for clinical decision-making, the nonspecific uptake of the PET tracer by different organs in emission scans often permits using them directly in image registration. In a comparative study, Skalski et al. (28) reported achieving higher accuracy with the transmission scan, but perhaps the optimal approach is to use a weighted sum of the 2 images, as suggested by Slomka et al. (8). In general, when working with separately acquired CT and PET and using the transmission scan for registration, any movement between the transmission and emission scans cannot be corrected. Transmission images, which offer poorer contrast resolution in the abdominal region, may also lower registration accuracy in the abdominal region compared with that in the thoracic region, as Mattes et al. (23) report. Indeed, no such debate exists for combined PET/CT scans because the CT itself is used for attenuation correction and a transmission scan does not exist. Thus, any registration must use emission PET. The accuracy of our method validates the use of the emission scan for PET/CT registration. Using a misregistered CT scan for attenuation correction in PET in combined PET/CT can itself be a source of misalignment and uptake errors (29). Further improvement of whole-body PET/CT image registration by using motion-corrected CT images for PET attenuation correction constitutes a future direction of our research.
Software image registration is flexible, powerful, and versatile (30) and we contend that it will continue to be needed despite the increasing deployment of combined PET/CT scanners. Combined PET/CT scanners neutralize rigid misalignment but not nonrigid misalignment. For applications demanding high registration accuracy, such as intensity-modulated radiotherapy, our method can be used to refine combined PET/CT images. Indeed, our method can be used in facilities lacking a combined scanner and for retrospective registration of an existing diagnostic CT scan with a combined PET/CT scan. Additionally, serial comparison of combined whole-body PET/CT images acquired at different times can only be accomplished through an algorithmic method such as ours.
| CONCLUSION |
|---|
|
|
|---|
| APPENDIX |
|---|
|
|
|---|
![]() | (Eq. 1A) |
![]() | (Eq. 2A) |
about an axis given by a unit vector a = [ax, ay, az] can be represented using a unit quaternion (magnitude unity) as follows:
![]() | (Eq. 3A) |
![]() | (Eq. 4A) |
Quaternion interpolation (for uniquely interpolating 3D rotational pose) is performed on the surface of a 4-dimensional (4D) hypersphere. The equivalent of linear interpolation for quaternions on a 4D hypersphere is called "spherical linear interpolation" or slerp. For unit quaternions q0, q1 and interpolation fraction t
[0,1] the spherical linear interpolation curve is defined as:
![]() | (Eq. 5A) |
[0,1], the following interpolation function was used:
![]() | (Eq. 6A) |
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
For correspondence or reprints contact: Raj Shekhar, PhD, Department of Diagnostic Radiology, University of Maryland School of Medicine, 22 S. Greene St., Baltimore, MD 21201.
E-mail: rshekhar{at}umm.edu
| REFERENCES |
|---|
|
|
|---|
Related articles in JNM:
This article has been cited by other articles:
![]() |
A. Grgic, U. Nestle, A. Schaefer-Schuler, S. Kremp, E. Ballek, J. Fleckenstein, C. Rube, C.-M. Kirsch, and D. Hellwig Nonrigid Versus Rigid Registration of Thoracic 18F-FDG PET and CT in Patients with Lung Cancer: An Intraindividual Comparison of Different Breathing Maneuvers J. Nucl. Med., December 1, 2009; 50(12): 1921 - 1926. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Dickfeld, P. Lei, V. Dilsizian, J. Jeudy, J. Dong, A. Voudouris, R. Peters, M. Saba, R. Shekhar, and S. Shorofsky Integration of three-dimensional scar maps for ventricular tachycardia ablation with positron emission tomography-computed tomography. J. Am. Coll. Cardiol. Img., January 1, 2008; 1(1): 73 - 82. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Z. Wong, E. K. Paulson, R. C. Nelson, E. F. Patz Jr., and R. E. Coleman Practical Approach to Diagnostic CT Combined with PET Am. J. Roentgenol., March 1, 2007; 188(3): 622 - 629. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Martinez-Moller, M. Souvatzoglou, N. Navab, M. Schwaiger, and S. G. Nekolla Artifacts from Misaligned CT in Cardiac Perfusion PET/CT Studies: Frequency, Effects, and Potential Solutions J. Nucl. Med., February 1, 2007; 48(2): 188 - 193. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | RSS | TABLE OF CONTENTS |
| JOURNAL OF NUCLEAR MEDICINE TECHNOLOGY | THE JOURNAL OF NUCLEAR MEDICINE |