|
|
||||||||
Clinical Investigations |
1 Division of Nuclear Medicine, University Hospital of Liège, Liège, Belgium
2 Department of Biostatistics, University Hospital of Liège, Liège, Belgium
| ABSTRACT |
|---|
|
|
|---|
Key Words: instrumentation oncology 18F-FDG PET standardized uptake value test-retest variability
| INTRODUCTION |
|---|
|
|
|---|
To our knowledge, there has been no published report of the test-retest within-patient variability of SUV in normal tissues. The purpose of this work was to evaluate this issue and to verify whether one correction method for SUV is more reproducible than the others. Both the qualitative and the quantitative evaluation of 18F-FDG PET images could be affected.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Average body weight (70 ± 12 kg) and height (168 ± 10 cm) were stable between the 2 studies. The change in weight was 0.7 ± 4.2 kg between studies. Only 9 of 70 patients had changes exceeding 10% of their initial weight (13% of the population sample). The blood glucose level was measured in all patients before injection. The average glucose level was 4.8 ± 0.9 mmol/L (3.29.6 mmol/L) for study A (test) and 4.9 ± 0.9 mmol/L (3.68.3 mmol/L) for study B (retest) (P = 0.29).
18F-FDG PET Studies
The dose injected and the uptake period of 18F-FDG were 185 ± 28 MBq (range, 148244 MBq) and 67 ± 10 min (range, 4995 min), respectively, for study A and 183 ± 28 MBq (range, 148262 MBq) (P = 0.59) and 69 ± 9 min (range, 53102 min) (P = 0.33), respectively, for study B. All patients were imaged from the base of the skull to the proximal thigh, using the C-PET scanner (UGM-Philips), which has been fully described elsewhere (6).
Attenuation correction using a 137Cs source was applied to each study. Each emission scan and transmission scan lasted 5 min and 1 min, respectively. Most patients with lymphoma and melanoma received 5 mg of diazepam orally approximately 15 min before 18F-FDG injection. There was no difference between the numbers of diazepam doses given on the 2 occasions. Images were reconstructed using the ordered-subsets expectation maximization algorithm and were corrected for decay, scatter, random events, and attenuation.
Image Analysis
Four regions of interest (ROIs) were placedon the liver (central region of right lobe), lung (basal region of the right lung, at a distance from the diaphragm), mediastinum (upper region, at the level of the large vessels), and trapezius muscleat similar levels for each study. These ROIs were placed on a single slice and were of similar size (19.2 cm3 for the liver, 5.6 cm3 for the mediastinum, 13.8 cm3 for the lung, and 4.5 cm3 for the muscle). Mean and maximum SUVs were calculated using the formula:
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
Statistical Analysis
Results are expressed as mean ± SD. Mean differences between the 2 studies (test and retest) for each SUV and each organ were assessed using a paired Student t test. Furthermore, the within-individual variation for each SUV (SDw) was calculated using the following formula: SDw =
d2/2n, where d represents the difference between test and retest values for each subject and n the number of subjects. SDw was then expressed in terms of coefficient of variation (CV) as follows: CV (%) = SDw/M, where M denotes the average value of test and retest means; that is, M = (mean test + mean retest)/2. In addition, to estimate the degree of agreement between test and retest measures, we computed the intraclass correlation coefficient (ICC). The closer the ICC to 1, the better the agreement. A lower 95% ICC confidence boundary was also calculated to test the statistical significance of the observed ICC. To compare test-retest differences between SUV measurement methods, a general linear mixed-model approach was used to account for repeated measurements on the study subjects. All results were considered to be significant at the 5% critical level (P < 0.05). Calculations were performed using version 8.2 of the statistical package of SAS Institute Inc.
| RESULTS |
|---|
|
|
|---|
|
|
|
The general linear mixed-model analysis did not evidence any significant advantage or disadvantage for any of the SUVs (normalized for body weight or LBM, average or maximum pixel value, corrected for blood glucose level or not).
| DISCUSSION |
|---|
|
|
|---|
The purpose of the present work was 2-fold: first, to evaluate whether the metabolic activity of various normal tissues remains stable over time in a single patient population, and second, to assess the SUV variability associated with the different corrections and normalizations that can be applied.
We found that, on average, all SUVs in the mediastinum and most SUVs in the liver remained stable over time. We used a statistical methodology slightly different from that of Minn et al. (10), but the CV, which we calculated, and the percentage difference, which Minn et al. calculated, measure the same parameter and provide almost the same values. In our population, we found similar variations in the liver and mediastinum (11% and 12.3%, respectively), as Minn et al. did in tumors (10%). On the other hand, all mean SUVs in the lungs and muscle significantly differed between the 2 studies. Even though these differences were statistically significant, their clinical relevance remains questionable. Regarding mean SUV, the maximum difference observed was 0.05 ± 0.17 in lung and 0.07 ± 0.21 in muscle. In addition, several factors may contribute to these findings: The pulmonary or cardiovascular status of these patients may change during 2.516 mo. These changes may in turn reflect on 18F-FDG uptake in the lung. As for muscle uptake, it may be influenced by the patients stress or anxiety at the time of injection and by other factors such as room temperature. A patients being more accustomed to PET examination may contribute to lowering the average uptake on the second scan, thus explaining why most SUVs decreased from study A to study B. Also, because the lung and muscle regions have low 18F-FDG metabolism, they are more easily affected by statistical noise. At high SUVs, the measurements reflect mainly the phosphorylated fraction of 18F-FDG, but at lower SUVs, the proportion of nonphosphorylated 18F-FDG is higher, which in turn contributes to higher variability. In any case, our study showed that if tumor-to-background activity ratios are to be used, reliance on the liver and mediastinum, rather than the lung or muscle, for measuring background activity is to be preferred. However, although this observation appears valid in cancer-free patients, it needs to be confirmed in cancer patients.
The SUV calculation includes a calibration factor that takes into account scanner efficiency (image counting rate per voxel vs. MBq/mL). Because scanner efficiency may vary significantly over time, this issue may influence the SUVs and contribute to their variability, independently of any variation in the true metabolic activity. This issue was not a factor in the present study, as scanner efficiency was found to be stable over time. The normalization tables were renewed according to our routine quality-control guidelines, that is, on a systematic basis rather than as a result of changes observed in the phantom studies. Indeed, SUV changes did not exceed 5% for any of the phantom studies during this investigation.
To evaluate the impact of the method of measurement on reproducibility of values, the statistical analysis focused on tissues that did not change between the 2 studies, that is, mediastinum and liver. The mean SUVs observed in these organs were within the range of those previously described in the literature for a study using a comparable method of measurement (iterative reconstruction with segmented attenuation correction) (4). The liver SUV has previously been described as being constant between patients (4,11), but to our knowledge, the present study was the first attempt to evaluate the long-term variability of SUV in normal tissues.
When looking at which of the correction methods for SUV reduces variability and improves test-retest agreement, we found none to be significantly superior. The best ICCs were obtained for SUV normalized for body weight, in both the liver and the mediastinum, but the CVs were similar for all 3 mean SUVs that were not corrected for glucose level. We did find, however, that normalizing for blood glucose level increased the variability and decreased the level of concordance between studies, possibly in relation to the low variability of plasma blood glucose between the 2 studies and to the absence of extreme values of blood glucose level in our patient population. The same phenomenon was reported by Menda et al. (9). We also observed a trend for the mean SUV to be better than the maximum SUV, but this held true only for CV. No significant difference was seen for ICC or when performing the general linear mixed-model analysis. In fact, a higher variability of maximum SUV could be expected since it is more affected by statistical noise than is mean SUV.
Our results tend to support those of Menda et al. (9), who showed no benefit to any correction method in lung tumors. Some authors have reported better results with SUV normalized for BSA in tumor tissues (1216), whereas others found SUV normalized for LBM to be better in normal tissues (8,11). However, the clinical superiority of one SUV method over another observed for tumor metabolic assessment does not necessarily imply similar conclusions for the reproducibility of metabolic measurements in normal tissue. On the other hand, we did not observe better agreement among SUVs normalized for BSA or LBM in our population. The subjects body weight did not change significantly from one study to the other. Body weight increased or decreased by 10% or more in only 9 patients. Such a small number does not permit any meaningful statistical analysis. Obviously, in a population sample whose body height and weight remain fairly stable over time, one cannot expect significant differences between the various methods of SUV measurement. However, during and after chemotherapy for cancer, patients can lose significant weight, so that such normalizations may become significant. In further evaluating this issue, it would be interesting to verify these results in cancer patients before and after treatment.
| CONCLUSION |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
For correspondence or reprints contact: Roland Hustinx, MD, PhD, Division of Nuclear Medicine, University Hospital of Liège, Sart-Tilman B35, 4000 Liège 1, Belgium.
E-mail: rhustinx{at}chu.ulg.ac.be
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
C. Nahmias and L. M. Wahl Reproducibility of Standardized Uptake Value Measurements Determined by 18F-FDG PET in Malignant Tumors J. Nucl. Med., November 1, 2008; 49(11): 1804 - 1808. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Belakhlef, C. Church, A. Hays, R. Fraser, and S. Lakhanpal Quantitative Assessment of the Influence of Location, Internal Temperature, Idle Time, and Normalization on the Sensitivity of a Mobile PET/CT Scanner J. Nucl. Med. Technol., September 1, 2008; 36(3): 147 - 150. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Hautzel, O. Sander, A. Heinzel, M. Schneider, and H.-W. Muller Assessment of Large-Vessel Involvement in Giant Cell Arteritis with 18F-FDG PET: Introducing an ROC-Analysis-Based Cutoff Ratio J. Nucl. Med., July 1, 2008; 49(7): 1107 - 1113. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Belakhlef, C. Church, R. Fraser, and S. Lakhanpal Sensitivity and Daily Quality Control of a Mobile PET/CT Scanner Operating in 3-Dimensional Mode J. Nucl. Med. Technol., December 1, 2007; 35(4): 229 - 231. [Abstract] [Full Text] [PDF] |
||||
![]() |
K G Tournoy, S Maddens, R Gosselin, G Van Maele, J P van Meerbeeck, and A Kelles Integrated FDG-PET/CT does not make invasive staging of the intrathoracic lymph nodes in non-small cell lung cancer redundant: a prospective study Thorax, August 1, 2007; 62(8): 696 - 701. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Louis, G. Ancion, A. Colard, V. Spote, J. Belaiche, and R. Hustinx Noninvasive Assessment of Crohn's Disease Intestinal Lesions with 18F-FDG PET/CT J. Nucl. Med., July 1, 2007; 48(7): 1053 - 1059. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Couturier, G. Jerusalem, J.-M. N'Guyen, and R. Hustinx Sequential Positron Emission Tomography Using [18F]Fluorodeoxyglucose for Monitoring Response to Chemotherapy in Metastatic Breast Cancer. Clin. Cancer Res., November 1, 2006; 12(21): 6437 - 6443. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Delbeke, R. E. Coleman, M. J. Guiberteau, M. L. Brown, H. D. Royal, B. A. Siegel, D. W. Townsend, L. L. Berland, J. A. Parker, K. Hubner, et al. Procedure Guideline for Tumor Imaging with 18F-FDG PET/CT 1.0 J. Nucl. Med., May 1, 2006; 47(5): 885 - 895. [Full Text] [PDF] |
||||
![]() |
S. S. Shim, K. S. Lee, B.-T. Kim, M. J. Chung, E. J. Lee, J. Han, J. Y. Choi, O J. Kwon, Y. M. Shim, and S. Kim Non-Small Cell Lung Cancer: Prospective Comparison of Integrated FDG PET/CT and CT Alone for Preoperative Staging Radiology, September 1, 2005; 236(3): 1011 - 1019. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. A. Weber Use of PET for Monitoring Cancer Therapy and for Predicting Outcome J. Nucl. Med., June 1, 2005; 46(6): 983 - 995. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| JOURNAL OF NUCLEAR MEDICINE TECHNOLOGY | THE JOURNAL OF NUCLEAR MEDICINE |