Abstract
The purpose of this study was to compare the efficacy of short-τ inversion-recovery (STIR) MRI and 18F-FDG PET/CT for the detection of metastasis in mediastinal and hilar lymph nodes in patients with lung cancer. Methods: Ninety-three patients with known or suspected lung cancer with mediastinal and hilar lymph node swelling underwent STIR MRI and 18F-FDG PET/CT examinations. STIR MRI scans were obtained with a 2% copper sulfate phantom placed along the back of each patient, with the lymph node–to–phantom ratio calculated for quantitative analysis. For qualitative analysis, the results of all STIR MRI scans were evaluated using a 5-point visual scoring system. To evaluate the diagnostic capabilities of STIR MRI and 18F-FDG PET/CT, we used receiver-operating-characteristic curve analysis to determine the optimal thresholds for the lymph node–to–phantom ratio, visual score, and maximal standardized uptake value. Further, the capability of each to determine N-stage was compared in each patient using the McNemar test. Results: A total of 137 lymph nodes (82 malignant lesions, 55 benign lesions) were analyzed. When optimal threshold values were adopted, the quantitative and qualitative sensitivity, specificity, and accuracy of STIR MRI were not significantly different from those of 18F-FDG PET/CT. However, 18F-FDG PET/CT in combination with qualitative STIR MRI analysis had a significantly higher capability to detect nodal involvement on an individual-patient basis (96.9% specificity, 90.3% accuracy) than did 18F-FDG PET/CT alone (65.6% specificity, 81.7% accuracy). Conclusion: We found that the diagnostic capability of STIR MRI was not significantly different from that of 18F-FDG PET/CT. However, when those methods were combined, the diagnostic capability for N-staging was significantly improved.
Imaging with 18F-FDG PET, which is used to detect the biologic activities of tumor cells, has been increasingly applied for diagnosing, staging, and determining the type of lung cancer (1,2). However, recent studies have reported poor accuracy for 18F-FDG PET in terms of lung cancer nodal staging (3–5). According to the evidence-based clinical practice guidelines of the American College of Chest Physicians, it is recommended that further evaluation of the mediastinum with sampling of the abnormal lymph node should be performed before surgical resection of the primary tumor in patients with an abnormal result on 18F-FDG PET scans (6). In addition, several investigators have reported that integrated PET/CT is superior to conventional contrast-enhanced CT or 18F-FDG PET alone for detecting nodal metastasis in patients with non–small cell lung cancer (3,7,8). However, PET/CT staging of lymph node involvement does not replace the need for mediastinoscopy (9,10).
Mediastinoscopy is the gold standard for mediastinal lymphatic staging because of its remarkable sensitivity of approximately 80%, with a specificity of, by definition, 100% (11,12). Nevertheless, mediastinoscopy has the risk of life-threatening complications, such as hemorrhage, mediastinitis, pneumothorax, and vocal cord paresis (13). Thus, it would be preferable to develop less invasive procedures that would also have improved diagnostic capabilities. 18F-FDG PET can reflect physiologic and biochemical functions and is useful for differential diagnosis of benign and malignant lesions (14–16). However, there are numerous causes of 18F-FDG uptake in benign processes, which lead to false-positive results (1,17–19) even when PET/CT is also used (20,21). Some investigators have reported that short-τ inversion-recovery (STIR) MRI has utility for the detection of metastases in mediastinal and hilar lymph nodes in lung cancer patients (22–24).
On the basis of these backgrounds, we hypothesized that STIR MRI would improve the accuracy of 18F-FDG PET for N-staging in patients with lung cancer. To test our hypothesis, we prospectively evaluated and compared the accuracy of STIR MRI, 18F-FDG PET/CT, and 18F-FDG PET combined with STIR MRI for the diagnosis of nodal involvement in patients suspected to have lung cancer.
MATERIALS AND METHODS
Patients
The study was approved by the Institutional Review Board of our hospital, and written informed consent was obtained from all patients who participated. We prospectively studied patients with known or suspected lung cancer and mediastinal and hilar lymph node swelling detected by chest CT. Mediastinal and hilar lymph nodes were assessed if the short-axis diameter on transaxial chest CT images was greater than 10 mm. Patients with hyperglycemia (blood glucose levels > 126 mg/dL) at the time of 18F-FDG injection were excluded from the study. All patients underwent STIR MRI and 18F-FDG PET/CT before a biopsy and treatment for lung cancer, and the 2 examinations were completed within 1 wk of each other. STIR MRI and 18F-FDG PET/CT findings for all primary lesions and nodal involvement were compared with the histopathologic diagnosis within 4 wk. Histopathologic diagnoses of all mediastinal and hilar lymph nodes were confirmed by the results of surgical resection, mediastinoscopy, or transbronchial needle aspiration. Because the negative predictive value of transbronchial needle aspiration is considered to be low (25), surgical confirmation was always obtained in cases that had no malignant lymph node invasion demonstrated by that endoscopic technique.
CT Examination
CT was performed using a 16-detector-row CT system (Sensation 16; Siemens). Continuous 5-mm-thick sections were obtained at 5-mm intervals from the lung apices to the adrenal glands before and during an intravenous bolus injection of contrast material (iopamidol [Iopamiron 300; Bayer] or iohexol [Omnipaque 300; GE Healthcare]) given at 2 mL/s with a power injector. Transaxial images were obtained for all patients.
MRI
MRI examination was performed using a 1.5-T superconducting magnet (Signa Excite; GE Healthcare) with a body coil. In all patients, axial electrocardiographically triggered STIR fast spin-echo MRI scans (repetition time, 4,000–14,285 ms; effective echo time, 58 ms; inversion time, 150 ms; number of excitations, 2; slice thickness, 7 mm; slice gap, 1 mm) were obtained with a 2% copper sulfate phantom (26), which was placed along the back of each patient (23).
18F-FDG PET/CT Examination
All 18F-FDG PET/CT examinations were performed with a whole-body scanner (Discovery LS; GE Healthcare). All patients fasted overnight (for at least 12 h) before radiotracer administration. 18F-FDG PET images from the skull through the mid thigh were obtained 50 min after intravenous injection of 18F-FDG at a dose (185 MBq) that is commonly used for clinical 18F-FDG PET examinations in Japan, and CT-based attenuation correction was performed. CT-based attenuation correction factors were then applied to the emission data, and the attenuation-corrected emission images were reconstructed using an ordered-subset expectation maximization iterative reconstruction algorithm (2 iterations and 14 subsets). The reconstructed images were converted to standardized-uptake-value (SUV) images using patient body weight and dose of 18F-FDG (tumor activity concentration/injected dose/body weight).
STIR MRI Scan Analysis
On axial images obtained by STIR MRI, signal intensity (SI) was measured in a circular region of interest drawn over each lymph node diagnosed as malignant on CT to determine its size and the phantom of 2% copper sulfate. Each region of interest drawn over the lymph node encompassed the entire cross-sectional area of the node and the phantom (10 mm in diameter). To quantitatively evaluate the SI of each of those lymph nodes, we calculated the lymph node–to–phantom ratio (LPR) using the following formula (23):To determine the ability to perform qualitative analysis using STIR MRI scans, the results of all examinations were prospectively evaluated by 2 experienced radiologists using a 5-point visual scoring system according to the method of Ohno et al. (23), as follows: 1, SI of lymph node could not be evaluated; 2, SI of lymph node was less than or equal to SI of mediastinal fat; 3, SI of lymph node was greater than SI of mediastinal fat and less than or equal to SI of muscle; 4, SI of lymph node was greater than SI of muscle and less than or equal to SI of primary lesion; and 5, SI of lymph node was greater than SI of primary lesion. The final diagnosis of lymph node metastasis was made after the 2 reviewers reached a consensus.
Analysis of 18F-FDG PET/CT Images
18F-FDG PET images were interpreted independently and prospectively by an experienced radiologist and nuclear medicine physician, without knowledge of histopathologic or other radiologic data. Semiquantitative analysis of the 18F-FDG uptake was based on region-of-interest analysis that produced maximal SUV and mean SUV. Swollen lymph nodes were evaluated using 18F-FDG PET/CT. A region of interest was drawn over each mediastinal or hilar lymph node at the most active site on 18F-FDG PET/CT images by the same radiologist and experienced nuclear medicine physician.
Statistical Analysis
To determine observer performance for qualitatively analyzed STIR MRI scans on a per-node basis, a κ-statistic was used. Interobserver agreement was considered as slight for a κ of less than 0.21, fair for a κ of 0.21–0.40, moderate for a κ of 0.41–0.60, substantial for a κ of 0.61–0.80, and nearly perfect for a κ of 0.81–1.00 (27). To evaluate the ability of the LPR and visual score to enable differentiation of lymph nodes with malignancy from benign nodes, we used the Student t test. The Student t test was also used to evaluate the diagnostic capability of STIR MRI and18F-FDG PET/CT while the optimal thresholds of LPR, visual score, maximal SUV, and mean SUV were determined using receiver-operating-characteristic curve–based analysis. The sensitivity, specificity, and diagnostic accuracy of each modality were determined. The diagnostic accuracy of each method was compared using a McNemar test. For all statistical analyses, P values of less than 0.05 were considered to be statistically significant.
RESULTS
Subject Demography and Clinical Characteristics
We studied 93 consecutive patients evaluated between January 2005 and June 2008 (age range, 30–87 y; mean age ± SD, 66.1 ± 10.9 y; 76 men and 17 women) (Table 1) and evaluated 137 lymph nodes, of which 82 were pathologically diagnosed as malignant and 55 as benign (Table 2). In the group without malignancy, 11 cases of sarcoidosis were included. For those 11 patients, the diagnosis of sarcoidosis was based on compatible clinical and follow-up findings, histologic evidence of noncaseating epithelioid cell granulomas, and exclusion of known causes of granulomatous diseases according to the consensus statement by the American Thoracic Society, European Respiratory Society, and World Association of Sarcoidosis and Other Granulomatous Disorders (28). The lymph nodes ranged in size from 10 to 36 mm, and the mean size of nodes with malignancy was significantly greater than that of benign nodes (mean, 18.4 ± 6.2 mm vs. 13.9 ± 3.1 mm, P < 0.0001, analyzed by Student t test).
Patient Characteristics
Lesion Characteristics
LPR and Visual Scores for Malignant and Benign Lymph Nodes
The mean LPR values for malignant and benign lymph nodes were 0.22 ± 0.32 and 0.08 ± 0.03, respectively (P < 0.001, analyzed by Student t test), which was a statistically significant difference. Table 3 shows the results of the rated visual scores based on STIR MRI by the 2 reviewers. Interobserver agreement for qualitative analysis of STIR MRI data was substantial (κ = 0.76).
Interobserver Agreement for Qualitative Analysis of Lymph Nodes Using STIR MRI on Individual-Node Basis
Accuracy of STIR MRI and 18F-FDG PET/CT for Diagnosing Lymph Nodes
An LPR of 0.1 for quantitative STIR MRI analysis and a visual score of 4 for qualitative analysis were adopted as optimal cutoff values for receiver-operating-characteristic analysis. Receiver-operating-characteristic analysis was also used to evaluate the effectiveness of SUV as a marker for malignant lymph nodes versus benign nodes. Sensitivity, specificity, positive predictive value, negative predictive value, and accuracy were calculated for each level of SUV, by varying the level that signified an optimal threshold value (29). A maximal SUV of 4.1 and mean SUV of 3.5 were adopted as optimal cutoff values for analysis. Table 4 shows the optimal cutoff values, sensitivities, specificities, positive and negative predictive values, and accuracies of STIR MRI and 18F-FDG PET/CT for diagnosing mediastinal and hilar lymph nodes. When the optimal cutoff values of LPR, visual score, and maximal SUV were adopted, sensitivity, specificity, and accuracy did not significantly differ between the 2 methods (STIR quantitative analysis vs. 18F-FDG PET/CT: P = 0.28; STIR qualitative analysis vs. 18F-FDG PET/CT: P = 0.35, analyzed by McNemar test). On the basis of STIR quantitative analysis, 5 lymph nodes were suggested to be artifacts, 3 of which were related to anthracosis and 2 to sarcoidosis. All showed false-positive results. Each node was less than 14 mm in size, and the mean size of these 5 was 12.0 ± 2.0 mm.
Comparisons of Quantitative and Qualitative Diagnostic Capabilities of STIR MRI, 18F-FDG PET/CT, and 18F-FDG PET/CT with STIR MRI on Individual-Node Basis
For cases in which lymph nodes were found positive both on 18F-FDG PET/CT and through qualitative analysis of STIR MRI and were determined to be malignant, the results of comparisons of diagnostic capability based on the definition and using only 18F-FDG PET/CT are shown in Table 4. According to the criteria noted above, specificity (94.5%) and accuracy (89.8%) were significantly higher than those of 18F-FDG PET/CT alone (65.5% and 80.3%, respectively) (P = 0.003, analyzed by McNemar test). A comparison of 18F-FDG PET/CT and 18F-FDG PET/CT with STIR MRI on an individual-patient basis for the evaluation of nodal staging is shown in Table 5. The accuracy of 18F-FDG PET/CT with STIR MRI was significantly higher than that of 18F-FDG PET/CT alone (90.3% vs. 81.7%, P = 0.021, analyzed by McNemar test). In a comparison of 18F-FDG PET/CT combined with STIR MRI and STIR MRI alone, the accuracy of the former method was significantly higher on an individual-node basis (89.8% vs. 84.7%, P = 0.035, analyzed by McNemar test) and individual-patient basis (90.3% vs. 81.7%, P = 0.045, analyzed by McNemar test). We also evaluated using mean SUV with an optimal cutoff value of 3.5 and obtained similar results. In a comparison of 18F-FDG PET/CT combined with STIR MRI and 18F-FDG PET/CT using mean SUV alone, the accuracy of the former method was significantly higher on an individual-patient basis (90.3% vs. 81.7%, P = 0.021, analyzed by McNemar test). When an SUV of less than 2.5 was used as the threshold level for normalcy, as defined by the American College of Chest Physicians, the sensitivity, specificity, and accuracy of 18F-FDG PET/CT were 98.4%, 37.5% and 77.4%, respectively, on an individual-patient basis. Adding qualitative STIR MRI analysis to PET/CT also improved specificity and accuracy, to 78.1% and 87.1%, respectively (P = 0.029, analyzed by McNemar test).
Comparison of 18F-FDG PET/CT and18F-FDG PET/CT Combined with STIR MRI on Individual-Patient Basis
Figure 1 shows a representative case of adenocarcinoma and mediastinal lymph node metastasis. The results of STIR MRI and 18F-FDG PET/CT were both true-positive. The SI of the lymph node in STIR MRI was greater than that of muscle and equal to the primary tumor in the right upper lobe. Figure 2 shows a representative case of malignant lymphoma that also had a true-positive result in both STIR and 18F-FDG PET/CT. Figure 3 shows a representative case of sarcoidosis for which only STIR was useful for diagnosis. 18F-FDG PET accumulation was intense in the lymph node.
A 60-y-old man with lung adenocarcinoma and metastasis in mediastinal lymph node 3 in pretracheal area. 18F-FDG PET (A) and integrated 18F-FDG PET/CT (B) show strong accumulation in primary tumor in right upper lobe and mediastinal lymph nodes (maximal SUV of 11.4 in lymph node 3), and STIR MRI (C) shows lymph node 3 and primary tumor as high-signal-intensity areas. LPR of this lymph node was 0.33, and rated visual score was 4.
A 63-y-old man with malignant lymphoma. 18F-FDG PET (A) and integrated 18F-FDG PET/CT (B) show strong accumulation in pretracheal and bilateral axillary lymph nodes (maximal SUVs of 10.9–13.6), and STIR MRI (C) shows lymph nodes as high-signal-intensity areas. LPRs were 0.11, and rated visual score was 4 for each node.
A 36-y-old man with sarcoidosis. 18F-FDG PET (A) and integrated 18F-FDG PET/CT (B) show strong accumulation in pretracheal and paraaortic lymph nodes (maximal SUVs of 5.3–6.0), and STIR MRI (C) shows lymph nodes as low-signal-intensity areas. LPRs were 0.06, and rated visual score was 2 or 3 for each node.
DISCUSSION
To our knowledge, this is the first report of the utility of 18F-FDG PET/CT combined with STIR MRI for the diagnosis of nodal involvement in thoracic malignancy. In the present study, we assessed STIR MRI for the detection of malignant findings in lymph nodes and compared the results with those obtained on 18F-FDG PET/CT. Our results clearly suggest an excellent diagnostic ability for those 2 methods used in combination and indicate the possibility that invasive examinations can be eliminated.
For patients with lung cancer, accurate tumor staging is a prerequisite for determining the disease course, because knowing the stage guides the selection of available treatment modalities and predicts survival. Although size criteria using CT have been applied in conventional assessments of lymph nodes for metastasis, CT has a limited ability to differentiate between benign and malignant lymph nodes. Recent studies have shown both advantages and limitations of 18F-FDG PET/CT for nodal staging and cast doubt on its specificity (3–5,10,19–21). Ohno et al. reported that STIR MRI is more accurate and should be considered as valuable as 18F-FDG PET/CT for the assessment of N-stage of patients with non–small cell lung cancer (22). Previous studies that evaluated the results and utility of STIR MRI for diagnosing mediastinal nodal involvement in lung cancer patients have reported sensitivity of 86%−100%, specificity of 86%−97%, and accuracy of 86%−96% (22–24).
In the present study, the specificities of STIR MRI and 18F-FDG PET/CT were lower than in previous studies. STIR enhances differences between the water content of various tissues, as increasing the T1 or T2 value in the tissue increases the relative SI in this imaging modality. Thus, false-positive cases are increased because not only the malignant tumor but also inflammation or lymphatic edema prolongs T1 and T2 relaxation times (30). Moreover, respiratory motion can result in prominent artifacts (31,32). False-positive results obtained by STIR in the present patients were cases of inflammation, such as active sarcoidosis, which increases the serum angiotensin-converting enzyme level, and motion artifacts. In the present study, electrocardiographically triggered STIR was used; thus, artifacts of cardiac motion could be reduced while noise caused by respiratory motion remained. The influence of artifacts mainly increases SI from small lymph nodes, whereas artifacts also cause an increase in false-positive results from STIR quantitative analysis. If a trigger for both cardiac pulse (electrocardiograph) and respiratory motion were possible, the artifacts would be reduced. A pneumatic belt or a compressive cushion placed at the upper abdomen or the chest of the patient was reported as a straightforward method to compensate for respiratory motion (32) and might be useful for this purpose. However, 18F-FDG PET/CT combined with STIR MRI provided a significantly more accurate diagnosis than did STIR MRI alone.
In our observation, false-positive results obtained by 18F-FDG PET/CT were cases of sarcoidosis and follicular hyperplasia in lymph nodes caused by obstructive pneumonia. An increase in glucose metabolism by chronic inflammation is the cause of false-positive uptake of 18F-FDG PET. In the present results, the specificity of PET/CT was low, at 65.6% on an individual-patient basis, because a few cases of sarcoidosis determined to be false-positive were the result of including cases with lymphadenopathy in this prospective study. The causes of false-positive results in STIR (water content) are different from those in 18F-FDG PET/CT (increase in glucose metabolism). In patients with sarcoidosis, the lymph nodes become enlarged by granulomatous formation through the hyperplasia of fibroblasts (33). Therefore, when STIR showed a low intensity, indicating relatively little water content, a differential diagnosis between malignancy and sarcoidosis was obtained. According to our criteria of malignancy diagnosed as positive on both 18F-FDG PET/CT and STIR MRI, there were only 3 false-positive results, 2 nodes related to sarcoidosis and 1 related to follicular hyperplasia, in which the size of all nodes was less than 14 mm. In addition, 11 false-negative nodes were also less than 14 mm in size. We considered that the spatial resolution of 18F-FDG PET/CT and STIR MRI might have been the cause of the false results. With lymph nodes greater than 15 mm, sensitivity, specificity, and accuracy were each 100%.
Previous studies have reported the utility of 18F-FDG PET/CT for mediastinal and extrathoracic staging of non–small cell lung cancer (8,20,34). Among the malignant cases in the present study, we also assessed cases of small cell lung cancer, malignant lymphoma, and melanoma. Although the evidence is insufficient for these diseases, the utility of 18F-FDG PET/CT for staging diseases was previously reviewed (35). Thus, it is recommended that 18F-FDG PET should be used for diagnosis of various tumors. Moreover, the utility of MRI combined with 18F-FDG PET for the assessment of uterine tumors was reported (36). Some investigators have discussed the possibility and utility of PET/MRI (37,38). Because clinical use of PET/STIR MRI results in both anatomic and characteristic images, fine diagnostic capability is expected of this modality. For the assessment of lymph node involvement in thoracic malignancy, including lung cancer, our findings demonstrated the utility of 18F-FDG PET/CT combined with STIR MRI and imply that additional invasive examinations can be eliminated, thereby lowering medical costs and avoiding hazardous diagnostic methods.
CONCLUSION
In this study, we compared the diagnostic capability of STIR MRI with that of 18F-FDG PET/CT. STIR MRI was able to differentiate false-positive results of 18F-FDG PET/CT and in combination significantly improved the accuracy of 18F-FDG PET/CT alone. For noninvasive diagnosis of nodal involvement in thoracic malignancy, including lung cancer, STIR MRI for the cases shown positive by 18F-FDG PET/CT is useful.
Acknowledgments
We thank Yasutaka Kawamura, Kameda Medical Center, for his clinical support and helpful comments. This work was supported by the 21st Century COE program “Biomedical Imaging Technology Integration Program,” funded by the Japan Society for the Promotion of Science (JSPS).
Footnotes
-
COPYRIGHT © 2009 by the Society of Nuclear Medicine, Inc.
References
- Received for publication July 29, 2008.
- Accepted for publication October 8, 2009.