See the associated article on page 1778.
It would be easy to presume that advanced hybrid imaging modalities, such as PET/CT and PET/MR, would produce more clinically valuable information than older techniques such as bone scintigraphy. The advantage of multiplanar reconstructions over planar imaging (1), the higher resolution of PET than SPECT (2,3), and the anatomic correlation provided by the addition of CT or MR (4,5) make the battle between bone scintigraphy (David) and PET/CT or PET/MR (the Goliaths) appear unequal. Indeed, multiple previous studies have concluded that the Goliaths handily defeat David (6–11).
This is what makes the report by Löfgren et al. (12) remarkable. Löfgren et al. expertly perform a prospective trial comparing the ability of David versus the Goliaths to detect bone metastases in a cohort of patients with known malignancy and clinical suspicion for bone metastases, and find no difference between them. The use of a modality including CT or MR did increase the confidence of reader interpretation, thus reducing equivocal reads, but the authors conclude that there was little improvement in the detection of osseous malignancy when using PET/CT or PET/MR compared with bone scintigraphy. This report challenges the presumption that the use of advanced imaging modalities will result in greater clinical value.
There are notable limitations to the study. First, there was histologic proof of suggestive imaging findings in only 2 patients. In 1 of these 2 patients, the suggestive imaging finding was histologically evaluated after a partial resection of a rib lesion, and the histology turned out to be benign. Thus, for most patients, the reference standard for the presence of malignancy used the imaging findings in determining the reference standard. This could result in circular logic, that is, the imaging findings are used to determine the reference standard of metastases, and the reference standard of metastases is then used as evidence that the imaging findings are correct. Having a reference standard, such as histology, that is independent of the imaging findings would provide much stronger evidence. Of course, there are logistical and ethical difficulties in designing a study in which all patients have histologic proof of their imaging findings. This is a conundrum that introduces limitations in many studies. However, it should be recognized that the reference standard used in this study may be suboptimal.
Second, as the authors admit, the study is underpowered. The power of a study is the probability of rejecting the null hypothesis when the null hypothesis is false (13). In this study, the null hypothesis is that the ability of bone scintigraphy to detect the presence of osseous metastases is equal to that of the advanced imaging modalities. If we reject the null hypothesis, then we assert that one method is superior to the other. A study should have sufficient statistical power to reduce the chance of concluding that the null hypothesis is correct when it is actually false. This would result in a false-negative study, also known as a type II error (13). Power is best calculated prospectively. For this study, power was prospectively calculated by assuming that 30% of patients would have osseous metastases. For a power of 80% and an α (type I error rate) of 5%, the sample size calculation was determined to be 120 patients. With a power of 80%, there would be only a 20% chance of a false-negative study. However, in actuality, only 16 of 117 (14%) patients were determined to have osseous metastases during the study. Given this small proportion, the number of patients needed for 80% power would need to be much higher than 120. If the power of the study is too low, then it increases the chance of the study failing to detect a difference when one truly exists, and thus it may not be accurate to conclude there is no difference between the modalities. There may truly be a difference between the modalities, but the study did not detect it. It is normally not appropriate to determine the power of a study retrospectively; however, this may be done in certain circumstances (14). It would be interesting to determine the power of this study based on the actual patient characteristics. Clinicaltrials.gov contains a study of 18F-NaF PET/CT versus 99mTc-methylene diphosphonate bone scintigraphy for the detection of osseous metastases (NCT00882609) for which the completion date has passed, though the study’s results have not yet been published. This study is designed to determine whether 18F-NaF PET/CT leads to improved treatment and patient outcomes compared with bone scintigraphy. It should have greater statistical power than the study by Löfgren et al., and the results will be of great interest when published.
So what can we glean from Löfgren et al.? In a population for which the likelihood of osseous metastases is relatively low, bone scintigraphy may perform equally well compared with more advanced imaging modalities for the detection of the presence of osseous metastases. This is good news for bone scintigraphy. Given the low cost and widespread availability of bone scintigraphy (15), Löfgren et al. could breathe new life into this nuclear medicine standard during an era when the most advanced imaging possible is increasingly used. Clearly, when determining the value of an imaging study, it is not just the technical aspects of the imaging modality, but also the intended patient population, that will help determine the optimal imaging method. For some patient populations, David may be equally strong as the Goliaths.
DISCLOSURE
No potential conflict of interest relevant to this article was reported.
Footnotes
Published online Aug. 17, 2017.
- © 2017 by the Society of Nuclear Medicine and Molecular Imaging.
REFERENCES
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
- 14.
- 15.
- Received for publication August 9, 2017.
- Accepted for publication August 10, 2017.