The Move from Accuracy Studies to Randomized Trials in PET: Current Status and Future Directions

Bettina Siepe; Poul Flemming Hoilund-Carlsen; Oke Gerke; Wolfgang A. Weber; Edith Motschall; Werner Vach

doi:10.2967/jnumed.113.127076

Abstract

Since the influential study by van Tinteren et al. published in The Lancet in 2002, there have been an increasing number of diagnostic randomized controlled trials (RCTs) investigating the benefit of PET. If they provide valid and useful information on the benefit, these studies can play an important role in informing guideline developers and policy makers. Our aim was to investigate how far the nuclear medicine community has come on its way from accuracy studies to RCTs and which issues we have to take into account in planning future studies. Methods: We conducted a systematic review of diagnostic randomized trials, in which PET was applied in only one arm. We covered published studies as well as registered unpublished and planned studies. We considered 3 quality indicators related to the usefulness of a trial to generate evidence for a clinical benefit: use of patient-important outcome, sufficient sample size, and current standard as comparator. Results: Fourteen published and 15 planned studies were identified. Five of the published studies and 12 of the planned studies did not use a patient-important outcome. Sample sizes were often so small that a significant result could be expected only under the assumption of a substantial reduction in the event rate. Comparators typically reflected the current standard. Conclusion: If we consider the traditional areas of primary diagnosis, staging, and follow-up, then the number and quality of RCTs on PET is currently not sufficient to provide a major source for evidence-based decisions on the clinical benefit of PET. There will also be a future need in these traditional areas to deduce the clinical benefit of PET from the results of accuracy studies. The situation may be more favorable for the areas of treatment planning and response evaluation. Choice of patient-important outcomes and sufficient sample sizes are crucial issues in planning RCTs to demonstrate the clinical benefit of using PET.

Improvement of diagnostic accuracy is no longer the ultimate criterion to establish new diagnostic procedures. Guideline developers and policy makers now require evidence of clinical benefit (1,2). Randomized controlled trials (RCTs) are often the preferred source of this evidence (3–5), because the step to deduce evidence for a clinical benefit from accuracy studies by combination with results from clinical trials or prognostic studies (1,6) is often rather cumbersome (7,8).

RCTs have gained some popularity in the area of PET since the influential multicentre study by van Tinteren et al. published in 2002 in The Lancet (9). In a randomized design, that study investigated the effect of adding ¹⁸F-FDG PET to conventional work-up before surgery in patients with suspected non–small cell lung cancer. In a recent review, Scheibler et al. (10) identified 12 published RCTs and 42 planned or unpublished RCTs from study registers. However, if RCTs are not appropriately designed to demonstrate a clinical benefit according to the established principles of evidence-based medicine and comparative effectiveness research, then they will not help overcome the limitations of accuracy studies. Scheibler et al. (10) already considered some general indicators related to the risk of bias in RCTs, such as the description of the randomization process, masking, or use of the intention-to-treat principle. In contrast, our investigation focused on aspects related to the usefulness of these studies to demonstrate a clinical benefit beyond diagnostic accuracy. Thus, we considered the use of patient-important outcomes, adequate choice of comparators, and sufficient sample size as quality indicators.

The aim of our investigation was to examine how far the nuclear medicine community has come on its way from accuracy studies to RCTs and to identify strengths and weaknesses to improve future study planning and rational promotion of clinical molecular imaging.

MATERIALS AND METHODS

Study Inclusion Criteria and Search Strategies

We searched systematically for all diagnostic RCTs in which PET was used in one arm and another modality or clinical examination was used in the other arm. No further restrictions were applied. Before the search, 2 experts from the field of nuclear medicine (PFHC, WW) each compiled a list of RCTs fulfilling the inclusion criteria based on their personal knowledge. These lists included 5 and 7 studies, respectively.

We searched in MEDLINE (National Library of Medicine; OvidSP [Ovid Technologies], 1948 to November 2011), EMBASE (Elsevier; OvidSP, 1974 to November 2011), and the Cochrane Library (Wiley Interscience; up to 2011). The search strategy is documented in the supplemental material (available at http://jnm.snmjournals.org). The search was performed on November 21, 2011, and was restricted to publications since 1990. This strategy allowed us to find all RCTs identified by the experts. After the identification of eligible trials, we conducted a forward reference search (Web of Science; Thomson Reuters) on February 20, 2012. To complete the search, we also checked the references of the eligible trials.

To find unpublished or ongoing studies, we searched ClinicalTrials.gov, the International Standard Randomised Controlled Trial Number Register (ISRCTN), and the International Clinical Trials Registry platform maintained by the World Health Organization. This search was performed on January 21, 2012, using the terms diagnostic and randomized and PET (ClinicalTrials.gov) and on August 25, 2012, using the terms pet and diagnos* (ISRCTN and International Clinical Trials Registry platform).

Study Selection and Data Extraction

One reviewer (BS) screened titles and abstracts of the retrieved publications. The full text of potentially eligible publications was obtained, and a final check of the inclusion criteria was performed. In cases of multiple publications of the same trial, the primary publication was identified. The entries retrieved from the trial registers were also screened. Here, the decision about inclusion was made only on the basis of the information in the trial register. The reviewer copied the relevant passages of the identified publications or trial register entries into a spreadsheet. Study characteristics and quality criteria were then assessed jointly by BS and WV.

Study Characteristics and Quality Criteria

For each study, we extracted 2 basic study characteristics: the disease or disease stage characterizing the patients included and the clinical situation, categorized as primary diagnosis, staging, follow-up, treatment planning, or response evaluation. The specific category treatment planning refers to a situation in which the same treatment (e.g., radiation or heart surgery) was performed in both arms but PET was used in one arm to identify the regions to be treated. For registered studies, we also extracted the start of the enrollment. In addition, we evaluated the following 3 predefined criteria to assess the ability of each study to contribute to an assessment of the clinical benefit of PET imaging.

Patient-Important Outcome

If the aim of an RCT should be to assess clinical benefit, the outcome should be relevant for the patient (11). Typically, such outcomes are related to mortality, morbidity, or quality of life. They should ensure that a difference between the modalities implies an actual advantage for the patient. In diagnostic RCTs, there are 2 major sources of outcomes, which are not patient-important. The first arises from outcomes reflecting exclusively diagnostic accuracy, such as the number of correct diagnoses. As pointed out above, this is exactly what we would like to overcome in RCTs. The second stems from outcomes reflecting a management decision, the benefit of which is not known. A typical example is the frequency with which an additional, invasive diagnostic procedure is used. Reducing this frequency probably improves the quality of life on average, but we do not know whether some patients would have benefited from this procedure had it been performed. For each trial, we extracted the primary outcome and checked whether it could be regarded as being patient-important.

Sufficient Sample Size

Sufficient sample size is a crucial issue in diagnostic RCTs (12,13), as distinct improvements in diagnostic accuracy often translate into only moderate improvements in long-term outcome. For example, PET may increase both the sensitivity and the specificity from 60% to 80% (compared with a standard modality) in distinguishing between 2 different stages. If no incorrect changes occur, this implies a change in only every fifth patient. If adequate treatment at each stage implies an increase in surviving 1 y by 30 percentage points compared with inadequate treatment due to incorrect staging, then we can expect an improvement by 30 percentage points in 20% of the patients, and overall by only 6 percentage points. Regarding 1-y mortality as a binary outcome and assuming a mortality of 50% in the control arm, then we need 1,486 patients in each arm to reach 90% power in a 2-sided test with significance level 5%.

In a first step, we checked whether a sample size calculation was reported. As diagnostic RCTs can often build on results from accuracy studies, this information should be used in the sample size calculation to qualify the assumptions made. Hence, in a second step, we checked whether the assumptions made were justified by references to other publications. In a third step, we computed the relative risk, hazard ratio, or Cohen d, which is necessary to assume to obtain a power of 90% for the given trial, while taking into consideration the sample size or number of events of the study and the failure rate observed in the control arm. This post hoc power analysis gives an idea about the actual power of the study.

Current Standard as Comparator

Since the aim of RCTs should be to provide arguments for policy makers, it is important to ensure that the comparator reflects the current standard. If PET is compared in an RCT to an “old-fashioned” modality, the results cannot be used directly. To decide whether the comparator could be regarded as the current standard, we tried to identify guidelines valid at the time the study was performed and checked whether the comparator was mentioned as a possible choice.

Because of the restricted scope of information in the trial registers, in these we could apply only the criterion about the patient-important outcome.

RESULTS

Published Studies

The search in MEDLINE, EMBASE, and the Cochrane Library resulted in 3,311 citations at first and 2,228 after the duplicate check. Screening of titles and abstracts resulted in 63 potentially relevant papers. Screening of these resulted in identifying 30 papers describing 14 trials that fulfilled our inclusion criterion. Table 1 summarizes the results of our investigation for the 14 published studies identified. Ten studies were from oncology, 3 from cardiology, and 1 from neurology. Six studies considered staging, 5 treatment planning, and 3 follow-up evaluation. One publication (14) reported a study protocol, not a finished study. The study of Plewnia et al. (15) used a cross-over design.

View this table:

TABLE 1

Selected Characteristics of Published Studies

In 5 of the 14 studies, the outcome was not a patient-important one. The number of positive scans used by Mullani et al. (16) covered both false-positive and true-positive scans, so that any increase might have been due to just an increasing number of false-positive scans, which is not an advantage for the patients. Correct upstaging used by Maziak et al. (17) implies avoiding unnecessary surgery and, hence, is an advantage, but its frequency has to be balanced against the frequency of incorrect upstaging, which implies that patients miss the advantage of surgery. Indeed, this study observed a significant increase in both correct upstaging by PET and incorrect upstaging by PET. Recurrence during follow-up was the primary outcome in the study of Sobhani et al. (18). In treatment trials, this is a patient-important outcome reflecting morbidity. However, Sobhani et al. did not use PET as a means to prolong the true time until recurrence but rather tested whether PET could shorten the time until detection of recurrences, which is a diagnostic decision and not a therapeutic effect. Actually, the authors reported that only 27 of the 44 recurrences they observed could be confirmed by biopsy or surgery. Therefore, since it remains unclear whether their patients actually benefited from the earlier detection, this is not a patient-important outcome either. Similar arguments apply to one further study (19). Finally, one study (20) considered the number of investigations needed to finalize staging as primary outcome. Reducing this number may be beneficial to the quality of life of the patient. However, this advantage may be counterbalanced by a poorer quality of staging. Consequently, this outcome can be regarded as patient-important only if we are sure that there was no loss in diagnostic accuracy. The authors provided additional information on the agreement between clinical and final staging in both arms addressing this issue. However, with limited sample sizes it is difficult to ensure true equivalence.

Nine studies used a patient-important outcome. Three of these studies (21–23) used long-term outcomes related to (disease-free) survival. One study (15) applied a disease-specific quality-of-life measure. The outcome used in the study of Viney et al. (24) was the performance of a thoracotomy. As that is a management decision, we have to be careful: if a diagnostic method never points to performing a thoracotomy, this method may turn out to be the best one. However, the authors listed in their Table 3 an overview of the reasons for not performing a thoracotomy, and in the text they provided an explicit reason why the thoracotomy would be futile for each patient lacking thoracotomy. Hence, in this case, to regard the outcome as being patient-important seems justified, as it reflects avoidance of an unnecessary invasive procedure. Another 3 studies (9,25,26) used the number of futile thoracotomies or futile laparotomies as outcome. Again, we have to be careful because reducing the number of surgeries also reduces the number of futile surgeries. However, there is only limited doubt about the fact that this is a patient-important outcome because the authors provided detailed information on why the surgical procedure would have been futile for each patient not exposed to it. Moreover, in all 3 studies, the definition of futile surgery also took into account recurrence or death within 6 or 12 mo, and a reduction in recurrence or death actually contributed substantially to the reduction in futile surgeries. Therefore, this outcome also represents improved patient management after surgery beyond the simple avoidance of surgeries. Consequently, this outcome is close to a long-term outcome such as survival, and all in all, we can regard it as being patient-important. The published study protocol (14) defines futile indications for direct laryngoscopy as primary outcome. The final decision on patient importance depends on whether the authors will be able to provide sufficient evidence for the validity of no-surgery decisions.

A sample size calculation was performed for most studies. However, only 2 studies succeeded in presenting explicit justification for the assumed effects in both arms. Moreover, if we look at the effects to be assumed to reach a power of 90% (Table 2), we observe that in most of these studies relative risks or hazard ratios of less than 0.5 have to be assumed to reach this power. In treatment research, such large effects are rare, and Djulbegovic et al. (27) suggested regarding the few studies that were able to demonstrate effects of this magnitude as “breakthroughs.” On the other hand, all 3 studies using futile surgery as primary outcome (9,25,26) actually estimated relative risks in the magnitude of 0.5. A Cohen d of 0.3 is typically regarded as a rather moderate effect.

View this table:

TABLE 2

Effects to Be Assumed in Each Study to Reach Power of 90%

With respect to the choice of the comparator, we could identify appropriate guidelines for 9 studies from oncology (Table 1). In all cases, the comparator chosen was not in conflict with the existing guidelines. In one study (26), CT was applied in all patients of the control arm, although this was not yet recommended in the guidelines. For 2 cardiologic studies, the onset of the study was not sufficiently specified to locate the corresponding guideline. The comparator in the study of Beanlands et al. (21) was in accordance with existing guidelines.

Registered Studies

The search in the trial registers resulted in 263 studies (clinicaltrials.gov, 63; International Clinical Trials Registry Platform, 166; ISRCTN, 34). Removing duplicates and screening the information available in the trial register resulted in 20 eligible trials. Among these, 5 were already included in the 14 published studies. The remaining 15 studies are summarized in Table 3. Fourteen studies are from oncology, and one from cardiology. In these studies the clinical situation was treatment planning for 5 studies, staging for 4, primary diagnosis for 2, follow-up for 2, and response evaluation for 1. In most studies, the primary outcome was related to accuracy, diagnostic decisions, or management decisions and, therefore, could not be regarded as patient-important. Only 3 studies used an unequivocally patient-important outcome. For 2 further studies, this decision depends on whether the events considered will be validated. One study used the response rate, which is today typically regarded as a surrogate outcome and not a true patient-important outcome (28). However, the description of the primary outcome was often close to a description of the general aim of the study, and hence we cannot exclude that the primary outcome actually chosen in the analysis will be patient-important.

View this table:

TABLE 3

Selected Characteristics of Studies Found in Clinical Trial Registers

Three clinical situations contributed at least 2 studies both among the published studies and among the registered studies: staging, treatment planning, and follow-up. The median sample sizes in the published studies were 188, 210, and 130, respectively. In the registered studies, the corresponding median sample sizes were 220, 288, and 214, suggesting a slight increase.

Comparison with Results of Scheibler et al.

The review of Scheibler et al. (10) identified 12 published studies. We identified 3 additional studies that were probably excluded by the authors for the following reasons: use of ¹⁸F-FDG coincidence imaging by dual-head γ camera rather than a real PET scanner (19), focus on risk factors instead of modality comparison (16), and the nature of a study protocol (14). One study included in the review of Scheibler et al. was excluded in our review, as PET was administered to all patients (29). Scheibler et al. identified 42 registered studies, in contrast to 20 identified by us, 4 of which were not included in their review. The much larger number of Scheibler et al. probably resulted from their searching for all randomized trials, whereas we included the term diagnos* in the search strategy. Moreover, we included only comparative RCTs, comparing PET with a competitor. Of the 28 additionally identified studies, 18 used an enrichment or interaction design; that is, they were not comparative in our sense. Six studies considered treatment planning, and 10 studies considered response evaluation. If we focus on primary diagnosis, staging, and follow-up evaluation—the most common context of accuracy studies—there is only one additional comparative study (ClinicalTrials.gov identifier NCT00329706) in the review of Scheibler et al. This study compared the impact of making the result of a PET scan available to the clinician immediately or after 2 y. This study actually was identified in our search but was excluded, because both arms were PET-based.

DISCUSSION

The systematic reviews by Scheibler et al. (10) and us both indicate an increasing number of RCTs on PET in recent years. One nice example illustrating this move from accuracy studies to RCTs is the increasing number of trials comparing PET-based and non–PET-based treatment planning. In this area, accuracy studies can typically provide information on improved sensitivity or specificity only on the level of single lesions or local regions. Whether such an improvement results in better treatment planning cannot be directly addressed in these studies because this result will depend on the specific strategy chosen and on the accuracy at the patient level, for example, whether all lesions have been detected. Interestingly, we overlooked some of these planned studies, as the study registration did not mention anything about diagnosing, probably because most principal investigators regard such studies as having a therapeutic rather than a diagnostic focus. Another area with an increasing number of RCTs is response evaluation. This increase may reflect the fact that response evaluation is often a rather new topic with no established methods that can serve as a gold standard in accuracy studies. Prognostic studies using survival as a gold standard are hampered by the fact that effective second-line therapies invalidate this gold standard (8). Consequently, a direct comparison of no response evaluation against a combination of response evaluation by PET and an effective second-line therapy is often the only way to demonstrate the value of PET.

The picture is less favorable if we focus on those clinical situations in which accuracy studies are feasible and have a long tradition: primary diagnosis, staging, and follow-up evaluation. These are also areas in which PET competes with existing modalities. PET has been considered a diagnostic modality for nearly all cancer entities, typically for various different clinical situations (3,30). The 10 published studies (5 alone on staging in non–small cell lung cancer patients) and 6 planned studies identified in our review and fitting into these areas have, then, to be regarded as a small number. This may simply reflect the fact that it is easier and cheaper to aim to demonstrate an improvement in accuracy in a single-center study than it is to aim to translate a better accuracy into a pathophysiologically more correct and, consequently, more patient-favorable treatment in a multidisciplinary clinical study. It is time that in the field of nuclear imaging, also, we overcome the traditional distinction between clinical trials and diagnostic studies, similar to the case in biomarker research (31,32). Close collaboration between clinical departments and imaging units and financial support by funding agencies are important issues in reaching this aim.

However, the impact of RCTs may not be restricted only by their small number. According to our review, there are still many RCTs using accuracy or management decisions as primary outcome, that is, RCTs that fail to use patient-important outcomes. Such is particularly the case among the clinical settings mentioned above, in which half the published RCTs do not use a patient-important outcome. These studies are of limited value because guidelines and reimbursement decisions should be based on evidence for a clinical benefit. Our post hoc power analysis of the published RCTs also indicates a lack of power to demonstrate clinically relevant effects that are not at the level of a “breakthrough,” Unfortunately, according to our review the intended sample sizes in the planned studies did not differ substantially from those observed in the published studies, and they do not plan to use more efficient designs such as randomizing only discordant patients. Consequently, this problem may persist. In contrast, the choice of the comparator was adequate for all studies as far as we could judge.

Besides the 3 quality indicators considered in this paper, further aspects are important to allow a generalization from the RCT to clinical practice. These include a description of basic technical features of the imaging procedure, a clear definition of the target population, and a description of the actual recruitment setting. For all 3 aspects, we could observe deficits in the reporting of some trials (supplemental data).

The choice of a patient-important outcome is a prerequisite for a convincing assessment of the clinical benefit of PET. However, the fact that this is not a trivial task may explain why we have few RCTs covering the above-mentioned clinical situations. In our review, we could actually observe only 3 different types of patient-important outcomes: futile surgery, (disease-free) survival, and quality of life. Futile surgery is special not only because it is restricted to patient courses with a surgical treatment option but also because, as an outcome, it needs a valid assessment of actually nonperformed surgeries to be futile, that is, a gold standard applicable in the absence of surgery. The absence of such a gold standard is a major limitation in many accuracy studies and, hence, is also the case for RCTs. It is somewhat suspicious that in all 3 studies with futile surgery as outcome (9,25,26), no single case of a false-negative decision—a wrong suggestion to avoid surgery—appeared, despite the overall reduction in the number of surgeries from 226 to 190 when the control arms were compared with the PET arms. A consensus is needed on which outcomes should be regarded as patient-important in diagnostic RCTs. Moreover, to inform reimbursement decisions adequately, costs should also be included as outcomes to allow cost-effectiveness analyses.

CONCLUSION

Although the number of RCTs in PET is increasing, RCTs are still lacking in those areas in which accuracy studies have traditionally been performed. Thus, in PET research the move from accuracy studies to RCTs seems to be rather slow, suggesting that in the near future we will still have to deduce clinical benefit from accuracy studies to inform guidelines or reimbursement decisions. Furthermore, in the future the emphasis should not only be on randomization when increasing the number of RCTs but also on quality criteria ensuring that these studies can actually contribute to an assessment of a genuine clinical benefit. To achieve these improvements, the choice of patient-important outcomes and sufficient sample sizes may be the main issues.

DISCLOSURE

The costs of publication of this article were defrayed in part by the payment of page charges. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 USC section 1734. No potential conflict of interest relevant to this article was reported.

Acknowledgments

We acknowledge the excellent support of Franklin Torres in editing and proofreading the manuscript and of Monika Richards in preparing the final version. We are grateful to the 3 reviewers for their valuable comments.

Footnotes

Published online Jun. 9, 2014.

REFERENCES

1.↵
1. Schünemann HJ,
2. Oxman AD,
3. Brozek J,
4. et al
.; GRADE Working Group. Grading quality of evidence and strength of recommendations for diagnostic tests and strategies. BMJ. 2008;336:1106–1110.
OpenUrl FREE Full Text
2.↵
1. Tunis SR,
2. Benner J,
3. McClellan M
. Comparative effectiveness research: policy context, methods development and research infrastructure. Stat Med. 2010;29:1963–1976.
OpenUrl CrossRef PubMed
3.↵
1. Fletcher JW,
2. Djulbegovic B,
3. Soares HP,
4. et al
. Recommendations on the use of ¹⁸F-FDG PET in oncology. J Nucl Med. 2008;49:480–508.
OpenUrl Abstract/FREE Full Text
4.
1. Scheibler F,
2. Raatz H,
3. Suter K,
4. et al
. Benefit assessment of PET in malignant lymphomas: the IQWiG point of view. Nuklearmedizin. 2010;49:1–5.
OpenUrl PubMed
5.↵
1. Ferrante di Ruffano L,
2. Hyde CJ,
3. McCaffery KJ,
4. Bossuyt PM,
5. Deeks JJ
. Assessing the value of diagnostic tests: a framework for designing and evaluating trials. BMJ. 2012;344:e686.
OpenUrl FREE Full Text
6.↵
Medical Services Advisory Committee. Guidelines for the Assessment of Diagnostic Technologies. Canberra, Australian Capital Territory, Australia: Medical Services Advisory Committee; 2005.
7.↵
1. Trikalinos TA,
2. Siebert U,
3. Lau J
. Decision-analytic modeling to evaluate benefits and harms of medical tests: uses and limitations. Med Decis Making. 2009;29:E22–E29.
OpenUrl Abstract
8.↵
1. Vach W,
2. Høilund-Carlsen PF,
3. Gerke O,
4. Weber WA
. Generating evidence for clinical benefit of PET/CT in diagnosing cancer patients. J Nucl Med. 2011;52(suppl 2):77S–85S.
OpenUrl Abstract/FREE Full Text
9.↵
1. van Tinteren H,
2. Hoekstra OS,
3. Smit EF,
4. et al
. Effectiveness of positron emission tomography in the preoperative assessment of patients with suspected non-small-cell lung cancer: the PLUS multicentre randomised trial. Lancet. 2002;359:1388–1393.
OpenUrl CrossRef PubMed
10.↵
1. Scheibler F,
2. Zumbé P,
3. Janssen I,
4. et al
. Randomized controlled trials on PET: a systematic review of topics, design, and quality. J Nucl Med. 2012;53:1016–1025.
OpenUrl Abstract/FREE Full Text
11.↵
1. Guyatt G,
2. Montori V,
3. Devereaux PJ,
4. Schünemann H,
5. Bhandari M
. Patients at the center: in our practice, and in our use of language. ACP J Club. 2004;140:A11–A12.
OpenUrl PubMed
12.↵
1. Deeks JJ
. Assessing outcomes following tests. In: Price CP, Christenson EH, eds. Evidence-Based Laboratory Medicine: Principles, Practice and Outcomes. 2nd ed. Washington, DC: AACC Press; 2007:95–111.
13.↵
1. Lu B,
2. Gatsonis C
. Efficiency of study designs in diagnostic randomized clinical trials. Stat Med. 2013;32:1451–1466.
OpenUrl CrossRef PubMed
14.↵
1. de Bree R,
2. van der Putten L,
3. Hoekstra OS,
4. et al
.; RELAPS Study Group. A randomized trial of PET scanning to improve diagnostic yield of direct laryngoscopy in patients with suspicion of recurrent laryngeal carcinoma after radiotherapy. Contemp Clin Trials. 2007;28:705–712.
OpenUrl CrossRef PubMed
15.↵
1. Plewnia C,
2. Reimold M,
3. Najib A,
4. Reischl G,
5. Plontke SK,
6. Gerloff C
. Moderate therapeutic efficacy of positron emission tomography-navigated repetitive transcranial magnetic stimulation for chronic tinnitus: a randomised, controlled pilot study. J Neurol Neurosurg Psychiatry. 2007;78:152–156.
OpenUrl Abstract/FREE Full Text
16.↵
1. Mullani NA,
2. Caras D,
3. Ahn C,
4. et al
. Fewer women than men have positive SPECT and PET cardiac findings among patients with no history of heart disease. J Nucl Med. 2000;41:263–268.
OpenUrl Abstract/FREE Full Text
17.↵
1. Maziak DE,
2. Darling GE,
3. Inculet RI,
4. et al
. Positron emission tomography in staging early lung cancer: a randomized trial. Ann Intern Med. 2009;151:221–228.
OpenUrl CrossRef PubMed
18.↵
1. Sobhani I,
2. Tiret E,
3. Lebtahi R,
4. et al
. Early detection of recurrence by ¹⁸FDG-PET in the follow-up of patients with colorectal cancer. Br J Cancer. 2008;98:875–880.
OpenUrl CrossRef PubMed
19.↵
1. Monteil J,
2. Vergnenègre A,
3. Bertin F,
4. et al
. Randomized follow-up study of resected NSCLC patients: conventional versus ¹⁸F-DG coincidence imaging. Anticancer Res. 2010;30:3811–3816.
OpenUrl Abstract/FREE Full Text
20.↵
1. Herder GJ,
2. Kramer H,
3. Hoekstra OS,
4. et al
. Traditional versus up-front [¹⁸F] fluorodeoxyglucose-positron emission tomography staging of non-small-cell lung cancer: a Dutch cooperative randomized study. J Clin Oncol. 2006;24:1800–1806.
OpenUrl Abstract/FREE Full Text
21.↵
1. Beanlands RS,
2. Nichol G,
3. Huszti E,
4. et al
. F-18-fluorodeoxyglucose positron emission tomography imaging-assisted management of patients with severe left ventricular dysfunction and suspected coronary disease: a randomized, controlled trial (PARR-2). J Am Coll Cardiol. 2007;50:2002–2012.
OpenUrl CrossRef PubMed
22.
1. Siebelink HM,
2. Blanksma PK,
3. Crijns HJ,
4. et al
. No difference in cardiac event-free survival between positron emission tomography-guided and single-photon emission computed tomography-guided patient management: a prospective, randomized comparison of patients with suspicion of jeopardized myocardium. J Am Coll Cardiol. 2001;37:81–88.
OpenUrl CrossRef PubMed
23.↵
1. Tsai CS,
2. Lai CH,
3. Chang TC,
4. et al
. A prospective randomized trial to study the impact of pretreatment FDG-PET for cervical cancer patients with MRI-detected positive pelvic but negative para-aortic lymphadenopathy. Int J Radiat Oncol Biol Phys. 2010;76:477–484.
OpenUrl CrossRef PubMed
24.↵
1. Viney RC,
2. Boyer MJ,
3. King MT,
4. et al
. Randomized controlled trial of the role of positron emission tomography in the management of stage I and II non-small-cell lung cancer. J Clin Oncol. 2004;22:2357–2362.
OpenUrl Abstract/FREE Full Text
25.↵
1. Fischer B,
2. Lassen U,
3. Mortensen J,
4. et al
. Preoperative staging of lung cancer with combined PET-CT. N Engl J Med. 2009;361:32–39.
OpenUrl CrossRef PubMed
26.↵
1. Ruers TJ,
2. Wiering B,
3. van der Sijp JR,
4. et al
. Improved selection of patients for hepatic surgery of colorectal liver metastases with ¹⁸F-FDG PET: a randomized study. J Nucl Med. 2009;50:1036–1041.
OpenUrl Abstract/FREE Full Text
27.↵
1. Djulbegovic B,
2. Kumar A,
3. Soares HP,
4. et al
. Treatment success in cancer: new cancer treatment successes identified in phase 3 randomized controlled trials conducted by the National Cancer Institute sponsored cooperative oncology groups, 1955 to 2006. Arch Intern Med. 2008;168:632–642.
OpenUrl CrossRef PubMed
28.↵
1. Fleming TR,
2. DeMets DL
. Surrogate end points in clinical trials: are we being misled? Ann Intern Med. 1996;125:605–613.
OpenUrl CrossRef PubMed
29.↵
1. Picardi M,
2. De Renzo A,
3. Pane F,
4. et al
. Randomized comparison of consolidation radiation versus observation in bulky Hodgkin’s lymphoma with post-chemotherapy negative positron emission tomography scans. Leuk Lymphoma. 2007;48:1721–1727.
OpenUrl CrossRef PubMed
30.↵
1. Rohren EM,
2. Turkington TG,
3. Coleman RE
. Clinical applications of PET in oncology. Radiology. 2004;231:305–332.
OpenUrl CrossRef PubMed
31.↵
1. Buyse M,
2. Michiels S,
3. Sargent DJ,
4. et al
. Integrating biomarkers in clinical trials. Expert Rev Mol Diagn. 2011;11:171–182.
OpenUrl CrossRef PubMed
32.↵
1. Sargent DJ,
2. Conley BA,
3. Allegra C,
4. Collette L
. Clinical trial designs for predictive marker validation in cancer treatment trials. J Clin Oncol. 2005;23:2020–2027.
OpenUrl Abstract/FREE Full Text
33.
1. Klocke FJ,
2. Baird MG,
3. Lorell BH,
4. et al
. ACC/AHA/ASNC guidelines for the clinical use of cardiac radionuclide imaging: executive summary—a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. Circulation. 2003;108:1404–1418.
OpenUrl FREE Full Text
34.
1. Silvestri GA,
2. Tanoue LT,
3. Margolis ML,
4. et al
. The noninvasive staging of non-small cell lung cancer: the guidelines. Chest. 2003;123(suppl):147S–156S.
OpenUrl CrossRef PubMed
35.
1. Silvestri GA,
2. Gould MK,
3. Margolis ML,
4. et al
. Noninvasive staging of non-small cell lung cancer: ACCP evidenced-based clinical practice guidelines (2nd ed.). Chest. 2007;132(suppl):178S–201S.
36.
The American Thoracic Society and the European Respiratory Society. Pretreatment evaluation of non-small-cell lung cancer. Am J Respir Crit Care Med. 1997;156:320–332.
OpenUrl CrossRef PubMed
37.
1. Desch CE,
2. Benson AB,
3. Somerfield MR,
4. et al
. Colorectal cancer surveillance: 2005 update of an American Society of Clinical Oncology practice guideline. J Clin Oncol. 2005;23:8512–8519.
OpenUrl Abstract/FREE Full Text
38.
1. Desch CE,
2. Benson AB,
3. Smith TJ,
4. et al
. Recommended colorectal cancer surveillance guidelines by the American Society of Clinical Oncology. J Clin Oncol. 1999;17:1312–1321.
OpenUrl Abstract/FREE Full Text
39.
1. Benson AB,
2. Desch CE,
3. Flynn PJ,
4. et al
. 2000 update of American Society of Clinical Oncology colorectal cancer surveillance guidelines. J Clin Oncol. 2000;18:3586–3588.
OpenUrl FREE Full Text
40.
1. Balleyguier C,
2. Sala E,
3. Da Cunha T,
4. et al
. Staging of uterine cervical cancer with MRI: guidelines of the European Society of Urogenital Radiology. Eur Radiol. 2011;21:1102–1110.
OpenUrl CrossRef PubMed

Received for publication June 10, 2013.
Accepted for publication April 28, 2014.

In this issue

Download PDF

Article Alerts

Email Article

Citation Tools

Bookmark this article

Cited By...

Google Scholar

More in this TOC Section

Show more Clinical Investigations

Keywords

[1] 1.↵
Schünemann HJ,
Oxman AD,
Brozek J,
et al
.; GRADE Working Group. Grading quality of evidence and strength of recommendations for diagnostic tests and strategies. BMJ. 2008;336:1106–1110.
OpenUrl FREE Full Text

[2] Schünemann HJ,

[3] Oxman AD,

[4] Brozek J,

[5] et al

[6] 2.↵
Tunis SR,
Benner J,
McClellan M
. Comparative effectiveness research: policy context, methods development and research infrastructure. Stat Med. 2010;29:1963–1976.
OpenUrl CrossRef PubMed

[7] Tunis SR,

[8] Benner J,

[9] McClellan M

[10] 3.↵
Fletcher JW,
Djulbegovic B,
Soares HP,
et al
. Recommendations on the use of ¹⁸F-FDG PET in oncology. J Nucl Med. 2008;49:480–508.
OpenUrl Abstract/FREE Full Text

[11] Fletcher JW,

[12] Djulbegovic B,

[13] Soares HP,

[14] et al

[15] 4.
Scheibler F,
Raatz H,
Suter K,
et al
. Benefit assessment of PET in malignant lymphomas: the IQWiG point of view. Nuklearmedizin. 2010;49:1–5.
OpenUrl PubMed

[16] Scheibler F,

[17] Raatz H,

[18] Suter K,

[19] et al

[20] 5.↵
Ferrante di Ruffano L,
Hyde CJ,
McCaffery KJ,
Bossuyt PM,
Deeks JJ
. Assessing the value of diagnostic tests: a framework for designing and evaluating trials. BMJ. 2012;344:e686.
OpenUrl FREE Full Text

[21] Ferrante di Ruffano L,

[22] Hyde CJ,

[23] McCaffery KJ,

[24] Bossuyt PM,

[25] Deeks JJ

[26] 6.↵
Medical Services Advisory Committee. Guidelines for the Assessment of Diagnostic Technologies. Canberra, Australian Capital Territory, Australia: Medical Services Advisory Committee; 2005.

[27] 7.↵
Trikalinos TA,
Siebert U,
Lau J
. Decision-analytic modeling to evaluate benefits and harms of medical tests: uses and limitations. Med Decis Making. 2009;29:E22–E29.
OpenUrl Abstract

[28] Trikalinos TA,

[29] Siebert U,

[30] Lau J

[31] 8.↵
Vach W,
Høilund-Carlsen PF,
Gerke O,
Weber WA
. Generating evidence for clinical benefit of PET/CT in diagnosing cancer patients. J Nucl Med. 2011;52(suppl 2):77S–85S.
OpenUrl Abstract/FREE Full Text

[32] Vach W,

[33] Høilund-Carlsen PF,

[34] Gerke O,

[35] Weber WA

[36] 9.↵
van Tinteren H,
Hoekstra OS,
Smit EF,
et al
. Effectiveness of positron emission tomography in the preoperative assessment of patients with suspected non-small-cell lung cancer: the PLUS multicentre randomised trial. Lancet. 2002;359:1388–1393.
OpenUrl CrossRef PubMed

[37] van Tinteren H,

[38] Hoekstra OS,

[39] Smit EF,

[40] et al

[41] 10.↵
Scheibler F,
Zumbé P,
Janssen I,
et al
. Randomized controlled trials on PET: a systematic review of topics, design, and quality. J Nucl Med. 2012;53:1016–1025.
OpenUrl Abstract/FREE Full Text

[42] Scheibler F,

[43] Zumbé P,

[44] Janssen I,

[45] et al

[46] 11.↵
Guyatt G,
Montori V,
Devereaux PJ,
Schünemann H,
Bhandari M
. Patients at the center: in our practice, and in our use of language. ACP J Club. 2004;140:A11–A12.
OpenUrl PubMed

[47] Guyatt G,

[48] Montori V,

[49] Devereaux PJ,

[50] Schünemann H,

[51] Bhandari M

[52] 12.↵
Deeks JJ
. Assessing outcomes following tests. In: Price CP, Christenson EH, eds. Evidence-Based Laboratory Medicine: Principles, Practice and Outcomes. 2nd ed. Washington, DC: AACC Press; 2007:95–111.

[53] Deeks JJ

[54] 13.↵
Lu B,
Gatsonis C
. Efficiency of study designs in diagnostic randomized clinical trials. Stat Med. 2013;32:1451–1466.
OpenUrl CrossRef PubMed

[55] Lu B,

[56] Gatsonis C

[57] 14.↵
de Bree R,
van der Putten L,
Hoekstra OS,
et al
.; RELAPS Study Group. A randomized trial of PET scanning to improve diagnostic yield of direct laryngoscopy in patients with suspicion of recurrent laryngeal carcinoma after radiotherapy. Contemp Clin Trials. 2007;28:705–712.
OpenUrl CrossRef PubMed

[58] de Bree R,

[59] van der Putten L,

[60] Hoekstra OS,

[61] et al

[62] 15.↵
Plewnia C,
Reimold M,
Najib A,
Reischl G,
Plontke SK,
Gerloff C
. Moderate therapeutic efficacy of positron emission tomography-navigated repetitive transcranial magnetic stimulation for chronic tinnitus: a randomised, controlled pilot study. J Neurol Neurosurg Psychiatry. 2007;78:152–156.
OpenUrl Abstract/FREE Full Text

[63] Plewnia C,

[64] Reimold M,

[65] Najib A,

[66] Reischl G,

[67] Plontke SK,

[68] Gerloff C

[69] 16.↵
Mullani NA,
Caras D,
Ahn C,
et al
. Fewer women than men have positive SPECT and PET cardiac findings among patients with no history of heart disease. J Nucl Med. 2000;41:263–268.
OpenUrl Abstract/FREE Full Text

[70] Mullani NA,

[71] Caras D,

[72] Ahn C,

[73] et al

[74] 17.↵
Maziak DE,
Darling GE,
Inculet RI,
et al
. Positron emission tomography in staging early lung cancer: a randomized trial. Ann Intern Med. 2009;151:221–228.
OpenUrl CrossRef PubMed

[75] Maziak DE,

[76] Darling GE,

[77] Inculet RI,

[78] et al

[79] 18.↵
Sobhani I,
Tiret E,
Lebtahi R,
et al
. Early detection of recurrence by ¹⁸FDG-PET in the follow-up of patients with colorectal cancer. Br J Cancer. 2008;98:875–880.
OpenUrl CrossRef PubMed

[80] Sobhani I,

[81] Tiret E,

[82] Lebtahi R,

[83] et al

[84] 19.↵
Monteil J,
Vergnenègre A,
Bertin F,
et al
. Randomized follow-up study of resected NSCLC patients: conventional versus ¹⁸F-DG coincidence imaging. Anticancer Res. 2010;30:3811–3816.
OpenUrl Abstract/FREE Full Text

[85] Monteil J,

[86] Vergnenègre A,

[87] Bertin F,

[88] et al

[89] 20.↵
Herder GJ,
Kramer H,
Hoekstra OS,
et al
. Traditional versus up-front [¹⁸F] fluorodeoxyglucose-positron emission tomography staging of non-small-cell lung cancer: a Dutch cooperative randomized study. J Clin Oncol. 2006;24:1800–1806.
OpenUrl Abstract/FREE Full Text

[90] Herder GJ,

[91] Kramer H,

[92] Hoekstra OS,

[93] et al

[94] 21.↵
Beanlands RS,
Nichol G,
Huszti E,
et al
. F-18-fluorodeoxyglucose positron emission tomography imaging-assisted management of patients with severe left ventricular dysfunction and suspected coronary disease: a randomized, controlled trial (PARR-2). J Am Coll Cardiol. 2007;50:2002–2012.
OpenUrl CrossRef PubMed

[95] Beanlands RS,

[96] Nichol G,

[97] Huszti E,

[98] et al

[99] 22.
Siebelink HM,
Blanksma PK,
Crijns HJ,
et al
. No difference in cardiac event-free survival between positron emission tomography-guided and single-photon emission computed tomography-guided patient management: a prospective, randomized comparison of patients with suspicion of jeopardized myocardium. J Am Coll Cardiol. 2001;37:81–88.
OpenUrl CrossRef PubMed

[100] Siebelink HM,

[101] Blanksma PK,

[102] Crijns HJ,

[103] et al

[104] 23.↵
Tsai CS,
Lai CH,
Chang TC,
et al
. A prospective randomized trial to study the impact of pretreatment FDG-PET for cervical cancer patients with MRI-detected positive pelvic but negative para-aortic lymphadenopathy. Int J Radiat Oncol Biol Phys. 2010;76:477–484.
OpenUrl CrossRef PubMed

[105] Tsai CS,

[106] Lai CH,

[107] Chang TC,

[108] et al

[109] 24.↵
Viney RC,
Boyer MJ,
King MT,
et al
. Randomized controlled trial of the role of positron emission tomography in the management of stage I and II non-small-cell lung cancer. J Clin Oncol. 2004;22:2357–2362.
OpenUrl Abstract/FREE Full Text

[110] Viney RC,

[111] Boyer MJ,

[112] King MT,

[113] et al

[114] 25.↵
Fischer B,
Lassen U,
Mortensen J,
et al
. Preoperative staging of lung cancer with combined PET-CT. N Engl J Med. 2009;361:32–39.
OpenUrl CrossRef PubMed

[115] Fischer B,

[116] Lassen U,

[117] Mortensen J,

[118] et al

[119] 26.↵
Ruers TJ,
Wiering B,
van der Sijp JR,
et al
. Improved selection of patients for hepatic surgery of colorectal liver metastases with ¹⁸F-FDG PET: a randomized study. J Nucl Med. 2009;50:1036–1041.
OpenUrl Abstract/FREE Full Text

[120] Ruers TJ,

[121] Wiering B,

[122] van der Sijp JR,

[123] et al

[124] 27.↵
Djulbegovic B,
Kumar A,
Soares HP,
et al
. Treatment success in cancer: new cancer treatment successes identified in phase 3 randomized controlled trials conducted by the National Cancer Institute sponsored cooperative oncology groups, 1955 to 2006. Arch Intern Med. 2008;168:632–642.
OpenUrl CrossRef PubMed

[125] Djulbegovic B,

[126] Kumar A,

[127] Soares HP,

[128] et al

[129] 28.↵
Fleming TR,
DeMets DL
. Surrogate end points in clinical trials: are we being misled? Ann Intern Med. 1996;125:605–613.
OpenUrl CrossRef PubMed

[130] Fleming TR,

[131] DeMets DL

[132] 29.↵
Picardi M,
De Renzo A,
Pane F,
et al
. Randomized comparison of consolidation radiation versus observation in bulky Hodgkin’s lymphoma with post-chemotherapy negative positron emission tomography scans. Leuk Lymphoma. 2007;48:1721–1727.
OpenUrl CrossRef PubMed

[133] Picardi M,

[134] De Renzo A,

[135] Pane F,

[136] et al

[137] 30.↵
Rohren EM,
Turkington TG,
Coleman RE
. Clinical applications of PET in oncology. Radiology. 2004;231:305–332.
OpenUrl CrossRef PubMed

[138] Rohren EM,

[139] Turkington TG,

[140] Coleman RE

[141] 31.↵
Buyse M,
Michiels S,
Sargent DJ,
et al
. Integrating biomarkers in clinical trials. Expert Rev Mol Diagn. 2011;11:171–182.
OpenUrl CrossRef PubMed

[142] Buyse M,

[143] Michiels S,

[144] Sargent DJ,

[145] et al

[146] 32.↵
Sargent DJ,
Conley BA,
Allegra C,
Collette L
. Clinical trial designs for predictive marker validation in cancer treatment trials. J Clin Oncol. 2005;23:2020–2027.
OpenUrl Abstract/FREE Full Text

[147] Sargent DJ,

[148] Conley BA,

[149] Allegra C,

[150] Collette L

[151] 33.
Klocke FJ,
Baird MG,
Lorell BH,
et al
. ACC/AHA/ASNC guidelines for the clinical use of cardiac radionuclide imaging: executive summary—a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. Circulation. 2003;108:1404–1418.
OpenUrl FREE Full Text

[152] Klocke FJ,

[153] Baird MG,

[154] Lorell BH,

[155] et al

[156] 34.
Silvestri GA,
Tanoue LT,
Margolis ML,
et al
. The noninvasive staging of non-small cell lung cancer: the guidelines. Chest. 2003;123(suppl):147S–156S.
OpenUrl CrossRef PubMed

[157] Silvestri GA,

[158] Tanoue LT,

[159] Margolis ML,

[160] et al

[161] 35.
Silvestri GA,
Gould MK,
Margolis ML,
et al
. Noninvasive staging of non-small cell lung cancer: ACCP evidenced-based clinical practice guidelines (2nd ed.). Chest. 2007;132(suppl):178S–201S.

[162] Silvestri GA,

[163] Gould MK,

[164] Margolis ML,

[165] et al

[166] 36.
The American Thoracic Society and the European Respiratory Society. Pretreatment evaluation of non-small-cell lung cancer. Am J Respir Crit Care Med. 1997;156:320–332.
OpenUrl CrossRef PubMed

[167] 37.
Desch CE,
Benson AB,
Somerfield MR,
et al
. Colorectal cancer surveillance: 2005 update of an American Society of Clinical Oncology practice guideline. J Clin Oncol. 2005;23:8512–8519.
OpenUrl Abstract/FREE Full Text

[168] Desch CE,

[169] Benson AB,

[170] Somerfield MR,

[171] et al

[172] 38.
Desch CE,
Benson AB,
Smith TJ,
et al
. Recommended colorectal cancer surveillance guidelines by the American Society of Clinical Oncology. J Clin Oncol. 1999;17:1312–1321.
OpenUrl Abstract/FREE Full Text

[173] Desch CE,

[174] Benson AB,

[175] Smith TJ,

[176] et al

[177] 39.
Benson AB,
Desch CE,
Flynn PJ,
et al
. 2000 update of American Society of Clinical Oncology colorectal cancer surveillance guidelines. J Clin Oncol. 2000;18:3586–3588.
OpenUrl FREE Full Text

[178] Benson AB,

[179] Desch CE,

[180] Flynn PJ,

[181] et al

[182] 40.
Balleyguier C,
Sala E,
Da Cunha T,
et al
. Staging of uterine cervical cancer with MRI: guidelines of the European Society of Urogenital Radiology. Eur Radiol. 2011;21:1102–1110.
OpenUrl CrossRef PubMed

[183] Balleyguier C,

[184] Sala E,

[185] Da Cunha T,

[186] et al

Main menu

User menu

Search

The Move from Accuracy Studies to Randomized Trials in PET: Current Status and Future Directions

Abstract

MATERIALS AND METHODS

Study Inclusion Criteria and Search Strategies

Study Selection and Data Extraction

Study Characteristics and Quality Criteria

Patient-Important Outcome

Sufficient Sample Size

Current Standard as Comparator

RESULTS

Published Studies

Registered Studies

Comparison with Results of Scheibler et al.

DISCUSSION

CONCLUSION

DISCLOSURE

Acknowledgments

Footnotes

REFERENCES

In this issue

Citation Manager Formats

Related Articles

Cited By...

More in this TOC Section

Similar Articles

Keywords

Main menu

User menu

Search

The Move from Accuracy Studies to Randomized Trials in PET: Current Status and Future Directions

Abstract

MATERIALS AND METHODS

Study Inclusion Criteria and Search Strategies

Study Selection and Data Extraction

Study Characteristics and Quality Criteria

Patient-Important Outcome

Sufficient Sample Size

Current Standard as Comparator

RESULTS

Published Studies

Registered Studies

Comparison with Results of Scheibler et al.

DISCUSSION

CONCLUSION

DISCLOSURE

Acknowledgments

Footnotes

REFERENCES

In this issue

Citation Manager Formats

Jump to section

Related Articles

Cited By...

More in this TOC Section

Similar Articles

Keywords