Skip to main content

Main menu

  • Home
  • Content
    • Current
    • Ahead of print
    • Past Issues
    • JNM Supplement
    • SNMMI Annual Meeting Abstracts
    • Continuing Education
    • JNM Podcasts
  • Subscriptions
    • Subscribers
    • Institutional and Non-member
    • Rates
    • Journal Claims
    • Corporate & Special Sales
  • Authors
    • Submit to JNM
    • Information for Authors
    • Assignment of Copyright
    • AQARA requirements
  • Info
    • Reviewers
    • Permissions
    • Advertisers
  • About
    • About Us
    • Editorial Board
    • Contact Information
  • More
    • Alerts
    • Feedback
    • Help
    • SNMMI Journals
  • SNMMI
    • JNM
    • JNMT
    • SNMMI Journals
    • SNMMI

User menu

  • Subscribe
  • My alerts
  • Log in
  • Log out
  • My Cart

Search

  • Advanced search
Journal of Nuclear Medicine
  • SNMMI
    • JNM
    • JNMT
    • SNMMI Journals
    • SNMMI
  • Subscribe
  • My alerts
  • Log in
  • Log out
  • My Cart
Journal of Nuclear Medicine

Advanced Search

  • Home
  • Content
    • Current
    • Ahead of print
    • Past Issues
    • JNM Supplement
    • SNMMI Annual Meeting Abstracts
    • Continuing Education
    • JNM Podcasts
  • Subscriptions
    • Subscribers
    • Institutional and Non-member
    • Rates
    • Journal Claims
    • Corporate & Special Sales
  • Authors
    • Submit to JNM
    • Information for Authors
    • Assignment of Copyright
    • AQARA requirements
  • Info
    • Reviewers
    • Permissions
    • Advertisers
  • About
    • About Us
    • Editorial Board
    • Contact Information
  • More
    • Alerts
    • Feedback
    • Help
    • SNMMI Journals
  • View or Listen to JNM Podcast
  • Visit JNM on Facebook
  • Join JNM on LinkedIn
  • Follow JNM on Twitter
  • Subscribe to our RSS feeds
Research ArticleOncology

The Dark Side of Radiomics: On the Paramount Importance of Publishing Negative Results

Irène Buvat and Fanny Orlhac
Journal of Nuclear Medicine November 2019, 60 (11) 1543-1544; DOI: https://doi.org/10.2967/jnumed.119.235325
Irène Buvat
Imagerie Moléculaire In Vivo, CEA-SHFJ, INSERM, CNRS, Université Paris-Sud, Université Paris-Saclay, Orsay, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Fanny Orlhac
Imagerie Moléculaire In Vivo, CEA-SHFJ, INSERM, CNRS, Université Paris-Sud, Université Paris-Saclay, Orsay, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Info & Metrics
  • PDF
Loading

See the associated article on page 1545.

Over the past few years, we have witnessed an exponential increase in the number of radiomic-related publications. In the PET literature, a PubMed search using a criterion of “(radiomic OR radiomics OR texture OR textural) AND PET” yielded 37 publications in 2015 increasing to 110 in 2018. Interestingly, an extensive survey of these publications demonstrates that 94% reported positive or “promising” results involving some sophisticated radiomic features, with large variations in the performance supporting that conclusion. As few as 6% of them clearly concluded negative results, including the paper by Saadani et al. (1) published in this issue of The Journal of Nuclear Medicine. These numbers highlight a publication bias well acknowledged in many different research fields, with an ongoing debate about the actual false-discovery rate in the medical literature (2–4). This publication bias has well-understood roots. From the editors’ standpoint, positive results are often thought to be much more exciting and valuable than negative ones and are more likely to be cited, hence favorably increasing the journal influence metrics (e.g., 2-y impact factor) (5). From the authors’ perspective, positive results are more rewarding than negative findings and might better contribute to boosting their careers. As a consequence, negative radiomic results, the dark side of radiomics, currently remain mostly unpublished.

Yet, publishing negative results is a must for ethical reasons. Criteria for publishing should be the quality of the study and its statistical power, whatever the outcome. A clinically or biologically relevant question and a methodologically well-designed study should warrant publication, no matter if the null hypothesis is rejected or not. In some instances, negative studies might be even more impactful than positive ones, as they may challenge existing paradigms and invite investigators to focus research efforts on different paths. The conclusion of the Saadani et al. paper (1) is that although B-rapidly accelerated fibrosarcoma (BRAF) mutation drives the mitogen-activated protein kinase pathway and glucose metabolism in some cancers, BRAF valine 600 status could not be successfully detected using radiomic features calculated from 18F-FDG PET/CT in melanoma patients. This observation should be an incentive to further explore the connection between the genetic mutations or pathway alterations and their macroscopic consequences detectable using our in vivo imaging devices. It is reasonable to expect that some biologic dysfunctions resulting from mutations will produce a cascade of events that might ultimately yield a signal detectable by our exquisite molecular imaging scanners. However, both the magnitude and the spatiotemporal extent of the biologic effect will determine our ability to identify an abnormality from in vivo images using a given radiotracer. Investigations of the relationships between the triggering signal (here a mutation) and the net observable result (here a change in tumor glucose metabolism) are absolutely needed for two reasons: first, to establish realistic expectations regarding the potential power of radiomic features; second, to use radiomic observations as a driver to formulate more precise biologic assumptions regarding the underlying processes and subsequently test them. Advancing that field will require the publication of both positive and negative radiomic results.

One could argue that given the overwhelming number of radiomic-related publications reporting positive results, which might be the trees that hide the forest, the publication of negative results in that domain will be practically challenging. What we need are methodologically sound, properly powered, and robust radiomic studies addressing a biologically driven hypothesis and described in such a way that independent investigators can reproduce and confirm the findings. Indeed, replication has often more scientific value than original discovery in the radiomic field. To the best of our knowledge, since 2014 no PET radiomic model has ever been validated by an independent group. Even worse, misleading interpretation of some of the most famous radiomic findings (6) has recently been demonstrated (7), confirming that some radiomic features initially interpreted as biomarkers of tumor heterogeneity were actually surrogates of tumor volume, as shown as early as 2014 (8,9). These findings suggest that radiomic-related publications should be thoroughly designed with sufficient statistical power, described in such a way that they can be repeated, and they should also include well-supported interpretation, possibly based on dedicated experiments.

Among the methodologic arsenal that can be used to chase spurious and confounding effects in radiomic studies, “sham” data and permutation/randomization tests could be used more often. Sham data can be obtained from healthy regions or by reshuffling samples in artificial groups or randomly changing voxel values (7). Such sham data are useful to check that the findings are present in the real data and not in the sham data. It is thanks to such sham data that the erroneous interpretation of the radiomic features highlighted in one study (6) could be nicely demonstrated in another (7). Permutation/randomization tests also yield a precise estimate of the distribution of a test statistic under the null hypothesis in a nonparametric setting by calculating all possible values of the test statistic under rearrangements of the labels on the observed data points. They are frequently used in genomic data analysis to assign a statistical significance to the findings (10). Last, statistical reviewing should be systematic, as it is effective in improving the quality of published articles.

The validity and soundness of radiomic studies also depend on how the “vibration of effects” is handled. Vibration of effects means that results can differ over a broad range depending on how the analysis is performed. This is observed when many variations of analyses are used, such as by selecting different variables, making statistical adjustments, testing different statistical models, sorting the data differently, or using different inclusion and exclusion criteria. The variability of results as a function of the chosen analysis strategy gives an indication about their robustness. Reporting only the analyses that yield a particular result, called the chrysalis effect (11), can yield a bias, whereas consistent conclusions across various methodologic approaches, as reported in the Saadani et al. paper (1), provide a persuasive indication of the correctness of the findings.

Radiomic results should be interpreted with great circumspection. Radiomic observations are easy to report given the wide availability of software enabling the calculation of radiomic features and of packages supporting sophisticated statistical analysis and data mining. Yet, radiomics should now go beyond reporting associations and designing predictive models. It should explain what the observed patterns and associations mean from a biologic point of view, so as to enhance our understanding of the mechanisms. Dedicated experiments and multiomics approaches are needed to fully take advantage of the radiomic approach, so that the radiomic phenotype and associated models can be additional exploratory tools to untangle the complexity of the biologic processes and of their macroscopic repercussions.

A more stringent selection of radiomic studies should thus be based on tougher editorial standards. More skepticism on the part of referees is needed to avoid the inflation of false-positive findings or overzealous interpretation of radiomic results. In that respect, the 5 aspects discussed above might be systematically considered: biologic rationale justifying the study, ability to independently replicate and reproduce the findings, comprehensive control of false discoveries validated by systematic statistical reviewing, investigation of the vibration of the effects, and proofs supporting the interpretation of the results. Careful selection of publishable manuscripts based on these criteria will then leave room for publishing radiomic studies yielding negative results and confirmatory radiomic studies, both of which are absolutely needed. Dedicated sections in top-tier journals could actually be devoted to important negative and confirmatory studies. Still, it will remain virtually impossible to publish all negative radiomic findings that would be useful to avoid unnecessary and costly repeats of experiments already done. Several practical solutions can be suggested. First, proposed future studies could be registered in international radiomic trial registries, so that it would be easier to determine whether a given question has already been or is being addressed by any investigator before an investment is made in that research. Second, negative results could be made accessible in public repositories, such as arxiv.org, and referred to in review papers to facilitate their identification. Last, funding agencies should request a registration identification number of the funded radiomic studies and access to the results, including the negative findings. All these actions together could considerably reduce the number of weekly contributive radiomic studies, clarify the state of the art, balance positive against negative radiomic results, and contribute to a proper identification of radiomic models that are useful either for patient care or for advancing our knowledge regarding how in vivo imaging can probe microscopic biologic mechanisms. In addition, establishing and enforcing best practices should help us get continuous support from external stakeholders that provide funding to advance sustainable research in radiomics.

DISCLOSURE

No potential conflict of interest relevant to this article was reported.

Footnotes

  • Published online Sep. 20, 2019.

  • © 2019 by the Society of Nuclear Medicine and Molecular Imaging.

REFERENCES

  1. 1.↵
    1. Saadani H,
    2. van der Hiel B,
    3. Aalbersberg EA,
    4. et al
    . Metabolic biomarker–based BRAFV600 mutation association and prediction in melanoma. J Nucl Med. 2019;60:1545–1552.
    OpenUrl
  2. 2.↵
    1. Ioannidis JPA
    . Why most published research findings are false. PLoS Med. 2005;2:e124.
    OpenUrlCrossRefPubMed
  3. 3.
    1. Jager LR,
    2. Leek JT
    . An estimate of the science-wise false discovery rate and application to the top medical literature. Biostatistics. 2014;15:1–12.
    OpenUrlCrossRefPubMed
  4. 4.↵
    1. Chalkidou A,
    2. O’Doherty MJ,
    3. Marsden PK
    . False discovery rates in PET and CT studies with texture features: a systematic review. PLoS One. 2015;10:e0124165.
    OpenUrlCrossRefPubMed
  5. 5.↵
    1. Duyx B,
    2. Urlings MJE,
    3. Swaen GHM,
    4. Bouter LM,
    5. Zeegers MP
    . Scientific citations favor positive results: a systematic review and meta-analysis. J Clin Epidemiol. 2017;88:92–101.
    OpenUrl
  6. 6.↵
    1. Aerts HJWL,
    2. Velazquez ER,
    3. Leijenaar RT,
    4. et al
    . Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat Commun. 2014;5:4006.
    OpenUrlCrossRefPubMed
  7. 7.↵
    1. Welch ML,
    2. McIntosh C,
    3. Haibe-Kains B,
    4. et al
    . Vulnerabilities of radiomic signature development: the need for safeguards. Radiother Oncol. 2019;130:2–9.
    OpenUrl
  8. 8.↵
    1. Orlhac F,
    2. Soussan M,
    3. Maisonobe JA,
    4. Garcia CA,
    5. Vanderlinden B,
    6. Buvat I
    . Tumor texture analysis in 18F-FDG PET: relationships between texture parameters, histogram indices, standardized uptake values, metabolic volumes, and total lesion glycolysis. J Nucl Med. 2014;55:414–422.
    OpenUrlAbstract/FREE Full Text
  9. 9.↵
    1. Buvat I,
    2. Orlhac F,
    3. Soussan M
    . Tumor texture analysis in PET: where do we stand? J Nucl Med. 2015;56:1642–1644.
    OpenUrlFREE Full Text
  10. 10.↵
    1. Dudoit S,
    2. Popper Shaffer J,
    3. Boldrick JC
    . Multiple hypothesis testing in microarray experiments. Stat Sci. 2003;1:71–103.
    OpenUrl
  11. 11.↵
    1. O’Boyle EH,
    2. Banks GC,
    3. Gonzales-Mulé E
    . The chrysalis effect: how ugly initial results metamorphosize into beautiful articles. J Manage. 2014;43:376–399.
    OpenUrl
  • Received for publication August 20, 2019.
  • Accepted for publication September 10, 2019.
PreviousNext
Back to top

In this issue

Journal of Nuclear Medicine: 60 (11)
Journal of Nuclear Medicine
Vol. 60, Issue 11
November 1, 2019
  • Table of Contents
  • Table of Contents (PDF)
  • About the Cover
  • Index by author
Print
Download PDF
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for your interest in spreading the word on Journal of Nuclear Medicine.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
The Dark Side of Radiomics: On the Paramount Importance of Publishing Negative Results
(Your Name) has sent you a message from Journal of Nuclear Medicine
(Your Name) thought you would like to see the Journal of Nuclear Medicine web site.
Citation Tools
The Dark Side of Radiomics: On the Paramount Importance of Publishing Negative Results
Irène Buvat, Fanny Orlhac
Journal of Nuclear Medicine Nov 2019, 60 (11) 1543-1544; DOI: 10.2967/jnumed.119.235325

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Share
The Dark Side of Radiomics: On the Paramount Importance of Publishing Negative Results
Irène Buvat, Fanny Orlhac
Journal of Nuclear Medicine Nov 2019, 60 (11) 1543-1544; DOI: 10.2967/jnumed.119.235325
Twitter logo Facebook logo LinkedIn logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One
Bookmark this article

Jump to section

  • Article
    • DISCLOSURE
    • Footnotes
    • REFERENCES
  • Info & Metrics
  • PDF

Related Articles

  • Metabolic Biomarker–Based BRAFV600 Mutation Association and Prediction in Melanoma
  • This Month in JNM
  • PubMed
  • Google Scholar

Cited By...

  • Is PET Radiomics Useful to Predict Pathologic Tumor Response and Prognosis in Locally Advanced Cervical Cancer?
  • Effects of Tracer Uptake Time in Non-Small Cell Lung Cancer 18F-FDG PET Radiomics
  • Google Scholar

More in this TOC Section

Oncology

  • The diagnostic role of 68Ga-DOTATATE PET/CT in the detection of neuroendocrine tumors
  • Utility of bone scans in patients with RCC
  • Disseminated xanthogranulomatous oophoritis mimicking malignancy on 18F-fluorodeoxyglucose positron emission tomography/computed tomography
Show more Oncology

Clinical

  • Detection of Early Progression with 18F-DCFPyL PET/CT in Men with Metastatic Castration-Resistant Prostate Cancer Receiving Bipolar Androgen Therapy
  • TauIQ: A Canonical Image Based Algorithm to Quantify Tau PET Scans
  • Dual PET Imaging in Bronchial Neuroendocrine Neoplasms: The NETPET Score as a Prognostic Biomarker
Show more Clinical

Similar Articles

SNMMI

© 2025 SNMMI

Powered by HighWire