Skip to main content

Main menu

  • Home
  • Content
    • Current
    • Ahead of print
    • Past Issues
    • JNM Supplement
    • SNMMI Annual Meeting Abstracts
    • Continuing Education
    • JNM Podcasts
  • Subscriptions
    • Subscribers
    • Institutional and Non-member
    • Rates
    • Journal Claims
    • Corporate & Special Sales
  • Authors
    • Submit to JNM
    • Information for Authors
    • Assignment of Copyright
    • AQARA requirements
  • Info
    • Reviewers
    • Permissions
    • Advertisers
  • About
    • About Us
    • Editorial Board
    • Contact Information
  • More
    • Alerts
    • Feedback
    • Help
    • SNMMI Journals
  • SNMMI
    • JNM
    • JNMT
    • SNMMI Journals
    • SNMMI

User menu

  • Subscribe
  • My alerts
  • Log in
  • Log out
  • My Cart

Search

  • Advanced search
Journal of Nuclear Medicine
  • SNMMI
    • JNM
    • JNMT
    • SNMMI Journals
    • SNMMI
  • Subscribe
  • My alerts
  • Log in
  • Log out
  • My Cart
Journal of Nuclear Medicine

Advanced Search

  • Home
  • Content
    • Current
    • Ahead of print
    • Past Issues
    • JNM Supplement
    • SNMMI Annual Meeting Abstracts
    • Continuing Education
    • JNM Podcasts
  • Subscriptions
    • Subscribers
    • Institutional and Non-member
    • Rates
    • Journal Claims
    • Corporate & Special Sales
  • Authors
    • Submit to JNM
    • Information for Authors
    • Assignment of Copyright
    • AQARA requirements
  • Info
    • Reviewers
    • Permissions
    • Advertisers
  • About
    • About Us
    • Editorial Board
    • Contact Information
  • More
    • Alerts
    • Feedback
    • Help
    • SNMMI Journals
  • View or Listen to JNM Podcast
  • Visit JNM on Facebook
  • Join JNM on LinkedIn
  • Follow JNM on Twitter
  • Subscribe to our RSS feeds
Meeting ReportEducational Exhibits - Correlative Imaging (including instrumentation, image fusion and data analysis)

DALL-E, Midjourney, Stable Diffusion: A Nuclear Medicine How-To for Commercial AI Text-To-Image Generation Tools

Rick Wray and Randy Yeh
Journal of Nuclear Medicine June 2023, 64 (supplement 1) P1463;
Rick Wray
1Memorial Sloan Kettering Cancer Center
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Randy Yeh
1Memorial Sloan Kettering Cancer Center
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Info & Metrics
Loading

Abstract

P1463

Introduction: In 2021 we saw the release of the first invite-only commercially available text-to-image neural network graphical user interface, DALL-E. This platform combines computer vision with natural language processing allowing the user to quickly and easily create an image using commands. Later in 2022 came Midjourney, a similar competing platform open to the public. And quickly after came Stable Diffusion, the main open-source text-to-image platform available. The power of these tools is that they allow the user to create images using only simple natural language commands instantly. While these were initially created to produce digital art images, medical images could be created for machine learning and artificial intelligence research. This educational exhibit aims to introduce these novel technologies and guide nuclear medicine professionals to understand better how to use these artificial intelligence models.

Methods: All three platforms are available online with different methods of access. DALL-E requires an email account to sign up and is a fee-per-image service. Images are generated via text entry into a web-based GUI, https://openai.com/dall-e-2/. Midjourney requires a Discord account to sign up and use, and it is a subscription-based service, https://discord.com/. To generate images, the user chats with an AI bot on the Discord server. Stable diffusion is a bit more involved. There are second-party GUI's that allow you to enter text to generate images, such as Hugging Face, https://huggingface.co/spaces/stabilityai/stable-diffusion. However, Stable Diffusion is an open-source model that can be downloaded onto your computer and used for free to generate images.

Results: These models are text-to-image Generative Adversarial Networks (GANs) and Variational Autoencoders (VAEs) with GUIs. No code is needed to use the models; instead, natural language in the form of "prompts" acts as the code. A prompt is a set of text instructions sent to the model to generate a specific image output. In the case of nuclear image generation, a prompt may look like this, "planar, anterior-posterior, black on white background, whole body bone scintigraphy image, with linear increase along the tibias, in the pattern of shin splints". It is important to use simple, clear, and straightforward language when prompting a model. The AI tends to directly translate prompt language to image characteristics. Once an image is created, you have the option to create a variation of the image with additional prompts to fine-tune the next iteration to optimize your image generation. Therefore the process of image generation with these models is iterative and occurs as an evolution over time. Once you have the desired final image, that comprehensive prompt can be used to make infinite additional images in a series. While DALL-E and Midjourney do allow image input for image generation, they do not effectively allow input of a sample of images to train the model to create similar images. Stable Diffision on the other hand, does allow image training on even small subsets of only 5-10 images. This represents a powerful tool for the future of medical image generation at a low cost with minimal effort and essentially free processing power.

Conclusions: We outlined the three main AI text-to-image generative models available today, explained the fundamental of image prompts, and proposed the possibility of using Stable-Diffusion as a free open-source method to generate nuclear medicine images for future machine learning research.

Previous
Back to top

In this issue

Journal of Nuclear Medicine
Vol. 64, Issue supplement 1
June 1, 2023
  • Table of Contents
  • Index by author
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for your interest in spreading the word on Journal of Nuclear Medicine.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
DALL-E, Midjourney, Stable Diffusion: A Nuclear Medicine How-To for Commercial AI Text-To-Image Generation Tools
(Your Name) has sent you a message from Journal of Nuclear Medicine
(Your Name) thought you would like to see the Journal of Nuclear Medicine web site.
Citation Tools
DALL-E, Midjourney, Stable Diffusion: A Nuclear Medicine How-To for Commercial AI Text-To-Image Generation Tools
Rick Wray, Randy Yeh
Journal of Nuclear Medicine Jun 2023, 64 (supplement 1) P1463;

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Share
DALL-E, Midjourney, Stable Diffusion: A Nuclear Medicine How-To for Commercial AI Text-To-Image Generation Tools
Rick Wray, Randy Yeh
Journal of Nuclear Medicine Jun 2023, 64 (supplement 1) P1463;
Twitter logo Facebook logo LinkedIn logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One
Bookmark this article

Jump to section

  • Article
  • Info & Metrics

Related Articles

  • No related articles found.
  • Google Scholar

Cited By...

  • No citing articles found.
  • Google Scholar

More in this TOC Section

  • Identifying nuclear medicine terms for inclusion in a nuclear medicine-specific ontology (NucLex)
  • Final analysis of our experience with Lu-177-PSMA radioligand treatment at Emory University
  • Nuclear Medicine Imaging Artifacts
Show more Educational Exhibits - Correlative Imaging (including instrumentation, image fusion and data analysis)

Similar Articles

SNMMI

© 2025 SNMMI

Powered by HighWire