Abstract
An important need exists for strategies to perform rigorous objective clinical-task-based evaluation of artificial intelligence (AI) algorithms for nuclear medicine. To address this need, we propose a four-class framework to evaluate AI algorithms for promise, technical task-specific efficacy, clinical decision making, and post-deployment efficacy. We provide best practices to evaluate AI algorithms for each of these classes. Each class of evaluation yields a claim that provides a descriptive performance of the AI algorithm. Key best practices are tabulated as the RELAINCE (Recommendations for EvaLuation of AI for NuClear medicinE) guidelines. The report was prepared by the Society of Nuclear Medicine and Molecular Imaging AI taskforce Evaluation team, which consisted of nuclear-medicine physicians, physicists, computational imaging scientists, and representatives from industry and regulatory agencies.
- Image Processing
- Image Reconstruction
- PET
- Research Methods
- SPECT
- Artificial intelligence
- Clinical task-based evaluation
- Clinical utility
- Generalizability
- Post-deployment monitoring
- Copyright © 2022 by the Society of Nuclear Medicine and Molecular Imaging, Inc.