TABLE 2

Quality Factors in Radiomics Studies

FactorDescription
Imaging
 Standardized imaging protocolsImaging acquisition protocols are well described and ideally similar across patients. Alternatively, methodologic steps are taken toward standardizing them.
 Imaging quality assuranceMethodologic steps are taken to incorporate only acquired images of sufficient quality.
 CalibrationComputation of radiomics features and image-processing steps matches benchmarks of the IBSI.
Experimental setup
 Multiinstitutional/external datasetsModel construction or performance is evaluated using cohorts from different institutions, ideally from different parts of world.
 Registration of prospective studyProspective studies provide the highest level of evidence supporting clinical validity and usefulness of radiomics models.
Feature selection
 Feature robustnessRobustness of features is evaluated against segmentation variations and varying imaging settings (e.g., noise fluctuations and interscanner differences). Unreliable features are discarded.
 Feature complementarityIntercorrelation of features is evaluated; redundant features are discarded.
Model assessment
 False-discovery correctionsCorrections for multiple testing comparisons (e.g., Bonferroni or Benjamini–Hochberg) is applied in univariate analysis.
 Estimation of model performanceTeaching dataset is separated into training and validation sets to estimate optimal model parameters (e.g., using bootstrapping, cross-validation, and random subsampling)
 Independent testingA testing set distinct from the teaching set is used to evaluate performance of complete models (i.e., without retraining and without adaptation of cutoffs). Evaluation of performance is unbiased and not used to optimize model parameters.
 Performance results consistencyModel performance in training, validation and testing sets is reported. Consistency checks of performance measures across the different sets are performed.
 Comparison to conventional metricsPerformance of radiomics-based models is compared against conventional metrics such as tumor volume and clinical variables (e.g., staging) to evaluate the added value of radiomics (e.g., by assessing the significance of AUC increase with the DeLong test).
 Multivariable analysis with nonradiomics variablesMultivariable analysis integrates variables other than radiomics features (e.g., clinical information, demographic data, and panomics).
Clinical implications
 Biologic correlateRelationship between macroscopic tumor phenotypes described with radiomics and underlying microscopic tumor biology is assessed.
 Potential clinical applicationCurrent and potential applications of proposed radiomics-based models in clinical setting are discussed.
Material availability
 Open dataImaging data, tumor region of interest, and clinical information are made available.
 Open codeSoftware code for computation of features, statistical analysis, machine learning, and exact reproduction of results, is open-source. Code package is ideally shared as easy-to-run organized scripts pointing to other relevant pieces of code, along with useful sets of instructions.
 Open modelsComplete models are available, including model parameters and cutoffs.
  • AUC = area under the receiver-operating-characteristic curve.