Skip to main content

Main menu

  • Home
  • Content
    • Current
    • Ahead of print
    • Past Issues
    • JNM Supplement
    • SNMMI Annual Meeting Abstracts
    • Continuing Education
    • JNM Podcasts
  • Subscriptions
    • Subscribers
    • Institutional and Non-member
    • Rates
    • Journal Claims
    • Corporate & Special Sales
  • Authors
    • Submit to JNM
    • Information for Authors
    • Assignment of Copyright
    • AQARA requirements
  • Info
    • Reviewers
    • Permissions
    • Advertisers
  • About
    • About Us
    • Editorial Board
    • Contact Information
  • More
    • Alerts
    • Feedback
    • Help
    • SNMMI Journals
  • SNMMI
    • JNM
    • JNMT
    • SNMMI Journals
    • SNMMI

User menu

  • Subscribe
  • My alerts
  • Log in
  • My Cart

Search

  • Advanced search
Journal of Nuclear Medicine
  • SNMMI
    • JNM
    • JNMT
    • SNMMI Journals
    • SNMMI
  • Subscribe
  • My alerts
  • Log in
  • My Cart
Journal of Nuclear Medicine

Advanced Search

  • Home
  • Content
    • Current
    • Ahead of print
    • Past Issues
    • JNM Supplement
    • SNMMI Annual Meeting Abstracts
    • Continuing Education
    • JNM Podcasts
  • Subscriptions
    • Subscribers
    • Institutional and Non-member
    • Rates
    • Journal Claims
    • Corporate & Special Sales
  • Authors
    • Submit to JNM
    • Information for Authors
    • Assignment of Copyright
    • AQARA requirements
  • Info
    • Reviewers
    • Permissions
    • Advertisers
  • About
    • About Us
    • Editorial Board
    • Contact Information
  • More
    • Alerts
    • Feedback
    • Help
    • SNMMI Journals
  • View or Listen to JNM Podcast
  • Visit JNM on Facebook
  • Join JNM on LinkedIn
  • Follow JNM on Twitter
  • Subscribe to our RSS feeds
Meeting ReportData Analysis & Management

Effect of harmonization and oversampling methods on multi-center imbalanced PET datasets: Application to radiomics-based NSCLC-subtype prediction

Dongyang Du, Isaac Shiri, Fereshteh Yousefirizi, Habib Zaidi, Lijun Lu and Arman Rahmim
Journal of Nuclear Medicine August 2022, 63 (supplement 2) 3166;
Dongyang Du
1School of Biomedical Engineering and Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou, Guangdong 510515, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Isaac Shiri
2Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211, Geneva 4, Switzerland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Fereshteh Yousefirizi
3BCCRC/UBC
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Habib Zaidi
4Geneva University Hospital
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lijun Lu
1School of Biomedical Engineering and Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou, Guangdong 510515, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Arman Rahmim
5University of British Columbia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Data
  • Info & Metrics
Loading

Abstract

3166

Introduction: Medical imaging data frequently encounter image-generation heterogeneity and class imbalance properties, challenging strong generalized predictive performances with data-driven learning methods. The purpose of this study was to investigate the impact of harmonization and oversampling methods for multi-center imbalanced datasets in PET, with specific application to radiomics-based predictive modeling of histologic subtype of non-small cell lung cancer (NSCLC).

Methods: Radiomics analysis was performed on PET images acquired on multi-vendor (Philips, Siemens and GE) PET/CT scanners, and reconstructed using different methods (i.e., VPFX, VPHD, VPHDS, OSEM). Hundred twenty five patients with adenocarcinoma (ADC) and 27 patients with squamous cell carcinoma (SCC) from two independent institutions were randomly divided into training (50%) and testing (50%) datasets, with approximately matching class-imbalance proportions, repeating this process 50 times for further statistical analysis. The predictive performance was investigated for 25 cross-combinations derived from no harmonization or 4 harmonization methods (ComBat, centering-scaling, Singular Value Decomposition (SVD)-based matrix factorization and Independent Component Analysis (ICA)-based matrix factorization) coupled to no oversampling (NOS) or 4 oversampling methods (synthetic minority oversampling technique (SMOTE), adaptive synthetic (ADASYN), borderline-SMOTE (BLSMOTE) and safe-level-SMOTE (SLSMOTE)). Before feature extraction all images were interpolated to isotropic voxel spacing of 4 ×4×4 mm3. Two hundred fifteen radiomic features (79 first order (including morphological, statistical, histogram and intensity-histogram), 136 3D texture features (including GLCM, GLRLM, GLSZM, GLDZM, NGTDM and NGLDM matrices) ) were extracted using the standardized publicly-available standardized environment for radiomics analysis (SERA). The minimum redundancy-maximum relevance (MRMR) feature selection method was used to reduce feature dimensionality, and the top k features were input into logistic regression classifier (k was determined via 5-fold cross validation within the training set). Area under the receiver operating characteristic curve (AUC) and balanced accuracy were used to evaluate predictive performance, and p-values were reported using the paired t-test for comparison of methods.

Results: ComBat harmonization (AUC 0.711; balanced accuracy 0.641) and BLSMOTE oversampling method (AUC 0.667; balanced accuracy 0.607) showed good mean performance amongst harmonization and oversampling methods employed in this study. The optimal harmonization and oversampling methods ComBat + BLSMOTE performed significantly better than combination of no harmonization + no oversampling (NOS) in terms of AUC (0.708 ± 0.062 vs. 0.640 ± 0.064, p < 0.0001) and balanced accuracy (0.651 ± 0.067 vs. 0.544 ± 0.031, p < 0.0001).

Conclusions: Our study showed a significant positive improvement in NSCLC-subtype predictive performance in multi-center imbalanced PET radiomics analysis when applying harmonization and oversampling methods. Harmonization improved data overall-consistency by removing batch effect, while oversampling increased intra-class diversity by generating new samples, which could potentially improve biological status capturing in NSCLC.

Figure
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure
  • Download figure
  • Open in new tab
  • Download powerpoint
Previous
Back to top

In this issue

Journal of Nuclear Medicine
Vol. 63, Issue supplement 2
August 1, 2022
  • Table of Contents
  • Index by author
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for your interest in spreading the word on Journal of Nuclear Medicine.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Effect of harmonization and oversampling methods on multi-center imbalanced PET datasets: Application to radiomics-based NSCLC-subtype prediction
(Your Name) has sent you a message from Journal of Nuclear Medicine
(Your Name) thought you would like to see the Journal of Nuclear Medicine web site.
Citation Tools
Effect of harmonization and oversampling methods on multi-center imbalanced PET datasets: Application to radiomics-based NSCLC-subtype prediction
Dongyang Du, Isaac Shiri, Fereshteh Yousefirizi, Habib Zaidi, Lijun Lu, Arman Rahmim
Journal of Nuclear Medicine Aug 2022, 63 (supplement 2) 3166;

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Share
Effect of harmonization and oversampling methods on multi-center imbalanced PET datasets: Application to radiomics-based NSCLC-subtype prediction
Dongyang Du, Isaac Shiri, Fereshteh Yousefirizi, Habib Zaidi, Lijun Lu, Arman Rahmim
Journal of Nuclear Medicine Aug 2022, 63 (supplement 2) 3166;
Twitter logo Facebook logo LinkedIn logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One
Bookmark this article

Jump to section

  • Article
  • Figures & Data
  • Info & Metrics

Related Articles

  • No related articles found.
  • Google Scholar

Cited By...

  • No citing articles found.
  • Google Scholar

More in this TOC Section

  • Automated liver reference region localization in PET/CT in presence of non-physiological tracer uptake
  • Impact of Equilibration Time (t*) on Patlak Quantitation in Dynamic Total-Body Imaging using the uEXPLORER PET Scanner
  • Progression-free survival prediction in head and neck cancer: A comparative study between conventional PET indices and radiomics models.
Show more Data Analysis & Management

Similar Articles

SNMMI

© 2025 SNMMI

Powered by HighWire