Abstract
PET combined with CT and prostate-specific membrane antigen (PSMA) ligands has gained significant interest for staging prostate cancer (PC). In this study, we propose 2 multimodal quantitative indices as imaging biomarkers for the assessment of osseous tumor burden using 68Ga-PSMA PET/CT and present preliminary clinical data. Methods: We defined 2 bone PET indices (BPIs) that incorporate anatomic information from CT and functional information from 68Ga-PSMA PET: BPIVOL is the percentage of bone volume affected by tumor and BPISUV additionally considers the level of PSMA expression. We describe a semiautomatic computation method based on segmentation of bones in CT and of lesions in PET. Data from 45 patients with castration-resistant PC and bone metastases during 223Ra-dichloride were retrospectively analyzed. We evaluated the computational stability and reproducibility of the proposed indices and explored their relation to the prostate-specific antigen blood value, the bone scan index (BSI), and disease classification using PERCIST. Results: On the technical side, BPIVOL and BPISUV showed an interobserver maximum difference of 3.5%, and their computation took only a few minutes. On the clinical side, BPIVOL and BPISUV showed significant correlations with BSI (r = 0.76 and 0.74, respectively, P < 0.001) and prostate-specific antigen values (r = 0.57 and 0.54, respectively, P < 0.01). When the proposed indices were compared against expert rating using PERCIST, BPIVOL and BPISUV showed better agreement than BSI, indicating their potential for objective response evaluation. Conclusion: We propose the evaluation of BPIVOL and BPISUV as imaging biomarkers for 68Ga-PSMA PET/CT in a prospective study exploring their potential for outcome prediction in patients with bone metastases from PC.
Patients with prostate cancer (PC), even when treated early, can later develop recurrence with cancer spread to other structures, in particular the bones. Both for primary tumor and for recurrence, bone scintigraphy, CT, and PET/CT are commonly used for staging. For bone scintigraphy, the bone scan index (BSI) (1) and its automatic computation method (2) can be used for quantitative analysis. However, in the absence of SPECT or SPECT/CT, bone scintigraphy is a 2-dimensional modality that lacks detailed anatomic information, has suboptimal specificity, and cannot reveal lymph node and visceral metastases. Because of its high detection rates and superb specificity for PC lesions 68Ga-prostate-specific membrane antigen (PSMA) PET/CT is increasingly used for staging of bone metastases instead of bone scintigraphy (3,4). So far, for cross-sectional imaging only qualitative to semiquantitative methods for assessment of tumor burden and response such as RECIST (5) for CT and PERCIST (6,7) for PET/CT have been developed. Additional important limitations for PC staging are RECIST is suboptimal for bone metastases staging because osteoblastic metastases without extraosseous tumor involvement are mainly regarded as nonmeasurable and PERCIST has been evaluated only for 18F-FDG PET, which is rarely used in these patients.
Therefore, a comprehensive quantitative imaging biomarker that is capable of measuring the whole-body tumor burden, exploring the potential of a 68Ga-PSMA, is an unmet clinical need. A first approach for the quantitative assessment of 18F-fluoride PET has been developed by Etchebehere et al. (8). However, it is unimodal and neglects the anatomic information contained in the CT image, therefore impeding interpatient comparability by, for example, not taking into account differences in patient size.
In this work, our aim was to define multimodal quantitative imaging indices using hybrid information from a 68Ga-PSMA PET/CT scan. In a first step toward full-body quantification, we focused on bone tumor burden. We also developed a method to compute the indices automatically with possible manual corrections, so that they can be easily implemented in clinical practice. In addition, we have performed preliminary clinical analyses applying this method to a retrospective cohort of PC patients with bone metastases who underwent 68Ga-PSMA PET/CT under 223Ra-dichloride and compared the results with a reference standard of clinical expert reading, BSI, and serum prostate-specific antigen (PSA).
MATERIALS AND METHODS
Bone PET Index (BPI)
Definition
BSI is defined as the percentage of skeletal mass affected by tumor calculated on a bone scintigram (1). Because bone scintigraphy is intrinsically 2-dimensional and lacks detailed anatomic information, a standard weighting of bones is incorporated in the calculation. The commercial software EXINI Bone BSI (EXINI Diagnostics AB) allows for automated calculation of the BSI. Inspired by this definition, we defined 2 new multimodal imaging indices for PET/CT: BPIVOL is the percentage of bone volume (including bone marrow) affected by tumor and BPISUV additionally considers the target expression measured by the average SUV. In both indices, the anatomic information was extracted from the CT image whereas the functional information was extracted from the PET image, making them intrinsically multimodal. They were calculated as follows, where N is the number of pixels p that belong to bone metastases:Notably, BPIVOL differs from the metabolic tumor volume and BPISUV from the total lesion glycosis because of normalization by skeleton volume. Contrarily to the calculation of BSI, no standard weighting of the bones is needed because patient-specific anatomic information from the CT image is used instead. In PET/CT, depending on the type of cancer, regularly only the trunk and not the whole body is imaged. Therefore, arms and legs as well as part of the head were excluded from the computation. To achieve a standardized calculation of the BPI that accounted for variation in the field of view and allowed for inter- and intrapatient comparison, only the slices between the bottom of the ischium (easily recognized on CT) and the caudal edge of the sublingual gland (easily recognized because of glandular uptake in PET) were considered. Of note, in the computation of BSI by EXINI BoneBSI, the forearms and lower legs are excluded as well.
Semiautomatic Computation Method
For the computation of BPIVOL, SUVmean, and BPISUV, a precise segmentation of the skeleton in CT and of bone metastases in PET were necessary. These segmentations could have been done manually with appropriate software, but would have been time consuming. We proposed instead an automatic method with possible manual corrections that we have implemented using the programming language Python (9).
Our tool read images in DICOM format. PET and CT were affinely registered using the information contained in the DICOM headers. This was possible because both images were acquired on the same scanner during the same session. The bed was automatically removed from the CT by simple morphologic operations.
On CT, the skeleton can be segmented using its density in Hounsfield units, which is higher than that of soft tissue and air. Following the first 2 steps of the method of Kang et al. (10), we used global thresholding of the CT image followed by local adaptive thresholding to obtain a first bone mask. Then we applied a supplementary morphologic hole closing to obtain a segmentation of the bones and bone marrow. In the case of heavy calcification or in the case of artifact resulting, for example, from implants, manual corrections were possible: corrections could be applied either with a brush or by removing in 1 click the whole “bone” from a slice. Details on the computation method are given in the supplemental materials (available at http://jnm.snmjournals.org).
68Ga-PSMA usually does not exhibit unspecific uptake in the bones and bone marrow. Therefore, regions of the skeleton with increased uptake can be considered as bone metastases. Thus, the lesions were segmented using a SUV threshold on PET and restricting the result to the skeleton segmented from the CT image (Fig. 1). This use of anatomic information avoided manual removal of normal uptake sites (e.g., bladder, kidneys) as proposed by Etchebehere et al. (8). Even though CT and PET are acquired consecutively on the same scanner, for example, breathing can cause misalignment in the region of the ribs. Because the liver and spleen show high physiologic uptake of 68Ga-PSMA, projection of the ribs in CT on liver and spleen in PET can potentially generate false-positives. When our current tool was used, such false-positives had to be manually corrected.
The final segmentation depended on the SUV used to threshold the PET as described in the previous paragraph. Rather than choosing an arbitrary threshold, we therefore developed a method to choose the threshold based on the maximum tolerated false-positive rate using negative training patients. For each patient, the lowest SUV threshold resulting in the given false-positive rate was automatically computed. BPIVOL values obtained during this search are shown for 1 training patient in Figure 2. Using the maximum of the thresholds computed for all training patients ensured that the false-positive rate did not exceed the maximum for any training patient. The effects of choosing different maximum tolerated false-positive rates on BPIVOL and BPISUV are presented in the “Results” section.
Patients Cohort
Between January 2014 and February 2016, 45 patients with metastatic castration-resistant PC (mCRPC) with bone metastases but no organ or relevant lymph node (>3 cm) metastases underwent 223Ra-dichloride therapy (Xofigo; Bayer Healthcare) at our institution. The mean age of the patients was 71 (SD, ±8 y). All patients received therapy at a dose of 50 kBq/kg per therapy cycle in monthly intervals with up to 6 cycles. Fifteen patients who underwent a 68Ga-PSMA PET/CT at our institution and were regarded as negative for bone metastases by an experienced nuclear medicine physician were also retrospectively randomly selected to serve as a negative training cohort for the method.
All patients underwent 68Ga-PSMA PET/CT within 4 wk before initiation of 223Ra-dichloride therapy. Thirty-one patients also underwent bone scintigraphy. Thirty-two patients underwent additional 68Ga-PSMA PET/CT and 22 of them bone scintigraphy 3–6 mo after the first scan. Table 1 shows a summary of data available for all patients.
The institutional review board of the Technical University Munich approved the retrospective analysis (permit 5665/13), and all subjects signed a written informed consent form for anonymized evaluation and publication of their data.
Data Acquisition
68Ga-PSMA PET/CT was performed approximately 54 min (±8 min; range, 43–88 min) after injection of a mean of 133 MBq (379 MBq; range, 52–239 MBq) of 68Ga-labeled HBED-CC. A diagnostic CT scan was obtained in the portal venous phase after intravenous injection of contrast agent (Imeron 300; Bracco IMAGING Deutschland GmbH). Immediately after the CT, the PET scan was acquired with 6–8 bed positions (3–5 min per bed position). PET was reconstructed using ordered-subset expectation-maximization with point-spread function and time-of-flight information (3 iterations, 21 subsets) and corrected for normalization, attenuation, scatter, randoms, and decay. The transaxial pixel size was 4.07 mm for PET and 1.52 mm for CT, and the slice thickness was 5 mm for both. 99mTc-HDP whole-body bone scintigraphy was performed in planar imaging mode with an acquisition time of 1 min/10 cm of body height. Activity was body weight–adjusted (9 MBq/kg) and injected 3 h before imaging.
Data and Statistical Analysis
To validate our new tool and the introduced BPI, we performed a reproducibility analysis. Randomly selected datasets of 10 mCRPC patients before application of 223Ra-dichloride were analyzed by 2 trained observers applying manual corrections independently. The reproducibility threshold was then defined as the maximum absolute difference observed between both observers for each index. For BSI, Anand et al. defined the reproducibility threshold as 0.30 (11).
For response assessment, BPI was compared with BSI, PERCIST performed by an experienced reader, and PSA. BSI was computed from the bone scintigraphy images using the commercially available EXINI BoneBSI. Response by PERCIST was evaluated by an experienced physician using recently published criteria (6,7), and criteria were adapted for 68Ga-PSMA similarly to a recent work (12). In brief, SUVpeak was measured in 1–5 target lesions and the appearance of new lesions was investigated.
Because PERCIST is not quantitative, but only classifies the patient as having progressive metabolic disease, stable metabolic disease, or partial metabolic response, we also defined these categories for BPI and BSI using the respective reproducibility threshold: a change of magnitude smaller than the reproducibility threshold was considered as stable metabolic disease, an increase in value larger than the reproducibility threshold was considered as progressive disease, and a decrease in value larger than the reproducibility threshold was considered as partial metabolic response. Moreover, 2 separate analyses based on PERCIST were performed: metastatic status based on all types of target lesions (including potential new lymph node and visceral metastases, as prescribed by the criteria) and metastatic status based on bone involvement only (to allow for direct comparison with BPI).
For comparing quantitative methods (i.e., BPI, BSI, and PSA), we used the Pearson coefficient r. For all tests, a P value smaller than 0.05 was considered significant.
RESULTS
Technical Validation
Bone Segmentation and Manual Corrections
After manual correction, the mean bone volume of the 45 treated patients was 4,184 cm3 (±503 cm3; range, 3,327–5,739 cm3). For 32 patients with 2 sequential 68Ga-PSMA PET/CT scans, the mean difference in computed bone volume between 2 scans was 66 cm3 (±61 cm3), with a maximum of 270 cm3. After bones had been segmented and manually corrected, false-positives (e.g., in the rib cage) were corrected by an expert reader. On 54 scans from the patient cohort, an average of 3.8 cm3 false-positives per patient had to be manually corrected. This represents an average difference in BPIVOL of 0.0008 per patient.
Selection of SUV Threshold for Lesion Segmentation
The SUV threshold lesion segmentation was determined using 15 negative training patients. For each patient, we computed the threshold that resulted in a BPIVOL of 0.1 and 1 (equal to 0.1% and 1% of false-positive voxels), respectively. Corresponding SUV thresholds for all negative training patients were in the range of 1.15–1.95 (mean, 1.42) for a false-positive threshold of 1 and a range of 1.7–2.65 (mean, 2.06) for 0.1, respectively. Figure 2 shows the different BPIVOL values obtained for different thresholds for 1 negative training patient.
SUV Threshold Influence on Lesion Segmentation
On the basis of these results, thresholds of 1.5 (mean value obtained for the scenario of 1% false-positive results) and 3 (conservative approach ensuring a maximum of 0.1% false-positive lesions in all patients) were used for the initial analysis of the baseline 68Ga-PSMA PET/CT of all 45 patients. There was a strong correlation between the BPISUV values obtained with these 2 thresholds (r = 0.99, P < 0.001; Fig. 3). This also held for BPIVOL and SUVmean (r = 0.95 for both, P < 0.001). Because of the high correlation between both values, we chose a threshold of 3 to ensure a high specificity of the BPI, with less than 0.1 of BPIVOL being related to false-positives. All following results were computed using a SUV threshold of 3.
Reproducibility
Comparison from 2 independent observers using 10 randomly selected datasets showed a nearly perfect correlation (r = 0.999, P < 0.001 for both). The maximum observed percentage differences between both observers were 3.5% for BPISUV and 2.2% for BPIVOL. The maximum absolute difference was 0.055 for BPISUV and 0.37 for BPIVOL. Rounding up these values, we defined 0.06 and 0.4 as reproducibility thresholds for the respective index. Note that a wide range of disease was present in the analyzed patients (range, BPISUV: 0.09–3.39, and BPIVOL: 1.53–38.05).
Quantification Using BPIVOL, SUVmean, and BPISUV
BPIVOL, SUVmean, and BPISUV Before and After Therapy
The average values of BPIVOL, SUVmean, and BPISUV before therapy were 19.5, 8.3, and 1.59, respectively. After therapy, the average values were 26.0, 7.7, and 1.99. These values represented changes of +33%, −7%, and +25%, respectively.
Correlation of BPI to Clinical Parameters
At baseline, BPIVOL and BPISUV showed a moderate and significant correlation with BSI (r = 0.76 and 0.74, respectively, P < 0.001; Fig. 4C). There was a tendency to a stronger correlation with PSA value for BPIVOL and BPISUV (r = 0.57 and 0.54, respectively, P < 0.01) than for BSI (r = 0.49, P < 0.01).
A moderate correlation between change of BPIVOL and BPISUV and percentage change of PSA value after treatment was observed (r = 0.70, P < 0.01). There was no correlation of change in BSI with percentage change in PSA value (r = 0.24, P = 0.32).
When compared with PERCIST for all type of target lesions (Table 2), BPIVOL, BPISUV, and BSI showed agreement for 65.6% (21/32), 68.7% (22/32), and 57.9% (11/19) of patients and opposite results for 25.0% (8/32), 15.6% (5/32), and 21.1% (4/19), respectively. When compared with PERCIST for bone involvement only (Table 2), BPIVOL, BPISUV, and BSI showed agreement for 62.5% (20/32), 68.7% (22/32), and 63.2% (12/19) of patients and opposite results for 12.5% (4/32), 6.2% (2/32), and 10.5% (2/19), respectively. An exemplary case of a patient showing divergent results with considerable decrease of PSMA expression under therapy indicating response in BPIVOL and BPISUV but progressive disease due to new lesions in PERCIST is given in Figure 5. Notably, cases in which results from one method indicated stable disease but the other method indicated progression or response were not counted as opposite results.
DISCUSSION
In this study, we have described the BPIs as new quantitative multimodal imaging indices for the assessment of bone metastases in 68Ga-PSMA PET/CT using an automatic computation method. We have shown that, on our patient cohort, small differences in SUV thresholds and small misalignments between PET and CT did not substantially influence the results. Different observers also obtained similar values of the indices, showing a good interobserver reproducibility. A small amount of manual correction was still necessary, especially because of calcifications and endoprothesis. The computation of both indices with our software took only a few minutes, including manual corrections of skeleton segmentation and false-positives. In contrast, a complete manual segmentation of a whole-body dataset would take several hours even for an experienced physician, because the skeleton and not only the lesions have to be segmented.
From a clinical perspective, our preliminary data indicated that BPI holds potential for quantitative response assessment. This was documented by a high correlation of BPI with BSI and PSA in mCRPC patients, and reasonable prediction of tumor response compared with PERCIST. We are aware that these results allow only for a first estimation of the potential clinical usability due to the known limitations of theses comparators. Therefore, future prospective clinical studies using more objective endpoints (overall survival, radiographic progression-free survival, skeletal adverse events) are necessary to fully investigate the potential of the proposed quantitative biomarkers for response prediction.
Compared with the imaging biomarkers FTV10, representing the volume of fluoride bone metastases, and TLF10, representing the total fluoride metastatic uptake introduced by Etchebehere et al. (8), calculation of BPIVOL and BPISUV was an intrinsically multimodal approach. TLF10 and FTV10 exclusively rely on the information from PET, and no correction is possible with regard to the patient size. In contrast, for BPIVOL and BPISUV, the skeleton volume based on additional information from CT was included as well, thus allowing for interpatient comparison.
In a first step, we demonstrated that the bone volume using the newly introduced tool was reproducible between different scans of the same patient (mean difference, <2%). The small discrepancy can be explained by slightly different positions of the patient in the scanner. The absolute values obtained in our study (mean, 4,184 ± 503 cm3) were in the expected range of a reference human skeleton, with an estimated total volume (including bone marrow, averaged for both sexes) of 7,700 cm3 (13), of which an average of 50.3% was included in our segmentation (14).
In a second step, both imaging indices BPIVOL and BPISUV proved highly reproducible, with a maximum interobserver difference of 3.5%. From a practical point of view, the false-positive correction was done with only a few clicks and the full image segmentation with manual corrections took only a few minutes, which makes it usable in clinical practice. The limited influence of different SUV thresholds for computation underlined the robustness for clinical analyses. For the quantitative read-out of both indices, we finally chose a conservative approach (SUV cutoff, 3) to ensure that less than 0.1 is due to false-positives.
In principle, the 2 introduced indices BPIVOL and BPISUV were highly correlated. Nevertheless, it is important to mention that the information they provide is not completely equivalent, because BPISUV also took into account the level of expression of PSMA. The percentage changes of BPIVOL and BPISUV during therapy were highly correlated. Interestingly, whereas the average values of BPIVOL and BPISUV increased during therapy, the average value of SUVmean decreased, showing that BPIVOL, SUVmean, and BPISUV provided different information.
Notably, some clear outliers in the comparison between BPI and BSI (Fig. 4C) were observed. It has to be respected that bone scintigraphy and 68Ga-PSMA PET/CT image 2 different biologic processes: bone scintigraphy displays the reactive changes of the tumor on the skeleton (15), whereas 68Ga-PSMA PET directly shows the intensity of PSMA expression on viable tumor cells. Thus, no absolute equivalence of BSI and BPI could be expected. Other explaining factors were the higher sensitivity of 68Ga-PSMA PET/CT for bone metastases (16), the delay after which bone scintigraphy usually shows changes as well as the flare phenomenon (15,17).
Compared with standardized imaging response evaluation using PERCIST results for all types of target lesion, BPISUV showed a higher agreement than BSI and also BPIVOL. This was not unexpected because BPISUV considers also quantitative values, as does PERCIST. An even better correlation with a lower number of patients showing divergent results between PERCIST and both indices was achieved when they were compared with PERCIST results based only on bone involvement. This clearly implied the need of further adaptation of BPI as a quantitative PET imaging index also for the assessment soft-tissue tumor burden. A further argument for the potential clinical value of BPI compared with conventional methods was the statistically significant substantial correlation of BPI with percentage change of PSA value during therapy (r = 0.70, P < 0.01) compared with BSI, which showed no correlation (r = 0.24, P = 0.32).
Our study had several limitations. First, the study was based on retrospective data analysis. Second, after the baseline scan before 223Ra-dichloride follow-up scans for assessment of therapy response were either after 3 or after 6 cycles. Nevertheless, the aim of the study was primarily the introduction of a new quantitative imaging biomarker and only second to establish first preliminary correlation to clinical data. The influence of previous lines of treatment was not assessed, which could potentially affect signals derived from both 68Ga-PSMA PET and bone scanning (18,19). These confounding factors should be investigated in future studies encompassing larger patient cohorts. Another limitation is the fact that no respiratory gating was used, which potentially would minimize the need for manual correction.
CONCLUSION
We have introduced BPISUV and BPIVOL as new multimodal quantitative imaging indices for 68Ga-PSMA PET/CT, representing a robust tool for quantitative assessment of osseous tumor burden. We have shown that their automatic computation (with minimal manual corrections) is feasible and highly reproducible on a retrospective cohort of mCRPC patients. Finally, our results demonstrated that BPIVOL and BPISUV provide clinically meaningful information when correlated to PERCIST, BSI, and PSA value. However, the full clinical value, for example, predicting patient outcome, has to be investigated in future prospective studies.
DISCLOSURE
This work was partially funded by the German Ministry for Education and Research (Bundesministerium für Bildung und Forschung) under grant agreement no. 01IS12057, by the European Union Seventh Framework Program (FP7) under grant agreement no. 294582 ERC MUMI, and by the German Research Society (DFG) under grant agreement no. SFB 824. No other potential conflict of interest relevant to this article was reported.
Footnotes
- © 2017 by the Society of Nuclear Medicine and Molecular Imaging.
REFERENCES
- Received for publication January 3, 2017.
- Accepted for publication April 13, 2017.