Updating approach for lexicographic optimization-based planning to improve cervical cancer plan quality

Caricato, Paolo; Trivellato, Sara; Pellegrini, Roberto; Montanari, Gianluca; Daniotti, Martina Camilla; Bordigoni, Bianca; Faccenda, Valeria; Panizza, Denis; Meregalli, Sofia; Bonetto, Elisa; Voet, Peter; Arcangeli, Stefano; De Ponti, Elena

doi:10.1007/s12672-023-00800-5

Updating approach for lexicographic optimization-based planning to improve cervical cancer plan quality

Research
Open access
Published: 30 September 2023

Volume 14, article number 180, (2023)
Cite this article

Download PDF

You have full access to this open access article

Discover Oncology Aims and scope Submit manuscript

Updating approach for lexicographic optimization-based planning to improve cervical cancer plan quality

Download PDF

Paolo Caricato^1,2,
Sara Trivellato¹,
Roberto Pellegrini³,
Gianluca Montanari¹,
Martina Camilla Daniotti^1,2,
Bianca Bordigoni^1,4,
Valeria Faccenda^1,2,
Denis Panizza^1,5,
Sofia Meregalli^5,6,
Elisa Bonetto⁶,
Peter Voet⁷,
Stefano Arcangeli^5,6 &
…
Elena De Ponti^1,5

803 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

Background

To investigate the capability of a not-yet commercially available fully automated lexicographic optimization (LO) planning algorithm, called mCycle (Elekta AB, Stockholm, Sweden), to further improve the plan quality of an already-validated Wish List (WL) pushing on the organs-at-risk (OAR) sparing without compromising target coverage and plan delivery accuracy.

Material and Methods

Twenty-four mono-institutional consecutive cervical cancer Volumetric-Modulated Arc Therapy (VMAT) plans delivered between November 2019 and April 2022 (50 Gy/25 fractions) have been retrospectively selected. In mCycle the LO planning algorithm was combined with the a-priori multi-criterial optimization (MCO). Two versions of WL have been defined to reproduce manual plans (WL01), and to improve the OAR sparing without affecting minimum target coverage and plan delivery accuracy (WL02). Robust WLs have been tuned using a subset of 4 randomly selected patients. The remaining plans have been automatically re-planned by using the designed WLs. Manual plans (MP) and mCycle plans (mCP01 and mCP02) were compared in terms of dose distributions, complexity, delivery accuracy, and clinical acceptability. Two senior physicians independently performed a blind clinical evaluation, ranking the three competing plans. Furthermore, a previous defined global quality index has been used to gather into a single score the plan quality evaluation.

Results

The WL tweaking requests 5 and 3 working days for the WL01 and the WL02, respectively. The re-planning took in both cases 3 working days. mCP01 best performed in terms of target coverage (PTV V_95% (%): MP 98.0 [95.6–99.3], mCP01 99.2 [89.7–99.9], mCP02 96.9 [89.4–99.5]), while mCP02 showed a large OAR sparing improvement, especially in the rectum parameters (e.g., Rectum D_50% (Gy): MP 41.7 [30.2–47.0], mCP01 40.3 [31.4–45.8], mCP02 32.6 [26.9–42.6]). An increase in plan complexity has been registered in mCPs without affecting plan delivery accuracy. In the blind comparisons, all automated plans were considered clinically acceptable, and mCPs were preferred over MP in 90% of cases. Globally, automated plans registered a plan quality score at least comparable to MP.

Conclusions

This study showed the flexibility of the Lexicographic approach in creating more demanding Wish Lists able to potentially minimize toxicities in RT plans.

Effectiveness of Multi-Criteria Optimization-based Trade-Off exploration in combination with RapidPlan for head & neck radiotherapy planning

Article Open access 23 November 2018

Planning comparison of five automated treatment planning solutions for locally advanced head and neck cancer

Article Open access 10 September 2018

Automated volumetric modulated arc therapy planning for whole pelvic prostate radiotherapy

Article Open access 21 December 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

With the advent of inverse planning and adaptive techniques in all the domains of the Radiotherapy (RT) world from Hadrontherapy to Conventional RT and Brachytherapy, the need to automate workflows to ensure speed and consistency has become more and more pressing. In this framework, the evolution towards automated contouring and automated planning techniques has been perceived as the resolution of these two classical bottlenecks, always requiring extensive manual activities by radiation oncologists and planners.

Nowadays, automatic planning tools are widely known and spread with their main characteristic to emulate the human planners’ interactions with the treatment planning system (TPS). Their operation mode is largely described in the literature. It is so widely known how automation can reduce planning time, and increase plan efficiency and consistency, potentially leading to improved patient outcomes [1,2,3,4,5,6,7]. The automated-planning capability to generate plans at least comparable with the manual ones has been extensively reported [2, 8,9,10,11,12,13,14,15]. A question naturally arises: “If automated strategies can easily achieve clinical performances can we go further by stressing these techniques? And if the answer is ‘yes’, how far can we go?”. Only a few studies focus on automated tools updating and upgrading, deeply investigating their performances. Two studies recently reported how updating a Knowledge-Based Planning (KBP) model would improve plan quality and consistency, although overfitting issues have to be carefully managed [16, 17], and how developing different KBP models on the same plan library optimized by different TPS could lead to different dosimetric and modulation complexity performances [18]. The not-yet commercially available fully-automated lexicographic optimization (LO) planning algorithm, called mCycle (Elekta AB, Stockholm, Sweden) has been recently validated in head and neck volumetric-modulated arc therapy (VMAT) treatment planning [5], conventional treatment of prostate cancer, prostate Stereotactic Body Radiation Therapy (SBRT), rectal cancer [19], prostate treatment on an MR-Linac [6]. While the Erasmus MC Cancer Institute of Rotterdam first introduced and implemented LO in the self-standing iCycle software [13], now mCycle is newly implemented into the Monaco TPS research version (v5.59.13). The LO optimization problem follows the hierarchical order of a treatment site specific list of requests, a so-called Wish-List (WL). The WLs are generated by imitating the plan discussion process between radiation oncologists and planners and are characterized by clinical and planning constraints (CC and PC, respectively), which cannot be violated, and a list of prioritized objective functions according to their importance degree [20, 21]. At our Institute, a recent validation in cervical cancer treatment has been concluded, demonstrating that automated plans were dosimetric comparable with manual plans, but outperformed manual ones at the blinded clinical scoring [21]. Now that mCycle has been validated in different anatomic sites, the same question on further planning performances arises. The aim of this study is to deeply explore mCycle capability to go further the manual plan quality, stressing the organs-at-risk (OARs) sparing while preserving a minimum acceptable target coverage and accuracy of the plan delivery. The following comparison of these plans has been based on dose distributions, complexity, delivery accuracy, and clinical acceptability.

2 Material and methods

2.1 Pathology

Cervical cancer is one of the most common cancers in females worldwide for both incidence and mortality [22]. According to the Worldwide Health Organization (WHO), about 340,000 females die of cervical cancer every year in the world, 90% of deaths occur in low- and middle-income countries, and 99% of cervical cancers are caused by infection with human papillomavirus (HPV) [23]. Cervical cancer patients represent more than 10% of the overall annual workload at our Department of Radiation Oncology.

2.2 Patient population

Twenty-four mono-institutional consecutive cervical cancer patients treated between November 2019 and April 2022 have been retrospectively selected. In order to be as generalizable as possible, 9 out of 24 patients had undergone surgery and the other 15 patients had not, thus challenging the mCycle algorithm’s robustness to manage very different anatomies. The criterion of inclusion was a prescription dose of 50 Gy in 25 fractions, representing the most frequent Institute’s cervical cancer protocol. On the other hand, the presence of mono- or bi-lateral femoral prosthesis was considered an exclusion criterion due to a non-standard planning setup chosen for each specific case. All patients underwent a CT simulation with a 3 mm slice thickness in the supine position. A specific OARs preparation requiring an empty rectum and a filled bladder was carried out before the simulation and each treatment fraction to ensure internal anatomy as reproducible as possible [24]. Two experienced radiation oncologists contoured the original structure sets that have been used for planning and analysis purposes. The structure sets included targets, involving cervix, uterus (if present), proximal vagina, pelvic nodes, and OARs, i.e., rectum, bladder, bowel bag (outer contour of bowel loops including the mesenterium, upper limit linked to the target extension, sigmoid as lower limit), and femoral heads [24]. The planning target volume (PTV) was defined as a 7-mm isotropic expansion of the clinical target volume (CTV) as prescribed by the institutional protocol. The selected DICOM sets were deeply anonymized by RSNA-CTP DicomAnonymizer (MIRC project, RSNA) prior to conducting the research. No ethical committee approval was needed for this retrospective dosimetric planning study.

2.3 Manual treatment planning

Clinical manual VMAT plans (MP) were optimized according to Institutional protocol dose tolerances PTV V_95% > 97%, acceptable > 95%, D_1% < 107%; rectum D_50% < 44.7 Gy; bladder D_50% < 57.3 Gy; small bowel V_45Gy < 195 cm³; femoral heads D_5% < 44.7 Gy [25,26,27,28]. All plans were optimized with Monaco TPS (version 5.51.10) using a 6 MV-coplanar dual 330°-arc (165–195°) with up to 150 control points (CP), and sequencing parameters such as 1 cm-minimum segment width (SW), and highly smoothed fluence. The parameters of the Monte Carlo calculation were a 3 mm-dose grid and 1%-statistical uncertainty per plan. Patients were treated using an Elekta VersaHD linear accelerator equipped with the Agility Multileaf Collimator (MLC, 160 leaves, 5 mm thickness, up to 6.5 cm/sec), with the Monitor Unit (MU) calibration of 1 MU = 1 cGy with the reference field at the reference depth. The clinical objectives were accounted for in the a-priori MCO of the clinical Monaco^™ TPS, as comprehensively treated described in Trivellato et al. [21]. Final normalization of dose distribution to achieve minimum PTV coverage or to satisfy small bowel constraints has been allowed. Whenever it was not possible to respect the above constraints for PTV or at least one OAR, minor or major deviations were discussed with and accepted by the approving clinician.

2.4 mCycle auto-planning

Unlike the previous iCycle, mCycle is now implemented into the Monaco TPS research version (v5.59.13) and it applies the LO approach to the typical Monaco cost functions and Monte Carlo Algorithm (XVMC). Moreover, it is based on a completely new code including a new mathematical solver and a new patient model [19]. Furthermore, a new Segment optimization has been made available, the Pseudo-Gradient Descent Segment Shape Optimizer (PGDSSO). It is a new method of refining a set of MLC segments for a plan using a search method analogous to gradient descent. At each loop, segments are chosen starting from the desired maximum number of segments among all the possible segments and then gradually reduced by 10% each loop down to 50%, where the algorithm then stays throughout the rest of the optimization.

The mCycle fluence optimization (FMO) uses a two-pass automated lexicographic MCO in which constraints and prioritized objectives are managed by the planner through the WL. The WL tuning process is a multi-step iterative method described by Hussein et al. [2], while the description of the two-passed fluence LO was thoroughly discussed by Trivellato et al. [21].

The previous WL tweaking has been performed aiming to reproduce the manual plans (WL01). In this study, a second WL was generated to investigate the possibility to improve the plan quality in terms of organs-at-risk (OARs) sparing without affecting plan delivery accuracy. The WLs tuning has been done on the same subset of CTs and structure sets to get a robust hierarchical list of requests giving clinically acceptable dose distributions limiting any manual intervention as much as possible.

The designed WLs have been exploited to automatically re-plan the remaining selected treatment plans, using the same treatment arc with up to 150 CP, and the same sequencing parameters of the manual plans, highly smoothed fluence, 3 mm-dose grid, and 0.3%-statistical uncertainty per CP in the Monte Carlo calculation. No further WLs changes were allowed in this test phase. To satisfy the clinical objectives, the manual interventions on mCycle plans (mCP01 and mCP02, respectively) were limited to a re-optimization with a 0.75 cm-minimum segment width or a final re-normalization of the dose distribution in order to reach the minimum PTV coverage of V_47.5 Gy > 95% or to comply with the small bowel constraint V_45Gy < 195 cm³. These interventions were allowed to ensure comparability with other plans, similar to our manual planning workflow. Any other extensive manual tweaking has been avoided to prevent introducing any bias in the plan comparison.

2.5 Plan comparison

MP, mCP01, and mCP02 were recalculated with a statistical uncertainty of 0.5% per plan to provide an unbiased comparison. Manual and automatic plans were compared by assessing differences in PTV V_100%, V_95%, and D_1%. The dose distributions were compared in terms of the conformality index (CI_95% and CI_50%), defined by the ratio between the total volume covered by the specified dose (95% and 50% of the prescription dose) and the volume of the PTV, and the homogeneity index (HI), represented by the formula HI = (D_2%–D_98%)/D_p, where D_p is the prescription dose. The OAR mean doses, the rectum and bladder D_50%, and the femoral heads D_5% have been also reported. The plan quality score introduced by Trivellato et al. [21] was used in the comparison.

2.6 Plan complexity and delivery accuracy

Manual and automated planning modalities have also been analyzed in terms of plan complexity through the total number of MUs, the number of segments, and the modulation complexity score (MCS), as defined by McNiven [29]. All plans have been recalculated on the CT scan of the Delta⁴⁺ phantom (ScandiDos, Uppsala, Sweden) using a 2-mm grid and a 0.5%-statistical uncertainty. All plans were delivered at the linac VersaHD to test the plan delivery accuracy and to assess the agreement between calculated and measured dose distributions by performing a 3D-gamma analysis (ɣ). Automatic and manual plans were consecutively delivered on the phantom on the same day to avoid daily delivery variations. The local ɣ has been performed with Scandidos software (version 1.00.0180). The gamma passing rate was evaluated with a local 3%/3 mm criteria [PR(3%/3 mm)] excluding any pixel registering a dose lower than 8% of the maximum dose (threshold), according to the institutional clinical routine. A ɣ-passing criterion of 90% was used, as clinically applied [30].

2.7 Blind physician scoring

To clinically evaluate the mCycle plans, two experienced radiation oncologists (ROs) have been asked to perform an independent blind plan evaluation. The request was to rank the three competing plans in order of acceptability as 1st, 2nd, and 3rd according to the institutional guidelines, i.e., based on dose distribution, dose-volume histograms, and clinical objectives. It’s worth noticing that the plans were randomly anonymized and no information about the planning method was provided. Cohen’s kappa coefficient has been calculated to assess the agreement between the two raters, providing valuable insights into the degree of concordance. Cohen’s kappa score was defined as excellent (k > 0.81), good (0.61 < k < 0.80), moderate (0.41 < k < 0.60), fair (0.21 < k < 40), and poor (k < 0.20) [31]. Moreover, the RO ranking has been evaluated in terms of the ranking agreement, which provides information about how many times the two raters ranked a plan in the same position, and the total agreement, as the sum of each ranking agreement.

2.8 Statistical analysis

The normality test of Shapiro–Wilk has been performed to establish whether to perform the parametric t-test or the non-parametric Wilcoxon rank-sum test. The Bonferroni correction for multiple tests has been applied and the selected significance level has been set at 5% (p = 0.05). According to whether a sample is parametric or non-parametric, Bartlett’s or Levene’s test has been carried out to check if the samples belong to populations with equal variances [4]. The analysis of Bland–Altman (B-A) plots was used to compare two measurements of the same variables and to identify any systematic differences, outliers, and particular disagreement patterns [33]. Furthermore, the box-and-whisker plots were used to display in a single chart how data of different populations are distributed. All the statistical tests have been performed using Rstudio (2021.09.0), while the B-A plots have been performed using Python 3-Release (Python 3.9.7).

3 Results

3.1 Wish-Lists tweaking

The WL01 and WL02 preparation and fine tuning required about 5 and 3 working days, respectively. The WL01 was the starting point of WL02 tuning: WL01 has been iteratively modified to reach the WL02 goals of getting OARs sparing as high as possible, accepting a slightly lower target coverage without compromising the plan delivery accuracy. The two detailed WLs are presented in Tables 1 and 2. In both cases, the fulfilment of the bowel bag constraint is indicated as CC because its violation implies plan rejection most of the time. It is followed by dose gradient requests (PC). The main differences between the two WLs regard the PTV coverage and the OAR mean doses requests priority order. In the WL02, a less strict PTV coverage request is performed as a 1^st-priority request, and a last-priority level request was added to achieve PTV coverage as high as possible, while the bladder and rectum mean doses claims are swapped with the right- and left-femoral heads ones. In both WLs, if there was a double PTV with the same dose prescription (PTV uterus and PTV pelvis) the requests were doubled and kept both as first-priority objectives function.

Table 1 mCycle Wish-List 01 for auto-planning of cervical cancer at 50 Gy in 25 fractions

Full size table

Table 2 mCycle Wish-List 02 for auto-planning of cervical cancer at 50 Gy in 25 fractions

Full size table

3.2 mCycle auto-planning

The automatic re-planning for the remaining 20 patients (test set) took 3 working days for each WLs. The obtained mCP01 and mCP02 required manual fine-tuning in 30% and 35% of plans, respectively. The plan re-normalization was required for 6 (30%) and 7 (35%) plans, respectively, while a re-optimization with 0.75 cm of minimum-SW was performed for 2 (10%) mCP01 and 1 (5%) mCP02.

3.3 Dosimetric comparison

The median PTV volume was 1073.7 cm³ [608.4–1453.9 cm³].

Target dose results are summarized in Table 3. Although not statistically significant once the Bonferroni correction was applied, mCP01 showed a higher target coverage than MP. This coverage increase resulted statistically significant with respect to mCP02. This analysis demonstrated significant growth in the PTV D_1%, never exceeding the protocol constraint. As it may be noticed in the related box-and-whisker and Bland Altman plots reported in Figs. 1 and 2 the data variability is less for AP than MP and the median value is generally higher for AP. Unlike the comparable conformality results, mCP02 is significantly more inhomogeneous than MP and mCP01.

Table 3 Comparison of original manual plans (MP) and mCycle plans (mCP01 and mCP02) in terms of PTV dose metrics

Full size table

OAR results are reported in Table 4 and Figs. 1 and 2. While OAR sparing in mCP01 is comparable to MP, OAR metrics showed a slight if not large decrease in mCP02, with a statistical and clinical relevance in median values of the rectum D_50% and D_mean. In B-A plots, it is worth noticing the lower position of mCP02 bias lines, meaning a mCP02 overall trend to overperform MP and mCP01. The variance test showed a statically relevant difference for the rectum D_mean and right femoral head D_mean parameters. In Fig. 1, it is worth noticing an extremely narrow boxplot for bowel V_45Gy coupled with a cluster of points in the B-A plot just below the constraint (Fig. 2).

Table 4 Comparison of original manual plans (MP) and mCycle plans (mCP01 and mCP02) in terms of OARs dose metrics

Full size table

The dose distributions and the relative DVHs for a representative patient are graphically reported in Fig. 3 illustrating the best performances of mCP01 regarding the PTV coverage and a large reduction in rectum and bladder doses in mCP02. Furthermore, it is worth noticing that mCP01 presented a slightly worse bladder DVH, as well as a larger extension of low doses with respect to MP and mCP02.

The results of the plan quality index (PQI) and its sub-metrics are reported in the Additional file 1. It is worth observing the PQI trend of mCP01 and mCP02 in comparison to the gold standard MP: the former shows a slight improvement in the overall plan quality, while the latter demonstrates comparable results.

3.4 Plan complexity and delivery accuracy

Plan complexity and delivery accuracy results are reported in Table 5 and in Figs. 4 and 5. All metrics reported an increase in the complexity of the automated plans without affecting the accuracy of the plan delivery. MCS results showed an increased complexity passing from MP to mCP01 and from mCP01 to mCP02. This is coupled with an increase in the number of MU in mCP01 (6.9% [−7.93 ± 27.2]%, p = 1.000) and in mCP02 (22.5% [−0.74 ± 61.5]%, p = 0.005). This trend is clearly highlighted by the related bias line in Fig. 5. The lower number of segments in automatic plans was obtained thanks to the novel pseudo-gradient descent segment shape optimizer (PGDSSO). Comparing mCP02 to mCP01, a statistically significant increase is registered in the number of segments testifying the stronger request for plan modulation. As reported in Fig. 5 and Table 5 all the plan delivery accuracy metrics registered similar results, although PR (3%/3 mm) revealed a downward trend as the passing rate means an increase in BA plots. At the variance test, it is worth noticing that both mCP01 and mCP02 showed a statistically significant difference compared to MP due to higher minimum values of gamma passing rates.

Table 5 Comparison of original manual plans (MP) and mCycle plans (mCP01 and mCP02) in terms of plan complexity and plan delivery accuracy

Full size table

3.5 Blind physician scoring results

All MP and mCPs were considered clinically acceptable. However, it has been highlighted that two MP presented a large deviation from the protocol criteria due to an overcoming of the bowel V_45Gy constraint strictly due to unfavorable anatomies. Despite these isolated cases, the remaining 58 plans satisfied the Institute protocol, although in a few cases minor deviations were accepted in PTV coverage, bowel, and femoral heads constraints.

It is crucial to consider the decision-making process of the ROs (Table 6). The two clinicians ranked mCP02 as the best strategy in 80% and 70% of cases, respectively, demonstrating its consistent performance. On the other hand, mCP01 was the preferred choice in 15% of cases. MPs were considered the best plan in merely 5% and 15% of cases, respectively, suggesting they were less favored by the ROs. The physicians’ total agreement was 63.3%, with a Cohen's kappa statistic of 0.45, indicating a moderate agreement among the raters.

Table 6 Plan ranking by two experienced radiation oncologists (RO1/RO2) of original manual plans (MP) and mCycle plans (mCP01 and mCP02)

Full size table

4 Discussions

To our knowledge, this is the first study proving how it is possible to make a step further in the mCycle automatic planning for cervical cancer treatment. The results presented here confirmed how fast the automatic re-planning can be: for both WLs, the fast automatic re-planning took slightly more than one hour per plan to achieve 20 clinically acceptable and deliverable plans, supporting the idea that mCycle application in the clinical routine would strongly reduce planners’ workload on cervical treatment planning confirming what was proved for Erasmus-iCycle tool [34,35,36]. Furthermore, mCycle mostly created an optimal plan in an almost “one-button click” procedure without any planner intervention for manual tuning. A plan re-optimization with manual refinements was required in 10% and 5% of the cases for mCP01 and mCP02, respectively. This decrease in manual intervention can be seen as a further improvement of the WL leading to a further reduction of manual workload.

This study shows how auto-planning can generate at least comparable plans to manual-planning with higher efficiency and less inter-planner variability. It is worth noticing that these results did not affect what was already obtained in mCP01, especially looking at the sparing of the bowel proving how robust the LO is. Further studies are needed to evaluate the therapeutic ratio of mCP02 with respect to MP and mCP01, to do so these dosimetric results should be used to assess tumor control probability (TCP) and normal tissue complication probability (NTCP) for a different clinical endpoint.

The plan complexity analysis revealed the significantly higher complexity of mCycle plans compared to manual ones. Although the new PGDSSO led to a lower number of segments, the results showed the required MUs increase in the automatic plans. The WL02 pressing requests on OAR sparing led to a further complexity increase in the mCP02. Nevertheless, the preserved gamma passing ratio testified that the increased complexity did not affect the plan delivery accuracy, guaranteeing a treatment at least as safe and precise as the manual ones. This outcome seems to be common in several automatic planning systems. Bijman et al. demonstrated a slight increase of needed MUs for the mCycle system with mean differences between 11 and 19% linked to the anatomical site under consideration [19]. In the study by Heijmen et al., a median increase of 13% in the requested MUs obtained with the iCycle system was related to a larger reduction in rectum parameters [3]. Also Pinnacle Autoplanning and Genetic Planning Solution (Raystation TPS) showed statistically significant growth of the MUs per plan in all the explored anatomic sites without a lower passing rate in the pre–treatment verifications [4, 37]. On the other hand, Yang et al. demonstrated that RayStation TPS, coupled with the IronPython language platform, obtained a comparable number of MUs between automatic and manual plans for nasopharyngeal carcinoma, with at least comparable plan quality [38].

The blind choice performed by two experienced ROs revealed that a large decrease in OAR doses with a guaranteed minimal acceptable target coverage (mCP02) was mostly preferred to the higher target coverage of the opposing MP and mCP01. It is worth noticing that the blind choice resulted in a ‘moderate agreement’ in the final ranking which has been interpreted as mainly due to the selected 3-degrees scale of preferences permitting a full spectrum of plan discussion and acceptance levels. In particular, two interesting outliers (Figs. 1 and 2) have been comprehensively discussed because of a large decrease of the bowel V45Gy in the automated plans coupled with a strongly reduced PTV coverage. ROs finally and independently claimed that, given the possibility to choose between these treatments, they would have confirmed their choice in the clinical routine.

On the other hand, the PQI analysis showed that the two automated strategies are at least comparable to manual planning, with mCP01 slightly outperforming mCP02. It is worth noticing that ROs and PQI plan scoring disagreed in the mCP01 and mCP02 ranks. A possible explanation can be found in the relation between the PQI definition and mCP02 excellent results. mCycle capability to strongly reduce OARs parameters could change what ROs can expect. The PQI definition based on MP daily routine needs to be updated in light of mCycle capabilities, changing the sub-metrics weights to better fit the clinical evaluation. It has been demonstrated that automated strategies can be stressed to go further than the well-known manual planning routine. Furthermore, the possibility to generate different WLs allows to promptly answer clinicians’ requests: it would be possible to choose, patient by patient, the preferred compromise between DVH and plan complexity. Furthermore, these fast and customizable results suggest exploiting automated planning systems in a fast adaptive workflow soon, as demonstrated by Castriconi et al. [39] who reported that a well-defined KBP model could reduce planning time and inter-planner variability.

Only a few other studies faced the same issue in KBP planning. Hundvin et al. reported modest but significant improvements in both plan quality and consistency for high-risk prostate cancers performing a KBP model tuning [17], while Nakamura et al. showed that the last update of their model could make a better estimation of the DVH in the open-loop validation plans [16].

This study demonstrated how far an automated tool could lead the radiotherapy routine but it is worth emphasizing that the WLs development and evaluation is a challenging iterative process, strongly dependent on many factors as the institutional protocol on which it is based, the user know-how [35], and the lack of human and time resources to deeper investigate the tool full potential.

Future studies will focus on LO capabilities to adapt the here presented WLs to a multiple dose levels scenario, doubling and differentiating the PTVs requests and coherently adapting the OAR objective functions [5, 6]. Furthermore, a prospective analysis on a larger patient cohort is suggested. Indeed, Fogliata et al. highlighted those systematic investigations are needed to test the performance and robustness of the automated tools [9], and Wortel et al. pointed out the importance of periodically checking the quality and the acceptance rates of automatic plans after their clinical introduction [40]. Finally, to assess the generalization of the WLs, a multi-centric validation would be suggested.

5 Conclusion

This comprehensive dosimetric and clinical study demonstrated that mCycle generates plans at least comparable and often superior to accepted manual plans in the selected patients’ cohort, outperforming manual plans at the blinded clinical ranking. The WL02 tuning showed the possibility of going further than manual planning quality in cervical cancer treatment. By considering the workload, dosimetric, and clinical advantages, mCycle proved to be an effective and flexible tool to generate automatic high-quality VMAT treatment plans according to the cervical treatment institutional protocol and its results are suggestive of a reliable methodology application to the clinical routine as soon as it will become commercially available.

Data availability

Research data are stored in an institutional repository and will be shared upon request to the corresponding author.

Abbreviations

LO:: Lexicographic optimization
OAR:: Organ-at-risk
VMAT:: Volumetric-modulated arc therapy
MCO:: Multicriterial optimization
WL:: Wish list
MP:: Manual plans
mCP:: MCycle plans
PTV:: Planning target volume
RT:: Radiotherapy
TPS:: Treatment planning system
KBP:: Knowledge-based planning
SBRT:: Stereotactic body radiation therapy
CC:: Clinical constraint
PC:: Planning constraint
HPV:: Human papillomavirus
CTV:: Clinical target volume
CP:: Control points
SW:: Segment width
MLC:: Multileaf collimator
MUs:: Monitor units
FMO:: Fluence matrix optimization
CI:: Conformality index
MCS:: Modulation complexity score
ROs:: Radiation oncologists
PQI:: Plan quality index
B-A:: Bland–Altman
PGDSSO:: Pseudo-gradient descent segment shape optimizer
TCP:: Tumor control probability
NTCP:: Normal tissue complication probability

References

Hansen CR, Hussein M, Bernchou U, Zukauskaite R, Thwaites D. Plan quality in radiotherapy treatment planning - review of the factors and challenges. J Med Imaging Radiat Oncol. 2022;66(2):267–78. https://doi.org/10.1111/1754-9485.13374.
Article PubMed Google Scholar
Hussein M, Heijmen BJM, Verellen D, Nisbet A. Automation in intensity modulated radiotherapy treatment planning - a review of recent innovation. Br J Radiol. 2018;91:20180270. https://doi.org/10.1259/bjr.20180270.
Article PubMed PubMed Central Google Scholar
Heijmen B, Voet P, Fransen D, Penninkhof J, Milder M, Akhiat H, Bonomo P, Casati M, Georg D, Goldner G, Henry A, Lilley J, Lohr F, Marrazzo L, Pallotta S, Pellegrini R, Seppenwoolde Y, Simontacchi G, Steil V, Stieler F, Wilson S, Breedveld S. Fully automated, multi-criterial planning for volumetric modulated arc therapy- an international multi-center validation for prostate cancer. Radiother Oncol. 2018;128:343–8.
Article PubMed Google Scholar
Cilla S, Ianiro A, Romano C, Deodato F, Macchia G, Buwenge M, Dinapoli N, Boldrini L, Morganti AG, Valentini V. Template-based automation of treatment planning in advanced radiotherapy: a comprehensive dosimetric and clinical evaluation. Sci Rep. 2020. https://doi.org/10.1038/s41598-019-56966-y.
Article PubMed PubMed Central Google Scholar
Biston MC, Costea M, Gassa F, Serre AA, Voet P, Larson R, et al. Evaluation of fully automated a priori MCO treatment planning in VMAT for head-and-neck cancer. Phys Med. 2021;87:31–8. https://doi.org/10.1016/j.ejmp.2021.05.037.
Article PubMed Google Scholar
Naccarato S, Rigo M, Pellegrini R, Voet P, Akhiat H, Gurrera D, De Simone A, Sicignano G, Mazzola R, Figlia V, Ricchetti F, Nicosia L, Giaj-Levra N, Cuccia F, Stavreva N, Pressyanov DS, Stavrev P, Alongi F, Ruggieri R. Automated planning for prostate stereotactic body radiation therapy on the 15 T MR-Linac. Adv Radiat Oncol. 2022;7(3):100865. https://doi.org/10.1016/j.adro.2021.100865.
Article PubMed PubMed Central Google Scholar
Yusufaly TI, Meyers SM, Mell LK, Moore KL. Knowledge-based planning for intact cervical cancer. Semin Radiat Oncol. 2020;30(4):328–39. https://doi.org/10.1016/j.semradonc.2020.05.009.
Article PubMed Google Scholar
Momin S, Fu Y, Lei Y, Roper J, Bradley JD, Curran WJ, Liu T, Yang X. Knowledge-based radiation treatment planning: a datadriven method survey. J Appl Clin Med Phys. 2021;22(8):16–44. https://doi.org/10.1002/acm2.13337.
Article PubMed PubMed Central Google Scholar
Fogliata A, Belosi F, Clivio A, Navarria P, Nicolini G, Scorsetti M, Vanetti E, Cozzi L. On the pre-clinical validation of a commercial model-based optimisation engine: application to volumetric arc therapy for patients with lung or prostate cancer. Radiother Oncol. 2014;113:385–91.
Article PubMed Google Scholar
Lian J, Yuan L, Ge Y, Chera BS, Yoo DP, Chang S, Yin FF, Wu JQ. Modeling the dosimetry of organ-at-risk in head and neck IMRT planning: an inter-technique and inter-institutional study. Med Phys. 2013;40:1217041–9.
Article Google Scholar
Tol JP, Dahele M, Peltola J, Nord J, Slotman BJ, Verbakel WFAR. Automatic interactive optimization for volumetric modulated arc therapy planning. Radiat Oncol. 2015. https://doi.org/10.1186/s13014-015-0388-6.
Article PubMed PubMed Central Google Scholar
Marrazzo L, Meattini I, Arilli C, Calusi S, Casati M, Talamonti C, Livi L, Pallotta S. Auto-planning for VMAT accelerated partial breast irradiation. Radiother Oncol. 2019;132:85–92.
Article PubMed Google Scholar
Breedveld S, Storchi PRM, Voet PWJ, Heijmen BJM. iCycle: Integrated, multicriterial beam angle, and profile optimization for generation of coplanar and noncoplanar IMRT plans. Med Phys. 2012;39(2):915–63. https://doi.org/10.1118/1.3676689.
Article Google Scholar
Craft D, Bortfeld TR. How many plans are needed in an IMRT multi-objective plan database? Phys Med Biol. 2008;53:2785–96.
Article PubMed Google Scholar
Monz M, Bortfeld TR, Kufer KH, Thieke C. Pareto navigation-algorithmic foundtion of interactive multi-criteria IMRT planning. Phys Med Biol. 2008;53:985–98.
Article CAS PubMed Google Scholar
Nakamura K, Okuhata K, Tamura M, Otsuka M, Kubo K, Ueda Y, Nakamura Y, Nakamatsu K, Tanooka M, Monzen H, Nishimura Y. An updating approach for knowledge-based planning models to improve plan quality and variability in volumetric-modulated arc therapy for prostate cancer. J Appl Clin Med Phys. 2021;22(9):113–22.
Article PubMed PubMed Central Google Scholar
Hundvin JA, Fjellanger K, Pettersen HES, Nygaard B, Revheim K, Sulen TH, Ekanger C, Hysing LB. Clinical iterative model development improves knowledge-bassed plan quality for high-risk prostate cancer with four integrated dose levels. Acta Oncol. 2021;60(2):237–44.
Article CAS PubMed Google Scholar
Ueda Y, Miyazaki M, Sumida I, Ohira S, Tamura M, Monzen H, Tsuru H, Inui S, Isono M, Ogawa K, Teshima T. Knowledge-based planning for oesophageal cancers using a model trained with plans from a different treatment planning system. Acta Oncol. 2020;59(3):274–83.
Article PubMed Google Scholar
Bijman R, Sharfo AW, Rossi L, Breedveld S, Heijmen BJM. Pre-clinical validation of a novel system for fully-automated treatment planning. Radiother Oncol. 2021;158:253–61.
Article PubMed Google Scholar
Jee KW, McShan DL, Fraass BA. Lexicographic ordering: intuitive multicriteria optimization for IMRT. Phys Med Biol. 2007;52(7):1845–61. https://doi.org/10.1088/0031-9155/52/7/006.
Article PubMed Google Scholar
Trivellato S, Caricato P, Pellegrini R, Montanari G, Daniotti MC, Bordigoni B, Faccenda V, Panizza D, Meregalli S, Bonetto E, Arcangeli S, De Ponti E. Comprehensive dosimetric and clinical evaluation of lexicographic optimization-based planning for cervical cancer. Front Oncol. 2022. https://doi.org/10.2289/fonc.2022.1041839.
Article PubMed PubMed Central Google Scholar
Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global Cancer Statistics 2018: GLOBOSCAN estimates of incidence and mortality worldwide for 36 cancers in 185 Countries. CA Cancer J Clin. 2018;68:394–424.
Article PubMed Google Scholar
World Health Organization, Cervical cancer Elimination Initiative, 2020. https://www.who.int/initiatives/cervical-cancer-elimination-initiative/. Accessed Aug 2020.
Potter R, Tanderup K, Kirisits C, de Leeuw A, Kurchheiner K, Nout R, Tan LT, Haie-Meder C, Mahantshetty U, Segedin B, Hoskin P, Bruheim K, Rai B, Huang F, Van Limbergen E, Schmid M, Nesvacil N, Sturdza A, Fokdal L, Jensen NBK, Georg D, Assenholt M, Seppenwoolde Y, Nomden C, Fortin I, Chopra S, van der Heide U, Rumpold T, Lindegaard JC, Jürgenliemk-Schulz I. The EMBRACE II study: The outcome and prospect of two decades of evolution within the GEC-ESTRO GYN working group and the EMBRACE studies. Clin Transl Radiat Oncol. 2018;9:48–60.
PubMed PubMed Central Google Scholar
ICRU, ICRU Report 62, Prescribing, Recording and Reporting Photon Beam Therapy (Supplement to ICRU 50), International Commission on Radiation Units and Measurements, Bethesda, Md, 1999.
Buckey CR, Swanson GP, Stathakis S, Papanikolaou N. Optimizing prostate intensity-modulated radiation therapy (IMRT): Do stricter constraints produce better dosimetric results? European J Clin Med Oncol. 2010;2(2):139–44.
Google Scholar
Roeske JC, Bonta D, Mell LK, Lujan AE, Mundt AJ. A dosimetric analysis of acute gastrointestinal toxicity in women receiving intensity-modulated whole-pelvic radiation therapy. Radiother Oncol. 2003;69(2):201–7. https://doi.org/10.1016/j.radonc.2003.05.001.
Article PubMed Google Scholar
Lawton CA, Michalski J, El-Naqa I, Buyyounouski MK, Lee WR, Menard C, O’Meara E, Rosenthal SA, Ritter M, Seider M. RTOG GU Radiation oncology specialists reach consensus on pelvic lymph node volumes for high-risk prostate cancer. Int J Radiat Oncol Biol Phys. 2009;74(2):383–7. https://doi.org/10.1016/j.ijrobp.2008.08.002.
Article PubMed Google Scholar
McNiven AL, Sharpe MB, Purdie TG. A new metric for assessing IMRT modulation complexity and plan deliverability. Med Phys. 2010;37(2):505–15. https://doi.org/10.1118/1.3276775.
Article PubMed Google Scholar
Venselaar J, Welleweerd H, Mijnheer B. Tolerances for the accuracy of photon beam dose calculations of treatment planning systems. Radiother Oncol. 2001;60(2):191–201. https://doi.org/10.1016/s0167-8140(01)00377-2.
Article CAS PubMed Google Scholar
Landis JR, Koch GG. The measurment of observer agreement fo categorical data. Biometrics. 1977;33:159–74.
Article CAS PubMed Google Scholar
Nelms BE, Robinson G, Markham J, Velasco K, Boyd S, Narayan S, Wheeler J, Sobczak ML. Variation in external beam treatment plan quality: an inter-institutional study of planners and planning systems. Pract Radiat Oncol. 2012;2(4):296–305. https://doi.org/10.1016/j.prro.2011.11.012.
Article PubMed Google Scholar
Franco F, Di Napoli A. Valutazione della concordanza tra misurazioni di carattere di tipo quantitativo: il metodo di Bland- Altman. G Tec Nefrol Dial. 2017;29:56–61.
Google Scholar
Voet PWJ, Dirkx MLP, Breedveld S, Fransen D, Levendag PC, Heijmen BJM. Toward fully automated multicriterial plan generation: a prospective clinical study. Int J Radiat Oncol Biol Phys. 2013;85:866–72.
Article PubMed Google Scholar
Sharfo AWM, Breedveld S, Voet PWJ, Heijkoop ST, Mens JWM, Hoogeman MS, Heijmen BJM. Validation of fully-automated VMAT plan generation for library-based plan-of-the-day cervical cancer radiotherapy. PLoS ONE. 2016. https://doi.org/10.1371/journal.pone.0169202.
Article PubMed PubMed Central Google Scholar
Voet PWJ, Dirkx MLP, Breedveld S, Al-Mamgani A, Incrocci L, Heijmen BJM. Fully automated volumetric modulated arc therapy plan generation for prostate cancer patients. Int J Radiat Oncol Biol Phys. 2014;88:1175–9.
Article PubMed Google Scholar
Fiandra C, Rossi L, Alparone A, Zara S, Vecchi C, Sardo A, Bartoncini S, Loi G, Pisani C, Gino E, Redda MGR, Deotto GM, Tini P, Comi S, Zerini D, Ametrano G, Borzillo V, Strigari L, Strolin S, Savini A, Romeo A, Reccanello S, Rumeileh IA, Ciscognetti N, Guerrisi F, Balestra G, Ricardi U, Heijmen B. Automatic genetic planning for volumteric modulated arc therapy: A large multi-centre validation for prostate cancer. Radiother Oncol. 2020;148:126–32.
Article CAS PubMed Google Scholar
Yang Y, Shao K, Zhang J, Chen M, Chen Y, Shan G. Automatic planning for nasopharyngeal carcinoma based on progressive optimization in raystation treatment planning system. Technol Cancer Res Treat. 2020. https://doi.org/10.1177/1533033820915710.
Article PubMed PubMed Central Google Scholar
Castriconi R, Fiorino C, Passoni P, Broggi S, Di Muzio NG, Cattaneo GM, Calandrino R. Knowledge-based automatic optimization of adaptive early-regression-guided VMAT for rectal cancer. Physica Med. 2020;70:58–64.
Article Google Scholar
Wortel G, Eekhout D, Lamers E, van der Bel R, Kiers K, Wiersma T, Janssen T, Damen E. Characterization of automatic treatment planning approaches in radiotherapy. Phys Imag Radiat Oncol. 2021;19:60–5.
Article Google Scholar

Download references

Acknowledgements

San Gerardo Hospital (Monza Italy) Radiation Oncology and Medical Physics Departments.

Funding

Not applicable.

Author information

Authors and Affiliations

Medical Physics Department, Fondazione IRCCS San Gerardo Dei Tintori, Monza, Italy
Paolo Caricato, Sara Trivellato, Gianluca Montanari, Martina Camilla Daniotti, Bianca Bordigoni, Valeria Faccenda, Denis Panizza & Elena De Ponti
Department of Physics, University of Milan, Milan, Italy
Paolo Caricato, Martina Camilla Daniotti & Valeria Faccenda
Medical Affairs, Elekta AB, Stockholm, Sweden
Roberto Pellegrini
Department of Physics, University of Milano Bicocca, Milan, Italy
Bianca Bordigoni
School of Medicine and Surgery, University of Milan Bicocca, Milan, Italy
Denis Panizza, Sofia Meregalli, Stefano Arcangeli & Elena De Ponti
Department of Radiation Oncology, Fondazione IRCCS San Gerardo Dei Tintori, Monza, Italy
Sofia Meregalli, Elisa Bonetto & Stefano Arcangeli
Research Clinical Liaison, Elekta AB, Stockholm, Sweden
Peter Voet

Authors

Paolo Caricato
View author publications
You can also search for this author in PubMed Google Scholar
Sara Trivellato
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Pellegrini
View author publications
You can also search for this author in PubMed Google Scholar
Gianluca Montanari
View author publications
You can also search for this author in PubMed Google Scholar
Martina Camilla Daniotti
View author publications
You can also search for this author in PubMed Google Scholar
Bianca Bordigoni
View author publications
You can also search for this author in PubMed Google Scholar
Valeria Faccenda
View author publications
You can also search for this author in PubMed Google Scholar
Denis Panizza
View author publications
You can also search for this author in PubMed Google Scholar
Sofia Meregalli
View author publications
You can also search for this author in PubMed Google Scholar
Elisa Bonetto
View author publications
You can also search for this author in PubMed Google Scholar
Peter Voet
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Arcangeli
View author publications
You can also search for this author in PubMed Google Scholar
Elena De Ponti
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

PC was the lead author, who participated in study design, data collection, data analysis, manuscript drafting, table/figure creation, and manuscript revision. ST contributed to study design, participated in data collection and statistical analysis, manuscript drafting, table/figure creation, and manuscript revision. RP participated in data collection, data analysis, and manuscript revision. GM participated in data collection and analysis, and manuscript revision. MD participated in table/figure creation, and manuscript revision. BB, VF and DP aided in data analysis and manuscript revision. SM and EB participated in the plan blind choice and manuscript revision. PV contributed to technical inputs and manuscript revision. SA is a senior author who aided in data analysis and manuscript revision. EP is a senior author who aided in the study design, contributed to statistical analysis, and revised the manuscript. All authors contributed to the article and approved the submitted version.

Corresponding author

Correspondence to Paolo Caricato.

Ethics declarations

Ethics approval and consent to participate

No ethical committee approval was needed for this retrospective dosimetric planning study (ASST Monza Committee).

Consent for publication

Not applicable.

Competing interests

Dr. Pellegrini R serves as Senior Scientist in Medical Affairs at Elekta AB and Dr. Peter Voet serves as Senior Researcher. All other authors have no disclosures to declare.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S7.

Comparison of original manual plans (MP) and mCycle plans (mCP01 and mCP02) in terms of Plan Quality Index (PQI) and its submetrics: coverage, OAR sparing, plan delivery accuracy, and plan complexity. Median values and ranges are reported. Figure S6. Box-and-whisker plots (left) and related Bland Altman plots (right) for PQI for manual plans (MP) and mCycle plans (mCP01 and mCP02). In Bland Altman plots orange circles and blue triangles represent "MP vs mCP01" and “MP vs mCP02” comparison, respectively. Dashed lines: bias line, solid lines: agreement limits lines. Abbreviations: PQI: Plan Quality Index.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Caricato, P., Trivellato, S., Pellegrini, R. et al. Updating approach for lexicographic optimization-based planning to improve cervical cancer plan quality. Discov Onc 14, 180 (2023). https://doi.org/10.1007/s12672-023-00800-5

Download citation

Received: 20 May 2023
Accepted: 25 September 2023
Published: 30 September 2023
DOI: https://doi.org/10.1007/s12672-023-00800-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Updating approach for lexicographic optimization-based planning to improve cervical cancer plan quality

Abstract

Background

Material and Methods

Results

Conclusions

Similar content being viewed by others

Effectiveness of Multi-Criteria Optimization-based Trade-Off exploration in combination with RapidPlan for head & neck radiotherapy planning

Planning comparison of five automated treatment planning solutions for locally advanced head and neck cancer

Automated volumetric modulated arc therapy planning for whole pelvic prostate radiotherapy

1 Introduction

2 Material and methods

2.1 Pathology

2.2 Patient population

2.3 Manual treatment planning

2.4 mCycle auto-planning

2.5 Plan comparison

2.6 Plan complexity and delivery accuracy

2.7 Blind physician scoring

2.8 Statistical analysis

3 Results

3.1 Wish-Lists tweaking

3.2 mCycle auto-planning

3.3 Dosimetric comparison

3.4 Plan complexity and delivery accuracy

3.5 Blind physician scoring results

4 Discussions

5 Conclusion

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1: Table S7.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation