Recent advances in the clinical applications of machine learning in proton therapy
==================================================================================

* Vanessa L. Wildman
* Jacob F. Wynne
* Aparna H. Kesarwala
* Xiaofeng Yang

## ABSTRACT

The present systematic review is an effort to explore the different clinical applications and current implementations of machine/deep learning in proton therapy. It will assist as a reference for scientists, researchers, and other health professionals who are working in the field of proton radiation therapy and need up-to-date knowledge regarding recent technological advances. This review utilized Pubmed and Embase to search for and identify research studies of interest published between 2019 and 2024. This systematic literature review utilized PubMed and Embase to search for and identify studies pertinent to machine learning in proton therapy. The time period of 2019 to 2024 was chosen to capture the most recent signficant advances. An initial search on PubMed was made with the search strategy “‘proton therapy’, ‘machine learning’, ‘deep learning’”, with filters including only research articles from 2019 to 2024, returning 84 results. Next, “(“proton therapy”) AND (“machine learning” OR “deep learning”)” was searched on Embase, retrieving 546 results. When filtered between 2019 to 2024 and to only research articles, 250 results were retrieved on Embase. Reviews, editorials, technical notes, and articles in any language other than English were excluded from the broad search on both databases. Filtering by title, papers were chosen based on two inclusion factors: explicit application to, or mention of, proton therapy, and inclusion of a machine learning algorithm. Assessing by abstract, works irrelevant to specific aspects of the proton therapy workflow in the scope of the review were excluded. Upon assessing and evaluating full texts for quality, studies were excluded that lacked a clear explanation of model architecture. If multiple studies of the same architecture applied to the same workflow step were identified, chronologically only the most recent advancement in application was included. An additional 5 studies that met all inclusion criteria were identified from references of chosen papers. In total, 38 relevant studies have been summarized and incorporated into this review. This is the first systematic review to comprehensively cover all current and potential areas of application of machine learning to the proton therapy clinical workflow.

## Introduction

The usage of protons in radiation was first proposed in 1946 by Wilson, a Harvard particle physicist [1]. The first treatments on human patients began in 1954 at the Berkeley Radiation Laboratory [2], and the first therapy center was opened in Boston in 1961. In 1988, the FDA approved proton therapy for clinical use. Currently, the National Association for Proton Therapy (NAPT) reports 45 active proton therapy centers located across the United States, and a recent clinical investigation found that between 2012 and 2021, the total annual number of patients treated with proton therapy increased from 5,377 to 15,829 [3]. Demand for proton therapy continues to rise, due to its increased precision and reduction of integral dose.

The physics behind proton therapy originates from the unique energetic nature of the proton. As beams travel through tissues, protons undergo inelastic and elastic Coulombic interactions. Kinetic energy is continuously lost through frequent inelastic interactions with atomic electrons. However, most protons continue to travel in a nearly straight line through the medium as their rest mass is significantly larger than that of an electron and only the electron is ejected. A proton passing close to the nucleus experiences a repulsive elastic interaction which, owing to the large mass of the nucleus, deflects the proton from its originally straight trajectory [4]. Such interactions are less frequent, but must be considered in treatment planning as they result in a change of particle trajectory. As energy loss of the proton beam determines range in the patient, an understanding of these interactions in the clinical context is essential. The Bragg peak is the fundamental phenomena supporting the efficacy of proton therapy treatment. The Bragg curve plots the energy loss of ionizing radiation, such as photons or protons, during its travel through matter. For protons, the peak occurs immediately prior to the particles coming to rest. This is the depth where the proton will deposit most of its dose [4]. With precise planning, this unique property has the potential to spare skin and other healthy tissue from radiation exposure both proximal and distal to the tumor target.

Conventional photon therapy and proton therapy each possess unique advantages and disadvantages. Because they are uncharged particles, photons primarily interact with matter through indirect processes such as Compton scattering and the photoelectric effect. These interactions result in a relatively uniform deposition of energy along their path through tissue, which leads to broader scatter to surrounding tissue and skin in the setting of conventional radiotherapy. Photon therapy is associated with resultant side effects, such as radiation dermatitis, due to increased skin exposure [5]. As a result of the Bragg peak, proton radiation is delivered with greater precision to the tumor site, sparing healthy surrounding tissue and skin relative to conventional photon radiotherapy. Specific techniques such as pencil beam scanning and intensity modulated proton therapy have been developed to further optimize proton therapy treatment delivery. Proton therapy is currently preferred in pediatric cases and in scenarios where tumor location is adjacent to sensitive organs at risk (OARs), such as in cancers of the head and neck and central nervous system. It is also useful in geometrically complex treatments with challenging dosimetry, such as when treating synchronous bilateral breast cancers.

Numerous machine learning (ML) applications are currently employed in conventional radiotherapy and may easily be adapted to proton therapy [6]. At present, models are employed primarily in segmentation, dose calculation and risk management. ML methods to improve clinical decision-making are needed. Adapting existing ML protocols for traditional photon treatment to proton applications may be expected to improve these processes, enhancing physician efficiency and quality in treatment planning and delivery.

ML, as a subfield of artificial intelligence (AI), encompasses models which learn from datasets to generate decisions without being explicitly coded for a repetitive, standard task. ML models may be trained using supervised on unsupervised techniques. Supervised learning provides the model labeled and correctly paired training and test datasets, where the correct output is known for each input [7]. These methods are typically applied to classification tasks where the goal is to map outputs from a given input, such as verifying Bragg peak range or optimizing treatment plans with Knowledge-Based Planning (KBP). In contrast, unsupervised learning is commonly applied to complex tasks where labeled datasets are not readily available, such as imaging segmentation or contouring. Deep learning (DL) is a subset of ML which utilizes neural networks. Inspired by information processing in the brain, neural networks pass raw input data through layers of nodes, each comprised of simple mathematical operations, to create an output. With additional (deeper) layers, more complex functions, such as image recognition, are possible [8]. Convolutional neural networks (CNNs) are specialized DL models equipped to process grid-like data. This is beneficial in medical imaging, as CNNs can extract features from the pixels or voxels of patient imaging data across several modalities. The layers of a CNN, known as convolutional layers, apply kernels to the input image to extract specific features such as contours or patterns across progressively smaller spatial scales [9].

A common form of DL that has been applied in the planning stage of radiotherapy treatment delivery is the Generative Adversarial Network (GAN). This type of network was originally created in 2017 by Goodfellow *et al*. and is a novel approach to unpaired image-to-image translation superior to traditional DL models when generating synthetic images [10]. The GAN architecture consists of two connected neural networks, a generator which creates synthetic data, and a discriminator which evaluates the resemblance to the real data. These two neural networks are trained simultaneously, using an adversarial framework that ultimately guides the generator to improve performance with each iteration, producing increasingly realistic data that challenges the discriminator. Training is complete when the min-max loss function is optimized. GANs are particularly useful in complex image translation tasks, such as converting images from one modality to another (e.g. CT to MRI). U-Nets, CNNs with a U-shaped architecture, are particularly suited to tasks in image segmentation, due to their ability to extract and retain image details across several spatial scales [11]. Such a network is able to be trained on few images, and performs segmentation-related tasks very quickly.

## Methods

This review utilized Pubmed and Embase to search for and identify research studies of interest published between 2019 and 2024. This systematic literature review utilized PubMed and Embase to search for and identify studies pertinent to machine learning in proton therapy. The time period of 2019 to 2024 was chosen to capture the most recent signficant advances. An initial search on PubMed was made with the search strategy “‘proton therapy’, ‘machine learning’, ‘deep learning’”, with filters including only research articles from 2019 to 2024, returning 84 results. Next, “(“proton therapy”) AND (“machine learning” OR “deep learning”)” was searched on Embase, retrieving 546 results. When filtered between 2019 to 2024 and to only research articles, 250 results were retrieved on Embase. Reviews, editorials, technical notes, and articles in any language other than English were excluded from the broad search on both databases. Filtering by title, papers were chosen based on two inclusion factors: explicit application to, or mention of, proton therapy, and inclusion of a machine learning algorithm. Assessing by abstract, works irrelevant to specific aspects of the proton therapy workflow in the scope of the review were excluded. Upon assessing and evaluating full texts for quality, studies were excluded that lacked a clear explanation of model architecture. If multiple studies of the same architecture applied to the same workflow step were identified, chronologically only the most recent advancement in application was included. An additional 5 studies that met all inclusion criteria were identified from references of chosen papers. In total, 38 relevant studies have been summarized and incorporated into this review.

## Results

### 1. ML for Patient Selection and Predictive Outcome Modeling in Proton Therapy

ML models are being developed to improve the efficiency of pre-screening processes and determining which modality of radiotherapy will result in the least adverse effects. Pre-selection enables patient informatics to be compared to an established criteria protocol that assesses whether or not an individual is eligible for, or will respond better to, proton therapy versus photon radiotherapy. This automated method enables more patients to be pre-screened in a shorter amount of time as well as eliminating any physician bias in the selection process. It is also possible to utilize a predictive model to choose proton or photon therapy based on post-radiation toxicity prediction as a marker of preferability.

#### 1.1 Pre-screening for Therapy Selection

Kouwenberg *et al*. [12] developed a pre-selection algorithm specifically designed for intensity modulated proton therapy (IMPT) for head and neck cancer patients. The group found that the proposed algorithm significantly reduces time and labor costs by utilizing ML and automated planning. The algorithm created a fully-automated IMPT plan for patients with a prior model-based selection (MBS) approach for photon treatment. A Gaussian naive Bayes classifier for MBS outcome prediction was trained on the dosimetric differences between the IMPT auto-generated plan and the previous photon plan, as well as on the outcomes of the MBS approach. The training process was curated to strongly avoid generating false negatives of IMPT eligible patients. The variables for training compared the differences in xerostomia, dysphagia, and feeding tube dependency for the photon therapy and IMPT plan, and whether or not the MBS approach deems the individual eligible. After training, a three-dimensional decision boundary was generated based on the xerostomia, dysphagia, and feeding tube dependency measurements. The boundary separates pre-selected patients eligible for proton therapy from those that were not pre-selected by the model, and is given by the conditional probability generated from the Gaussian classifier that outlines whether the patient eligible for proton therapy was larger than the decision threshold. Such efficient models provide valuable insight as to whether an individual patient would be selected for proton therapy prior to any other further treatment planning, conserving both time and money.

An alternative model, titled AI-PROTIPP (Artificial Intelligence Predictive Radiation Oncology Treatment Indication to Photons/Protons), features a U-Net architecture and is able to quantitatively assess which treatment method is superior depending on patient factor inputs [13]. The model is able to predict dose distributions for both photon and proton therapy, and use Normal Tissue Complication Probability (NTCP) to determine which therapy would result in the least side effects. This model was employed in clinical cases of patients with oropharyngeal cancer, and resulted in 87.4% accuracy of treatment selection based on the parameters given. AI-PROTIPP effectively uses NTCP data to predict dose distributions, select treatment plans, and reduce the time needed for manual comparisons. Similar to AI-PROTIPP, Chen *et al.* [14] generated two 3D U-Nets to predict photon and proton doses for patients with localized prostate therapy. They selected NTCP models of grade 2 or higher to determine proton therapy partiality. The deep-learning based dose prediction technique returned with 90%-93.5% selection accuracy based on the NTCP model parameters. These studies demonstrate that DL methods can accurately predict dosage and distinguish NTCP differences between photon and proton plans to select the superior modality for the patient [12–14]. Clinically, ML applied to this area has the potential to greatly decrease time spent on plan comparison down to as little as 5 seconds, expediting the start of patient treatment.

Geng *et al.* conducted a multicenter study using a KBP ML model to evaluate and improve radiotherapy plan quality in a clinical trial for non-small-cell lung cancer (NSCLC) [15]. The trial aimed to compare the effectiveness of IMPT versus traditional IMRT, with a focus on reducing toxicity to OARs. The KBP model was utilized to assess the quality of both IMPT and IMRT plans across the various centers, showing that while proton therapy often demonstrated dosimetric advantages, it also had greater variability in plan quality compared to photon therapy. This study highlights the KBP model’s utility in QA, and its potential for enhancing radiotherapy plans in multicenter clinical trials. The research group also plans to publish both photon and proton KBP models for broader use in radiotherapy plan optimization and QA, one of the first for proton therapy.

#### 1.2 Toxicity Prediction and Strategy

Padannayil *et al*. [16] applied a ML algorithm to improve toxicity prediction and to specifically reduce radiation dermatitis for IMPT. Radiation dermatitis is a common side effect of all radiotherapy modalities characterized by skin inflammation and redness. It has been reported that up to 67.4% of nasopharyngeal carcinoma patients experience Grade 2 (medium) radiation dermatitis [17], therefore, finding ways to mitigate its severity is of high importance. The group developed an unsupervised k-means clustering ML algorithm that was trained on previous head and neck cancer patients treated with IMPT. The algorithm created clusters based on dermatitis severity (low, medium, or high grade) and treatment parameters to locate patterns found between the two. By optimizing the distance between clusters, the algorithm was able to group patients according to how likely they are to develop varying grades of radiation dermatitis depending on their treatment parameters. By implementing this model which optimizes where the proton beam is placed, the algorithm is able to lower the skin dose of radiation while maintaining comparable treatment outcomes to traditional photon therapy.

### 2. Simulation & Treatment Planning

The clinical workflow of radiotherapy relies heavily on manual inputs and standardized protocols for simulation and treatment planning, critical components that guarantee safe and effective treatment delivery. While current methodologies are effective, they may be time-consuming, require substantial processing power, and fall susceptible to patient-specific variability. With the incorporation of ML, many of these areas may be automated and optimized to tailor each nuance of treatment planning to the individual patient case.

#### 2.1 Auto-planning and Treatment Plan Optimization

A 3-dimensional cycle generative adversarial network (CycleGAN) was trained on CT and MRI data pairs from a training cohort composed of patients with base-of-skull tumors [18]. The clinical study consisted of 50 patients, split into two non-overlapping cohorts, one for training and one for study purposes. Post-training, synthetic CT (sCT) scans were generated by the algorithm and compared against the initial CT scans where the mean error ranged from 38.65 HU to 65.12 HU. Proton plans with 2 beams each were generated based on the initial CT scans as well as the sCT images, and were compared based on dose-volume histogram endpoints and proton distal range along the beams. Quantitative results showed that the algorithm-generated sCT was in agreement with the ground truth CT; the dosimetric evaluation of the dose-volume histogram endpoints did not have statistically significant differences, and 96% of the proton ranges were clinically accepted. Shafai-Erfani *et al.* showcased an algorithm that is capable of generating clinically-acceptable sCT images and highlighted the potential for MRI-based proton treatment planning, which was subsequently built upon by Zimmerman *et al*. [19].

A DL algorithm capable of generating sCT images independent of MRI sequence data was created in 2022 [19]. 47 meningioma patients treated with pencil beam scanning (PBS) therapy were split into training, validation and test cohorts. MRI sequences in combination with planning CT (pCT) data were used to train a 3D U-Net framework with ResNet-Blocks, a type of CNN block in which the output from one layer is added into a deeper layer. The training outcome was assessed by metric, dosimetric, and spot difference map accuracy in comparison to the original treatment plans. The synthetic dose parameters of the proposed model agreed within 1% of the real-world original plans, and 98% of the spots on the spot difference maps had less than 1 cm difference from the original plan. The novel MRI sequence independent sCT generator created by Zimmerman *et al.* suggests that “the training phase of neural networks can be disengaged from specific MRI acquisition protocols” [19].

Incorporating KBP pipelines improved with ML into commercial treatment planning may decrease inter-plan variability. KBP in the modern day best refers to data-driven approaches in treatment planning, incorporating clinicians’ knowledge into precise algorithms. Post-processing and dose mimicking are critical steps to create clinically acceptable ML-planned IMPT plans. Borderias-Villarroel *et al.* [20] investigated the quality of such plans utilizing four different KBP pipelines. The four models were generated from two sets of conditions; post-processing or no post-processing, isodose-based or commercial RayStation dose mimicking. Using data from 60 oropharyngeal cancer patients, the study trained 11 dose prediction models and results produced comparable target coverage and improved OAR preservation compared to manual plans. However, post-processing appeared to compromise the robustness of the plan. The combination of no post-processing and RayStation dose mimicking produced the IMPT plan with the best robustness with sufficient OAR sparing.

#### 2.2 Target Delineation and Segmentation

Contouring OARs is a laborious, manually-intensive task in the clinical workflow. Studies have shown that contours created by AI are more consistent than those completed manually by oncologists with the same level of NTCP [21]. Applying related technology to the clinical workflow of proton therapy is therefore preferable to ensure patients receive the highest quality treatment plan. Segmentation of the target and OARs utilizing ML has not been explored for proton therapy treatment specifically. At the current time, all published ML algorithms related to contouring in treatment planning pertain to conventional radiation methodologies. Nielsen et al. is the only group to have evaluated a CNN and local AI to contour OARs specifically for future proton therapy [21]. Specific ML models have been developed for adaptive proton therapy purposes, but the planning stage remains underdeveloped.

Van der Veen *et al.* explored how DL models can be implemented in the identification and delineation of OARs in head and neck cancer radiotherapy treatment [22]. While this study does not directly address the applications to proton therapy but rather radiotherapy broadly, the technology has great potential for practical application. Precise delineation of tumors in the head and neck is crucial during treatment planning to ensure that the radiation is sufficiently covering and concentrated on the tumor while sparing surrounding healthy tissue. OAR delineation in the majority of cases is auto-contoured with physician confirmation, but some difficult areas may be delineated manually. The study assesses how a CNN, a type of DL model, can automatically delineate OARs from patient CT or MRI scans used in treatment planning. Using a CNN can significantly decrease the time it takes to delineate the tumor from OAR while simultaneously improving consistency when trained on a large dataset. The authors discuss potential limitations as well: the availability of vast training datasets is still expanding, and the necessity of carefully validating the algorithm’s accuracy remains for successful implementation.

#### 2.3 Dose Prediction, Deposition & Verification

PBS dose calculation often may be inaccurate due to approximations used to work around heterogeneities in tissue, but the result is provided quickly. The doses are calculated using the water equivalent path length along the center of the proton beam, and the medium is treated as infinite and homogeneous. However, tissue is rarely homogenous in clinical reality, leading to range uncertainties and inaccurate modeling of multiple Coulomb scattering. Despite being the current gold standard and extremely accurate, Monte Carlo (MC) dose calculation suffers from slow processing due to tracking each individual particle. To mitigate the tradeoff between accuracy and time when utilizing PBS dose calculation or MC simulations, incorporating ML algorithms may improve the shortcomings of either method.

To boost the accuracy of PBS dose planning while maintaining the short time scale, Wu *et al.* created a DL model that converts a PBS dose to a MC dose using the initial PBS data and the CT images [23]. The model was trained on data from four tumor sites; head and neck, liver, prostate, and lung. The group found that, “Training the model on data from all tumor sites together and using the dose distribution of each individual beam as input yielded the best performance for all four tumor sites” [23]. The average gamma passing rate, a QA measure, was >88% between the converted PBS dose and the MC dose, and the conversion time was less than 4 seconds. Such results demonstrate high accuracy and rapid speed, combining the strengths of both planning techniques. The group reports that the model is able to be applied to new datasets through transfer learning, and may efficiently be added to the clinical workflow of proton therapy treatment planning.

Proton Source Model Commissioning (PSMC) is a critical component of the planning workflow which aims to optimize the match between calculated dose and delivered dose in PBS therapy. Presently, PSMC ensures accurate dose calculation via MC simulations. Nominal energy refers to the average or expected energy value of the protons in the simulation, while the energy spread is the range of energies around the nominal energy. Setting up, or calibrating, the nominal energy and energy spread parameters in the PSMC is difficult as these parameters are not able to be easily solved from an equation. To facilitate these calculations, a CNN known as “PSMC-Net” was developed [24]. PSMC-Net was trained on a range of 33 clinical-level energies, from 75-225 MeV. For each of the 33 energies, a dataset was generated with 15 nominal energies, 10 spreads, and the corresponding 150-calculated depth doses (IDDs). 130 of the data pairs were used for training, 10 for validation, and 10 for testing. Results showed that the gamma pass rate between the MC and measured IDDs was 99% when PSMC-Net was implemented. Without PSMC-Net, the gamma pass rate reduced to ∼54%. This difference in plan quality showcases the significant improvement in accuracy and efficiency of the PSMC process when commissioning a CNN-based proton source model.

Linear energy transfer (LET) refers to the rate at which energy is transferred per unit length. X-rays in photon therapy would be classified as low LET and cause smaller amounts of DNA damage, compared to particles with higher LET, such as alpha particles that subsequently cause greater damage. Protons are advantageous in radiation therapy due to their characteristic Bragg peak, which creates a mixed radiation field of both low LET components further from the tumor and high LET components at the tumor site. MC simulations are utilized as the gold standard to predict yield and LET at a certain depth of the proton treatment plan to assess biological effectiveness. These simulations are difficult to validate experimentally. Gao *et al.* implemented a method to verify LET via an entirely synthetic computational approach for proton stereotactic body radiation therapy (SBRT) [25]. To date, this is the first time a KBP methodology has been adopted in the context of LET distributions. Utilizing the dose distribution map, the DL framework predicts the LET distribution map of the protons to better estimate the relative biological effectiveness (RBE). The framework utilized data from 50 prostate cancer patients receiving proton SBRT, and featured a 5-fold cross-validation method, dividing the patient dataset into five subsets. In each iteration, one subset was used for testing while the remaining four were used for training, ensuring each patient’s data was tested exactly once. The proposed model consists of two sub-networks; two U-net based generators that create a synthetic LET map image from the dose map image, and two discriminations which minimize judgement error. The supervised CycleGAN model demonstrated superior performance to other GAN-based models with a mean absolute error (MAE) of 0.096 ± 0.019 keV *μ*m−1. This model may significantly improve the efficacy of proton therapy by improving treatment plan quality through RBE optimization, aided by the provision of highly accurate LET information.

#### 2.4 Range Prediction, Calculation & Verification

An alternative method to estimate proton beam range is the use optical camera systems equipped with scintillators. Scintillators are materials that absorb ionizing radiation, or the energy from charged particles, and convert it into short bursts of visible photons or UV light. Such devices are excellent candidates for dosimetric or range QA measures in radiation therapy. As passive detectors, scintillators provide a non-invasive, safe option for clinical usage. Additionally, the devices are able to provide real-time feedback with high spatial resolution provided by the sensitive ability to capture minute light emissions. They are cost-effective, adaptable to different radiation energies, and generally compatible with optical imaging systems.

Utilizing ML to hybridize dosimetric and range predictions, Ranjith *et al.* developed artificial neural networks (ANN) able to predict six variables related to dosimetric parameters: proton beam spot size in the x-axis, y-axis, major axis, minor axis, and relative x and y-axis positional errors [26]. All of the ANN models utilized a multi-layer perception (MLP) network with one input layer, three hidden layers, and an output layer. Trained on data from 9000 proton spots, all predicted spot size location and relative positional errors were compared with scintillator-measured data as reference. The ANN models resulted in lower prediction errors compared to the scintillators, and demonstrated excellent beam spot size predications as well as positional errors.

Lee *et al.*’s study appears to be one of the first combining scintillators with DL as a QA measure for both range and dose [27]. The group remarks the difficulty previous methods have had with analytically deriving dose distribution from scintillation images manually due to quenching and optical effects. Their study introduces a a 2D residual U-net DL approach to predict beam range and spread-out Bragg peak (SOBP) using a scintillation light distribution (LD) captured in a water phantom. These predictions were reached via 2D map conversion of the LD maps into dose maps. The training and validation set was comprised of 8659 image pairs generated via MC simulations with varying proton beam conditions, and the LD was “reconstructed using photons backpropagated from the aperture as a visual lens” [27]. The model showed high accuracy with beam range and SOBP width resolutions of 0.02 mm and 0.19 mm, respectively, and deviations of less than 0.1 mm and 0.8 mm from reference simulations. Results demonstrated good agreement in gamma analysis, validating the feasibility and accuracy of this DL-based QA method in clinical practice.

#### 2.5 HU-to-SPR Conversion (CT/DECT)

Derived from the nature of proton interactions, stopping power is an essential aspect of treatment planning which measures how quickly protons lose kinetic energy as they travel through a medium. Linear stopping power is proportional to the density of electrons in the absorbing material due to energy loss resulting from Coulombic interactions between the proton and atomic electrons. Linear stopping power is the property that determines where the Bragg peak occurs. The stopping power ratio (SPR) is the ratio of the linear stopping power of the tissue to the stopping power of water [4]. This is an essential parameter in determining how protons will interact with different bodily tissues, such as air, bone, fat, or muscle. To calculate dose distribution, Hounsfield Units (HU) from CT images are converted into SPR values in order to estimate the linear stopping power of tissues, ensuring precise proton dose delivery in the patient. Conventionally, HU-to-SPR conversion is achieved with a calibration curve relating the two units based on known relationships. ML is increasingly being implemented in this clinical step to increase conversion accuracy, reduce systematic errors, and improve time efficiency.

Wang *et al.* present a noise-robust learning-based method to predict RSP maps from DECT images for proton therapy [28]. DECT images may be utilized to derive the RSP maps by obtaining the energy dependence of proton interactions, however the maps are easily compromised due to noise levels and artifacts from physics-based mapping methods. Utilizing ML to reduce the noise when predicting the RSP maps shows great clinical potential. The model developed utilized a residual attention cycle-consistent generative adversarial network (ccGAN), similar to a CycleGAN, to “bring DECT-to-RSP mapping close to a 1-to-1 mapping by introducing an inverse RSP-to-DECT mapping” [28]. In simulation, 20 head-and-neck cancer patients with DECT images were evaluated. A leave-one-out cross-validation strategy evaluated ground truth RSP values which served as learning targets against results from the proposed model. The predicted RSP maps from the model resulted in a mean square error of 2.83% and a MAE less than 3%. Even with additional simulated noise in the DECT datasets, the model maintained a comparably accurate performance, while the traditional physics-based method faced decreased accuracy due to the noise. Regarding DVH metrics, clinical target volumes were less than 0.2 Gy with no statistical significance, with OAR metrics around 1 Gy. The results highlight the excellent accuracy of the predicted RSP maps generated by the ML model and the potential for improving treatment planning and dose calculation.

Translating CT numbers into material properties contributes to uncertainty in dose calculations. A physics-constrained DL-based multimodal imaging (PDMI) framework was proposed to bridge the gap between proton range uncertainty and material density of different tissue types [29]. This model integrated physics, DL, MRI, and DECT to generate accurate mass density maps, and built upon previous work by the same group investigating a similar physics-informed DL framework [30]. The prior network successfully generated accurate mass density and relative SPR maps from DECT images from anthropomorphic adult male, female, and child phantoms with unspecified tissue models. Building upon this work, the new study modeled adipose, brain, muscle, liver, skin, spongiosa, and hydroxyapatite bone in phantoms, and MRI images were taken of each. From the PDMI framework, an empirical model and various residual networks were generated. Parameters included training with and without a physics constraint, and the utilization of MRI and DECT images or solely DECT images. Supervised learning further enhanced the ResNet model trained with a physics constraint compared to the ResNest trial without. Additionally, the ResNet utilizing both MRI and DECT images opposed to solely DECT predicted the densities of each tissue most closely to established literature. As of currently, this novel framework is the first approach to inform DL models with both physics insights and MRI data to derive accurate mass density maps.

A present concern when considering radiotherapy plans for children is to avoid extraneous radiation, including substitution of MR for CT imaging whenever possible. A group affiliated with St. Jude Children’s Research Hospital worked to create synthetic relative proton stopping power (sRPSP) images from MRI sequences utilizing a particular GAN model [31]. Utilizing pCT and MR images from 195 pediatric brain tumor patients, 17 ccGAN models were trained on paired CT-converted RPSP and MRI datasets. T1-weighted, T2-weighted, and FLAIR MRI were tested in 17 total combinations, with or without preprocessing, to cover all optimal training sequences for the models. The intended purpose of the model was to transform the patient MRIs into sRPSP images by learning from the paired CT dataset. To evaluate the performance of the ccGAN and potential for clinical use, the group developed an online QA tool to ensure the safe integration of MR-only proton planning into practice. The generated sRPSP images were converted to sCT, then the QA technique adjusted the sCTs to match a standard reference template created from the training dataset to identify areas where the sCT deviated from the ground-truth CT by >100 HUs. Additionally, a gamma intensity analysis was conducted for similar purposes to analyze the accuracy of the sCT. The group concluded that accurate sRPSP images from T1-weighted and T2-weighted MRI were able to be generated by the ccGAN, with the QA tool highlighting regions of inaccuracy to remove unsuitable sRPSP images from clinical use.

### 3. In-Vivo Monitoring and Adaptivity in Proton Therapy

Continual adaptation and replanning are crucial components of both photon and proton therapy, and demonstrate reductions in NTCP and improved target coverage [32]. The two primary adaptive therapy techniques include offline replanning, which occurs when a new plan is generated between fractions and online replanning, which occurs while the patient is on the treatment table. ML has the potential to benefit both areas of adaptive proton replanning, which currently are time-consuming and potentially require additional irradiation to the patient through repetitive imaging.

#### 3.1 Anatomy Changes

As noted by Wang *et al.*, the sculpting ability benefits of IMPT are countered by increased sensitivity to anatomical variations during and between treatment sessions [33]. As of 2021, anatomic changes were still not effectively accounted for during pediatric treatment plans. Such limitations may result in suboptimal delivery to the tumor or increased dosage to OARs. Adaptive anatomy planning is resource intensive and usually requires additional CT simulation, where in pediatric patients, increased radiation exposure is not preferable. DL may improve this area of clinical workflow by allowing sCTs to be derived from offline on-treatment MRI images to calculate dose delivery based on daily anatomy, as well as flagging cases which may require adaptation.

The group developed a novel CycleGAN model with self-attenuation with abilities to identify pediatric patients that would benefit from adaptive replanning [33]. The goal of creating such a model, opposed to utilizing a conventional CycleGAN, is to assess if the addition of self-attenuation generates more accurate sCTs for children with brain tumors between the ages of 2 and 21. The goal of introducing self-attenuation to a conventional CycleGAN is to enhance boundary delineation of bone-air tissue interfaces, specific to common pediatric brain tumors, and to reduce noise. The training dataset was composed of both CT and T1-weighted MRI images of 125 children with brain tumors. Seven patients between the ages of 2 and 14 that underwent adaptive planning due to anatomy changes discovered via MRI during proton therapy comprised the test dataset. Results were obtained by comparing the MRI taken during proton therapy with the model-generated sCT and the replanning CT (ground truth). The HU MAE with the self-attenuation CycleGAN was 65.3 ± 13.9 versus 88.9 ± 19.3 for the conventional CycleGAN, demonstrating improved accuracy by the proposed model. The self-attenuated model also demonstrated improved gamma passing rates and appropriately triggered plan adaptation in all test patients.

Similar to the study utilizing a CNN to contour OARs in conventional radiation planning [22], Elmahdy *et al.* utilized a CNN to automatically segment the bladder from CT scans of IMPT prostate cancer patients online [34]. This study focuses on online adaptive IMPT, where the patient’s anatomy is continuously tracked to generate real-time treatment adjustments in the case of observed changes. In the case of prostate cancer, the goal of this algorithm was to monitor changes in bladder or prostate shape or location to minimize OAR dosage. Particular attention was paid to accurate segmentation of the bladder and adjacent structures (prostate, seminal vesicles, and lymph nodes). The group acknowledges commercially available automatic recontouring applications for adaptive therapy, but reaffirms their current status as a black box for clinicians.

The proposed CNN is able to predict deformation vector fields which are required for accurate contour propagation by detecting spatial transformations between the reference CT image and the real-time treatment imaging [34]. The CNN was trained on manually-delineated images and corresponding deformation vector fields of 20 prostate cancer patients. As the network makes the continuous comparison between the two images, this enables adaptations in the treatment plan to be generated based on the patient’s anatomical variations during the treatment session. Their results found that the combined CNN and image registration technique improved the target delineation accuracy significantly with a dice similarity coefficient of 88% compared to manual attempts. The CNN-propagated contours met dose coverage constraints in 86%, 91%, and 99% of cases for the prostate, seminal vesicles, and lymph nodes, respectively. The group reported that 80% of the automatically generated treatment plans were directly usable without manual correction, improving clinical efficiency and potentially reducing adverse side effects by sparing healthy tissue.

#### 3.2 Range

Protoacoustic signals in proton therapy are acoustic waves generated when protons interact with tissues. According to the Bragg peak, protons suddenly and rapidly deposit energy at a certain depth, causing a local transient increase in temperature and pressure [35]. This minute expansion generates protoacoustic signals, which may be detected with ultrasound sensors. The clinical relevance relates to recreation of the Bragg peak and real-time monitoring of proton beam range, ensuring that the beam is depositing at the intended target depth. Protoacoustic signal denoising is quickly gaining traction in the field of proton therapy research, and ML techniques are readily being incorporated.

While the protoacoustic technique is able to determine the Bragg peak location in vivo, it requires a large dose delivery in order to achieve an adequate signal-to-noise (SNR) ratio and high number of signaling average (NSA). Such large doses are not suitable for clinical use. A novel DL model demonstrated enhancement of the SNR of protoacoustic measures as well as improving Bragg peak location identification accuracy in range verification scenarios [36]. Three accelerometers were placed on cylindrical polyethylene phantoms to record protoacoustic signals during simulated treatment. Device-specific stack autoencoder (SAE) denoising models were trained to denoise low NSA signals. The supervised SAE outperformed the unsupervised SAE in Bragg peak range verification. Clinically, this model shows promise to decrease unnecessary dosage in the patient by accurately measuring the Bragg peak in vivo.

Expanding upon the range of applications of ML in online range monitoring, Jiang *et al*. proposed a novel architecture capable of denoising acoustic, protoacoustic, and electroacoustic signals both quantitatively and qualitatively [37]. Radiation-induced acoustic imaging generally requires a substantial number of recorded frames to achieve a satisfactory dose deposition average, exposing the patient to increased dose. The proposed model is a general deep inception convolution neural network, or a GDI-CNN. The model features radiation-induced acoustic signal denoising capabilities, which results in a reduced number of frames required for averaging. The group details that “the network employs convolutions with multiple dilations in each inception block, allowing it to encode and decode signal features with varying temporal characteristics” [37]. Such characteristics expand the reach of the network’s abilities to different radiation sources. Compared to prior inception networks, this study implements convolutions with different dilations, rather than different filter sizes, resulting in increased inception field size and decreased computational assumptions. The inputs of the model were the few-averaged noisy signals, which in turn outputs high-SNR denoised signals. For the protoacoustic radiation therapy trial, four of the five data were employed for model training, with the remainder used for validation. The performance of the GDI-CNN was assessed in comparison with experimental data of protoacoustic signals; it realized proton range accuracy parallel to 1500-frame-average results with only 20-frame-average measurements inputted. These results improved range verification frequency from 0.5 to 37.5 Hz in proton therapy. Additionally, it demonstrated lower mean-squared errors and higher peak-SNR, improving the reach of real-time proton therapy monitoring.

#### 3.3 Dosimetry

Protoacoustic imaging specifically focuses on generating images based on the acoustic signals generated by protons. This specific application of protoacoustic signaling is beneficial as it offers in vivo 3D dose verification; however, it is limited by the narrow angle of the ultrasound transducer. Jiang’s group developed a DL model featuring a deep cascaded CNN (DC-CNN) to improve proton-acoustic image reconstruction using proton-acoustic signals detected by a matrix array [38]. The framework was validated on data from 81 prostate cancer patients’ IMPT therapy plans, and dosage was calculated with RayStation. A matrix ultrasound array was simulated near patient treatment site to measure radiofrequency signals during dose delivery. To address how realistic the simulation may be, tissue heterogeneity and attenuation were considered by adding noise to the signals. The proposed model was trained on 204 samples from 69 patients and tested on 26 samples from 12 separate patients. 3D pressures and dose maps were generated, and qualitatively and quantitatively compared with the ground truth. The results showed that the proposed DC-CNN reconstructed high-quality 3D pressure images from the proton-acoustic images, potentially enabling 3D dose verification during treatment.

Pastor-Serrano *et al.* propose DoTA, a DL based millisecond speed dose calculation algorithm capable of accurately predicting pencil beam proton dose depositions [39]. The two current physics-based tools to calculate proton doses are analytical pencil beam algorithms (PBA) and MC simulations, which respectively feature better speed or precision. MC methods and PBAs are currently unable to synthesize the particle transport simulations in sub-second times necessary for next generation real-time adaptive radiotherapy. The importance of accurate particle transport calculations results from IMPT requiring MC simulations and PBAs to compute the spatial distribution of physical dose delivered by the many individual protons. Based upon previous long short-term memory (LSTM) networks [40, 41], the proposed algorithm sequentially calculates proton pencil beam dose distributions from relative SPR slices, and does not require a separate model per beam energy [39]. DoTA takes a novel approach by featuring an attention-based transformer backbone, which dispenses of convolutions and connects the encoder and decoder. This architecture is simpler than many other transduction models, which accounts for the additional speed that may benefit adaptive replanning.

With the parametric DoTA model, dose distributions from individual proton beamlets are able to be predicted from patient geometries (x) and beam energies (ɛ̝) with remarkable speed. The essential concept behind the algorithm is that it captures particle transport physics from provided data and learns the appropriate function y(θ)= f(x, ɛ̝) through a series of artificial neural networks with parameters (θ). Performance was based on speed and accuracy comparison to standard clinically used methods, such as PBA or MC ground truth dose distributions. In summary, DoTA predicted dose distributions from single pencil beams 100 times quicker than widely used PBAs. The distributions yielded a gamma pass rate of 99.37 ± 1.17, close to standard MC accuracy in a fraction of the computational time. DoTA outperformed previous PBA and MC approaches with a 10% improvement in gamma pass rate, and features speed close to commercial GPU MC methods, critical for adaptive replanning events.

### 4. Future directions

Significant advances have been recently made incorporating ML into the proton therapy clinical workflow. However, this is still an emerging area of investigation gaining momentum as proton therapy becomes more commonplace. While great strides have been made in method applicable to all radiotherapy generally, proton therapy-specific models have been less frequently published. Areas of priority for future development are discussed in the following section.

#### 4.1 Quality Assurance

At present, ML-assisted QA methods are sparse in proton therapy. QA is laborious in all high-precision radiation therapy modalities, and is comprises management of beam delivery mechanisms, beam parameters, and instrumentation. Currently, ML algorithms have been applied to improving gamma pass rates and predicting beam data in proton therapy, but no publications directly address ML for proton QA. Prerequisites for a quality plan, such as accurate delineation of the target and OARs, is beyond the scope of most QA procedures [15]. According to the AAPM Task Group 224, QA procedures have been outlined for three proton therapy techniques: scattering, uniform scanning, and PBS [42]. This is an area of work where ML may be incorporated to improve the accuracy, efficiency, and consistency in QA. Ono *et al.* reported on the applications of AI for QA in radiotherapy as of 2024, and proton therapy is absent [43]. Current focus areas include feature extraction and selection, mechanical hardware setup QA, and gamma passing rate calculations. With the recently developed QA procedure outlines by Task Group 224 [42], ML may now be addressed as it has been with various other radiotherapy techniques. Connecting back to ML for in-vivo monitoring, an underdeveloped area, projects can continue to be pursued to further improve real-time treatment monitoring by analyzing live delivery of dosage. Additionally, AI may be harnessed to provide streamlined QA workflows by automating manual routine tasks, such as data entry or documentation.

#### 4.2 Cost Calculation

A large barrier to proton therapy treatment is cost. Proton therapy is expensive and may be inaccessible to patients not geographically located near a treatment center. By training on patient geographic and socioeconomic data and associated treatment outcomes, ML could be leveraged to improve equity or personalize the costs of proton treatment. From an institutional perspective, these same data could be used to assess opportunities for growth. Verma *et al.* investigated the health economics of proton therapy in terms of sustainability and cost-effectiveness [44]. As this study was conducted purely from a health economics perspective, ML was not utilized. A significant concern brought to attention is the inherent inaccuracy of cost-effectiveness analyses due to the inability to account for all aspects of operation, such as electricity and maintenance, beam delivery time, and number of patients treated on a given day. The group reported that Markov and MC modeling have been utilized to address cost tabulations and comparisons, but all modeling studies have limitations due to probabilistic assumptions. As of 2016 when the study was conducted, the group reports that proton therapy for the routine treatment of breast cancer was not shown to be cost-effective. ML may greatly improve the accuracy and efficiency of cost-effectiveness analyses by reliably accounting for additional factors otherwise neglected through manual calculation [44]. Additionally, as demonstrated in previously discussed studies, ML is able to improve MC simulations in various aspects [23, 39, 45]. However, such an approach has not yet been accounted for in proton therapy economic analysis.

#### 4.3 Outcome Studies

Prognostic research is sparse in the context of both proton therapy and ML. The recent nature of proton therapy compared to the history of data available for conventional photon therapy is a primary limiting factor. ML has been utilized to predict toxicity and side effects [16, 46, 47], but not overall treatment prognostics in proton therapy. The integration of DL radiomics and circulating tumor cell counts has demonstrated improved accuracy in predicting the risk of reoccurrence for NSCLC patients treated with stereotactic body radiotherapy (SBRT). Jiao *et al.* accomplished this feat by building DL models on clinical measures and CT data from 421 NSCLC patients [48]. The models were able to not only predict recurrence risks, but were able to stratify patients into specific outcome subgroups based on the CTC counts. The proposed model achieved a concordance index of 0.880. It is promising for such a novel methodology to be applied to proton therapy subsequently, gleaning valuable predictions for physicians to consider throughout the treatment process. In particular, such a technique may serve beneficial in adaptive replanning.

#### 4.4 Challenges

The recent nature of proton therapy compared to the history of data available for conventional photon therapy is a limiting factor in the incorporation of ML. There are many large, high-quality datasets available for conventional radiation therapy, but few for proton therapy. This limits most training and testing data for proton therapy to clinical work, as deidentified and declassified patient datasets are not as common. Additional challenge arises from the variation in the nature of the physics. Proton therapy harnesses the complex power of the Bragg peak to provide targeted therapy with decreased side effects. However, with such benefit comes difficulty in applying existing ML algorithms used in conventional radiotherapy to proton therapy. ML applications are common in other areas of radiation therapy, but the challenge in adapting and applying them to proton therapy is inherent to the predicate physics and hardware differences.

## Discussion

Many variations and applications of machine learning (ML) have appeared in the proton therapy workflow over the past five years with promise of improving efficiency, accuracy, and efficacy of treatment. U-Net architectures are prevalent in the patient pre-screening process, and convolutional neural networks (CNNs) and CycleGANs frequent dose and range prediction in treatment planning. For adaptive monitoring, advanced deep learning architectures such as general deep inception convolution neural networks (GDI-CNNs) and deep cascaded CNNs (DC-CNNs) improve real-time dose verification and range monitoring. However, certain areas such as target segmentation in treatment planning, cost analysis, and quality assurance measures lack notable ML contributions and are scopes of future interest. In conclusion, with the increased clinical interest in proton therapy, ML algorithms have been incorporated into both the clinical and research workflows to facilitate treatment and discovery.

View this table:
[Table 1:](http://medrxiv.org/content/early/2024/10/28/2024.10.09.24314920/T1)

Table 1: Datasheet summary of current ML applications and associated data sources at various stages of proton therapy clinical treatment.

## Supporting information

Funding Information [[supplements/314920_file02.docx]](pending:yes)

## Data Availability

All data and studies referenced are publicly available in the cited sources.

## Footnotes

*   The funding declaration has been updated with appropriate information.

*   Received October 9, 2024.
*   Revision received October 28, 2024.
*   Accepted October 28, 2024.


*   © 2024, Posted by Cold Spring Harbor Laboratory

This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/)

## References

1.  1.Wilson, R.R., Radiological Use of Fast Protons. Radiology, 1946. 47(5).
    
    
2.  2.Tobias, C.A., et al., Pituitary Irradiation with High-Energy Proton Beams A Preliminary Report*. Cancer Research, 1958. 18(2): p. 121–134.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NjoiY2FucmVzIjtzOjU6InJlc2lkIjtzOjg6IjE4LzIvMTIxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjQvMTAvMjgvMjAyNC4xMC4wOS4yNDMxNDkyMC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

3.  3. William F. Hartsell, et al., Temporal Evolution and Diagnostic Diversification of Patients Receiving Proton Therapy in the United States: A Ten-Year Trend Analysis (2012 to 2021) From the National Association for Proton Therapy. International Journal of Radiation Oncology Biology Physics, 2024. 119(4): p. 1069–1077.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=38163519&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

4.  4.Park, S.H. and  J.O. Kang, Basics of particle therapy I: physics. Radiat Oncol J, 2011. 29(3): p. 135–46.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3857/roj.2011.29.3.135&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22984664&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

5.  5.Prasanna, P.G., et al., Normal Tissue Injury Induced by Photon and Proton Therapies: Gaps and Opportunities. International Journal of Radiation Oncology\*Biology\*Physics, 2021. 110(5): p. 1325–1340.
    
    
6.  6. Hao H. Zhang, et al., Machine Learning Applications in Radiation Therapy, in Machine Learning Algorithms for Problem Solving in Computational Applications: Intelligent Techniques. 2012, IGI Global: Hershey, PA. p. 59–84.
    
    
7.  7.1.   M. Cord and 
    2.   P. Cunningham
    
    Cunningham, P.,  M. Cord, and  S.J. Delany, *Supervised Learning*, in Machine Learning Techniques for Multimedia: Case Studies on Organization and Retrieval,  M. Cord and  P. Cunningham, Editors. 2008, Springer Berlin Heidelberg: Berlin, Heidelberg. p. 21–49.
    
    
8.  8.1.   R. Pandey
    
    Tyagi, K., et al., *Chapter 3 - Unsupervised learning*, in Artificial Intelligence and Machine Learning for EDGE Computing,  R. Pandey, et al., Editors. 2022, Academic Press. p. 33–52.
    
    
9.  9.O’Shea, K. and  R. Nash, An Introduction to Convolutional Neural Networks. ArXiv, 2015. abs/1511.08458.
    
    
10. 10.Goodfellow, I.J., et al. Generative Adversarial Networks. 2014. arXiv:1406.2661 DOI: 10.48550/arXiv.1406.2661.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.48550/arXiv.1406.2661&link_type=DOI) 

11. 11.Ronneberger, O.,  P. Fischer, and  T. Brox. U-Net: Convolutional Networks for Biomedical Image Segmentation. 2015. Cham: Springer International Publishing.
    
    
12. 12.Kouwenverg, J., et al., Model based patient pre-selection for intensity-modulated proton therapy (IMPT) using automated treatment planning and machine learning. Radiotherapy and Oncology, 2021. 158: p. 224–229.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33667584&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

13. 13.Huet-Dastarac, M., et al., Patient selection for proton therapy using Normal Tissue Complication Probability with deep learning dose prediction for oropharyngeal cancer. Medical Physics, 2023. 50(10): p. 6201–6214.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=37140481&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

14. 14.Chen, M., et al., Evaluation of an automated clinical decision system with deep learning dose prediction and NTCP model for prostate cancer proton therapy. Physics in Medicine & Biology, 2024. 69(11).
    
    
15. 15.Geng, H., et al., Implementation of Machine Learning Models to Ensure Radiotherapy Quality for Multicenter Clinical Trials: Report from a Phase III Lung Cancer Study. Cancers, 2023. 15(4).
    
    
16. 16.Padannayil, N.M., et al., IMPT of head and neck cancer: unsupervised machine learning treatment planning strategy for reducing radiation dermatitis. Radiation Oncology, 2023. 18(1): p. 11.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36639667&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

17. 17.Fang, K.C., et al., Acute radiation dermatitis among patients with nasopharyngeal carcinoma treated with proton beam therapy: Prognostic factors and treatment outcomes. Int Wound J, 2023. 20(2): p. 499–507.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35880316&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

18. 18.Shafai-Erfani, G., et al., MRI-Based Proton Treatment Planning for Base of Skull Tumors. International Journal of Particle Therapy, 2019. 6(2): p. 12–25.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31998817&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

19. 19.Zimmermann, L., et al., An MRI sequence independent convolutional neural network for synthetic head CT generation in proton therapy. Z Med Phys, 2022. 32(2): p. 218–227.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34920940&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

20. 20.Borderias-Villarroel, E., et al., Machine learning-based automatic proton therapy planning: Impact of post-processing and dose-mimicking in plan robustness. Medical Physics, 2023. 50(7): p. 4480–4490.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=37029632&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

21. 21.Nielsen, C.P., et al., Consistency in contouring of organs at risk by artificial intelligence vs oncologists in head and neck cancer patients. Acta Oncologica, 2023. 62(11): p. 1418–1425.
    
    
22. 22.Veen, J.v.d., et al., Benefits of deep learning for delineation of organs at risk in head and neck cancer. Radiotherapy & Oncology, 2019. 138: p. 68–74.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31146073&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

23. 23.Wu, C., et al., Improving proton dose calculation accuracy by using deep learning. Machine Learning: Science and Technology, 2021. 2(1).
    
    
24. 24.Liu, Y., et al., Commissioning dose computation model for proton source in pencil beam scanning therapy by convolution neural networks. Physics in Medicine & Biology, 2023. 68(15).
    
    
25. 25.Gao, Y., et al., Deep learning-based synthetic dose-weighted LET map generation for intensity modulated proton therapy. Physics in Medicine & Biology, 2024. 69(2).
    
    
26. 26.Ranjith, C.P., et al., An artificial neural network based approach for predicting the proton beam spot dosimetric characteristics of a pencil beam scanning technique. Biomedical Physics and Engineering Express, 2024. 10(3).
    
    
27. 27.Lee, E., et al., Deep learning proton beam range estimation model for quality assurance based on two-dimensional scintillated light distributions in simulations. Med Phys, 2023. 50(11): p. 7203–7213.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=37517077&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

28. 28.Wang, T., et al., Learning-Based Stopping Power Mapping on Dual-Energy CT for Proton Radiation Therapy. International Journal of Particle Therapy, 2021. 12(7): p. 46–60.
    
    
29. 29.Chang, C.-W., et al., Multimodal imaging-based material mass density estimation for proton therapy using supervised deep learning. The British Journal of Radiology, 2023. 96(1152).
    
    
30. 30.Chang, C.-W., et al., Dual-energy CT based mass density and relative stopping power estimation for proton therapy using physics-informed deep learning. Physics in Medicine & Biology, 2022. 67(11).
    
    
31. 31.Wang, C., et al., Toward MR-only proton therapy planning for pediatric brain tumors: Synthesis of relative proton stopping power images with multiple sequence MRI and development of an online quality assurance tool. Med Phys, 2022. 49(3): p. 1559–1570.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35075670&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

32. 32.Koay, E.J., et al., Adaptive/Nonadaptive Proton Radiation Planning and Outcomes in a Phase II Trial for Locally Advanced Non-small Cell Lung Cancer. International Journal of Radiation Oncology\*Biology\*Physics, 2012. 84(5): p. 1093–1100.
    
    
33. 33.Wang, C., et al., Facilitating MR-Guided Adaptive Proton Therapy in Children Using Deep Learning-Based Synthetic CT. Int J Part Ther, 2022. 8(3): p. 11–20.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35127971&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

34. 34.Elmahdy, M.S., et al., Robust contour propagation using deep learning and image registration for online adaptive proton therapy of prostate cancer. Medical Physics, 2019. 46(8): p. 3329–3343.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/mp.13620&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31111962&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

35. 35. Kevin C Jones,  Chandra M Seghal, and  S. Avery, How proton pulse characteristics influence protoacoustic determination of proton-beam range: simulation studies. Physics in Medicine & Biology, 2016. 61(6).
    
    
36. 36.Wang, J., et al., Deep learning-based protoacoustic signal denoising for proton range verification. Biomedical Physics and Engineering Express, 2023. 9(4).
    
    
37. 37.Jiang, Z., et al., Radiation-induced acoustic signal denoising using a supervised deep learning framework for imaging and therapy monitoring. Physics in Medicine & Biology, 2023. 68(23).
    
    
38. 38.Jiang, Z., et al., 3Din vivodose verification in prostate proton therapy with deep learning-based proton-acoustic imaging. Phys Med Biol, 2022. 67(21).
    
    
39. 39.Pastor-Serrano, O. and  Z. Perkó, Millisecond speed deep learning based proton dose calculation with Monte Carlo accuracy. Physics in Medicine & Biology, 2022. 67(10).
    
    
40. 40.Hochreiter, S. and  J. Schmidhuber, Long Short-Term Memory. Neural Computation, 1997. 9(8): p. 1735–1780.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1162/neco.1997.9.8.1735&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9377276&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1997YA04500007&link_type=ISI) 

41. 41.Neishabouri, A., et al., Long short-term memory networks for proton dose calculation in highly heterogeneous tissues. Medical Physics, 2021. 48(4): p. 1893–1908.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=33332644&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

42. 42.Arjomandy, B., et al., AAPM task group 224: Comprehensive proton therapy machine quality assurance. Medical Physics, 2019. 46(8): p. e678–e705.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31125441&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

43. 43.Ono, T., et al., Applications of artificial intelligence for machine- and patient-specific quality assurance in radiation therapy: current status and future direction. Journal of Radiation Research, 2024.
    
    
44. 44.Verma, V., et al., Cost-comparativeness of proton versus photon therapy. Chinese Clinical Oncology, 2016. 5(4).
    
    
45. 45.Pietsch, J., et al., Automatic detection and classification of treatment deviations in proton therapy from realistically simulated prompt gamma imaging data. Medical Physics, 2023. 50(1): p. 506–517.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36102783&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

46. 46.Chamseddine, I., et al., Predictive Model of Liver Toxicity to Aid the Personalized Selection of Proton Versus Photon Therapy in Hepatocellular Carcinoma. International Journal of Radiation Oncology Biology Physics, 2023. 116(5): p. 1234–1243.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36739920&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

47. 47.Chang, C.-W., et al., Early in vivo Radiation Damage Quantification for Pediatric Craniospinal Irradiation Using Longitudinal MRI for Intensity Modulated Proton Therapy. Advances in Radiation Oncology, 2023. 8(5).
    
    
48. 48.Jiao, Z., et al., Integration of Deep Learning Radiomics and Counts of Circulating Tumor Cells Improves Prediction of Outcomes of Early Stage NSCLC Patients Treated With Stereotactic Body Radiation Therapy. Int J Radiat Oncol Biol Phys, 2022. 112(4): p. 1045–1054.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34775000&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

49. 49.Chen, S., et al., MR-based synthetic CT image for intensity-modulated proton treatment planning of nasopharyngeal carcinoma patients. Acta Oncol, 2022. 61(11): p. 1417–1424.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36305424&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

50. 50.Bruggen, I.G.v., et al., Automated Robust Planning for IMPT in Oropharyngeal Cancer Patients Using Machine Learning. International Journal of Radiation Oncology, Biology, Physics, 2023. 115(5): p. 1283–1290.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=36535432&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

51. 51.Grewal, H.S., et al., Prediction of the output factor using machine and deep learning approach in uniform scanning proton therapy. Journal of Applied Clinical Medical Physics, 2020. 21(7): p. 128–134.
    
    
52. 52.Stasica, P., et al., Single proton LET characterization with the Timepix detector and artificial intelligence for advanced proton therapy treatment planning. Physics in Medicine & Biology, 2023. 68(10).
    
    
53. 53.Tang, X., et al., Deep learning based linear energy transfer calculation for proton therapy. Phys Med Biol, 2024. 69(11).
    
    
54. 54.Uh, J., et al., A hybrid method of correcting CBCT for proton range estimation with deep learning and deformable image registration. Physics in Medicine & Biology, 2023. 68(16).
    
    
55. 55.Charyyev, S., et al., Learning-based synthetic dual energy CT imaging from single energy CT for stopping power ratio calculation in proton radiation therapy. The British Journal of Radiology, 2022. 95(1129).
    
    
56. 56.Thummerer, A., et al., Deep learning-based 4D-synthetic CTs from sparse-view CBCTs for dose calculations in adaptive proton therapy. Med Phys, 2022. 49(11): p. 6824–6839.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=35982630&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

57. 57.Zhang, G., et al., SWFT-Net: a deep learning framework for efficient fine-tuning spot weights towards adaptive proton therapy. Phys Med Biol, 2022. 67(24).
    
    
58. 58.Lerendegui-Marco, J., et al., Towards machine learning aided real-time range imaging in proton therapy. Scientific Reports, 2022. 12.
    
    
59. 59.Sato, S., et al., A simulation study of in-beam visualization system for proton therapy by monitoring scattered protons. Frontiers in Medicine, 2023. 10.
    
    
60. 60.Zhang, L., et al., Beam mask and sliding window-facilitated deep learning-based accurate and efficient dose prediction for pencil beam scanning proton therapy. Medical Physics, 2024. 51(2): p. 1484–1498.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=37748037&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom) 

61. 61.Harms, J., et al., Cone-beam CT-derived relative stopping power map generation via deep learning for proton radiotherapy. Medical Physics, 2020. 47(9): p. 4416–4427.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32579710&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F10%2F28%2F2024.10.09.24314920.atom)