PortPy, short for Planning and Optimization for Radiation Therapy, is an initiative aimed at creating an open-source Python library for cancer radiotherapy treatment planning optimization. Radiotherapy is a key treatment for over half of all cancer patients, either alone or alongside surgery, chemotherapy, and immunotherapy. It works by directing high-energy radiation beams at tumors to kill cancerous cells. Given that each patient has a unique anatomy, it is essential to customize the radiation beams' shape, angle, and intensity. The goal is to maximize damage to the tumor while minimizing exposure to healthy surrounding tissues. This process involves solving complex, large-scale mathematical optimization problems tailored to each individual patient. PortPy aims to accelerate research in this interdisciplinary field by offering tools, benchmark algorithms, and datasets.
Contents
- Radiotherapy Optimization 101
- Quick start and examples
- Benchmark data, benchmark algorithms, and PortPy toolkit
- High-Level Description of PortPy
- How to contribute?
- Data
- Installation
- Team
The key variables in treatment planning optimization are the parameters of the radiation beams (e.g., beams' shape, angle, and intensity). However, the quality of a treatment is primarily measured by the radiation dose delivered to the patient’s body. We can connect the beam parameters to the radiation dose using a straightforward linear relationship. First, we divide the patient’s body into small three-dimensional units called voxels and each radiation beam into small two-dimensional sections called beamlets. By calculating how much radiation each beamlet (j) delivers to each voxel (i), and representing this with a value aij, we create what is known as the dose deposition matrix (A). This matrix links the intensities of the beamlets (x) to the total radiation dose delivered (d) using the equation: d=Ax. A general radiotherapy optimization problem can be formulated as:
Minimize f(Ax,x) subject to g(Ax,x)<=0, x>=0
where f and g are functions that evaluate the quality of radiation dose (Ax) and the beamlet intensities (x). These optimization problems are typically large, non-convex, and involve multiple conflicting criteria (i.e., tumor irradiation vs healthy tissues’ sparing). They must be solved quickly for each patient, often within minutes, seconds, or even milliseconds, depending on whether the planning is done offline, online, or in real-time.
Note: If you are new to the field, we suggest reviewing relevant literature review papers (Zarepisheh et al. 2021, Breedveld et al. 2019, Ehrgott et al. 2010) and watching YouTube videos (Edelman competition, Varian IMRT, Elekta VMAT). In the near future, we plan to launch an educational YouTube channel to assist researchers new to this field. Meanwhile, you can support this effort by starring this GitHub page, which will help us secure funding for further development.
The easiest way to start is through the PorPy following Jupiter Notebook examples.
Example File | Description |
---|---|
1_basic_tutorial.ipynb | Demonstrates the main functionalities of PortPy (e.g., Access data, create an IMRT plan, visualize) |
vmat_scp_tutorial.ipynb | Creates a VMAT plan using sequential convex programming |
vmat_scp_dose_prediction.ipynb | Predicts 3D dose distribution using deep learning and converts it into a deliverable VMAT plan |
3d_slicer_integration.ipynb | Creates an IMRT plan and visualizes it in 3D-Slicer |
imrt_tps_import.ipynb | 1. Outputs IMRT plan in DICOM RT format and imports it into TPS. 2. Outputs IMRT plan optimal fluence in an Eclipse-compatable format and imports it into Eclipse |
vmat_tps_import.ipynb | Outputs VMAT plan in DICOM RT format and imports it into TPS |
imrt_dose_prediction.ipynb | Predicts 3D dose distribution using deep learning and converts it into a deliverable IMRT plan |
vmat_global_optimal.ipynb | Finds a globally optimal VMAT plan |
beam_orientation_global_optimal.ipynb | Finds globally optimal beam angles for IMRT |
dvh_constraint_global_optimal.ipynb | Finds a globally optimal plan meeting Dose Volume Histogram (DVH) constraints |
This figure illustrates the inspiration behind developing PortPy, drawing from successful open-source practices in the AI and computer science communities. Tools like PyTorch and TensorFlow, along with benchmark datasets such as ImageNet and algorithms like AlexNet, have revolutionized AI and data science. Our goal is to replicate this successful model in the field of radiotherapy by equipping researchers with PortPy toolkit, benchmark algorithms, and datasets, as outlined below:
- PortPy Toolkit. PortPy allows researchers to develop, test, and validate novel treatment planning optimization algorithms.
- Benchmark Datasets. We have curated and made publicly available a dataset of 50 lung cancer patients, which includes all the necessary data for treatment plan optimization (e.g., beamlets, voxels, dose influence matrix). These data are extracted from the commercial FDA-approved Eclipse treatment planning system using its API. For more info about data, see Data.
- Benchmark Algorithms. Many optimization problems in radiotherapy treatment planning suffer from “non-convexity”, a mathematical property that can cause optimization algorithms to become trapped in “local optima” rather than finding the global optimum. Several of these problems (e.g., VMAT planning) can be formulated using advanced optimization techniques like Mixed Integer Programming (MIP). Although MIP is computationally intensive, often taking days to solve for each patient, it can provide global optimal solutions that can serve as "ground truth" benchmarks, enabling researchers to develop and evaluate more computationally efficient algorithms. For more info, see our Jupyter Notebooks (vmat_global_optimal.ipynb, beam_orientation_global_optimal.ipynb, dvh_constraint_global_optimal.ipynb).
The above figure illustrates the PortPy design and its three main modules: “Data Management”, “Plan Generation”, and “Plan Evaluation”, which are discussed below. We recommend reviewing our Jupyter Notebooks examples for a more comprehensive understanding of these modules.
-
Data Management
- This module provides access to the curated benchmark PortPy dataset, allowing researchers to test their algorithms on a standardized dataset (see basic_tutorial.ipynb notebook)
- The available data includes:
- CT images and contours
- all necessary data for optimization extracted from Eclipse using its API (version 16.1)
- expert-selected beams for each patient
- an IMRT plan for each patient, generated using our in-house automated planning system, ECHO (YouTube Video, Paper). More information about data can be found in Data section.
- In the current version, you can only work with the benchmark dataset provided in this PortPy repo and cannot use your own dataset for now. We will address this problem in the near future
# Use PortPy DataExplorer class to explore PortPy data data = pp.DataExplorer(data_dir=''../data) # Load ct, structure set, beams for the above patient using CT, Structures, and Beams classes ct = pp.CT(data) structs = pp.Structures(data) beams = pp.Beams(data) # By default, PortPy uses the beams selected by an expert planner, which are included as part of the dataset.
-
Plan Generation
- This module facilitates the generation of treatment plans using either classical optimization methods or emerging AI-based techniques
- For optimization tasks, PortPy has been integrated with CVXPy, a widely-used open-source package. CVXPy enables the high-level formulation of optimization problems and offers out-of-the-box access to a range of free (e.g., SCIP, SCIPY) and commercial (e.g., MOSEK, CPLEX, GUROBI) optimization engines (available for free for research purposes) (see basic_tutorial.ipynb notebook)
- PortPy.AI module is equipped with essential functionalities for AI-based planning. These include data access, data pre-processing, model training and testing, and patient-specific 3D dose prediction (see imrt_dose_prediction.ipynb notebook)
# Load optimization parameters and clinical criteria clinical_criteria = pp.ClinicalCriteria(data, protocol_name='Lung_2Gy_30Fx') opt_params = data.load_config_opt_params(protocol_name='Lung_2Gy_30Fx') # Load influence matrix inf_matrix = pp.InfluenceMatrix(ct=ct, structs=structs, beams=beams) # create a plan object my_plan = pp.Plan(ct = ct, structs = structs, beams = beams, inf_matrix = inf_matrix, clinical_criteria=clinical_criteria) # create cvxpy problem using the clinical criteria and optimization parameters and solve it opt = pp.Optimization(my_plan, opt_params=opt_params, clinical_criteria=clinical_criteria) opt.create_cvxpy_problem() sol = opt.solve(solver='MOSEK', verbose=False)
-
Plan Visualization and Evaluation
- Basic built-in visualization tools (e.g., DVH, dose distribution) are integrated into PortPy
- Enhanced visualizations are available through the integration with the popular open-source 3DSlicer package (see 3d_slicer_integration.ipynb notebook)
- Plans can be quantitatively evaluated using well-established clinical protocols (e.g., Lung 2Gyx30, see basic_tutorial.ipynb)
- Plans can be imported into any TPS for final clinical evaluations (see imrt_tps_import.ipynb)
# plot fluence in 3d for the 1st beam pp.Visualization.plot_fluence_3d(sol=sol, beam_id=my_plan.beams.get_all_beam_ids()[0]) # plot dvh for the structures pp.Visualization.plot_dvh(my_plan, sol=sol, struct_names=['PTV', 'CORD'], title=data.patient_id) # plot 2d axial slice for the given solution and display the structures contours on the slice pp.Visualization.plot_2d_slice(my_plan=my_plan, sol=sol, slice_num=60, struct_names=['PTV']) # visualize plan metrics and compare them against the clinical criteria pp.Evaluation.display_clinical_criteria(my_plan, sol=sol, clinical_criteria=clinical_criteria)
As illustrated in the above figure, PortPy organization includes "PortPy", which is the current repository, and PortPy extensions, which are the repositories developed using the PortPy as a platform. To maintain the lightweight nature and user-friendliness of PortPy modules, our aim is to include only fundamental functionalities, along with benchmark data and algorithms in the PortPy repo, and establish separate repositories for other projects, similar to what we've done for projects like CompressRTP and ECHO VMAT.
If you're interested in contributing to existing PortPy modules or wish to create a new module, we encourage you to contact us first. This will help ensure that our objectives and priorities are aligned. If you use PortPy to build your own package, you're welcome to host your package within the PortPy-Project orgainization. Alternatively, you can host your package on your own GitHub page. In this case, please inform us so that we can fork it and feature it under the PortPy-Project organization. For those keen on creating a logo for their repository, we offer the option to customize our pre-designed logo.
PortPy equips researchers with a robust benchmark patient dataset, sourced from the FDA-approved Eclipse commercial treatment planning system through its API. This dataset embodies all necessary elements for optimizing various machine configurations such as beam angles, aperture shapes, and leaf movements. It includes
- Dose Influence Matrix (AKA dose deposition matrix, dij matrix): The dose contribution of each beamlet to each voxel,
- Beamlets/Voxels Details: Detailed information about the position and size of beamlets/voxels,
- Expert-Selected Benchmark Beams: An expert clinical physicist has carefully selected benchmark beams, providing reference beams for comparison and benchmarking,
- Benchmark IMRT Plan: A benchmark IMRT plan generated using our in-house automated treatment planning system called ECHO (YouTube Video, Paper). This plan serves as a benchmark for evaluating new treatment planning algorithms.
- Benchmark Clinical Criteria: A set of clinically relevant mean/max/DVH criteria for plan evaluation. Currently, this set encompasses only the Lung 2Gy×30 protocol but will be expanded in the future to more protocols as well as TCP/NTCP evaluation functions.
To access these resources, users are advised to download the latest version of the dataset, which can be found here. Subsequently, create a directory titled './data' in the current project directory and transfer the downloaded file into it. For example, ./data/Lung_Phantom_Patient_1. We have adopted the widely-used JSON and HDF5 formats for data storage. HDFViwer can be utilized to view the contents of the HDF5 files.
Note: Initially, we will utilize a lung dataset from TCIA. The original DICOM CT images and structure sets are not included in the PortPy dataset and need to be directly downloaded from the TCIA. Users can fetch the TCIA collection ID and the TCIA subject ID for each PortPy patient using the get_tcia_metadata() method in PortPy and subsequently download the data from TCIA (see imrt_tps_import)
-
Install using pip:
- Run the command
pip install portpy
- You can install optional packages using
pip install portpy[mosek, pydicom]
- Run the command
-
Install using conda:
conda install -c conda-forge portpy
-
Install from source:
-
Clone this repository using
git clone https://github.com/PortPy-Project/PortPy.git
-
Navigate to the repository with
cd portpy
-
Install the dependencies within a Python virtual environment or Anaconda environment. To set up in a Python virtual environment, install all the dependencies specified in requirements.txt as follows:
- Create the virtual environment with
python3 -m venv venv
- Activate the environment with
source venv/bin/activate
- Install the requirements using
pip install -r requirements.txt
- Create the virtual environment with
-
PortPy is a community project initiated at Memorial Sloan Kettering Cancer Center (MSK). It is currently developed and maintained by Masoud Zarepisheh (Principal Investigator, zarepism@mskcc.org) and Gourav Jhanwar (Lead Developer, jhanwarg@mskcc.org). Other team members include: Mojtaba Tefagh (Optimization/AI/ML expert from University of Edinburgh), Linda Hong (Medical Physicist from MSK), Vicki Taasti (Proton Physicist from Aarhus University), and Saad Nadeem (AI/Imaging expert from MSK).
PortPy code is distributed under Apache 2.0 with Commons Clause license, and is available for non-commercial academic purposes.
If you find our work useful in your research or if you use parts of this code please cite our AAPM'23 abstract :
@article{jhanwar2023portpy,
title={Portpy: An Open-Source Python Package for Planning and Optimization in Radiation Therapy Including Benchmark Data and Algorithms},
author={Jhanwar, Gourav and Tefagh, Mojtaba and Taasti, Vicki T and Alam, Sadegh R and Tuomaala, Seppo and Nadeem, Saad and Zarepisheh, Masoud},
journal={AAPM 65th Annual Meeting & Exhibition},
year={2023}
}