Deep learning-assisted multiple organ segmentation from whole-body CT images

Yazdan Salimi; Isaac Shiri; Zahra Mansouri; Habib Zaidi

doi:10.1101/2023.10.20.23297331

Abstract

Background Automated organ segmentation from computed tomography (CT) images facilitates a number of clinical applications, including clinical diagnosis, monitoring of treatment response, quantification, radiation therapy treatment planning, and radiation dosimetry.

Purpose To develop a novel deep learning framework to generate multi-organ masks from CT images for 23 different body organs.

Methods A dataset consisting of 3106 CT images (649,398 axial 2D CT slices, 13,640 images/segment pairs) and ground-truth manual segmentation from various online available databases were collected. After cropping them to body contour, they were resized, normalized and used to train separate models for 23 organs. Data were split to train (80%) and test (20%) covering all the databases. A Res-UNET model was trained to generate segmentation masks from the input normalized CT images. The model output was converted back to the original dimensions and compared with ground-truth segmentation masks in terms of Dice and Jaccard coefficients. The information about organ positions was implemented during post-processing by providing six anchor organ segmentations as input. Our model was compared with the online available “TotalSegmentator” model through testing our model on their test datasets and their model on our test datasets.

Results The average Dice coefficient before and after post-processing was 84.28% and 83.26% respectively. The average Jaccard index was 76.17 and 70.60 before and after post-processing respectively. Dice coefficients over 90% were achieved for the liver, heart, bones, kidneys, spleen, femur heads, lungs, aorta, eyes, and brain segmentation masks. Post-processing improved the performance in only nine organs. Our model on the TotalSegmentator dataset was better than their models on our dataset in five organs out of 15 common organs and achieved almost similar performance for two organs.

Conclusions The availability of a fast and reliable multi-organ segmentation tool leverages implementation in clinical setting. In this study, we developed deep learning models to segment multiple body organs and compared the performance of our models with different algorithms. Our model was trained on images presenting with large variability emanating from different databases producing acceptable results even in cases with unusual anatomies and pathologies, such as splenomegaly. We recommend using these algorithms for organs providing good performance. One of the main merits of our proposed models is their lightweight nature with an average inference time of 1.67 seconds per case per organ for a total-body CT image, which facilitates their implementation on standard computers.

Introduction

Segmentation of healthy organs from Computed Tomography (CT) images is critical and beneficial in a number of applications, including the generation of anthropomorphic computational models, delimitation of organs at risk in radiation therapy (RT) treatment planning (1–4), and other kinds of computer-assisted applications, such as pathologic detection (5, 6), prognosis and outcome prediction (7, 8), image quantification (9, 10), and radiation dosimetry calculations (11–13). The manual slice-by-slice segmentation of organs can be labor-intensive and time-consuming, in addition to the high inter- and intra-observer variability reported for segmentation of healthy organs and malignant lesions (14, 15). Since the emergence of machine learning and deep learning (DL) algorithms in medical imaging research, especially medical image segmentation, a number of studies focused on automatic segmentation of structures from CT images and other imaging modalities (16–18). Most published studies attempted to improve segmentation accuracy (commonly quantified by the Dice coefficient), robustness and generalizability on new unseen dataset acquired with different imaging settings on disparate patient characteristics and including a large number of organs (19–21). Newly developed neural network architectures, loss functions, and image processing algorithms contributed to the improvement of the performance of image segmentation models. Yet, the number of datasets and their diversity remains the bottleneck for successful implementation of deep learning-based algorithms (22). Most studies conveyed the performance of the developed models on a test set excluded from the training set, thus reaching very high Dice coefficients as reported in few challenges held on multiple organ segmentations (23). Yet, the majority of these studies didn’t investigate models’ performance on unseen external datasets. Xu et al. (24) focused on the occurrence of outliers during image segmentation and how to solve this problem. Recent studies addressed the limitations and benefits of DL-based organ segmentation in real-life clinical scenarios (14, 25). The comparison of the results achieved by different techniques using private/local databases is not straightforward given that the used datasets are not publicly available. Besides, it’s well established that acquisition, scanner, and demographic parameters can affect the performance of a model on external unseen datasets from other centers (14, 26). Ma et al. (19) described the low performance of segmentation models trained and inferenced on different databases for abdominal organs segmentation task. In this context, a segmentation model trained on a dataset presenting with a large variability and tested on an unseen dataset may be beneficial in estimating the performance in real clinical scenarios.

In this study, we aimed to develop a deep neural network to segment multiple healthy organs (28 organs) from total-body CT images targeting improvement of the accuracy and generalizability compared to previously developed models. We also compared the performance of our models with existing methods and considered the effect of post-processing algorithms to take into account organ-specific anatomical information during the segmentation process.

Materials and Methods

Patient population

This study included 3106 CT images (649,398 axial 2D CT slices, 13,640 3D image/segment pairs) collected from multiple online available datasets (27–32). A total of 300 pediatric cases with 18.9 ± 4.13 cm effective diameter and 2806 adult cases with 27.53 ± 5.35 cm effective diameter as defined by the AAPM #204 Report (33) were included. The average age was 6.32 ± 4.34 years for pediatric patients and 66.98 ± 9.84 years for adult patients. It should be noted that the age, gender, and acquisition parameters were available only in a limited number of datasets, the rest were either anonymized or in NIFTI format without additional information. The number of slices and patient size characteristics were summarized in supplementary Table 1. The data were split into training and test set for each organ according to the number of cases from each database to ensure the test and train data use cases from each available database, i.e., the training (80 %) and testing (20%) data for each organ include cases from all databases.

Figure 1 depicts the number of CT images used for training each model for each organ segmentation. The number of training and test datasets are summarized in Table 1. The detailed number of training cases from each database is provided in supplementary Table 2. The masks (segmentations) of the 23 different organs were used to train separate segmentation models. The summed gastrointestinal segment (GIs) was defined by adding distinct segmentations of the duodenum, small intestine, and colon together to define a single organ.

Figure 1.

Number of 3D CT images extracted from the clinical studies included in all databases used for training and testing of the models. The upper (color) bar depicts the test, whereas the bottom (gray) bar depicts the training numbers. UB: Urinary Bladder, SC: Spinal Cord, GB: Gall Bladder, AG: Adrenal Gland, IVC: Inferior Vena Cava, SI: Small Intestine, GIs: Gastrointestinal.

View this table:

Table 1.

Summary of image segmentation metrics, including Dice, Jaccard coefficients and the effect of post-processing, and the number of cases in the train/validation and test groups in strategy (a). UB: Urinary Bladder, SC: Spinal Cord, GB: Gall Bladder, AG: Adrenal Gland, IVC: Inferior Vena Cava, SI: Small Intestine, GIs: Gastrointestinal.

Preprocessing and Network architecture

The external body contour was extracted from axial CT images through image processing algorithms developed and used in previous studies (34, 35). The CT images were cropped to a bounding box (BB) including the body contour in the lateral and AP directions to remove the background area. The images were cropped to 30 slices in the superior direction and 30 slices in the inferior direction according to the BB covering organ segmentation in the Z-axis (cranio-caudal direction). The model was trained in 2D fashion, meaning that the input to the network consisted of 2D axial images with the output being the corresponding segmentation masks. A Res-UNET neural network architecture used in a previous PET segmentation study (36) was employed in this work (Figure 2). The 2D images and masks were resized to 304 (right to left) × 224 (anterior posterior) pixel dimensions. The image intensities were clipped between −70 HU and +170 HU and normalized between zero and one and then discretized to 240 intensity values.

Figure 2.

The Res-UNET neural network architecture adopted for multi-organ segmentation from CT images. Conv2d: 2D convolution layer.

The cropped information was stored in the image header and used later to reverse the cropped model output segmentation to the original image dimensions. Figure 3 summarizes the steps performed to train the network.

Figure 3.

The flowchart of training and inference steps followed in implementing the segmentation algorithm. From left to right: The upper axial slices show CT images, with the liver mask defined with the yellow contour, segmented body contour, and pre-processed (cropped, normalized, and resized) images. The lower segmentation shows the 3D visualization of the network output after post-processing.

Post-processing and prior knowledge implementation

Organ-specific post-processing algorithms were used to take into account the anatomical locations of the organs. For all test datasets, segmentation masks of six anchor organs including the liver, spleen, lung, femoral head, bladder, and kidneys were generated by our trained models. These generated segmentation masks were used to perform organ-specific post-processing. For each organ, a specific algorithm was used to remove the segmented voxels outside of the BB delimitating each organ. For instance, for spleen post-processing, the post-processing function input was the liver, lung, and femoral head segmentation. The BB of the femur and lung were determined considering the known prior anatomical information that the liver is in the abdomen and always higher than the femoral head BB and lower than the lungs apex. We used the same strategy for other organs, such as the gall bladder (GB), and adrenal glands (AG) to remove unwanted (false positive) voxels from the network output by exploiting the fact that the gall bladder is at the inferior part of the liver and the adrenal gland are upper than kidneys. In the end, the network performance evaluation was compared with/without organ-specific post-processing.

Evaluation metrics

For each organ segmentation task, 20% of each database was randomly defined and used as the test dataset. The predicted segmentation masks were compared to the ground truth masks by measuring Dice and Jaccard coefficients.

Real-life evaluation on external datasets

To evaluate the performance of our model in real clinical scenarios on an external unseen dataset and compare our method to previously reported deep learning models, we tested our trained models on the online available TotalSegmentator dataset published recently by Wasserthal et al. (37) and then tested their trained model on the databases we used for testing. The dataset used in the above reference was local for the involved centers and could be considered completely unseen data for our models, In addition, they separated the test and train dataset to make the comparison more reproducible. They have used state-of-the-art nnU-Net (38) network/training methodology and managed to won 9 out of 10 MICCAI 2020 (39) and AMOS (23) challenges. We compared the performance of our model and models reported in the reference above for organs included in both studies (15 organs listed in Table 2). Figure 4 shows the dataflow in this study. Overall, we performed three evaluations: a) our model tested on our test dataset (23 organs), b) our model tested on the TotalSegmentator test dataset (15 organs), and c) the TotalSegmentator model tested on our dataset (15 organs). TotalSegmentator trained model was collected from GitHub on April 28, 2023.

Figure 4.

The Dataflow adopted for external evaluation. Green, light blue, and dark blue lines show the dataflow for strategies (a), (b), and (c), respectively.

View this table:

Table 2.

Results of the external evaluation using Wasserthal et al. (37) algorithm in strategies (b) and (c). * The brain images were distorted and blurred in Wasserthal et al. dataset. UB: Urinary Bladder, SC: Spinal Cord, GB: Gall Bladder, AG: Adrenal Gland, IVC: Inferior Vena Cava, SI: Small Intestine, GIs: Gastrointestinal.

Statistical analysis

We used the Wilcoxon rank t-test to evaluate the effect of post-processing on each organ.

Results

Evaluation on the test dataset

Table 1 summarizes the Dice and Jaccard image segmentation metrics for our model on test sets separated from our data (strategy (a) mentioned in the Methods section). The highest Dice coefficients were achieved for the lung, spleen, liver, and brain organs, while the lowest values were obtained for the thymus, adrenal gland organs. Organ-specific post-processing increased the Dice coefficient in seven organs by more than 0.15 absolute value and this increase was statistically significant. However, it did not increase or even significantly decrease the Dice coefficient for the remaining organs. These seven organs were the heart, spleen, UB, SC, aorta, GB, and thymus. Figure 5 depicts examples of 3D visualization of segmentations of CT images corresponding to different subjects shown from eight different perspectives. Supplementary figure 1 extends the examples shown in Figure 5 for pediatric cases and cases with unusual anatomical variations such as splenomegaly. Figure 6 depicts the Dice coefficients in different organs with/without post-processing. The detailed results of segmentation accuracy for each database included in the assessment are presented in supplementary Table 2.

Figure 5.

3D visualization of segmentation masks of different organs showing: the kidneys (dark green), femoral heads (lime), bones (yellow), liver (dark red), aorta (light red), spleen (cyan), heart (purple), stomach (light green), spinal cord (dark blue), urinary bladder (light blue), and the rectum (pink).

Figure 6.

Box plots of Dice coefficients achieved for different organ segmentations before and after post-processing. The red, blue, and green reference lines depict 60%, 80%, and 90% Dice coefficients, respectively. UB: Urinary Bladder, SC: Spinal Cord, GB: Gall Bladder, AG: Adrenal Gland, IVC: Inferior Vena Cava, SI: Small Intestine, GIs: Gastrointestinal.

External comparison

Table 2 summarizes the Dice and Jaccard evaluation metrics for strategies (b) and (c). A relative difference larger than 2% for the Dice coefficient was considered as significant when comparing strategies (b) and (c). Our model outperformed TotalSegmentator for five organs, including the liver, pancreas, spleen, UB, and femur heads, while the outcome was the same for the lungs and kidneys. For the remaining organs listed in Table 2 (7 organs), the TotalSegmentator models outperformed our models. The brain mask was not valid due to the blurring generated on the face area for privacy preserving concerns on TotalSegmentator dataset. Organ-specific post-processing improved the segmentation accuracy, reflected by higher Dice coefficients for strategy (b) in seven other organs, including the liver, kidneys, spleen, UB, esophagus, femur heads, and GB. Table 3 compares our model’s performance in strategy (a) to recent studies reported in the literature for common organs.

View this table:

Table 3. Model performance reported in terms of Dice coefficient in strategy (a) compared to recent studies in the field. UB: Urinary Bladder, SC: Spinal Cord, GB: Gall Bladder, AG: Adrenal Gland, IVC: Inferior Vena Cava.

Discussion

Automated organ segmentation is a critical step in a wide range of clinical applications, including personalized radiation dosimetry and quantification, and radiation treatment planning. The availability of a fast and reliable organ segmentation tool can facilitate the automation of these procedures and their adoption/deployment in clinical setting. In this work, we developed novel deep learning models to segment multiple organs from total-body CT images and compared the performance of our models with previous algorithms reported in the literature. Our model was trained on images presenting with high variability using large datasets, including adults, pediatrics and patients presenting with a wide range of pathologies and anatomic pathologies. The proposed model demonstrated an acceptable outcome even on cases with uncommon anatomies and pathologies, such as splenomegaly as shown in Error! Reference source not found.Error! Reference source not found.. Besides, we used prior anatomical knowledge for some organs in the body for organ-specific post-processing to improve the outcome. This methodology enabled to successfully improve the results in nine organs by achieving higher Dice coefficients in strategy (a) and could help improving the Dice coefficient in more than five organs in strategy (b). We used the model output anchor organ segmentation as reference for prior knowledge implementation i.e., no manual segmentation or ground truth segmentation was needed in post-processing. The error in anchor organ masks was used as input for the post-processing function that can propagate to the post-processed mask and accumulated errors can be problematic. One possible explanation is that our decision algorithm to exclude false positive segmented voxels was not successful in a number of organs. We believe that using ground truth anchor organ segmentations for post-processing can improve the post-processing capabilities. According to the results achieved through post-processing, we suggest using these algorithms for organs achieving good performance (Table 2 and Table 3).

To evaluate the performance of the proposed model on real-world unseen external datasets, we tested our model on the online available dataset provided by Wasserthal et al. (37). Our model outperformed their model for five organs in strategies (b) and (c). For most small organs and gastrointestinal organs, our model’s performance was inferior to their model, which can be explained by the different 2D and 3D training strategies used in our and their models, respectively. They have used the nn-Unet (38) model which demands a high computational burden for both training and inference. Their images were resampled to 1.5 mm isotropic voxel dimension in a specific orientation. Conversely, we resampled the images again during cropping and resizing in strategy (b), while our test set in strategy (c), the images were in the original dimensions and the used preprocessing steps were similar to the training step in their model. As shown in

Table 2, the number of valid cases in the testing group was limited and the effect of statistical difference can be significant, while the number of our test dataset was larger. It should be mentioned that their brain images were blurred in the face region to preserve the privacy of subjects. This has affected the performance of our model in strategy (b). In addition, their initial model was trained and improved based on models trained on the same dataset used for training our models and then manually edited the segmentations. As such, our test dataset was not really unseen for their model in strategy (c), while in strategy (b) their local data were completely unseen for our model. The effect of post-processing improved the performance of image segmentation of five organs and there is still scope for improvement that can be explored in future studies by changing the number of anchor organs or providing manual edited segmentation as an aid to generate robust post-processing without initial anchor segmentation errors.

The comparison of our models with recent studies revealed that our model’s performance was better or at least comparable to algorithms reported in the literature for most organs, except for small and gastrointestinal organs (Table 3). One of the main merits of our proposed model is its lightweight requiring a small number of parameters (533 K). In addition, for each organ, we have a separate light model, and the user can select a lower number of organs to be segmented to save time. We calculated the inference time on an NVIDIA RTX 4090 GPU where the average inference time for a total body CT was 1.67 seconds per case per organ. Besides, we tested inferring our model on an Intel Corei9 13900KF CPU and the inference time on the CPU was 14.5 seconds per study and per organ, which is a bearable and acceptable inference time for centers lacking access to dedicated GPUs.

We trained different deep learning models to segment 23 organs from total-body CT images which can be beneficial in various clinical tasks. We evaluated our models on an external dataset. The number of cases was limited to a few organs, and the segmentation criteria were different for each manual segmentation available from the online databases, inherently causing inter-observer variability, e.g., some databases provided whole segmented kidneys while others excluded pelvicalyceal systems. These differences may mislead our models and affect their performance.

Conclusion

We have developed a fully automated deep learning algorithm capable of generating accurate masks for multiple organs from CT images in an affordable computing time. After training these models on a diverse dataset comprising images from various databases, we compared the performance of our model with other algorithms on external datasets in real clinical scenarios. The proposed model exhibited remarkable capabilities even in cases involving uncommon anatomies and pathologies, such as splenomegaly.

Based on our analysis and results, we recommend using this algorithm especially for organs achieving excellent performance. One key advantage of our proposed models is their lightweight nature, enabling to run them efficiently on standard devices without access to dedicated GPUs in a bearable time. With an average GPU inference time of only 1.67 seconds per organ for a total-body CT image, they provide fast results and can be exploited in most routine tasks, even for verification in RT positioning. Overall, a reliable organ segmentation tool enables wider adoption by medical professionals in clinical setting.

Competing interests

The authors have no relevant financial or non-financial interests to disclose and the authors have no competing interests to declare that are relevant to the content of this article.

Acknowledgments

This work was supported by the Euratom research and training programme 2019-2020 Sinfonia project under grant agreement No 945196.

References

1.↵
Shi F, Hu W, Wu J, Han M, Wang J, Zhang W, et al. Deep learning empowered volume delineation of whole-body organs-at-risk for accelerated radiotherapy. Nature communications. 2022;13(1):6566.
OpenUrl Google Scholar
2.
Mohammadi R, Shokatian I, Salehi M, Arabi H, Shiri I, Zaidi H. Deep learning-based auto-segmentation of organs at risk in high-dose rate brachytherapy of cervical cancer. Radiother Oncol. 2021;159:231–40.
OpenUrl Google Scholar
3.
Ibragimov B, Xing L. Segmentation of organs-at-risks in head and neck CT images using convolutional neural networks. Med Phys. 2017;44(2):547–57.
OpenUrl CrossRef Google Scholar
4.↵
Akhavanallaf A, Fayad H, Salimi Y, Aly A, Kharita H, Al Naemi H, et al. An update on computational anthropomorphic anatomical models. Digit Health. 2022;8:20552076221111941.
OpenUrl Google Scholar
5.↵
Tang Y, Huo Y, Xiong Y, Moon H, Assad A, Moyo T, et al. Improving splenomegaly segmentation by learning from heterogeneous multi-source labels. SPIE Medical Imaging: SPIE; 2019.
Google Scholar
6.↵
Yang Y, Tang Y, Gao R, Bao S, Huo Y, McKenna MT, et al. Validation and estimation of spleen volume via computer-assisted segmentation on clinically acquired CT scans. J Med Imaging (Bellingham). 2021;8(1):014004.
OpenUrl Google Scholar
7.↵
Hernandez-Boussard T, Macklin P, Greenspan EJ, Gryshuk AL, Stahlberg E, Syeda-Mahmood T, et al. Digital twins for predictive oncology will be a paradigm shift for precision cancer care. Nat Med. 2021;27(12):2065–6.
OpenUrl CrossRef Google Scholar
8.↵
Shiri I, Salimi Y, Pakbin M, Hajianfar G, Avval AH, Sanaat A, et al. COVID-19 prognostic modeling using CT radiomic features and machine learning algorithms: Analysis of a multi-institutional dataset of 14,339 patients. Comput Biol Med. 2022;145:105467.
OpenUrl CrossRef PubMed Google Scholar
9.↵
Lindgren Belal S, Sadik M, Kaboteh R, Enqvist O, Ulén J, Poulsen MH, et al. Deep learning for segmentation of 49 selected bones in CT scans: First step in automated PET/CT-based 3D quantification of skeletal metastases. European Journal of Radiology. 2019;113:89–95.
OpenUrl Google Scholar
10.↵
van Sluis J, Noordzij W, de Vries EGE, Kok IC, de Groot DJA, Jalving M, et al. Manual Versus Artificial Intelligence-Based Segmentations as a Pre-processing Step in Whole-body PET Dosimetry Calculations. Mol Imaging Biol. 2023;25(2):435–41.
OpenUrl Google Scholar
11.↵
Xie T, Zaidi H. Estimation of the radiation dose in pregnancy: an automated patient-specific model using convolutional neural networks. Eur Radiol. 2019;29(12):6805–15.
OpenUrl Google Scholar
12.
Fu W, Sharma S, Abadi E, Iliopoulos AS, Wang Q, Lo JY, et al. iPhantom: A Framework for Automated Creation of Individualized Computational Phantoms and Its Application to CT Organ Dosimetry. IEEE J Biomed Health Inform. 2021;25(8):3061–72.
OpenUrl Google Scholar
13.↵
Salimi Y, Akhavanallaf A, Mansouri Z, Shiri I, Zaidi H. Real-time, acquisition parameter-free voxel-wise patient-specific Monte Carlo dose reconstruction in whole-body CT scanning using deep neural networks. Eur Radiol. 2023.
Google Scholar
14.↵
Hobbis D, Yu NY, Mund KW, Duan J, Rwigema JM, Wong WW, et al. First Report On Physician Assessment and Clinical Acceptability of Custom-Retrained Artificial Intelligence Models for Clinical Target Volume and Organs-at-Risk Auto-Delineation for Postprostatectomy Patients. Pract Radiat Oncol. 2023;13(4):351–62.
OpenUrl Google Scholar
15.↵
Liao W, Luo X, He Y, Dong Y, Li C, Li K, et al. Comprehensive Evaluation of a Deep Learning Model for Automatic Organs at Risk Segmentation on Heterogeneous Computed Tomography Images for Abdominal Radiation Therapy. Int J Radiat Oncol Biol Phys. 2023.
Google Scholar
16.↵
Fu Y, Lei Y, Wang T, Curran WJ, Liu T, Yang X. A review of deep learning based methods for medical image multi-organ segmentation. Phys Med. 2021;85:107–22.
OpenUrl Google Scholar
17.
Vrtovec T, Močnik D, Strojan P, Pernuš F, Ibragimov B. Auto-segmentation of organs at risk for head and neck radiotherapy planning: From atlas-based to deep learning methods. Med Phys. 2020;47(9):e929–e50.
OpenUrl Google Scholar
18.↵
Xiaoyu Liu LQ, Ziyue Xie, Jiayue Zhao, Yonghong Shi, and Zhijian Song. Towards More Precise Automatic Analysis: A Comprehensive Survey of Deep Learning-based Multi-organ Segmentation. Arxive. 2023.
Google Scholar
19.↵
Ma J, Zhang Y, Gu S, Zhu C, Ge C, Zhang Y, et al. AbdomenCT-1K: Is Abdominal Organ Segmentation a Solved Problem? IEEE Trans Pattern Anal Mach Intell. 2022;44(10):6695–714.
OpenUrl Google Scholar
20.
Huang Z, Wang H, Deng Z, Ye J, Su Y, Sun H, et al. STU-Net: Scalable and Transferable Medical Image Segmentation Models Empowered by Large-Scale Supervised Pre-training. arXiv pre-print server. 2023.
Google Scholar
21.↵
Zhao Q, Zhong L, Xiao J, Zhang J, Chen Y, Liao W, et al. Efficient Multi-Organ Segmentation from 3D Abdominal CT Images with Lightweight Network and Knowledge Distillation. IEEE Trans Med Imaging. 2023;PP:1-.
Google Scholar
22.↵
Hadjiiski L, Cha K, Chan HP, Drukker K, Morra L, Näppi JJ, et al. AAPM task group report 273: Recommendations on best practices for AI and machine learning for computer-aided diagnosis in medical imaging. Med Phys. 2023;50(2):e1–e24.
OpenUrl Google Scholar
23.↵
Ji Y, Bai H, Ge C, Yang J, Zhu Y, Zhang R, et al. Amos: A large-scale abdominal multi-organ benchmark for versatile medical image segmentation. Advances in Neural Information Processing Systems. 2022;35:36722-32.
OpenUrl Google Scholar
24.↵
Xu Y, Tang O, Tang Y, Lee HH, Chen Y, Gao D, et al. Outlier Guided Optimization of Abdominal Segmentation. Proc SPIE Int Soc Opt Eng. 2020;11313:1131336.
OpenUrl Google Scholar
25.↵
Lucido JJ, DeWees TA, Leavitt TR, Anand A, Beltran CJ, Brooke MD, et al. Validation of clinical acceptability of deep-learning-based automated segmentation of organs-at-risk for head-and-neck radiotherapy treatment planning. Front Oncol. 2023;13:1137803.
OpenUrl Google Scholar
26.↵
Salimi Y, Shiri I, Akhavanallaf A, Mansouri Z, Saberi Manesh A, Sanaat A, et al. Deep learning-based fully automated Z-axis coverage range definition from scout scans to eliminate overscanning in chest CT imaging. Insights Imaging. 2021;12(1):162.
OpenUrl Google Scholar
27.↵
Patrick Bilic1a PFCa, Eugene Vorontsov1e,1, Grzegorz Chlebusr, Hao, Chenm, Qi Doum, Chi-Wing Fum, Xiao Hanp, Pheng-Ann Hengm, Jrgen Hesserq, Samuel, Kadourye, Tomasz Kopczyskiv, Miao Leo, Chunming Lio, Xiaomeng Lim, Jana Lipkováa,, John Lowengrubn HM, Jan Hendrik Moltzr, Chris Pale,1, Marie Pirauda,, Xiaojuan Qim JQ, 1, Markus Rempflera, Karsten Rothq, Andrea Schenkr, Anjany, Sekuboyinaa PZ, Christian Hülsemeyera, Marcel Beetza, Florian Ettlingera, Felix, et al. The Liver Tumor Segmentation Benchmark (LiTS). 2019.
Google Scholar
28.
Heller N, Sathianathen N, Kalapara A, Walczak E, Moore K, Kaluzniak H, et al. The kits19 challenge data: 300 kidney tumor cases with clinical context, ct semantic segmentations, and surgical outcomes. arXiv preprint arXiv:190400445. 2019.
Google Scholar
29.
Rister B, Yi D, Shivakumar K, Nobashi T, Rubin DL. CT-ORG, a new dataset for multiple organ segmentation in computed tomography. Sci Data. 2020;7(1):381.
OpenUrl Google Scholar
30.
Antonelli M, Reinke A, Bakas S, Farahani K, Kopp-Schneider A, Landman BA, et al. The Medical Segmentation Decathlon. Nature communications. 2022;13(1):4128.
OpenUrl Google Scholar
31.
Sekuboyina A, Husseini ME, Bayat A, Loffler M, Liebl H, Li H, et al. VerSe: A Vertebrae labelling and segmentation benchmark for multi-detector CT images. Medical image analysis. 2021;73:102166.
OpenUrl Google Scholar
32.↵
Jordan P, Adamson PM, Bhattbhatt V, Beriwal S, Shen S, Radermecker O, et al. Pediatric chest-abdomen-pelvis and abdomen-pelvis CT images with expert organ contours. Med Phys. 2022;49(5):3523–8.
OpenUrl Google Scholar
33.↵
AAPM. Use of Water Equivalent Diameter for Calculating Patient Size and Size-Specific Dose Estimates (SSDE) in CT. AAPM; 2014. Report No.: The Report of AAPM Task Group 220.
Google Scholar
34.↵
Salimi Y, Shiri I, Akhavanallaf A, Mansouri Z, Sanaat A, Pakbin M, et al. Deep Learning-based calculation of patient size and attenuation surrogates from localizer Image: Toward personalized chest CT protocol optimization. European Journal of Radiology. 2022;157:110602.
OpenUrl Google Scholar
35.↵
Salimi Y, Shiri I, Akavanallaf A, Mansouri Z, Arabi H, Zaidi H. Fully automated accurate patient positioning in computed tomography using anterior-posterior localizer images and a deep neural network: a dual-center study. Eur Radiol. 2023;33(5):3243–52.
OpenUrl Google Scholar
36.↵
Shiri I, Arabi H, Sanaat A, Jenabi E, Becker M, Zaidi H. Fully Automated Gross Tumor Volume Delineation From PET in Head and Neck Cancer Using Deep Learning Algorithms. Clin Nucl Med. 2021;46(11):872–83.
OpenUrl Google Scholar
37.↵
Wasserthal J, Breit H-C, Meyer MT, Pradella M, Hinck D, Sauter AW, et al. TotalSegmentator: Robust segmentation of 104 anatomic structures in CT images. Radiology: Artificial Intelligence. 2023;0:e230024.
OpenUrl Google Scholar
38.↵
Isensee F, Jaeger PF, Kohl SAA, Petersen J, Maier-Hein KH. nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat Methods. 2021;18(2):203–11.
OpenUrl CrossRef PubMed Google Scholar
39.↵
Ma J. Cutting-edge 3D medical image segmentation methods in 2020: Are happy families all alike? arXiv preprint arXiv:210100232. 2021.
Google Scholar
40.
Tang Y, Gao R, Lee HH, Han S, Chen Y, Gao D, et al. High-resolution 3D abdominal segmentation with random patch network fusion. Medical image analysis. 2021;69:101894.
OpenUrl Google Scholar
41.
Duan J, Bernard M, Downes L, Willows B, Feng X, Mourad WF, et al. Evaluating the clinical acceptability of deep learning contours of prostate and organs-at-risk in an automated prostate treatment planning process. Med Phys. 2022;49(4):2570–81.
OpenUrl Google Scholar
42.
Xiao C, Jin J, Yi J, Han C, Zhou Y, Ai Y, et al. RefineNet-based 2D and 3D automatic segmentations for clinical target volume and organs at risks for patients with cervical cancer in postoperative radiotherapy. J Appl Clin Med Phys. 2022;23(7):e13631.
OpenUrl Google Scholar
43.
Song Y, Teoh JY, Choi KS, Qin J. Dynamic Loss Weighting for Multiorgan Segmentation in Medical Images. IEEE Trans Neural Netw Learn Syst. 2023;PP:1–12.
Google Scholar
44.
Wang J, Qu A, Wang Q, Zhao Q, Liu J, Wu Q. TT-Net: Tensorized Transformer Network for 3D medical image segmentation. Comput Med Imaging Graph. 2023;107:102234.
OpenUrl Google Scholar
45.
Crespi L, Portanti M, Loiacono D. Comparing Adversarial and Supervised Learning for Organs at Risk Segmentation in CT images. arXiv preprint arXiv:230317941. 2023.
Google Scholar
46.
Siciarz P, McCurdy B. U-net architecture with embedded Inception-ResNet-v2 image encoding modules for automatic segmentation of organs-at-risk in head and neck cancer radiation therapy based on computed tomography scans. Phys Med Biol. 2022;67(11).
Google Scholar
47.
Chen P-H, Huang C-H, Chiu W-T, Liao C-M, Lin Y-R, Hung S-K, et al. A multiple organ segmentation system for CT image series using Attention-LSTM fused U-Net. Multimed Tools Appl. 2022;81(9):11881–95.
OpenUrl Google Scholar

Comments

medRxiv aims to provide a venue for anyone to comment on a medRxiv preprint. Comments are moderated for offensive or irrelevant content (this can take ~24 h). Please avoid duplicate submissions and read our Comment Policy before commenting. The content of a comment is not endorsed by medRxiv.

Community Reviews

medRxiv aims to inform readers about online discussion of this preprint occurring elsewhere. The content at the links below is not endorsed by either medRxiv or the preprint's authors.

Community reviews for this article:

There are no community reviews for this paper.

Automated Evaluations

Certain services provide automated analysis of preprints. Analyses invited by the authors are displayed at the top of this tab. Those done independently of authors are shown underneath . None of these analyses is endorsed by medRxiv.

Automated Evaluations:

There are no automated evaluations for this paper.

[1] 1.↵
Shi F, Hu W, Wu J, Han M, Wang J, Zhang W, et al. Deep learning empowered volume delineation of whole-body organs-at-risk for accelerated radiotherapy. Nature communications. 2022;13(1):6566.
OpenUrl Google Scholar

[2] 2.
Mohammadi R, Shokatian I, Salehi M, Arabi H, Shiri I, Zaidi H. Deep learning-based auto-segmentation of organs at risk in high-dose rate brachytherapy of cervical cancer. Radiother Oncol. 2021;159:231–40.
OpenUrl Google Scholar

[3] 3.
Ibragimov B, Xing L. Segmentation of organs-at-risks in head and neck CT images using convolutional neural networks. Med Phys. 2017;44(2):547–57.
OpenUrl CrossRef Google Scholar

[4] 4.↵
Akhavanallaf A, Fayad H, Salimi Y, Aly A, Kharita H, Al Naemi H, et al. An update on computational anthropomorphic anatomical models. Digit Health. 2022;8:20552076221111941.
OpenUrl Google Scholar

[5] 5.↵
Tang Y, Huo Y, Xiong Y, Moon H, Assad A, Moyo T, et al. Improving splenomegaly segmentation by learning from heterogeneous multi-source labels. SPIE Medical Imaging: SPIE; 2019.
Google Scholar

[6] 6.↵
Yang Y, Tang Y, Gao R, Bao S, Huo Y, McKenna MT, et al. Validation and estimation of spleen volume via computer-assisted segmentation on clinically acquired CT scans. J Med Imaging (Bellingham). 2021;8(1):014004.
OpenUrl Google Scholar

[7] 7.↵
Hernandez-Boussard T, Macklin P, Greenspan EJ, Gryshuk AL, Stahlberg E, Syeda-Mahmood T, et al. Digital twins for predictive oncology will be a paradigm shift for precision cancer care. Nat Med. 2021;27(12):2065–6.
OpenUrl CrossRef Google Scholar

[8] 8.↵
Shiri I, Salimi Y, Pakbin M, Hajianfar G, Avval AH, Sanaat A, et al. COVID-19 prognostic modeling using CT radiomic features and machine learning algorithms: Analysis of a multi-institutional dataset of 14,339 patients. Comput Biol Med. 2022;145:105467.
OpenUrl CrossRef PubMed Google Scholar

[9] 9.↵
Lindgren Belal S, Sadik M, Kaboteh R, Enqvist O, Ulén J, Poulsen MH, et al. Deep learning for segmentation of 49 selected bones in CT scans: First step in automated PET/CT-based 3D quantification of skeletal metastases. European Journal of Radiology. 2019;113:89–95.
OpenUrl Google Scholar

[10] 10.↵
van Sluis J, Noordzij W, de Vries EGE, Kok IC, de Groot DJA, Jalving M, et al. Manual Versus Artificial Intelligence-Based Segmentations as a Pre-processing Step in Whole-body PET Dosimetry Calculations. Mol Imaging Biol. 2023;25(2):435–41.
OpenUrl Google Scholar

[11] 11.↵
Xie T, Zaidi H. Estimation of the radiation dose in pregnancy: an automated patient-specific model using convolutional neural networks. Eur Radiol. 2019;29(12):6805–15.
OpenUrl Google Scholar

[12] 12.
Fu W, Sharma S, Abadi E, Iliopoulos AS, Wang Q, Lo JY, et al. iPhantom: A Framework for Automated Creation of Individualized Computational Phantoms and Its Application to CT Organ Dosimetry. IEEE J Biomed Health Inform. 2021;25(8):3061–72.
OpenUrl Google Scholar

[13] 13.↵
Salimi Y, Akhavanallaf A, Mansouri Z, Shiri I, Zaidi H. Real-time, acquisition parameter-free voxel-wise patient-specific Monte Carlo dose reconstruction in whole-body CT scanning using deep neural networks. Eur Radiol. 2023.
Google Scholar

[14] 14.↵
Hobbis D, Yu NY, Mund KW, Duan J, Rwigema JM, Wong WW, et al. First Report On Physician Assessment and Clinical Acceptability of Custom-Retrained Artificial Intelligence Models for Clinical Target Volume and Organs-at-Risk Auto-Delineation for Postprostatectomy Patients. Pract Radiat Oncol. 2023;13(4):351–62.
OpenUrl Google Scholar

[15] 15.↵
Liao W, Luo X, He Y, Dong Y, Li C, Li K, et al. Comprehensive Evaluation of a Deep Learning Model for Automatic Organs at Risk Segmentation on Heterogeneous Computed Tomography Images for Abdominal Radiation Therapy. Int J Radiat Oncol Biol Phys. 2023.
Google Scholar

[16] 16.↵
Fu Y, Lei Y, Wang T, Curran WJ, Liu T, Yang X. A review of deep learning based methods for medical image multi-organ segmentation. Phys Med. 2021;85:107–22.
OpenUrl Google Scholar

[17] 17.
Vrtovec T, Močnik D, Strojan P, Pernuš F, Ibragimov B. Auto-segmentation of organs at risk for head and neck radiotherapy planning: From atlas-based to deep learning methods. Med Phys. 2020;47(9):e929–e50.
OpenUrl Google Scholar

[18] 18.↵
Xiaoyu Liu LQ, Ziyue Xie, Jiayue Zhao, Yonghong Shi, and Zhijian Song. Towards More Precise Automatic Analysis: A Comprehensive Survey of Deep Learning-based Multi-organ Segmentation. Arxive. 2023.
Google Scholar

[19] 19.↵
Ma J, Zhang Y, Gu S, Zhu C, Ge C, Zhang Y, et al. AbdomenCT-1K: Is Abdominal Organ Segmentation a Solved Problem? IEEE Trans Pattern Anal Mach Intell. 2022;44(10):6695–714.
OpenUrl Google Scholar

[20] 20.
Huang Z, Wang H, Deng Z, Ye J, Su Y, Sun H, et al. STU-Net: Scalable and Transferable Medical Image Segmentation Models Empowered by Large-Scale Supervised Pre-training. arXiv pre-print server. 2023.
Google Scholar

[21] 21.↵
Zhao Q, Zhong L, Xiao J, Zhang J, Chen Y, Liao W, et al. Efficient Multi-Organ Segmentation from 3D Abdominal CT Images with Lightweight Network and Knowledge Distillation. IEEE Trans Med Imaging. 2023;PP:1-.
Google Scholar

[22] 22.↵
Hadjiiski L, Cha K, Chan HP, Drukker K, Morra L, Näppi JJ, et al. AAPM task group report 273: Recommendations on best practices for AI and machine learning for computer-aided diagnosis in medical imaging. Med Phys. 2023;50(2):e1–e24.
OpenUrl Google Scholar

[23] 23.↵
Ji Y, Bai H, Ge C, Yang J, Zhu Y, Zhang R, et al. Amos: A large-scale abdominal multi-organ benchmark for versatile medical image segmentation. Advances in Neural Information Processing Systems. 2022;35:36722-32.
OpenUrl Google Scholar

[24] 24.↵
Xu Y, Tang O, Tang Y, Lee HH, Chen Y, Gao D, et al. Outlier Guided Optimization of Abdominal Segmentation. Proc SPIE Int Soc Opt Eng. 2020;11313:1131336.
OpenUrl Google Scholar

[25] 25.↵
Lucido JJ, DeWees TA, Leavitt TR, Anand A, Beltran CJ, Brooke MD, et al. Validation of clinical acceptability of deep-learning-based automated segmentation of organs-at-risk for head-and-neck radiotherapy treatment planning. Front Oncol. 2023;13:1137803.
OpenUrl Google Scholar

[26] 26.↵
Salimi Y, Shiri I, Akhavanallaf A, Mansouri Z, Saberi Manesh A, Sanaat A, et al. Deep learning-based fully automated Z-axis coverage range definition from scout scans to eliminate overscanning in chest CT imaging. Insights Imaging. 2021;12(1):162.
OpenUrl Google Scholar

[27] 27.↵
Patrick Bilic1a PFCa, Eugene Vorontsov1e,1, Grzegorz Chlebusr, Hao, Chenm, Qi Doum, Chi-Wing Fum, Xiao Hanp, Pheng-Ann Hengm, Jrgen Hesserq, Samuel, Kadourye, Tomasz Kopczyskiv, Miao Leo, Chunming Lio, Xiaomeng Lim, Jana Lipkováa,, John Lowengrubn HM, Jan Hendrik Moltzr, Chris Pale,1, Marie Pirauda,, Xiaojuan Qim JQ, 1, Markus Rempflera, Karsten Rothq, Andrea Schenkr, Anjany, Sekuboyinaa PZ, Christian Hülsemeyera, Marcel Beetza, Florian Ettlingera, Felix, et al. The Liver Tumor Segmentation Benchmark (LiTS). 2019.
Google Scholar

[28] 28.
Heller N, Sathianathen N, Kalapara A, Walczak E, Moore K, Kaluzniak H, et al. The kits19 challenge data: 300 kidney tumor cases with clinical context, ct semantic segmentations, and surgical outcomes. arXiv preprint arXiv:190400445. 2019.
Google Scholar

[29] 29.
Rister B, Yi D, Shivakumar K, Nobashi T, Rubin DL. CT-ORG, a new dataset for multiple organ segmentation in computed tomography. Sci Data. 2020;7(1):381.
OpenUrl Google Scholar

[30] 30.
Antonelli M, Reinke A, Bakas S, Farahani K, Kopp-Schneider A, Landman BA, et al. The Medical Segmentation Decathlon. Nature communications. 2022;13(1):4128.
OpenUrl Google Scholar

[31] 31.
Sekuboyina A, Husseini ME, Bayat A, Loffler M, Liebl H, Li H, et al. VerSe: A Vertebrae labelling and segmentation benchmark for multi-detector CT images. Medical image analysis. 2021;73:102166.
OpenUrl Google Scholar

[32] 32.↵
Jordan P, Adamson PM, Bhattbhatt V, Beriwal S, Shen S, Radermecker O, et al. Pediatric chest-abdomen-pelvis and abdomen-pelvis CT images with expert organ contours. Med Phys. 2022;49(5):3523–8.
OpenUrl Google Scholar

[33] 33.↵
AAPM. Use of Water Equivalent Diameter for Calculating Patient Size and Size-Specific Dose Estimates (SSDE) in CT. AAPM; 2014. Report No.: The Report of AAPM Task Group 220.
Google Scholar

[34] 34.↵
Salimi Y, Shiri I, Akhavanallaf A, Mansouri Z, Sanaat A, Pakbin M, et al. Deep Learning-based calculation of patient size and attenuation surrogates from localizer Image: Toward personalized chest CT protocol optimization. European Journal of Radiology. 2022;157:110602.
OpenUrl Google Scholar

[35] 35.↵
Salimi Y, Shiri I, Akavanallaf A, Mansouri Z, Arabi H, Zaidi H. Fully automated accurate patient positioning in computed tomography using anterior-posterior localizer images and a deep neural network: a dual-center study. Eur Radiol. 2023;33(5):3243–52.
OpenUrl Google Scholar

[36] 36.↵
Shiri I, Arabi H, Sanaat A, Jenabi E, Becker M, Zaidi H. Fully Automated Gross Tumor Volume Delineation From PET in Head and Neck Cancer Using Deep Learning Algorithms. Clin Nucl Med. 2021;46(11):872–83.
OpenUrl Google Scholar

[37] 37.↵
Wasserthal J, Breit H-C, Meyer MT, Pradella M, Hinck D, Sauter AW, et al. TotalSegmentator: Robust segmentation of 104 anatomic structures in CT images. Radiology: Artificial Intelligence. 2023;0:e230024.
OpenUrl Google Scholar

[38] 38.↵
Isensee F, Jaeger PF, Kohl SAA, Petersen J, Maier-Hein KH. nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat Methods. 2021;18(2):203–11.
OpenUrl CrossRef PubMed Google Scholar

[39] 39.↵
Ma J. Cutting-edge 3D medical image segmentation methods in 2020: Are happy families all alike? arXiv preprint arXiv:210100232. 2021.
Google Scholar

[40] 40.
Tang Y, Gao R, Lee HH, Han S, Chen Y, Gao D, et al. High-resolution 3D abdominal segmentation with random patch network fusion. Medical image analysis. 2021;69:101894.
OpenUrl Google Scholar

[41] 41.
Duan J, Bernard M, Downes L, Willows B, Feng X, Mourad WF, et al. Evaluating the clinical acceptability of deep learning contours of prostate and organs-at-risk in an automated prostate treatment planning process. Med Phys. 2022;49(4):2570–81.
OpenUrl Google Scholar

[42] 42.
Xiao C, Jin J, Yi J, Han C, Zhou Y, Ai Y, et al. RefineNet-based 2D and 3D automatic segmentations for clinical target volume and organs at risks for patients with cervical cancer in postoperative radiotherapy. J Appl Clin Med Phys. 2022;23(7):e13631.
OpenUrl Google Scholar

[43] 43.
Song Y, Teoh JY, Choi KS, Qin J. Dynamic Loss Weighting for Multiorgan Segmentation in Medical Images. IEEE Trans Neural Netw Learn Syst. 2023;PP:1–12.
Google Scholar

[44] 44.
Wang J, Qu A, Wang Q, Zhao Q, Liu J, Wu Q. TT-Net: Tensorized Transformer Network for 3D medical image segmentation. Comput Med Imaging Graph. 2023;107:102234.
OpenUrl Google Scholar

[45] 45.
Crespi L, Portanti M, Loiacono D. Comparing Adversarial and Supervised Learning for Organs at Risk Segmentation in CT images. arXiv preprint arXiv:230317941. 2023.
Google Scholar

[46] 46.
Siciarz P, McCurdy B. U-net architecture with embedded Inception-ResNet-v2 image encoding modules for automatic segmentation of organs-at-risk in head and neck cancer radiation therapy based on computed tomography scans. Phys Med Biol. 2022;67(11).
Google Scholar

[47] 47.
Chen P-H, Huang C-H, Chiu W-T, Liao C-M, Lin Y-R, Hung S-K, et al. A multiple organ segmentation system for CT image series using Attention-LSTM fused U-Net. Multimed Tools Appl. 2022;81(9):11881–95.
OpenUrl Google Scholar

Deep learning-assisted multiple organ segmentation from whole-body CT images

Abstract

Introduction