PT  - JOURNAL ARTICLE
AU  - Kim, Joon
AU  - Lee, Hoyeon
AU  - Park, Jonghyeok
AU  - Park, Sang Hyun
AU  - Lee, Myungjae
AU  - Sunwoo, Leonard
AU  - Kim, Chi Kyung
AU  - Kim, Beom Joon
AU  - Ryu, Wi-Sun
TI  - In-Silo Federated Learning vs. Centralized Learning for Segmenting Acute and Chronic Ischemic Brain Lesions
AID  - 10.1101/2024.05.24.24307154
DP  - 2024 Jan 01
TA  - medRxiv
PG  - 2024.05.24.24307154
4099  - http://medrxiv.org/content/early/2024/05/26/2024.05.24.24307154.short
4100  - http://medrxiv.org/content/early/2024/05/26/2024.05.24.24307154.full
AB  - Purpose To investigate the efficacy of federated learning (FL) compared to industry-level centralized learning (CL) for segmenting acute infarct and white matter hyperintensity.Materials and Methods This retrospective study included 13,546 diffusion-weighted images (DWI) from 10 hospitals and 8,421 fluid-attenuated inversion recovery images (FLAIR) from 9 hospitals for acute (Task I) and chronic (Task II) lesion segmentation. The mean ages (SD) for the training datasets were 68.1 (12.8) for Task I and 67.4 (13.0) for Task II. The frequency of male participants was 51.5% and 60.4%, respectively. We trained with datasets from 9 and 3 institutions for Task I and Task II, respectively, and externally tested them in datasets from 1 and 9 institutions each. For FL, the central server aggregated training results every four rounds with FedYogi (Task I) and FedAvg (Task II). A batch clipping strategy was tested for the FL models. Performances were evaluated with the Dice similarity coefficient (DSC).Results In Task I, the FL model employing batch clipping trained for 360 epochs achieved a DSC of 0.754±0.183, surpassing an equivalent CL model (DSC 0.691±0.229; p&amp;lt;0.001) and comparable to the best-performing CL model at 940 epochs (DSC 0.755±0.207; p=0.701). In Task II, no significant differences were observed amongst FL model with clipping, without clipping, and CL model after 48 epochs (DSCs of 0.761±0.299, 0.751±0.304, 0.744±0.304). Few-shot FL showed significantly lower performance. Task II reduced training times with batch clipping (3.5 to 1.75 hours).Conclusion Comparisons between CL and FL in identical settings suggest the feasibility of FL for medical image segmentation.Competing Interest StatementHoyeon Lee, Jonghyeok Park, Myungjae Lee, and Wi-Sun Ryu are employees of JLK Inc., Seoul, Republic of Korea. Other authors had nothing to declare.Funding StatementThis study was supported by the Multiministry Grant for Medical Device Development (KMDF_PR_20200901_0098).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The study protocol was approved by institutional review board of Dongguk University Ilsan Hospital (2017-09-017).I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesData generated or analyzed during the study are available from the corresponding author by request.