Deep Learning in Magnetic Resonance Enterography for Crohn’s Disease Assessment: A Systematic Review

Ofir Brem; David Elisha; Eli Konen; Michal Amitai; Eyal Klang

doi:10.1101/2023.12.27.23300507

Abstract

Crohn’s disease (CD) poses significant morbidity, underscoring the need for effective, non-invasive inflammatory assessment using magnetic resonance enterography (MRE).

This literature review evaluates recent publications on deep learning’s role in enhancing MRE segmentation, image quality, and visualization of inflammatory activity related to CD.

We searched MEDLINE/PUBMED for studies that reported the use of deep learning algorithms for assessment of CD activity. The study was conducted according to the PRISMA guidelines. The risk of bias was evaluated using the QUADAS_J2 tool.

Five eligible studies, encompassing 468 subjects, were identified.

Our study suggests that diverse deep learning applications, including image quality enhancement, bowel segmentation, and motility measurement are useful and promising for CD assessment. However, most of the studies are preliminary, retrospective studies, and have a high risk of bias in at least one category.

Future research is needed to assess how automated deep learning can impact patient care, especially when considering the increasing integration of these models into hospital systems.

Introduction

Crohn’s disease (CD) is associated with substantial morbidity^1,2. The management of inflammation is crucial in preventing disease complications - emphasizing the importance of effective assessment of inflammation³.

Colonoscopy stands as the gold standard for CD diagnosis. However, it is invasive, and the evaluation of the small bowel remains inadequate⁴. Magnetic Resonance Enterography (MRE) is a non-invasive technique that is effective for assessment of CD activity^4–6 ^7,8. MRE’s noninvasive nature also offers potential in evaluating treatment response, or identifying therapeutic inefficiency. This can prompt early detection and timely adjustments in therapy to maintain clinical remission^9–11. Nevertheless, diagnosing Crohn’s disease using MRE is time-intensive and demands high expertise.

In recent years, AI, especially convolutional neural networks (CNN), have notably impacted computer vision. CNNs, a type of deep learning (Figure 1), excel in pattern recognition and are affecting the way in which medical images can be analyzed^12,13. This technology offers an innovative approach to diagnosing and monitoring CD activity^14–17.

Figure 1

CNN specialize in image processing, utilizing small filters per layer to identify recurring patterns. Their hierarchical structure enables shallow layers to detect low-level patterns and deeper layers to grasp high-level image comprehension.

We reviewed the literature to evaluate articles focused on the use of deep learning to improve MRE analysis in Crohn’s disease.

Methods

This review adhered to the guidelines outlined in the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA). The PRISMA checklist for systematic reviews can be found in Supplementary Material 1.

Search strategy

We conducted an extensive literature search on October 1, 2023, using the PubMed/MEDLINE database to identify studies investigating the use of deep learning for detecting CD in MRI/MRE Our search terms included “MRI or MRE,” and “Crohn’s disease,” and “deep learning,” or related terms such as “convolutional neural networks,” “machine learning,” and “artificial intelligence.”

Detailed information about the complete search strategies can be found in Supplementary Material 2. We also conducted a manual search of the references in the studies we included.

Inclusion criteria encompassed studies that (1) assessed the effectiveness of a deep learning model in detecting CD on MRI/MRE, (2) were published in the English language, (3) were peer-reviewed original publications, and (4) included an outcome measure.

Exclusions were applied to articles not related to computer vision, non-deep learning articles, non-original articles, and abstracts.

This study is registered with PROSPERO under the registration number CRD42023484725.

Study selection

Two authors (OB and EK) autonomously assessed the titles and abstracts to ascertain if the studies satisfied the inclusion criteria. When the title met the inclusion criteria or if any uncertainty arose, a thorough examination of the full-text article was conducted. If a relevant title appeared in the references section of one of the included studies, it was also screened for inclusion. In cases of disagreements, a third reviewer (DE) was consulted for resolution.

Data extraction

Utilizing a uniform data extraction template in Microsoft Excel, the two reviewers (OB and EK) separately gathered information. The data encompassed details such as publication year, study design and location, patient count, ethical considerations, inclusion and exclusion criteria, study population description, deep learning technique, utilization of an online database, database size, incorporation of an independent test dataset, performance of cross-validation, assessment metrics employed, and the main findings.

Quality assessment and risk of bias

We evaluated the quality of the studies using the modified Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) criteria¹⁸.

Results

Study selection and characteristics

The initial literature search resulted in 16 articles. Five studies met our inclusion criteria (Supplementary Figure 1). Studies were published between 2019 and 2023. A total of 468 subjects were analyzed. Table 1 summarizes the characteristics of the included studies. Figure 2 summarizes the clinical application of each study included.

Supplementary Figure 1:

Flow Diagram of the search and inclusion process

Figure 2

Graphical depiction categorizing deep learning studies according to their clinical application.

View this table:

Table 1: A Summary of articles in the literature review that applied deep learning techniques for magnetic resonance imaging involving Crohn’s Disease

Four of the studies were retrospective, one was prospective. In four of the studies, a board-certified radiologist, served as reference standard. No studies have performed external validation.

Descriptive summary of results

Studies included in this review utilized deep learning techniques for various tasks.

Son et al. used deep learning-based reconstruction (DLR) and Lian et al. employed CNN to improve image quality. Lamash et al. used CNN for inflammation assessment while Van Harten et al. measured intestinal motility via centerline segmentation. McFarlane et al. utilized 3D image processing and reconstruction (3D-IPR) with AI to create pre-procedure 3D animations of perianal fistulas (Figure 2).

Son et al. utilized a DLR technique to enhance the quality of MRE images in Crohn’s patients. A qualitative assessment was carried out by two abdominal radiologists who evaluated three distinct sets of images: (1) original, (2) images processed with a conventional filter, and (3) images processed using the deep learning tool.

The mean scores assigned by the radiologists revealed a statistically significant improvement in overall image quality for the DLR image set in comparison to both the original and conventionally filtered images (e.g., Coronal overall image quality: 3.6 (Original), 3.8 (Filtered), 4.7 (DLR)). Additionally, they conducted a signal-to-noise ratio (SNR) analysis and observed a significant increase in SNR when employing the DLR method.

Lian et. al developed and tested a CNN algorithm with a dataset from 392 patients with epidemic IBD. For the diagnosis of epidemic IBD, they achieved sensitivity 95% and specificity 47%. This demonstrates their CNN algorithm significantly improved image quality.

Lamash et al. employed a 1.5T MRI system and T1-weighted post-contrast VIBE sequence to examine 23 pediatric patients with Crohn’s disease. Their CNN-based segmentation demonstrated Dice Similarity Coefficients of 75% for the lumen, 81% for the wall, and 97% for the background. The median relative contrast enhancement value (P = 0.003) demonstrated discriminatory potential between active and non-active disease segments, while various other extracted markers showed differentiation capacity between segments with and without strictures (P < 0.05).

Van Harten et. al utilized a deep learning technique for quantification of intestinal motility as a marker of inflammation. They achieved sensitivity 80%, and PPV 86% for severe bowel disease with 312 annotated segments between the two groups (185 segments in the healthy group vs. 127 segments in the severe bowel disease group).

McFarlane et al. evaluated utility of deep learning-based 3D reconstruction models for 4 perineal CD patients. They found the model provided a more comprehensive visual representation of the disease. No quantitative measures were conducted.

Quality assessment

As per the QUADAS-2 tool, three papers were identified with a high risk of bias in at least one category. Additionally, none of the five papers discussed in this review have been externally validated. A detailed evaluation of the bias risk is provided in Supplementary Document 2, Table 1 and Table 2.

View this table:

Table 2: A Summary of articles in the literature review that applied deep learning techniques for magnetic resonance imaging involving Crohn’s Disease

Discussion

Accurate assessment of CD is pivotal for patient management^19–21. MRE provides noninvasive insights into both structural and functional aspects without ionizing radiation. Objective endpoints for CD management have been evaluated and established in the form of MRE indices^10,11,22. However, prior research raises concern about MRE analysis being prone to human error, stemming from the time and attention required in radiologists’ interpretations of the entire bowel ²³.

The use of deep learning models presents significant potential advancements. It offers enhanced accuracy with these algorithms applied to MRE image analysis for CD^12,13,24. Dice coefficient values between studies ranged from 0.75 to 0.97, indicating similarity between the results of the deep learning models and those of conventional methods^25,26.

Recent studies in our review have underscored the capabilities of deep learning in improving MRE evaluation of CD. For example, Lamash et al. demonstrated the effectiveness of CNN-based segmentation in pediatric patients, showing high Dice Similarity Coefficients for disease burden characterization and stricture identification ²⁵. Their findings indicated that such models may surpass the clinically recommended model in assessing ileal CD activity. This highlights the potential for precise, automated, and non-invasive monitoring of intestinal inflammation in CD patients²⁷.

The complex nature of CD can benefit from more reliable assessment of disease activity, distribution, and treatment response. This is especially true for clinical trials, where precise quantification of disease is essential. Interestingly, our review demonstrates varied clinical tasks. These range from improving image quality, segmenting disease to quantify burden, and 3D reconstruction for surgery planning.

Despite the promise shown by these studies, they are not without limitations. All, but one, of the studies are retrospective and none of them have had external validation^24–29.

Future research needs to include direct comparisons between deep learning and conventional radiological assessments in diverse clinical scenarios. Multicenter prospective studies will be crucial in validating the effectiveness of these AI systems, thereby establishing their role in the clinical management of CD.

In conclusion, deep learning models in CD offer promising enhancements to current MRE readings. Preliminary research indicates acceptable sensitivity and specificity. However, these results are primarily based on retrospective studies and thus require further validation through future research in a clinical setting.

Data Availability

All data produced in the present work are contained in the manuscript

References

1.↵
Dahlhamer JM, Zammitti EP, Ward BW, Wheaton AG, Croft JB. Prevalence of Inflammatory Bowel Disease Among Adults Aged ≥18 Years - United States, 2015. MMWR Morb Mortal Wkly Rep. 2016;65(42):1166–1169. doi:10.15585/mmwr.mm6542a3
OpenUrl CrossRef PubMed
2.↵
Molodecky NA, Soon IS, Rabi DM, et al. Increasing incidence and prevalence of the inflammatory bowel diseases with time, based on systematic review. Gastroenterology. 2012;142(1):46–54.e42; quiz e30. doi:10.1053/j.gastro.2011.10.001
OpenUrl CrossRef PubMed Web of Science
3.↵
Khanna R, Bressler B, Levesque BG, et al. Early combined immunosuppression for the management of Crohn’s disease (REACT): a cluster randomised controlled trial. Lancet. 2015;386(10006):1825–1834. doi:10.1016/S0140-6736(15)00068-9
OpenUrl CrossRef PubMed
4.↵
Girometti R, Zuiani C, Toso F, et al. MRI scoring system including dynamic motility evaluation in assessing the activity of Crohn’s disease of the terminal ileum. Acad Radiol. 2008;15(2):153–164. doi:10.1016/j.acra.2007.08.010
OpenUrl CrossRef PubMed Web of Science
5.
Lee SS, Kim AY, Yang S-K, et al. Crohn disease of the small bowel: comparison of CT enterography, MR enterography, and small-bowel follow-through as diagnostic techniques. Radiology. 2009;251(3):751–761. doi:10.1148/radiol.2513081184
OpenUrl CrossRef PubMed Web of Science
6.↵
Quencer KB, Nimkin K, Mino-Kenudson M, Gee MS. Detecting active inflammation and fibrosis in pediatric Crohn’s disease: prospective evaluation of MR-E and CT-E. Abdom Imaging. 2013;38(4):705–713. doi:10.1007/s00261-013-9981-z
OpenUrl CrossRef PubMed
7.↵
Amitai MM, Klang E, Levartovsky A, et al. Diffusion-weighted magnetic resonance enterography for prediction of response to tumor necrosis factor inhibitors in stricturing Crohn’s disease. Abdom Radiol (NY). 2018;43(12):3207–3212. doi:10.1007/s00261-018-1626-9
OpenUrl CrossRef
8.↵
Klang E, Amitai MM, Lahat A, et al. Capsule Endoscopy Validation of the Magnetic Enterography Global Score in Patients with Established Crohn’s Disease. J Crohns Colitis. 2018;12(3):313–320. doi:10.1093/ecco-jcc/jjx156
OpenUrl CrossRef
9.↵
Cheriyan DG, Slattery E, McDermott S, et al. Impact of magnetic resonance enterography in the management of small bowel Crohn’s disease. Eur J Gastroenterol Hepatol. 2013;25(5):550–555. doi:10.1097/MEG.0b013e32835d4e9c
OpenUrl CrossRef
10.↵
Ordás I, Rimola J, Rodríguez S, et al. Accuracy of magnetic resonance enterography in assessing response to therapy and mucosal healing in patients with Crohn’s disease. Gastroenterology. 2014;146(2):374–82.e1. doi:10.1053/j.gastro.2013.10.055
OpenUrl CrossRef PubMed Web of Science
11.↵
Moy MP, Sauk J, Gee MS. The role of MR enterography in assessing crohn’s disease activity and treatment response. Gastroenterol Res Pract. 2016;2016:8168695. doi:10.1155/2016/8168695
OpenUrl CrossRef
12.↵
Soffer S, Ben-Cohen A, Shimon O, Amitai MM, Greenspan H, Klang E. Convolutional neural networks for radiologic images: A radiologist’s guide. Radiology. 2019;290(3):590–606. doi:10.1148/radiol.2018180547
OpenUrl CrossRef PubMed
13.↵
Klang E. Deep learning and medical imaging. J Thorac Dis. 2018;10(3):1325-1328. doi:10.21037/jtd.2018.02.76
OpenUrl CrossRef PubMed
14.↵
Klang E, Barash Y, Margalit RY, et al. Deep learning algorithms for automated detection of Crohn’s disease ulcers by video capsule endoscopy. Gastrointest Endosc. 2020;91(3):606–613.e2. doi:10.1016/j.gie.2019.11.012
OpenUrl CrossRef PubMed
15.
Klang E, Grinman A, Soffer S, et al. Automated detection of crohn’s disease intestinal strictures on capsule endoscopy images using deep neural networks. J Crohns Colitis. 2021;15(5):749–756. doi:10.1093/ecco-jcc/jjaa234
OpenUrl CrossRef
16.
Soffer S, Klang E, Shimon O, et al. Deep learning for wireless capsule endoscopy: a systematic review and meta-analysis. Gastrointest Endosc. 2020;92(4):831–839.e8. doi:10.1016/j.gie.2020.04.039
OpenUrl CrossRef
17.↵
Arkko A, Kaseva T, Salli E, Mäkelä T, Savolainen S, Kangasniemi M. Automatic detection of Crohn’s disease using quantified motility in magnetic resonance enterography: initial experiences. Clin Radiol. 2022;77(2):96–103. doi:10.1016/j.crad.2021.10.006
OpenUrl CrossRef
18.↵
Whiting PF, Rutjes AWS, Westwood ME, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011;155(8):529–536. doi:10.7326/0003-4819-155-8-201110180-00009
OpenUrl CrossRef PubMed Web of Science
19.↵
Schiavone C, Romano M. Diagnosis and management of Crohn’s disease. J Ultrasound. 2015;18(1):1–2. doi:10.1007/s40477-015-0159-0
OpenUrl CrossRef
20.
Rodrigues BL, Mazzaro MC, Nagasako CK, Ayrizono M de LS, Fagundes JJ, Leal RF. Assessment of disease activity in inflammatory bowel diseases: Non-invasive biomarkers and endoscopic scores. World J Gastrointest Endosc. 2020;12(12):504–520. doi:10.4253/wjge.v12.i12.504
OpenUrl CrossRef
21.↵
Kucharzik T, Verstockt B, Maaser C. Monitoring of patients with active inflammatory bowel disease. Front Gastroenterol. 2023;2. doi:10.3389/fgstr.2023.1172318
OpenUrl CrossRef
22.↵
Plumb AA, Menys A, Russo E, et al. Magnetic resonance imaging-quantified small bowel motility is a sensitive marker of response to medical therapy in Crohn’s disease. Aliment Pharmacol Ther. 2015;42(3):343–355. doi:10.1111/apt.13275
OpenUrl CrossRef
23.↵
Brady AP. Error and discrepancy in radiology: inevitable or avoidable? Insights Imaging. 2017;8(1):171–182. doi:10.1007/s13244-016-0534-1
OpenUrl CrossRef PubMed
24.↵
Son JH, Lee Y, Lee H-J, Lee J, Kim H, Lebel MR. LAVA HyperSense and deep-learning reconstruction for near-isotropic (3D) enhanced magnetic resonance enterography in patients with Crohn’s disease: utility in noise reduction and image quality improvement. Diagn Interv Radiol. 2023;29(3):437–449. doi:10.4274/dir.2023.232113
OpenUrl CrossRef
25.↵
Lamash Y, Kurugol S, Freiman M, et al. Curved planar reformatting and convolutional neural network-based segmentation of the small bowel for visualization and quantitative assessment of pediatric Crohn’s disease from MRI. J Magn Reson Imaging. 2019;49(6):1565–1576. doi:10.1002/jmri.26330
OpenUrl CrossRef PubMed
26.↵
van Harten LD, de Jonge CS, Beek KJ, Stoker J, Išgum I. Untangling and segmenting the small intestine in 3D cine-MRI using deep learning. Med Image Anal. 2022;78:102386. doi:10.1016/j.media.2022.102386
OpenUrl CrossRef
27.↵
Guez I, Focht G, Greer M-LC, et al. Development of a multimodal machine-learning fusion model to non-invasively assess ileal Crohn’s disease endoscopic activity. Comput Methods Programs Biomed. 2022;227:107207. doi:10.1016/j.cmpb.2022.107207
OpenUrl CrossRef
28.
Lian G, Peng Y, He J, et al. Diagnosis and prognosis of epidemic inflammatory bowel disease under convolutional neural network algorithm and nonlinear equation model. Results in Physics. 2021;22:103912. doi:10.1016/j.rinp.2021.103912
OpenUrl CrossRef
29.↵
McFarlane J. Three_Jdimensional modelling as a novel interactive tool for preoperative.pdf. Colorectal Disease -. 2023.