MAUDGAN: Motion Artifact Unsupervised Disentanglement Generative Adversarial Network of Multicenter MRI Data with Different Brain tumors

Mojtaba Safari; Ali Fatemi; Louis Archambault

doi:10.1101/2023.03.06.23285299

Abstract

Purpose This study proposed a novel retrospective motion reduction method named motion artifact unsupervised disentanglement generative adversarial network (MAUDGAN) that reduces the motion artifacts from brain images with tumors and metastases. The MAUDGAN was trained using a mutlimodal multicenter 3D T1-Gd and T2-fluid attenuated inversion recovery MRI images.

Approach The motion artifact with different artifact levels were simulated in k-space for the 3D T1-Gd MRI images. The MAUDGAN consisted of two generators, two discriminators and two feature extractor networks constructed using the residual blocks. The generators map the images from content space to artifact space and vice-versa. On the other hand, the discriminators attempted to discriminate the content codes to learn the motion-free and motion-corrupted content spaces.

Results We compared the MAUDGAN with the CycleGAN and Pix2pix-GAN. Qualitatively, the MAUDGAN could remove the motion with the highest level of soft-tissue contrasts without adding spatial and frequency distortions. Quantitatively, we reported six metrics including normalized mean squared error (NMSE), structural similarity index (SSIM), multi-scale structural similarity index (MS-SSIM), peak signal-to-noise ratio (PSNR), visual information fidelity (VIF), and multi-scale gradient magnitude similarity deviation (MS-GMSD). The MAUDGAN got the lowest NMSE and MS-GMSD. On average, the proposed MAUDGAN reconstructed motion-free images with the highest SSIM, PSNR, and VIF values and comparable MS-SSIM values.

Conclusions The MAUDGAN can disentangle motion artifacts from the 3D T1-Gd dataset under a multimodal framework. The motion reduction will improve automatic and manual post-processing algorithms including auto-segmentations, registrations, and contouring for guided therapies such as radiotherapy and surgery.

1 Introduction

Magnetic resonance imaging (MRI) with different sequences provides excellent soft tissue contrast for diagnosis and treatment planning. However, high MRI acquisition time limits the quality of high-resolution images¹ because of the increased probability of patient motion. Involuntary and voluntary subject motions during data acquisition cause image blurring and ghosting along the phase-encoding direction. The prevalence of motion artifacts is high for infants and patients with acute distress.²

To tackle motion artifacts, retrospective motion correction (RMC) and prospective motion correction (PMC) methods were developed. PMC approaches modify the gradient magnetic fields using the imaged object positions that are tracked during imaging to maintain a constant relationship between imaged object and imaged volume.^3, ⁴ PMC can maintain a uniform k-space sampling density, which avoids Nyquist violation, and compensate for spin-history effects.⁵ However, PMC methods require additional hardware and complicated pulse sequences that increase the imaging time. On the other hand, RMC methods are post-processing approaches, and do not require additional hardware and pulse sequence modifications during imaging. Traditional RMC methods, such as auto-focusing, attempt to optimize image quality metrics like entropy and gradient,⁶ iterative methods to estimate motion trajectory,⁷ compressed-sensing theory,⁸ and modified imaging sequences.⁹ They are either limited to 2D imaging methods or require raw k-space data that are not widely available. In addition, these methods are computationally expensive.

Recently, deep learning techniques, in particular, convolutional neural networks (CNNs) have been used to quantify^10, ¹¹ and reduce^12, ¹³ MRI motion artifact retrospectively. These models learn the task through a supervised framework using the simulating motion artifacts. Unpaired deep learning models attempted to use data without the motion artifact as a ground truth to reduce the artifacts from MRI with the same imaging sequence.¹⁴

This study aimed to address the problem in a more practical setting where one motion-free MRI modality removes artifacts from the motion-corrupted images acquired with different MRI imaging sequences. This study reformulated MRI motion artifacts as an unsupervised disentanglement problem. Thus, we introduced a novel motion artifact unsupervised disentanglement generative adversarial network (MAUDGAN). The novel MAUDGAN was applied to reduce the motion of 3D T1-Gd MRI sequences using motion-free T2-fluid attenuated inversion recovery (FLAIR) sequences for the patients with different brain cancers metastasis. This study used a multicenter dataset to improve the MUADGAN’s generalization.

This study leverages an inductive bias¹⁵ that the MAUDGAN learn to disentangle motion artifacts from motion-free contents by comparing 3D T1-Gd MRI sequences (typically with motion artifacts) with motion-free T2-FLAIR (Figure 1) in the latent space.

Fig 1:

Content and artifact components of 3D T1-Gd MRI (x_a) in the motion-corrupted space 𝒯_a and T2-FLAIR in artifact-free space 𝒯 are mapped to the content space 𝒞 and artifact space 𝒜, respectively. MAUDGAN maps the data in 𝒯_a space to 𝒯 space shown by blue arrows. Conversely, MADuGAN learns to map from 𝒯 space to 𝒯_a space (y → ŷ_a) shown by green arrows.

The MAUDGAN consists of U-net¹⁶ generators to perform different forms of image translations including motion artifact reduction and synthesis. Discriminators were used to distinguish between the motion-free and the motion-corrupted MRI sequences in the latent spaces. To our knowledge, MAUDGAN is the first study in multi-modal anatomical MRI motion artifact reduction.

The rest of this paper is as follows: Section 2 explains the dataset and motion simulation steps. Section 3 gives a detail about the MAUDGAN architecture and loss functions. Results and comparisons with two generative models are illustrated in Section 4. Finally, Sections 6 and 5 discuss the significance of the MAUDGAN and its possible use in the context of diagnosis and therapy.

2 Material

2.1 Dataset

We used a publicly available multicenter medical GLIS-RT dataset from the Cancer Imaging Archive¹⁷ consisting of 230 patients (100 males and 130 females). All patients with different brain tumor types underwent 3D T1-Gd, 2D T2-FLAIR MRI sequences, and a CT scan under different imaging protocols. The brain tumor types were glioblastoma (GBM - 198 cases), anaplastic astrocytoma (AAC - 23 cases), astrocytoma (AC - 5 cases), anaplastic oligodendroglioma (AODG - 2 cases), and oligodendroglioma (ODG - 2 case). We used 80% (11246 image slices) and 20% (2276 image slices) of data for training and testing our method, respectively.

The median of the T2-FLAIR and 3D T1-Gd images’ resolution was 1.1×1.1×5 mm³ (standard deviation 0.53 × 0.53 × 0.87 mm³) and 0.94 × 0.94 × 1. mm³ (standard deviation 0.24 × 0.24 × 1.21 mm³), respectively. The T2-FLAIR imaging parameters were (median ± std); TE = 119 ± 64.06 ms, TR = 9000 ± 936.20 ms, TI = 2500 ± 174.02 ms, and flip angle = 150° ± 13.56°. Those parameters for T1-Gd were (median ± std); TE = 2.98 ± 3.86 ms, TR = 2200 ± 1031.76 ms, TI = 900 ± 235.50 ms, and flip angle = 9° ± 5.45° About 30% of data were acquired using MRI scanners with B₀ of 1.5 T and the others were acquired using 3T scanners. Out of 230 cases, 55 cases were obtained using GE MRI scanners and the rest were obtained using Siemens MRI scanners.

Finally, we evaluated the MAUDGAN performance on anonymized clinical data with real motion artifacts. This retrospective single-centre study was approved by the institutional review board, and the requirement for written informed consent was waived.

2.2 Motion simulation

The head motion was simulated in the Fourier domain (k-space), and the motion-corrupted data was generated after the inverse discrete Fourier transform. We adapted the piecewise constant motion simulation approach with a low computation burden because it provides a similar generalization than to the complex motion simulation techniques.¹³ Moreover, the generated motion artifacts were similar to the real motion artifacts.¹³

We assumed the phase encoding interval was much faster than the head motion. Thus, the same motion parameters could be used at each phase encoding direction (Figure 2). The k-space lines within the randomly selected slabs were translated in the phase encoding direction. However, the middle of the k-space that corresponds to the low-frequency content of the MRI images was excluded in the motion artifact simulation process, shown as a forbidden region in Figure 2. Our motion simulation method could successfully model the ghosting of the bright fat tissue, due to the motion artifact, to the background around the skull, which is common in structural MRI images.¹⁸

Fig 2:

The motion simulation process. After choosing the phase encoding direction, several random k-space regions were selected. The randomly selected k-space lines were randomly translated within the random regions.

3 Method

We denote 𝒯_a and 𝒯 as the motion-corrupted image and the motion-free image spaces, respectively. The paired and unpaired motion reduction process is formalized as a ℳ = {(x_a, x) | x_c ∈ 𝒯 _a, x ∈ 𝒯, f (x_a) = x} where x_a and x were the motion-corrupted and motion-free single MRI image sequence and f : 𝒯_a → 𝒯.^14, ¹⁹ However, we assumed there is no paired or unpaired dataset of a single modality available to disentangle motion artifacts. Instead, another MRI image sequence, T2-FLAIR, was employed to disentangle the motion artifact of the T1-Gd MRI sequence, which is more practical in clinical settings. Thus, the MAUDGAN is formalized as ℳ = {(x_a, y) | x_a ∈ 𝒯 _a, y ∈ 𝒯, f (x_a) = x, g(x_a, y) = y_a} where f : 𝒯_a → 𝒯 and g : 𝒯 → 𝒯 _a are the encoding into a content space 𝒞 and artifact space 𝒜. Also, x_a and y are motion-corrupted T1-Gd and the motionfree T2-FLAIR MRI images. After training the MAUDGAN, the image data in the content space will be free of motion artifacts. In contrast, the motion-corrupted T2-FLAIR could be generated using the learned motion artifact model.

3.1 MAUDGAN

The MAUDGAN consists of two generators 𝒯 : 𝒯 _a → 𝒯 and 𝒢 : 𝒯 → 𝒯 _a to map from motion-corrupted space to motion-free space and vice-versa (Figure 3). In addition, two networks and were also employed to extract features of the images before feeding them to the generators.

Fig 3:

The proposed MAUDGAN is illustrated. The Generators ℱ learns disentanglement while the 𝒢 learns to generate motion-corrupted images from motion-free images.

Given multimodal MRI images T1-Gd x_a ∈ 𝒯 _a and T2-FLAIR y ∈ 𝒯, the training steps were as follows:

ℱ maps the motion-corrupted T1-Gd x_a to motion-free space ,
𝒢 maps the motion-free space T2-FLAIR y to the motion-corrupted space ŷ_a,
trained ℱ in step 1 was used to recover motion-free T2-FLAIR from motion-corrupted ŷ_a simulated in step 2,
trained 𝒢 in step 2 was used to recover motion corrupted T1-Gd from motion-free and motion-corrupted ŷ_a simulated in step 1 and 2,

3.2 Learning

The MAUDGAN attempts to train generators in an adversarial scenario to achieve motion artifact disentanglement. Thus, the MAUDGAN employed loss functions to remove motion artifacts from T1-Gd using content information of T2-FLAIR as given in (1)-(4). The MAUDGAN employs four loss functions including two adversarial losses and , reconstruction loss ℒ_rec, and artefact consistency loss ℒ_arti. The cost function is formalized as the weighted sum of the losses, where λ_adv, λ_rec, and λ_arti are the hyper-parameters controlling the importance of each term.

3.2.1 Adversarial loss

The MAUDGAN was trained to map from motion-corrupted space to motion-free space as given in (1) and (3) and vice versa as given in (2) and (4). Learning those two tasks are important to disentangle motion artifact from the image content. As the MAUDGAN is trained on multimodal MRI sequences, regression losses like ℒ₁ and ℒ₂ could not be employed due to the domain difference between T2-FLAIR and T1-Gd MRI images. Therefore, the adversarial learning technique,²⁰ introduced 𝒟_T and discriminators, was employed to regularize the plausibility between motion-corrected and motion-free images using loss and between motion-corrupted and motion-simulated images using loss. Thus, the MAUDGAN is trained to fool the discriminators, so they could not determine whether the motion was generated or real. The adversarial losses are as follows; where z is the latent variable generators, 𝒟_T and are the discriminators to distinguish between motion-corrupted and motion-free content data sampled from 𝒯 and 𝒯_a domains, respectively. 𝕀 is an unit matrix with a size M × M, where M is substantially smaller than the image dimension size, that matched the discriminators’ output.

3.2.2 Reconstruction loss

Despite motion artifact disentanglement, the whole process needed to be lossless. In other words, the MAUDGAN was required to recover the original motion-corrupted T1-Gd from motion-corrected and to recover motion-free T2-FLAIR from motion-simulated ŷ_a. Therefore, two reconstruction losses given in (7) were used to encourage the MAUDGAN to preserve the information. where and shown in Figure 3 are given in (3) and (4). We adapted the ℒ₁ loss rather that than the ℒ₂ to generate sharper images.²¹

3.2.3 Artifact consistency loss

Adversarial losses encouraged the content of generated motion-corrupted ŷ_a and motion-free images to be indistinguishable from T1-Gd x_a and T2-FLAIR y images, respectively. However, the discriminators lose the spatial resolution. To preserve the spatial resolution, ℒ₁ and ℒ₂ could be used. But, due to the domain difference between T1-Gd and T2-FLAIR, direct use of losses would transfer the images’ domain. We proposed artifact loss ℒ_artif given in (8) to induce motion artifacts to the motion-corrected images. Thus, ℒ_artif conflicts with adversarial losses and comprises the overall learning process.

Equation (8) encourages the difference between x_a and to be similar to y and ŷ_a. Unlike a direct minimization by ℒ₁ that would cause an image domain translation, ℒ_artif requires the and x_a to be anatomically close rather be exactly close to preserve structural information.

3.3 Network architecture

The MAUDGAN network generator is illustrated in Figure 4-(a). The generator employed residual blocks²² (Figure 4-(b)) for a better generalization than convolution blocks without skip connection. To improve the generators’ performance,²³ the convolution layers were used to down-sample the data in the encoder part of the generator. However, the decoder part of the generator employed the up-sampling layers rather than the transpose convolution layers to preserve the image edge information and avoid the checkerboard effect.²⁴

Fig 4:

The Generator with the blocks used to construct discriminator and are illustrated.

The discriminator consists of four residual blocks (Figure 4-(b)) and down-sampling blocks. Finally, the discriminators were constructed by four convolution blocks shown in Figure 4-(c) and the final layer with one convolution layer. The feature extractors ( for i ∈ {1, 2}) were constructed using five residual blocks (Figure 4-(b)).

We implemented the MAUDGAN under the PyTorch 1.12.0¹ deep learning framework using two NVIDIA GPUs RTX 3090. The batch size, optimizer, and the learning rate were 6, RAdam,²⁵ and 2 × 10⁻⁴. We trained the network using hyper-parameters λ_rec = 10, λ_adv = 5, and λ_artif = 50.

4 Results

To our knowledge the MAUDGAN is the first network that employs the multi-modal MRI images to reduce MRI motion artifacts. Thus, we could only compare the MAUDGAN with two wellknown unsupervised image-to-image translation approaches including CycleGAN²⁶ and Pix2pix.²¹ The original implementations of the CycleGAN and Pix2pix were used² to compare the results.

The supervised methods like U-Net²⁷ were excluded since the ground truth targets were unavailable, and the domain shifts between the multi-modal images transfer the domain of the input motion-corrupted 3D T1-Gd images to the motion-free T2-FLAIR dataset. We compared the MAUDGAN with those networks for different motion artifact levels. Finally, we evaluated the performance of the MAUDAN to remove real motion artifacts from the patients with head & neck cancer.

Motion simulated dataset allowed us to perform qualitative and quantitative comparisons. We report six quantitative metrics including normalized mean squared error (NMSE), structural similarity index (SSIM),²⁸ multi-scale structural similarity index (MS-SSIM),²⁹ peak signal-to-noise ratio, visual information fidelity (VIF),³⁰ and multi-scale gradient magnitude similarity deviation (MS-GMSD).³¹ The higher metric values are better regarding motion artifact reduction and distortion levels except with the NMSE and MS-GMSD metrics.

Qualitative comparisons are illustrated in Figure 5 for different motion levels. Qualitatively, the Pix2Pix method had the lowest performance in preserving the MRI soft tissue contrast. CycleGAN reduced soft-tissue contrasts, smeared out the signal intensity, and unrealistically elevated the skull signals. MAUDGAN remove motion artifact with better soft tissue contrast and realistic skull signal intensity.

Fig 5:

Visual comparisons of the motion-reduction methods on the motion-simulated data. The simulated motion artifact was added along the row in (a) and column in (b). The heavy, moderate, and minor motion simulation data and the motion-corrected results are from top to bottom rows.

In addition, CycleGAN generated images with high signal intensity voxels mimicking the false tumors (see Figure 6). The false tumors were generated might be attributed to the wrong sampling from data manifolds. Those false tumors differ from water droplet-like artifacts³² cause by the normalization layers. Especially, the false tumor shown in Figure 6b is similar to the post-surgery cases.

Fig 6:

The white arrows illustrate the false tumors generated by the CycleGAN dataset.

The quantitative metrics evaluating the motion-corrected image contrast, image distortion level, and structure and texture similarity to the ground truth data are illustrated in Figure 7. The MAUDGAN with the lowest NMSE and the highest PSNR values indicates the removing the motion artifact with small spatial distortion. However, NMSE and PSNR tend to favor smoothness. The MS-SSIM and SSIM were reported to evaluate the structural similarity of the motion-corrected images and the ground truth. Higher MS-SSIM and SSIM indicate better similarity. Our method got better SSIM values and comparable MS-SSIM values for different distortion levels. The MAUDGAN with the highest value of VIF could preserve more information than the other utilized methods. Finally, to evaluate the image gradient, which is related to image contrast, the MS-GMSD was reported for different distortion levels. Lower MS-GMSD indicates a smaller deviation between the gradients of motion-corrected and ground truth data. The MAUDGAN with smaller MS-GMSD could preserve more, say soft-tissue, the contrast of the ground truth data.

Fig 7:

Quantitative metrics to evaluate the quality of the motion-corrected data. The proposed MAUDGAN, Pix2Pix, and CycleGAN were evaluated on three motion distortion levels heavy, moderate, and minor.

We tested the MAUDGAN model on the data with real data with motion artifacts. The data were extracted anonymized from the PACS system. The real artifact was reduced using the MAUDGAN as shown in Figure 8.

Fig 8:

The anonymized data with real motion artifacts were exported from the PACS system to evaluate the MAUDGAN model to remove the real motion artifacts. The first row is the data with real artifact, and the second row illustrates the data after motion reduction. The arrows indicate the motion artifact.

5 Discussion

This study aimed to reduce 3D T1-Gd motion artifacts using T2-FLAIR images. 3D T1-Gd images with high acquisition times are more likely to corrupt with the motion artifact.² In addition, the high-resolution images’ quality acquired with the high B0 magnetic fields is limited due to the motion artifact, which the PMC methods could partially remove the motion artifacts.¹ Motion artifacts reduce the image quality reducing the performance of manual and automatic post-processing approaches like tumor and organ at risks auto-segmentation.^33, ³⁴ This study introduced MAUDGAN to tackle motion reduction as a disentanglement problem. The multi-center dataset with different brain tumors and metastases was used to train the MAUDGAN, which is expected to improve its generalization. Our qualitative and quantitative comparisons with two well-known GAN methods indicate that the MAUDGAN could disentangle the motion artifact using T2-FLAIR with a lower spatial distortion and a better spatial contrast.

The MAUDGAN was qualitatively compared with generative models CycleGAN and Pix2pix. The MAUDGAN could preserve better soft-tissue contrast (see Figure 5). The Pix2pix approach did not preserve soft-tissue contrast, which might because this method was proposed to work under the paired framework which is different from the theory of this study. On the other hand, the CycleGAN smeared out the MRI soft-tissue contrast, which was better than the pix2pix. Finally, the MAUDGAN reduced the motion artifact with better soft-tissue contrast.

When a network is trained on datasets with tumors, it is crucial that the network to be robust against spatial distortions because those distortions could be misinterpreted as a tumor. The MAUDGAN was free of spatial distortion, while the CycleGAN added spatial distortions (see Figure 6). The added spatial distortions were similar to the brain tumor of the patient with edema and after tumor resection as illustrated in Figure 6(a) and (b), respectively.

The quantitative comparisons shown in Figure 7 between the motion-free ground truth dataset and motion-corrected reconstructed by the CycleGAN, Pix2pix, and MAUDGAN suggest that the MAUDGAN-generated images were more distortion-free with a lower NMSE and a higher PSNR. In addition, MAUDGAN with the higher SSIM, MS-SSIM, and VIF and lower gradient deviations (MS-GMSD) generated more similar to the ground truth dataset.

To the best of our knowledge, this is the first study reporting on the feasibility of an approach enabling to disentangle motion of 3D T1-Gd using T2-FLAIR. The dataset contains different brain tumors and metastases, which are enhanced differently on the different MRI sequences. Thus, we did not use motion-free images of other patients, which need to be exported from PACS. This way, the dataset of the patients without motion artifacts remain in the clinical system. Moreover, we can use all the data to train the network, which is more than training under an unpaired scenario since we do not need to export the same number of patients’ data without motion artifacts.

This study is more challenging compared with the unpaired studies^14, ³⁵ because the data space domain of 3D T1-Gd differs from T2-FLAIR. Thus, the MAUDGAN must be robust to the domain shift between datasets. Due to the MAUDGAN’s robustness, it could employ other image modalities like the T1-w dataset instead of T2-FLAIR. Thus, the MAUDGAN applies to other available MRI sequences than T2-FLAIR. However, this study is limited to the in-plane motion artifact due to the fact T2-FLAIR images were acquired in 2D that is inherently contain geometry distortion along the slice directions.³⁶

6 Conclusion

Our method, MAUDGAN, could disentangle motion artifacts from the 3D T1-Gd dataset under a multi-modal framework. The motion reduction will improve post-processing methods like manual and automatic brain tumors and organ at risk delineations and might increase the CT/MRI coregistration accuracy. Especially, the MAUDGAN would benefit elderly and infant patients with more involuntary motions during the 3D T1-Gd imaging with a long acquisition time. This retrospective motion correction is free from additional hardware or sequence modifications during the imaging, which makes it more practical.

Data Availability

All data produced are available online at

https://wiki.cancerimagingarchive.net/pages/viewpage.action?pageId=95224486

Disclosures

There are no conflicts of interest declared by the authors.

Data Availability

The brain dataset was obtained from The Cancer Imaging Archieve (https://wiki.cancerimagingarchive.net/pages/viewpage.action?pageId=95224486).

Acknowledgments

This work was supported by NSERC CREATE RHHDS program and NSERC discovery grant.

Footnotes

References

↵
D. Stucht, K. A. Danishad, P. Schulze, et al., “Highest resolution in vivo human brain mri using prospective motion correction,” PloS one 10(7), e0133921 (2015).
OpenUrl CrossRef PubMed
↵
J. M. Slipsager, S. L. Glimberg, J. Søgaard, et al., “Quantifying the financial savings of motion correction in brain mri: a model-based estimate of the costs arising from patient head motion and potential savings from implementation of motion correction,” Journal of Magnetic Resonance Imaging 52(3), 731–738 (2020).
OpenUrl
↵
J. G. Pipe, “Motion correction with propeller mri: application to head motion and free-breathing cardiac imaging,” Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine 42(5), 963–969 (1999).
OpenUrl
↵
J. Maclaren, M. Herbst, O. Speck, et al., “Prospective motion correction in brain imaging: a review,” Magnetic resonance in medicine 69(3), 621–636 (2013).
OpenUrl CrossRef PubMed
↵
M. Zaitsev, J. Maclaren, and M. Herbst, “Motion artifacts in mri: A complex problem with many partial solutions,” Journal of Magnetic Resonance Imaging 42(4), 887–901 (2015).
OpenUrl CrossRef PubMed
↵
W. Lin and H. K. Song, “Improved optimization strategies for autofocusing motion compen-sation in mri via the analysis of image metric maps,” Magnetic resonance imaging 24(6), 751–760 (2006).
OpenUrl CrossRef PubMed
↵
M. W. Haskell, S. F. Cauley, and L. L. Wald, “Targeted motion estimation and reduction (tamer): data consistency based motion mitigation for mri using a reduced model joint opti-mization,” IEEE transactions on medical imaging 37(5), 1253–1265 (2018).
OpenUrl CrossRef PubMed
↵
M. Usman, D. Atkinson, F. Odille, et al., “Motion corrected compressed sensing for free-breathing dynamic cardiac mri,” Magnetic resonance in medicine 70(2), 504–516 (2013).
OpenUrl CrossRef PubMed
↵
S. Kecskemeti, A. Samsonov, J. Velikina, et al., “Robust motion correction strategy for struc-tural mri in unsedated children demonstrated with three-dimensional radial mpnrage,” Radi-ology 289(2), 509 (2018).
OpenUrl CrossRef PubMed
↵
A. Sciarra, S. Chatterjee, M. Dünnwald, et al., “Automated ssim regression for detection and quantification of motion artefacts in brain mr images,” arXiv preprint arXiv:2206.06725 (2022).
↵
I. Oksuz, B. Ruijsink, E. Puyol-Antón, et al., “Deep learning using k-space based data augmentation for automated cardiac mr motion artefact detection,” in International Confer-ence on Medical Image Computing and Computer-Assisted Intervention, 250–258, Springer (2018).
↵
T. Küstner, K. Armanious, J. Yang, et al., “Retrospective correction of motion-affected mr images using deep learning frameworks,” Magnetic resonance in medicine 82(4), 1527–1540 (2019).
OpenUrl
↵
B. A. Duffy, L. Zhao, F. Sepehrband, et al., “Retrospective motion artifact correction of struc-tural mri images using deep learning improves the quality of cortical surface reconstructions,” Neuroimage 230, 117756 (2021).
OpenUrl
↵
G. Oh, J. E. Lee, and J. C. Ye, “Unpaired mr motion artifact deep learning using outlier-rejecting bootstrap aggregation,” IEEE Transactions on Medical Imaging 40(11), 3125–3139 (2021).
OpenUrl
↵
F. Locatello, S. Bauer, M. Lucic, et al., “Challenging common assumptions in the unsu-pervised learning of disentangled representations,” in international conference on machine learning, 4114–4124, PMLR (2019).
↵
O. Ronneberger, P. Fischer, and T. Brox, “U-net: Convolutional networks for biomedical im-age segmentation,” in International Conference on Medical image computing and computer-assisted intervention, 234–241, Springer (2015).
↵
N. Shusharina & T. Bortfeld, “Glioma image segmentation for radiotherapy: Rt targets, bar-riers to cancer spread, and organs at risk [data set],” (2021). The Cancer Imaging Archive, https://doi.org/10.7937/TCIA.T905-ZQ20.
↵
B. Mortamet, M. A. Bernstein, C. R. Jack Jr, et al., “Automatic quality assessment in struc-tural brain magnetic resonance imaging,” Magnetic Resonance in Medicine: An Official Jour-nal of the International Society for Magnetic Resonance in Medicine 62(2), 365–372 (2009).
OpenUrl
↵
M. Torop, S. V. Kothapalli, Y. Sun, et al., “Deep learning using a biophysical model for robust and accelerated reconstruction of quantitative, artifact-free and denoised images,” Magnetic resonance in medicine 84(6), 2932–2942 (2020).
OpenUrl CrossRef
↵
I. Goodfellow, J. Pouget-Abadie, M. Mirza, et al., “Generative adversarial networks,” Com-munications of the ACM 63(11), 139–144 (2020).
OpenUrl
↵
P. Isola, J.-Y. Zhu, T. Zhou, et al., “Image-to-image translation with conditional adversarial networks,” in Proceedings of the IEEE conference on computer vision and pattern recogni-tion, 1125–1134 (2017).
↵
K. He, X. Zhang, S. Ren, et al., “Deep residual learning for image recognition,” in Proceed-ings of the IEEE conference on computer vision and pattern recognition, 770–778 (2016).
↵
A. Radford, L. Metz, and S. Chintala, “Unsupervised representation learning with deep con-volutional generative adversarial networks,” arXiv preprint arXiv:1511.06434 (2015).
↵
A. Odena, V. Dumoulin, and C. Olah, “Deconvolution and checkerboard artifacts,” Distill 1(10), e3 (2016).
OpenUrl
↵
L. Liu, H. Jiang, P. He, et al., “On the variance of the adaptive learning rate and beyond,” arXiv preprint arXiv:1908.03265 (2019).
↵
J.-Y. Zhu, T. Park, P. Isola, et al., “Unpaired image-to-image translation using cycle-consistent adversarial networks,” in Proceedings of the IEEE international conference on computer vision, 2223–2232 (2017).
↵
K. H. Jin, M. T. McCann, E. Froustey, et al., “Deep convolutional neural network for inverse problems in imaging,” IEEE Transactions on Image Processing 26(9), 4509–4522 (2017).
OpenUrl CrossRef PubMed
↵
Z. Wang, A. C. Bovik, H. R. Sheikh, et al., “Image quality assessment: from error visibility to structural similarity,” IEEE transactions on image processing 13(4), 600–612 (2004).
OpenUrl CrossRef PubMed Web of Science
↵
Z. Wang, E. P. Simoncelli, and A. C. Bovik, “Multiscale structural similarity for image quality assessment,” in The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003, 2, 1398–1402, Ieee (2003).
OpenUrl
↵
H. R. Sheikh and A. C. Bovik, “Image information and visual quality,” IEEE Transactions on image processing 15(2), 430–444 (2006).
OpenUrl CrossRef PubMed
↵
B. Zhang, P. V. Sander, and A. Bermak, “Gradient magnitude similarity deviation on mul-tiple scales for color image quality assessment,” in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1253–1257, IEEE (2017).
↵
T. Karras, S. Laine, M. Aittala, et al., “Analyzing and improving the image quality of style-gan,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recogni-tion, 8110–8119 (2020).
↵
P. Kemenczky, P. Vakli, E. Somogyi, et al., “Effect of head motion-induced artefacts on the reliability of deep learning-based whole-brain segmentation,” Scientific reports 12(1), 1–13 (2022).
OpenUrl
↵
N. Aldoj, F. Biavati, F. Michallek, et al., “Automatic prostate and prostate zones segmenta-tion of magnetic resonance images using densenet-like u-net,” Scientific reports 10(1), 1–17 (2020).
OpenUrl
↵
S. Liu, K.-H. Thung, L. Qu, et al., “Learning mri artefact removal with unpaired data,” Nature Machine Intelligence 3(1), 60–67 (2021).
OpenUrl
↵
R. W. Brown, Y.-C. N. Cheng, E. M. Haacke, et al., Magnetic resonance imaging: physical principles and sequence design, ch. 20. John Wiley & Sons (2014).

View the discussion thread.

Posted March 08, 2023.

Download PDF

Data/Code

Citation Tools

Subject Area

Radiology and Imaging

Subject Areas

All Articles

Addiction Medicine (412)
Allergy and Immunology (726)
Anesthesia (214)
Cardiovascular Medicine (3107)
Dentistry and Oral Medicine (349)
Dermatology (263)
Emergency Medicine (463)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1100)
Epidemiology (13046)
Forensic Medicine (13)
Gastroenterology (862)
Genetic and Genomic Medicine (4866)
Geriatric Medicine (449)
Health Economics (751)
Health Informatics (3068)
Health Policy (1108)
Health Systems and Quality Improvement (1135)
Hematology (410)
HIV/AIDS (962)
Infectious Diseases (except HIV/AIDS) (14351)
Intensive Care and Critical Care Medicine (885)
Medical Education (453)
Medical Ethics (120)
Nephrology (502)
Neurology (4631)
Nursing (247)
Nutrition (689)
Obstetrics and Gynecology (847)
Occupational and Environmental Health (764)
Oncology (2393)
Ophthalmology (677)
Orthopedics (270)
Otolaryngology (333)
Pain Medicine (306)
Palliative Medicine (88)
Pathology (516)
Pediatrics (1243)
Pharmacology and Therapeutics (521)
Primary Care Research (522)
Psychiatry and Clinical Psychology (3976)
Public and Global Health (7201)
Radiology and Imaging (1606)
Rehabilitation Medicine and Physical Therapy (958)
Respiratory Medicine (944)
Rheumatology (460)
Sexual and Reproductive Health (478)
Sports Medicine (403)
Surgery (514)
Toxicology (65)
Transplantation (222)
Urology (190)

[1] ↵
D. Stucht, K. A. Danishad, P. Schulze, et al., “Highest resolution in vivo human brain mri using prospective motion correction,” PloS one 10(7), e0133921 (2015).
OpenUrl CrossRef PubMed

[2] ↵
J. M. Slipsager, S. L. Glimberg, J. Søgaard, et al., “Quantifying the financial savings of motion correction in brain mri: a model-based estimate of the costs arising from patient head motion and potential savings from implementation of motion correction,” Journal of Magnetic Resonance Imaging 52(3), 731–738 (2020).
OpenUrl

[3] ↵
J. G. Pipe, “Motion correction with propeller mri: application to head motion and free-breathing cardiac imaging,” Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine 42(5), 963–969 (1999).
OpenUrl

[4] ↵
J. Maclaren, M. Herbst, O. Speck, et al., “Prospective motion correction in brain imaging: a review,” Magnetic resonance in medicine 69(3), 621–636 (2013).
OpenUrl CrossRef PubMed

[5] ↵
M. Zaitsev, J. Maclaren, and M. Herbst, “Motion artifacts in mri: A complex problem with many partial solutions,” Journal of Magnetic Resonance Imaging 42(4), 887–901 (2015).
OpenUrl CrossRef PubMed

[6] ↵
W. Lin and H. K. Song, “Improved optimization strategies for autofocusing motion compen-sation in mri via the analysis of image metric maps,” Magnetic resonance imaging 24(6), 751–760 (2006).
OpenUrl CrossRef PubMed

[7] ↵
M. W. Haskell, S. F. Cauley, and L. L. Wald, “Targeted motion estimation and reduction (tamer): data consistency based motion mitigation for mri using a reduced model joint opti-mization,” IEEE transactions on medical imaging 37(5), 1253–1265 (2018).
OpenUrl CrossRef PubMed

[8] ↵
M. Usman, D. Atkinson, F. Odille, et al., “Motion corrected compressed sensing for free-breathing dynamic cardiac mri,” Magnetic resonance in medicine 70(2), 504–516 (2013).
OpenUrl CrossRef PubMed

[9] ↵
S. Kecskemeti, A. Samsonov, J. Velikina, et al., “Robust motion correction strategy for struc-tural mri in unsedated children demonstrated with three-dimensional radial mpnrage,” Radi-ology 289(2), 509 (2018).
OpenUrl CrossRef PubMed

[10] ↵
A. Sciarra, S. Chatterjee, M. Dünnwald, et al., “Automated ssim regression for detection and quantification of motion artefacts in brain mr images,” arXiv preprint arXiv:2206.06725 (2022).

[11] ↵
I. Oksuz, B. Ruijsink, E. Puyol-Antón, et al., “Deep learning using k-space based data augmentation for automated cardiac mr motion artefact detection,” in International Confer-ence on Medical Image Computing and Computer-Assisted Intervention, 250–258, Springer (2018).

[12] ↵
T. Küstner, K. Armanious, J. Yang, et al., “Retrospective correction of motion-affected mr images using deep learning frameworks,” Magnetic resonance in medicine 82(4), 1527–1540 (2019).
OpenUrl

[13] ↵
B. A. Duffy, L. Zhao, F. Sepehrband, et al., “Retrospective motion artifact correction of struc-tural mri images using deep learning improves the quality of cortical surface reconstructions,” Neuroimage 230, 117756 (2021).
OpenUrl

[14] ↵
G. Oh, J. E. Lee, and J. C. Ye, “Unpaired mr motion artifact deep learning using outlier-rejecting bootstrap aggregation,” IEEE Transactions on Medical Imaging 40(11), 3125–3139 (2021).
OpenUrl

[15] ↵
F. Locatello, S. Bauer, M. Lucic, et al., “Challenging common assumptions in the unsu-pervised learning of disentangled representations,” in international conference on machine learning, 4114–4124, PMLR (2019).

[16] ↵
O. Ronneberger, P. Fischer, and T. Brox, “U-net: Convolutional networks for biomedical im-age segmentation,” in International Conference on Medical image computing and computer-assisted intervention, 234–241, Springer (2015).

[17] ↵
N. Shusharina & T. Bortfeld, “Glioma image segmentation for radiotherapy: Rt targets, bar-riers to cancer spread, and organs at risk [data set],” (2021). The Cancer Imaging Archive, https://doi.org/10.7937/TCIA.T905-ZQ20.

[18] ↵
B. Mortamet, M. A. Bernstein, C. R. Jack Jr, et al., “Automatic quality assessment in struc-tural brain magnetic resonance imaging,” Magnetic Resonance in Medicine: An Official Jour-nal of the International Society for Magnetic Resonance in Medicine 62(2), 365–372 (2009).
OpenUrl

[19] ↵
M. Torop, S. V. Kothapalli, Y. Sun, et al., “Deep learning using a biophysical model for robust and accelerated reconstruction of quantitative, artifact-free and denoised images,” Magnetic resonance in medicine 84(6), 2932–2942 (2020).
OpenUrl CrossRef

[20] ↵
I. Goodfellow, J. Pouget-Abadie, M. Mirza, et al., “Generative adversarial networks,” Com-munications of the ACM 63(11), 139–144 (2020).
OpenUrl

[21] ↵
P. Isola, J.-Y. Zhu, T. Zhou, et al., “Image-to-image translation with conditional adversarial networks,” in Proceedings of the IEEE conference on computer vision and pattern recogni-tion, 1125–1134 (2017).

[22] ↵
K. He, X. Zhang, S. Ren, et al., “Deep residual learning for image recognition,” in Proceed-ings of the IEEE conference on computer vision and pattern recognition, 770–778 (2016).

[23] ↵
A. Radford, L. Metz, and S. Chintala, “Unsupervised representation learning with deep con-volutional generative adversarial networks,” arXiv preprint arXiv:1511.06434 (2015).

[24] ↵
A. Odena, V. Dumoulin, and C. Olah, “Deconvolution and checkerboard artifacts,” Distill 1(10), e3 (2016).
OpenUrl

[25] ↵
L. Liu, H. Jiang, P. He, et al., “On the variance of the adaptive learning rate and beyond,” arXiv preprint arXiv:1908.03265 (2019).

[26] ↵
J.-Y. Zhu, T. Park, P. Isola, et al., “Unpaired image-to-image translation using cycle-consistent adversarial networks,” in Proceedings of the IEEE international conference on computer vision, 2223–2232 (2017).

[27] ↵
K. H. Jin, M. T. McCann, E. Froustey, et al., “Deep convolutional neural network for inverse problems in imaging,” IEEE Transactions on Image Processing 26(9), 4509–4522 (2017).
OpenUrl CrossRef PubMed

[28] ↵
Z. Wang, A. C. Bovik, H. R. Sheikh, et al., “Image quality assessment: from error visibility to structural similarity,” IEEE transactions on image processing 13(4), 600–612 (2004).
OpenUrl CrossRef PubMed Web of Science

[29] ↵
Z. Wang, E. P. Simoncelli, and A. C. Bovik, “Multiscale structural similarity for image quality assessment,” in The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003, 2, 1398–1402, Ieee (2003).
OpenUrl

[30] ↵
H. R. Sheikh and A. C. Bovik, “Image information and visual quality,” IEEE Transactions on image processing 15(2), 430–444 (2006).
OpenUrl CrossRef PubMed

[31] ↵
B. Zhang, P. V. Sander, and A. Bermak, “Gradient magnitude similarity deviation on mul-tiple scales for color image quality assessment,” in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1253–1257, IEEE (2017).

[32] ↵
T. Karras, S. Laine, M. Aittala, et al., “Analyzing and improving the image quality of style-gan,” in Proceedings of the IEEE/CVF conference on computer vision and pattern recogni-tion, 8110–8119 (2020).

[33] ↵
P. Kemenczky, P. Vakli, E. Somogyi, et al., “Effect of head motion-induced artefacts on the reliability of deep learning-based whole-brain segmentation,” Scientific reports 12(1), 1–13 (2022).
OpenUrl

[34] ↵
N. Aldoj, F. Biavati, F. Michallek, et al., “Automatic prostate and prostate zones segmenta-tion of magnetic resonance images using densenet-like u-net,” Scientific reports 10(1), 1–17 (2020).
OpenUrl

[35] ↵
S. Liu, K.-H. Thung, L. Qu, et al., “Learning mri artefact removal with unpaired data,” Nature Machine Intelligence 3(1), 60–67 (2021).
OpenUrl

[36] ↵
R. W. Brown, Y.-C. N. Cheng, E. M. Haacke, et al., Magnetic resonance imaging: physical principles and sequence design, ch. 20. John Wiley & Sons (2014).

MAUDGAN: Motion Artifact Unsupervised Disentanglement Generative Adversarial Network of Multicenter MRI Data with Different Brain tumors

Abstract

1 Introduction

2 Material

2.1 Dataset

2.2 Motion simulation

3 Method

3.1 MAUDGAN

3.2 Learning

3.2.1 Adversarial loss

3.2.2 Reconstruction loss

3.2.3 Artifact consistency loss

3.3 Network architecture

4 Results

5 Discussion

6 Conclusion

Data Availability

Disclosures

Data Availability

Acknowledgments

Footnotes

References

Citation Manager Formats

Subject Area