Next Article in Journal
Data-Driven Identification of Crane Dynamics Using Regularized Genetic Programming
Previous Article in Journal
Design of Series-Fed Circularly Polarized Beam-Tilted Antenna for Microwave Power Transmission in UAV Application
Previous Article in Special Issue
Enhancing Acute Bilirubin Encephalopathy Diagnosis with Multi-Modal MRI: A Deep Learning Approach
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Practical Medical Image Generation with Provable Privacy Protection Based on Denoising Diffusion Probabilistic Models for High-Resolution Volumetric Images

1
Department of Radiology, The University of Tokyo Hospital, 7-3-1 Hongo, Bunkyo, Tokyo 113-8655, Japan
2
Department of Computational Diagnostic Radiology and Preventive Medicine, The University of Tokyo Hospital, 7-3-1 Hongo, Bunkyo, Tokyo 113-8655, Japan
3
Department of Radiology, School of Medicine, Jichi Medical University, 3311-1 Yakushiji, Shimotsuke, Tochigi 329-0498, Japan
4
Center for Frontier Medical Engineering, Chiba University, 1-33 Yayoi-cho, Inage, Chiba 263-8522, Japan
*
Author to whom correspondence should be addressed.
Appl. Sci. 2024, 14(8), 3489; https://doi.org/10.3390/app14083489
Submission received: 19 March 2024 / Revised: 19 April 2024 / Accepted: 19 April 2024 / Published: 20 April 2024
(This article belongs to the Special Issue Artificial Intelligence in Biomedical Image Processing)

Abstract

:
Local differential privacy algorithms combined with deep generative models can enhance secure medical image sharing among researchers in the public domain without central administrators; however, these images were limited to the generation of low-resolution images, which are very insufficient for diagnosis by medical doctors. To enhance the performance of deep generative models so that they can generate high-resolution medical images, we propose a large-scale diffusion model that can, for the first time, unconditionally generate high-resolution ( 256 × 256 × 256 ) volumetric medical images (head magnetic resonance images). This diffusion model has 19 billion parameters, but to make it easy to train it, we temporally divided the model into 200 submodels, each of which has 95 million parameters. Moreover, on the basis of this new diffusion model, we propose another formulation of image anonymization with which the processed images can satisfy provable Gaussian local differential privacy and with which we can generate images semantically different from the original image but belonging to the same class. We believe that the formulation of this new diffusion model and the implementation of local differential privacy algorithms combined with the diffusion models can contribute to the secure sharing of practical images upstream of data processing.

1. Introduction

Local differential privacy (LDP) [1,2] combined with the deep generative models [3,4] can simultaneously ensure the provable and controllable upper bound of information leakage and hold realistic utilities [5,6] upstream of data processing. Therefore, we consider that this combination can be a realistic tool to anonymize and then share private medical images among researchers in the public domain. On the other hand, the total dimension of data modeled by deep generative models for images is typically limited to, equivalent to, or below 2,097,152 (see the section on related works for details). However, to diagnose patients with volumetric medical images, medical doctors usually want those images to have a resolution of at least 256 × 256 × 256 = 16,777,216 total dimensions. On the basis of these backgrounds, the purpose of the present study is to build a deep generative model that can learn volumetric images with state-of-the-art high-resolution ( 256 × 256 × 256 ) and sample from the probabilistic distribution of those images to apply LDP algorithms with it for high-resolution volumetric medical images. Specifically, we adopt diffusion models as the deep generative models because the diffusion models can divide the training problem for the probabilistic distribution of images to virtually infinite subproblems to enhance the power of expression. With reference to this division, the stochastic differential equations for the diffusion models [7] are an interesting topic; however, we do not handle those equations in the present study. More strictly, we adopt pixel-space diffusion models [3] because they can straightforwardly handle real images without encoders, as in generative adversarial networks (GANs) [4] and latent-space diffusion models [8], which is an important and favorable feature in LDP processing. Furthermore, a pair of an encoder and a decoder for the latent-space diffusion models requires a significantly large GPU memory when we scale up the models to high-resolution volumetric images. Finally, we not only show unconditional image generation from a pseudo-random noise with the proposed diffusion model, but also apply the proposed model to generate high-resolution LDP volumetric medical images from a real image. Because differential privacy (DP) provides provable privacy protection, we decided to concentrate on clarifying the utility of those generated images, including LDP-processed images with visual evaluations by three medical doctors. We adopt head magnetic resonance (MR) images taken at the University of Tokyo Hospital throughout the present study. In summary, this study enhances diffusion models to facilitate modeling high-resolution volumetric images, contributing to the construction of practical medical systems. Additionally, we apply this improved diffusion model to propose and validate a method for sharing medical images with guaranteed privacy.

2. Related Works

2.1. Deep Generative Models for High-Dimensional Data

Shibata et al. [9] improved the training method of flow-based deep generative models [10,11,12] and successfully modeled three-dimensional chest computed tomography (CT) images with the models, but the image resolution was limited, that is, equivalent to or below 128 × 128 × 128 . Khader et al. [13] adopted denoising diffusion probabilistic models (DDPMs) to unconditionally generate three-dimensional medical images, but the total dimensions of image pixels were limited, that is, equivalent to or below 2,097,152. Bieder et al. [14] adopted DDPMs to segment three-dimensional high-resolution ( 256 × 256 × 256 ) medical images, but they did not report the results of their image generation. Dorjsembe et al. [15] adopted DDPMs to conditionally generate three-dimensional medical images, but their resolution was limited to 128 × 128 × 128 . Finally, Sun et al. [16] proposed a three-dimensional GAN, which can unconditionally generate high-resolution ( 256 × 256 × 256 ) medical images, but they did not report image generations conditioned with another image with a GAN, unlike in our present study.

2.2. Differential Privacy for Multidimensional Data

Fan [17] proposed an LDP algorithm for image anonymization. The algorithm directly adds a perturbation noise on images. In the present study, we adopt the algorithm, but we additionally postprocess the LDP-processed noisy images. Croft et al. [6], Li and Clifton [18], and Liu et al. [19] almost simultaneously proposed another LDP algorithm for the obfuscation of facial images, which adopts generative models to semantically change the identity in facial images while it can preserve the class (face) of those images. However, they did not adopt diffusion models as generative models. We stress that the above LDP algorithms are all different from DP-SGD [20], which adds perturbation noise against parameters of deep discriminative or generative models.

3. Methods

Figure 1 shows the flowchart of the proposed method.

3.1. Score Matching and DDPMs

Given the domain-specific data (we represent the set of the data as D and one of the data as x ), the purpose of the generative models is to estimate the probabilistic distribution log p ( x ) of those data. However, the direct modeling of log p ( x ) is very difficult. Therefore, we focus on score  x log p ( x ) . However, the direct modeling of x log p ( x ) is still difficult. Therefore, we estimate the score with implicit score matching [21]. However, the scale up of implicit score matching to high-dimensional data is difficult and overfitting is significantly problematic. Therefore, we divide the problem. We add the perturbation noise of different signal-to-noise ratios (SNRs) to data and prepare datasets characterized with different SNRs. The implicit score matching now learns denoising from noisy data.
The denoising diffusion probabilistic models (DDPMs) [3] are another expression of score-based diffusion models and formulated to maximize the evidence lower bound (ELBO) of the Kullback–Leibler divergence. Maximizing the ELBO is equivalent to learning noise prediction from noisy data. Specifically, we train a model, which is represented as a vector function conditioned on the time step f t , θ , so that the model can predict the noise component at a time step t, which is linearly combined with the normalized image ( x t 1 , 1 ):
x t = α ¯ t x 0 + β ¯ t ϵ ,
and
x ^ t 1 = f t , θ x t ,
where the hat represents the predicted quantity throughout this paper, and 
ϵ N 0 , I ,
α t : = 1 β t ,
α ¯ t : = s = 1 t α s ,
β ¯ t : = 1 α ¯ t ,
where 0 < β 1 < β 2 < < β T < 1 are the control parameters for the magnitude of deviations. In this study, we adopt the Sigmoid scheduling [22] for the betas.

3.2. Scale Up to High-Resolution Volumetric Images by Model Unrolling

Previous DDPMs recursively adopt the same neural network to execute denoising from images of different SNRs with the time step information prescribed inside the denoising network:
x ^ 0 = f t = 1 , θ f t = 2 , θ f t = T , θ T -steps x T .
This DDPM network ( f θ ) requires a large power of expression, i.e., numerous parameters, to enable denoising from images of different SNRs, but it is very difficult to scale this up to high-resolution volumetric images owing to the GPU memory limitation. On the contrary, if we train a different DDPM network for each time step, the power of expression required for each model would be significantly relaxed. On the basis of this insight, we have
x ^ 0 = f θ 1 f θ 2 f θ T T -steps x T ,
where f θ i learns denoising from x t to estimate the noise component and it can generate the image in the previous time step x t 1 . Specifically, for training, we prepare multiple instances of the network shown in Figure A1 for different time steps (t), and optimize the parameters included in them using the ADAM optimizer.

3.3. DP

We deal with the LDP apart from global differential privacy. The LDP can anonymize data itself upstream of data processing. The theoretical guarantee for privacy protection of LDP with the Gaussian mechanism [1,23] is given by
P r ( x ˜ | x ) e ϵ P r ( x ˜ | x ) δ ,
where x ˜ is an LDP-processed image, x and x are different arbitrarily selected images in the probabilistic distribution of images, ϵ 0 and 0 δ < 1 are the privacy budgets specified by administrators of the images, and  P r x ˜ | x [or P r x ˜ | x ] is the conditional probability that x ˜ is generated when x (or x ) is given.
To generate x ˜ from x (or, x ), which satisfies (9), we add a perturbation that obeys a normal (Gaussian) distribution on x as follows:
x ˜ i j k = x i j k + n ,
n N ( 0 , σ 2 ) ,
σ 2 = 2 ln 1.25 / δ i j k · ( Δ f i j k ) 2 ϵ i j k 2 ,
where for a single pixel ( x i j k ) of x , ϵ i j k and δ i j k are the privacy budgets, and  Δ f i j k is the sensitivity. After the addition of the perturbation, we clip the range of x ˜ i j k from −1 to 1.

3.4. Integration of DDPM and DP

First, we handle ( ϵ i j k , δ i j k )-Gaussian-LDP for a single fixed pixel ( x i j k ) and extend this to handling all the pixels using the following composition theorem of DP:
ϵ = i , j , k ϵ i j k ,
δ = i , j , k δ i j k ,
where ϵ and δ are the total privacy budgets.
Because the deviations of perturbed images appear in DDPMs (1), we can write
σ t 2 = β ¯ t α ¯ t .
We can now connect the above equation with (12) as
σ t 2 = σ 2 .
This indicates that we can compute the virtually infinite pairs of ϵ and δ if the noise scheduling (betas) of the DDPM and the sensitivity Δ f i j k are given.
Note that we can obtain LDP-processed images by just picking intermediate images x t in the DDPM (the forward process), but those images do not preserve the class (head MR images in the present study) if the privacy budgets are small. To preserve the class even when the privacy budgets are small, we reversely apply the DDPM to the intermediate images as follows (the reverse process):
x ^ 0 = f θ 1 f θ 2 f θ t t -steps x t .
With this postprocessing, we can have images such that they (i) preserve the class, (ii) may be semantically different from original images, and (iii) have provable indistinguishability and therefore are practical for medical data sharing.

4. Numerical Experiments

4.1. Preparation of Head MR Images

The institutional review board of the University of Tokyo Hospital approved the use of head MR images (T1WI) taken in the hospital for the present retrospective study. From November 2006 to December 2017, high-resolution volumetric fast spoiled gradient-echo MR imaging was performed with a Signa EXCITE and a Discovery MR750 scanner (GE Healthcare Japan, Tokyo, Japan) (repetition time, 6.4 ms; echo time, 2.0 ms; inversion time, 450 ms; field-of-view, 25 cm; flip angle, 15 deg; acquisition matrix, 256 × 256; number of excitations, 0.5; and voxel dimensions 0.98 × 0.98 × 1.0 mm 3 ) using an 8-channel head coil. From January 2018 to April 2021, high-resolution volumetric magnetization-prepared rapid gradient-echo MR images were acquired with a Biograph mMR scanner (repetition time, 1660 ms; echo time, 2.4 ms; inversion time, 910 ms; field-of-view, 25 cm; flip angle, 8 deg; acquisition matrix, 256 × 256; number of excitations, 1; and voxel dimensions, 0.98 × 0.98 × 1.0 mm 3 ) using a 16-channel receiver coil (These protocols are the same as in [24].). We extracted only 1327 head MR images (volumes) from a large set (including both normal and abnormal cases) of images taken with the above protocols. The resolution of each two-dimensional MR image (sagittal slices) was 256 × 256 . We stacked these two-dimensional images to create a three-dimensional image of 256 × 256 × 256 size with padding. These head MR images were processed without skull stripping. The images were divided into training (1224) and test (103) datasets. Each image was normalized so that the whole pixel range of all the images was included. We then mapped the range of these images onto [ 1 , 1 ] for the training.

4.2. Training of DDPMs

We trained the models in an unsupervised manner with the DDPM network described in Appendix A without the time step information (t). We set the number of the total time steps ( T = 200 ); hence, we trained 200 models ( θ 1 , θ 2 , , θ 200 ). Each model for the fixed time step contains 95 million trainable parameters and all the models together contain 19 billion trainable parameters. We efficiently trained the models in a serpentine folding manner. Specifically, we first trained the model θ 1 and when finished, we initialized the next model θ 2 with the parameters of θ 1 and then trained it. When all the models ( θ 1 , θ 2 , , θ 200 ) were updated once, we define this process as completinig one epoch. We updated the model until four epochs. All the computations were executed on a single node of the supercomputer Wisteria/BDEC-01 (Fujitsu, Tokyo, Japan) (one node contains eight A100 GPUs with 40 GB of memory for each GPU), at the University of Tokyo.

5. Results

5.1. Unconditional Image Generation and Visual Evaluation

Using the trained model, we first unconditionally generated head MR images [i.e., ( ϵ , δ ) = ( 0 , 0 ) ]. Three medical doctors (radiologists) evaluated the quality of those images (six volumes) and real images (six volumes) to show the capability of the model to generate realistic but fictional high-resolution volumetric medical images. They evaluated the appearance of anatomical structures and the contrast of the cortical white matter in five stages on the basis of the criteria shown in Table 1. Table 2 and Table 3 show the results of the evaluation of real and fake cases, respectively. Moreover, in Figure 2, we show three representative slices (i.e., axial, coronal, and sagittal slices) of a generated head MR image (we selected case 1 in Table A1, Table A2 and Table A3.) from pseudo random noise, i.e., from t = 200 in (17). The medical images obtained in the present study were cropped and enlarged to enhance their visibility.

5.2. Conditional Image Generation and Equivalent Privacy Budget

Because we clarified that the model can generate realistic but fictional head MR images in the previous subsection, we now adopt the model to anonymize real head MR images. Specifically, we consider cases with nonzero privacy budgets, i.e.,  ( ϵ , δ ) ( 0 , 0 ) , for a real head MR image (this image corresponds to x in Section 3.3) and generate anonymized fictional head MR images (these images correspond to postprocessed x ˜ in Section 3.3).
Figure 3 shows three representative slices of the real images to be LDP-processed, three representative slices of the LDP-processed images, and three representative slices of the LDP-processed and postprocessed images when the total privacy budgets (In the case of t = 50 , we have σ 2 = 0.175 . Additionally, the  l 2 -sensitivity Δ f i j k is always 2 because we set 1 ( x 0 ) i j k 1 . Therefore, for a given δ i j k = 10 8 , we have ϵ i j k = 2.92 × 10 1 per pixel. Therefore, the total privacy budgets in this case are ϵ = 4.90 × 10 8 and δ = 0.168 for the image of 256 × 256 × 256 size.) are ϵ = 4.90 × 10 8 and δ = 0.168 .

6. Discussion

6.1. Novelty

We realized state-of-the-art unconditional and conditional high-resolution volumetric image generation with this improved DDPM. The new architecture with low memory consumption for our proposed DDPM enabled this (see Section 3 and Appendix A for details). Moreover, the novel LDP and postprocessing algorithms that can generate semantically different images in the same class (head MR images) based on the DDPM were proposed and validated.

6.2. Quality Evaluation by Medical Doctors

Global and large structures, e.g., the lateral ventricle, are well unconditionally generated with the proposed DDPM. However, relatively small structures, e.g., the hippocampus and cortical white matter contrast, are not very well generated with the proposed DDPM, as commonly indicated by the three medical doctors in the evaluation results. Nevertheless, most of the anatomically important structures scored more than 2.0 on average. Since in this study a score of 2 means “slightly identifiable”, we believe that our proposed method can reconstruct these important structures properly. In the clinical domain, the corticomedullary contrast and the volumes of the hippocampi are important in disease diagnosis and prognosis prediction (e.g., Alzheimer disease [25]). Therefore, we need to improve the depiction correctness of these structures. Note that such a clinically relative evaluation of AI-generated volumes was rarely performed in previous studies and is thus one of our contributions in this paper.
As the difficult structures (corticomedullary contrast and hippocampi) are slightly visualized in the results, we can also apply any additional postprocess to improve the depiction of these structures by, e.g., deep-learning-based filtering methods [26]. Generally, however, simple noise reduction postprocessing cannot recover the structures that are not represented in the original generated volume. In other words, for evaluating the quality of generated medical images, the ordinal contrast-noise ratio (CNR) or structural similarity index measure (SSIM) is not enough and the faithful recreation of relative anatomical structures must be confirmed by medical experts.
On the other hand, the amount of statistical variety of generalized images is another quality measure of generative models and it should be evaluated by further experiments. It is one of our future works. However, it would be difficult to fairly evaluate the amount of variety of medical images generated by a given model, because medical image generation should be not only diverse but also resemble the distribution of a real population of human beings. The ordinal inception score (IS) and Fréchet inception distance (FID) are based on and rely on another classifier model, which is rarely customized for medical images. Inventing a new methodology to evaluate the amount of variety of generated medical images will be another future work.

6.3. Limitation

First, without the model parallelization or high-capacity-memory GPUs, the straightforward scale up of the present model to the resolution of 512 × 512 × 512 is not easy. Second, we showed LDP-processed images with the limited deviations σ t 2 , but we can set arbitrary deviations in theory. This requires the retraining of the diffusion model; otherwise, the model will generate a suboptimal image.

6.4. Future Works

If we increase the number of time steps (T) in the DDPM, it would significantly improve the quality of generated images including the contrast of the cortical white matter, and this is included in our future works. Moreover, we further plan to apply the proposed DDPM to other conditional image generation tasks, e.g., aging prediction, ultra-sparse view CTs [9].

7. Conclusions

We improved the diffusion models so that they can, for the first time, unconditionally generate high-resolution volumetric ( 256 × 256 × 256 ) medical images. Moreover, on the basis of this new diffusion model, we proposed another formulation of image anonymization with which the processed images can satisfy provable Gaussian local differential privacy and we can generate images semantically different from the original image but belonging to the same class. Furthermore, we validated the formulation with high-resolution volumetric medical image anonymization. This method assumes no specific class of images, making it potentially applicable to any type of natural image. Low-resolution medical images are not well-suited for practical medical systems, and traditional standard diffusion models have struggled to model high-resolution volumetric medical images. In this study, we overcame this challenge and paved the way for applying deep generative models to practical medical systems. We believe that this improvement of the DDPM and the formulation of LDP algorithms combined with the DDPM can contribute to the secure sharing of practical images upstream of data processing.

Author Contributions

Conceptualization, H.S.; methodology, H.S.; software, H.S.; validation, H.S.; formal analysis, H.S.; investigation, T.N., T.K. and Y.N. (Yuta Nakamura); resources, T.Y.; data curation, Y.N. (Yukihiro Nomura); writing—original draft preparation, H.S.; writing—review and editing, H.S., S.H., T.N., T.K., Y.N. (Yuta Nakamura), Y.N. (Yukihiro Nomura), T.Y. and O.A.; visualization, H.S.; supervision, T.Y. and O.A.; project administration, H.S.; funding acquisition, H.S. and S.H. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Japan Science and Technology Agency (JST), CREST Grant Number JPMJCR21M2, including the FY2023 AIP challenge program (Establishment of the diffusion models for high-resolution volumetric images; PI: H. Shibata), Japan.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Institutional Review Board (or Ethics Committee) of the University of Tokyo Hospital (protocol code: 1461-(9) and date of approval: 16 September 2020).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The code presented in this study is available on reasonable request from the corresponding author. The data are not publicly available due to privacy protection.

Acknowledgments

The Department of Computational Diagnostic Radiology and Preventive Medicine, the University of Tokyo Hospital, is sponsored by HIMEDIC Inc., and Siemens Healthcare K.K. This research was conducted using the FUJITSU Supercomputer PRIMEHPC FX1000 and FUJITSU Server PRIMERGY GX2570 (Wisteria/BDEC-01) at the Information Technology Center, the University of Tokyo.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

    The following abbreviations are used in this manuscript:
CTComputed Tomography
DDPMDenoising Diffusion Probabilistic Models
GANGenerative Adversarial Networks
LDPLocal Differential Privacy
MRMagnetic Resonance
SGDStochastic Gradient Descent

Appendix A. Network Architecture of the Proposed DDPM

Figure A1 illustrates the network architecture adopted in this study. We implemented a code on the basis of one of the PyTorch implementations [27] of the pixel space DDPMs for two-dimensional images [3] so that it can handle three-dimensional images and run on eight NVIDIA A100 GPUs (NVIDIA Corporation, Santa Clara, CA, USA) with 40 GB of memory for each in a data-parallel manner. This code combines the attention mechanism [28], a linear attention mechanism, and U-Net [29] enabling local and global feature extraction and generation. All the Conv2D operations were converted into Conv3D operations. We adopted the SiLU activation function. The previous network of attention mechanisms was also converted into the equivalent three-dimensional network. We set the number of input image channels to 1. The depth of the three-dimensional U-Net in our code is 8. The Res-Net in different depths of the U-Net has the channel sizes of 4, 8, 16, 32, 64, 128, 256, and 512. For attention mechanisms, we set the dimension of each head to 8 and the number of heads to 4. For linear attention mechanisms, we set the dimension of each head to 8 and the number of heads to 2.
Figure A1. We adopted a UNet that includes attention to model the function f θ (for details about function f θ , refer to the main text).
Figure A1. We adopted a UNet that includes attention to model the function f θ (for details about function f θ , refer to the main text).
Applsci 14 03489 g0a1

Appendix B. All the Results of Evaluation by Three Medical Doctors

In Table A1, Table A2 and Table A3, we show all the evaluation results by the three medical doctors for the six fake cases at resolution of 256 × 256 × 256 , respectively.
Table A1. Results of visual evaluation by medical doctor A.
Table A1. Results of visual evaluation by medical doctor A.
Case (Fake)123456
Brain regions
Anterior commissure444432
Posterior commissure322312
Cerebral aqueduct212211
Tegmentum of midbrain333333
Cerebellar hemisphere sulcus122223
Cerebral peduncle444433
Corpus callosum444444
Third ventricle433333
Fourth ventricle454545
Lateral ventricle555555
Cortical white matter contrast
Hippocampus123112
Frontal lobe323122
Occipital lobe222111
Temporal lobe223122
Parietal lobe222132
Basal ganglia233232
Other regions
First cervical vertebra (C1)121411
Second cervical vertebra (C2)222522
Optic nerve153422
Extraocular muscles445533
Table A2. Results of visual evaluation by medical doctor B.
Table A2. Results of visual evaluation by medical doctor B.
Case (Fake)123456
Brain regions
Anterior commissure322122
Posterior commissure212211
Cerebral aqueduct112211
Tegmentum of midbrain333333
Cerebellar hemisphere sulcus232333
Cerebral peduncle333323
Corpus callosum444544
Third ventricle544444
Fourth ventricle544444
Lateral ventricle544444
Cortical white matter contrast
Hippocampus223232
Frontal lobe313111
Occipital lobe322111
Temporal lobe322232
Parietal lobe322121
Basal ganglia223222
Other regions
First cervical vertebra (C1)122221
Second cervical vertebra (C2)223322
Optic nerve142422
Extraocular muscles332422
Table A3. Results of visual evaluation by medical doctor C.
Table A3. Results of visual evaluation by medical doctor C.
Case (Fake)123456
Brain regions
Anterior commissure444232
Posterior commissure232322
Cerebral aqueduct222223
Tegmentum of midbrain222233
Cerebellar hemisphere sulcus344333
Cerebral peduncle555555
Corpus callosum555555
Third ventricle545554
Fourth ventricle533344
Lateral ventricle555555
Cortical white matter contrast
Hippocampus333433
Frontal lobe233333
Occipital lobe333343
Temporal lobe344443
Parietal lobe333343
Basal ganglia233444
Other regions
First cervical vertebra (C1)325555
Second cervical vertebra (C2)323344
Optic nerve252422
Extraocular muscles432433

References

  1. Dwork, C. Differential privacy. In Proceedings of the International Colloquium on Automata, Languages, and Programming ICALP 2006, Venice, Italy, 10–14 July 2006; pp. 1–12. [Google Scholar]
  2. Dwork, C.; Kenthapadi, K.; McSherry, F.; Mironov, I.; Naor, M. Our data, ourselves: Privacy via distributed noise generation. In Proceedings of the Advances in Cryptology-EUROCRYPT 2006: 24th Annual International Conference on the Theory and Applications of Cryptographic Techniques, St. Petersburg, Russia, 28 May–1 June 2006; pp. 486–503. [Google Scholar]
  3. Ho, J.; Jain, A.; Abbeel, P. Denoising diffusion probabilistic models. Adv. Neural Inf. Process. Syst. 2020, 33, 6840–6851. [Google Scholar]
  4. Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial nets. In Proceedings of the Advances in Neural Information Processing Systems 2014, Montreal, QC, Canada, 8–13 December 2014; Volume 27. [Google Scholar]
  5. Croft, W.L.; Sack, J.R.; Shi, W. Differentially private facial obfuscation via generative adversarial networks. Future Gener. Comput. Syst. 2022, 129, 358–379. [Google Scholar] [CrossRef]
  6. Croft, L.; Sack, J.R.; Shi, W. Obfuscation of images via differential privacy: From facial images to general images. Peer Netw. Appl. 2021, 14, 1705–1733. [Google Scholar] [CrossRef]
  7. Song, Y.; Sohl-Dickstein, J.; Kingma, D.P.; Kumar, A.; Ermon, S.; Poole, B. Score-Based Generative Modeling through Stochastic Differential Equations. In Proceedings of the International Conference on Learning Representations, Addis Ababa, Ethiopia, 26–30 April 2020. [Google Scholar]
  8. Rombach, R.; Blattmann, A.; Lorenz, D.; Esser, P.; Ommer, B. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 10684–10695. [Google Scholar]
  9. Shibata, H.; Hanaoka, S.; Nomura, Y.; Nakao, T.; Takenaga, T.; Hayashi, N.; Abe, O. On the Simulation of Ultra-Sparse-View and Ultra-Low-Dose Computed Tomography with Maximum a Posteriori Reconstruction Using a Progressive Flow-Based Deep Generative Model. Tomography 2022, 8, 2129–2152. [Google Scholar] [CrossRef] [PubMed]
  10. Dinh, L.; Krueger, D.; Bengio, Y. Nice: Non-linear independent components estimation. arXiv 2014, arXiv:1410.8516. [Google Scholar]
  11. Dinh, L.; Sohl-Dickstein, J.; Bengio, S. Density estimation using real nvp. arXiv 2016, arXiv:1605.08803. [Google Scholar]
  12. Kingma, D.P.; Dhariwal, P. Glow: Generative flow with invertible 1x1 convolutions. Adv. Neural Inf. Process. Syst. 2018, 31, 10236–10245. [Google Scholar]
  13. Khader, F.; Müller-Franzes, G.; Tayebi Arasteh, S.; Han, T.; Haarburger, C.; Schulze-Hagen, M.; Schad, P.; Engelhardt, S.; Baeßler, B.; Foersch, S. Denoising diffusion probabilistic models for 3D medical image generation. Sci. Rep. 2023, 13, 7303. [Google Scholar] [CrossRef] [PubMed]
  14. Bieder, F.; Wolleb, J.; Durrer, A.; Sandkuehler, R.; Cattin, P.C. Memory-Efficient 3D Denoising Diffusion Models for Medical Image Processing. In Proceedings of the Medical Imaging with Deep Learning, Nashville, TN, USA, 10–12 July 2023. [Google Scholar]
  15. Dorjsembe, Z.; Odonchimed, S.; Xiao, F. Three-dimensional medical image synthesis with denoising diffusion probabilistic models. In Proceedings of the Medical Imaging with Deep Learning, Zurich, Switzerland, 6–8 July 2022. [Google Scholar]
  16. Sun, L.; Chen, J.; Xu, Y.; Gong, M.; Yu, K.; Batmanghelich, K. Hierarchical amortized GAN for 3D high resolution medical image synthesis. IEEE J. Biomed. Health Inform. 2022, 26, 3966–3975. [Google Scholar] [CrossRef] [PubMed]
  17. Fan, L. Image pixelization with differential privacy. In Proceedings of the IFIP Annual Conference on Data and Applications Security and Privacy, Bergamo, Italy, 16–18 July 2018; pp. 148–162. [Google Scholar]
  18. Li, T.; Clifton, C. Differentially private imaging via latent space manipulation. arXiv 2021, arXiv:2103.05472. [Google Scholar]
  19. Liu, B.; Ding, M.; Xue, H.; Zhu, T.; Ye, D.; Song, L.; Zhou, W. DP-Image: Differential Privacy for Image Data in Feature Space. arXiv 2021, arXiv:2103.07073. [Google Scholar]
  20. Abadi, M.; Chu, A.; Goodfellow, I.; McMahan, H.B.; Mironov, I.; Talwar, K.; Zhang, L. Deep learning with differential privacy. In Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, Vienna, Austria, 24–28 October 2016; pp. 308–318. [Google Scholar]
  21. Song, Y.; Ermon, S. Generative modeling by estimating gradients of the data distribution. In Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada, 8–14 December 2019; Volume 32. [Google Scholar]
  22. Jabri, A.; Fleet, D.J.; Chen, T. Scalable Adaptive Computation for Iterative Generation. In Proceedings of the International Conference on Machine Learning, PMLR, Honolulu, HI, USA, 23–29 July 2023; pp. 14569–14589. [Google Scholar]
  23. Dwork, C. Differential privacy: A survey of results. In Proceedings of the International Conference on Theory and Applications of Models of Computation, Xi’an, China, 25–29 April 2008; pp. 1–19. [Google Scholar]
  24. Fujita, S.; Mori, S.; Onda, K.; Hanaoka, S.; Nomura, Y.; Nakao, T.; Yoshikawa, T.; Takao, H.; Hayashi, N.; Abe, O. Characterization of Brain Volume Changes in Aging Individuals With Normal Cognition Using Serial Magnetic Resonance Imaging. JAMA Netw. Open 2023, 6, e2318153. [Google Scholar] [CrossRef] [PubMed]
  25. Leandrou, S.; Petroudi, S.; Kyriacou, P.A.; Reyes-Aldasoro, C.C.; Pattichis, C.S. Quantitative MRI brain studies in mild cognitive impairment and Alzheimer’s disease: A methodological review. IEEE Rev. Biomed. Eng. 2018, 11, 97–111. [Google Scholar] [CrossRef] [PubMed]
  26. Chen, Z.; Pawar, K.; Ekanayake, M.; Pain, C.; Zhong, S.; Egan, G.F. Deep learning for image enhancement and correction in magnetic resonance imaging—State-of-the-art and challenges. J. Digit. Imaging 2023, 36, 204–230. [Google Scholar] [CrossRef] [PubMed]
  27. Denoising Diffusion Probabilistic Model, in PyTorch. Available online: https://github.com/lucidrains/denoising-diffusion-pytorch/releases/tag/1.8.5 (accessed on 12 March 2024).
  28. Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention is all you need. In Proceedings of the Advances in Neural Information Processing Systems 2017, Long Beach, CA, USA, 4–9 December 2017; Volume 30. [Google Scholar]
  29. Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention—MICCAI 2015: 18th International Conference, Munich, Germany, 5–9 October 2015; pp. 234–241. [Google Scholar]
Figure 1. We extend the denoising diffusion probabilistic model (DDPM) to handle high-resolution volumetric images, and furthermore, propose and validate a novel method to remove noise induced by local differential privacy (LDP) using DDPM.
Figure 1. We extend the denoising diffusion probabilistic model (DDPM) to handle high-resolution volumetric images, and furthermore, propose and validate a novel method to remove noise induced by local differential privacy (LDP) using DDPM.
Applsci 14 03489 g001
Figure 2. One of the unconditionally generated images (Case 1 in Table A1). (a) Axial slice. (b) Coronal slice. (c) Sagittal slice.
Figure 2. One of the unconditionally generated images (Case 1 in Table A1). (a) Axial slice. (b) Coronal slice. (c) Sagittal slice.
Applsci 14 03489 g002
Figure 3. Results of applying Local Differential Privacy (LDP). (a) Axial slice of one of the real images. (b) Coronal slice of one of the real images. (c) Sagittal slice of one of the real image. (d) Axial slice of the LDP-processed real image. (e) Coronal slice of the LDP-processed real image. (f) Sagittal slice of the LDP-processed real image. (g) Axial slice of the LDP-processed and postprocessed real image. (h) Coronal slice of the LDP-processed and postprocessed real image. (i) Sagittal slice of the LDP-processed and postprocessed real image.
Figure 3. Results of applying Local Differential Privacy (LDP). (a) Axial slice of one of the real images. (b) Coronal slice of one of the real images. (c) Sagittal slice of one of the real image. (d) Axial slice of the LDP-processed real image. (e) Coronal slice of the LDP-processed real image. (f) Sagittal slice of the LDP-processed real image. (g) Axial slice of the LDP-processed and postprocessed real image. (h) Coronal slice of the LDP-processed and postprocessed real image. (i) Sagittal slice of the LDP-processed and postprocessed real image.
Applsci 14 03489 g003
Table 1. Evaluation Criteria.
Table 1. Evaluation Criteria.
StagesCriteria
1The structure is invisible.
2The structure is slightly identifiable.
3The structure is visible but not sufficient.
4The structure is visible as in real cases
with the resolution of 256 × 256 × 256 .
5The structure is well visible as in real cases
with the resolution of 256 × 256 × 256 .
Table 2. Visual evaluation results. A, B, and C indicate averaged evaluation results for six real cases. Ave. indicates the averaged results (A, B, and C) from the three medical doctors.
Table 2. Visual evaluation results. A, B, and C indicate averaged evaluation results for six real cases. Ave. indicates the averaged results (A, B, and C) from the three medical doctors.
DoctorABCAve. ( 256 3 )
Brain regions
Anterior commissure5.05.05.05.0
Posterior commissure5.05.05.05.0
Cerebral aqueduct5.05.05.05.0
Tegmentum of midbrain5.05.05.05.0
Cerebellar hemisphere5.05.04.74.9
sulcus
Cerebral peduncle5.05.05.05.0
Corpus callosum5.05.05.05.0
Third ventricle5.05.05.05.0
Fourth ventricle5.05.05.05.0
Lateral ventricle5.05.05.05.0
Cortical white matter contrast
Hippocampus5.05.04.84.9
Frontal lobe5.05.04.84.9
Occipital lobe5.05.04.84.9
Temporal lobe5.05.05.05.0
Parietal lobe5.05.05.05.0
Basal ganglia4.84.84.84.8
Other regions
First cervical vertebra5.05.05.05.0
Second cervical vertebra5.05.05.05.0
Optic nerve5.05.04.84.9
Extraocular muscles5.05.05.05.0
Table 3. Visual evaluation results. A, B, and C indicate averaged evaluation results for six fake cases. Ave. indicates the averaged results (A, B, and C) from the three medical doctors.
Table 3. Visual evaluation results. A, B, and C indicate averaged evaluation results for six fake cases. Ave. indicates the averaged results (A, B, and C) from the three medical doctors.
DoctorABCAve. ( 256 3 )
Brain regions
Anterior commissure3.52.03.22.9
Posterior commissure1.81.52.31.9
Cerebral aqueduct1.51.32.21.7
Tegmentum of midbrain3.03.02.32.8
Cerebellar hemisphere2.02.73.32.7
sulcus
Cerebral peduncle3.22.85.03.7
Corpus callosum4.04.25.04.4
Third ventricle3.24.24.74.0
Fourth ventricle4.54.24.74.5
Lateral ventricle5.04.23.74.3
Corticomedullary contrast
Hippocampus1.72.33.22.4
Frontal lobe2.31.72.82.3
Occipital lobe1.51.73.22.1
Temporal lobe2.02.33.72.7
Parietal lobe2.01.83.22.3
Basal ganglia2.52.23.32.7
Other regions
First cervical vertebra1.71.74.22.5
Second cervical vertebra2.52.33.22.7
Optic nerve2.82.52.82.7
Extraocular muscles4.02.73.23.3
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Shibata, H.; Hanaoka, S.; Nakao, T.; Kikuchi, T.; Nakamura, Y.; Nomura, Y.; Yoshikawa, T.; Abe, O. Practical Medical Image Generation with Provable Privacy Protection Based on Denoising Diffusion Probabilistic Models for High-Resolution Volumetric Images. Appl. Sci. 2024, 14, 3489. https://doi.org/10.3390/app14083489

AMA Style

Shibata H, Hanaoka S, Nakao T, Kikuchi T, Nakamura Y, Nomura Y, Yoshikawa T, Abe O. Practical Medical Image Generation with Provable Privacy Protection Based on Denoising Diffusion Probabilistic Models for High-Resolution Volumetric Images. Applied Sciences. 2024; 14(8):3489. https://doi.org/10.3390/app14083489

Chicago/Turabian Style

Shibata, Hisaichi, Shouhei Hanaoka, Takahiro Nakao, Tomohiro Kikuchi, Yuta Nakamura, Yukihiro Nomura, Takeharu Yoshikawa, and Osamu Abe. 2024. "Practical Medical Image Generation with Provable Privacy Protection Based on Denoising Diffusion Probabilistic Models for High-Resolution Volumetric Images" Applied Sciences 14, no. 8: 3489. https://doi.org/10.3390/app14083489

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop