Collaborative Screening of COVID-19-like Disease from Multi-Institutional Radiographs: A Federated Learning Approach

Abdel-Basset, Mohamed; Hawash, Hossam; Abouhawwash, Mohamed

doi:10.3390/math10244766

Open AccessArticle

Collaborative Screening of COVID-19-like Disease from Multi-Institutional Radiographs: A Federated Learning Approach

by

Mohamed Abdel-Basset

¹,

Hossam Hawash

¹ and

Mohamed Abouhawwash

^2,3,*

¹

Department of Computer Science, Faculty of Computers and Informatics, Zagazig University, Zagazig 44519, Egypt

²

Department of Mathematics, Faculty of Science, Mansoura University, Mansoura 35516, Egypt

³

Department of Computational Mathematics, Science and Engineering (CMSE), College of Engineering, Michigan State University, East Lansing, MI 48824, USA

^*

Author to whom correspondence should be addressed.

Mathematics 2022, 10(24), 4766; https://doi.org/10.3390/math10244766

Submission received: 11 November 2022 / Revised: 27 November 2022 / Accepted: 5 December 2022 / Published: 15 December 2022

(This article belongs to the Topic Application of Deep Learning Method in 6G Communication Technology)

Download

Browse Figures

Versions Notes

Abstract

:

COVID-19-like pandemics are a major threat to the global health system have the potential to cause high mortality across age groups. The advance of the Internet of Medical Things (IoMT) technologies paves the way toward developing reliable solutions to combat these pandemics. Medical images (i.e., X-rays, computed tomography (CT)) provide an efficient tool for disease detection and diagnosis. The cost, time, and efforts for acquiring and annotating, for instance, large CT datasets make it complicated to obtain large numbers of samples from a single institution. However, owing to the necessity to preserve the privacy of patient data, it is challenging to build a centralized dataset from many institutions, especially during a pandemic. Moreover, heterogeneity between institutions presents a barrier to building efficient screening solutions. Thus, this paper presents a fog-based federated generative domain adaption framework (FGDA), where fog nodes aggregate patients’ data necessary to collaboratively train local deep-learning models for disease screening in medical images from different institutions. Local differential privacy is presented to protect the local gradients against attackers during the global model aggregation. In FGDA, the generative domain adaptation (DA) method is introduced to handle data discrepancies. Experimental evaluation on a case study of COVID-19 segmentation demonstrated the efficiency of FGDA over competing learning approaches with statistical significance.

Keywords:

deep learning; internet of medical things; COVID-19-like pandemics; federated learning; domain adaption

MSC:

68T05; 68U01; 68W50

1. Introduction

The Internet of Medical Things (IoMT) has been an essential contributor to the enhancement of the accuracy, reliability, and productivity of healthcare services provided in smart cities. The scientific community has expressed a significant amount of enthusiasm over the computerization of the IoMT in relation to conventional medical facilities and medical care services [1]. Through the establishment of a seamless connection between patients and medical staff, as well as the transmission of diagnostic and treatment data, the IoMT has the potential to eliminate unnecessary actions and responsibilities of health organisations [2].

The social and physiological problems that have been brought on by pandemic diseases pose a threat to the global health system. Every time a pandemic disease breaks out, it presents the world’s healthcare systems with new and difficult problems and responsibilities. It is widely believed that COVID-19 is the most dangerous and deadly pandemic disease that is having a severe impact on healthcare all over the world. Therefore, in order to battle pandemics caused by COVID-19-like viruses, it is necessary to have an IoMT system that can collect and evaluate diagnostic and treatment data pertaining to patients. In this regard, the screening and diagnosis of COVID-19-like pandemics is a first line of defence in the fight against a pandemic disease [3]. As a result of this, a significant amount of research attention has recently been dedicated to the screening and detection of COVID-19 from radiological images such as computed tomography (CT), X-rays, and ultrasound frames as a supplement to the RT-PCR test [4].

In recent years, Deep Learning (DL) has shown impressive performance in diagnosis activities (i.e., identification, localization, categorization, etc.) utilising x-ray images. Several convolutional neural networks (CNNs) have been trained to distinguish between COVID-19 patients, other types of pneumonia patients, and normal cases based on radiographic images [5]. In contrast, segmentation methods attempt to autonomously localize/detect and segment diseased areas in chest radiography images, with the primary goal of reducing the workloads associated with performing manual segmentation and helping to overcome the issues of inter- and intra-observer variance. Most current DL research, however, takes for granted that all data can be found in a single, centralised hub. Cloud servers are commonly used in cloud-based IoMT to do this. Nevertheless, it is challenging, if not unachievable, to have a big training dataset at a central area due to the decentralized architecture of the IoMT environment (i.e., institution, hospital, etc.). In the IoMT setting, the labelled data are typically shared throughout multiple organisations. Therefore, it is time-consuming and fraught with latency concerns to move these data to a centralised place. There is also the possibility that hospitals and clinics will not divulge patients’ personal health data. Because of this, radiographic image screening for COVID-19-like pandemics in IoMT is not something that can be accomplished with centralised DL [6].

In response, Google announced Federated Learning (FL) to facilitate a data-private collaborative learning solution. With FL, multiple coworkers can train a machine/deep learning network locally using their own data and then send their training updates to a centralised server, where they will be consolidated into a global model [7]. The global updates are then transmitted by the aggregation server to all participating universities for use and further education. Since the data are stored locally by the participating universities in the IoMT, their confidentiality is protected throughout the learning process. Fog computing, a middleware interface to cloud computing, was developed to move data storage and processing closer to the point of origin [8]. It is a paradigm shift that directs the development of new fields of study in content distribution, data dissemination, and service management. More promising IoMT applications can be achieved through fog computing by establishing interdependencies of network elements using existing practise and methodology [9]. In this model, fog servers located in different parts of the world handle the various computing and communication tasks. Thus, this paper recommends utilising FL and fog computing to strengthen IoMT capabilities for collaborative screening of COVID-19-like pandemics from radiography images [10].

Despite the numerous studies aimed at enhancing the AI method’s capability in screening COVID-19-like pandemics, the application of these techniques remains restricted, particularly for multi-site data, for the explanations that follow [11]. First, patients are anxious about the security of their data, and healthcare centres are afraid that sharing patients’ data may cause them to lose the trust of their patients. Concern that rival organisations would use their data for their own commercial or scholarly advantage is a common source of anxiety for healthcare organisations from a purely business perspective. Second, the medical images that come from a variety of institutions are often drawn from a wide variety of data distributions [12]. This fluctuation poses a risk to the training stability and places restrictions on the capacity to achieve the greatest screening performance.

This work was inspired by the recent achievement of deep learning in COVID-19 diagnosis, and it helps solve the problems listed above. First, this paper presents an FL framework that enables interactive screening of COVID-19-like diseases from medical imagery in fog-assisted IoMT. The concept of fog computing is presented to provide the training phase with more robust and convenient resources.

Second, multi-institutional samples are exploited to align the source domain with the target using a novel task-agnostic domain aligner that is regularized via a single discriminator that first checks the distribution of the source information with those of the target and later enforces the two domains to match each other, aiming to reduce the impact of domain shift in the global updates of the proposed FL framework. Third, to preserve the privacy of data through the domain adaption, an intelligent local differential privacy (LDP) mechanism is redesigned to protect the privacy of discriminator’s and generators’ information during the communication of learning updates between client side and server side. Finally, the effectiveness of the suggested framework was validated by empirical analysis and evaluation using a specific example of multi-source COVID-19 segmentation as a test subject. The results showed significant performance increases.

The remainder of this study is structured as follows: Related studies are overviewed in Section 2. The design of the proposed system is given Section 3. The proposed FL framework is discussed in Section 4. The experimental configurations are given in Section 5. Then, the discussion of empirical results is introduced in Section 6. Section 7 summarizes and concludes our work at the end of the manuscript.

2. Literature Review

In this part, a comprehensive assessment of the recent and relevant research literature for the radiographic image-based diagnosis of COVID-19-like pandemics is provided.

Deep learning for screening of COVID-19-like diseases

The recent literature contains many studies for screening COVID-19-like disease from medical images. These studies cover one or more tasks such as classification, segmentation, localization, reconstruction, etc.

To classify COVID-19 from other pneumonia, the authors of [13] presented an open-source framework, called CovidCTNet, that integrated multiple DL models to realize accurate classification from CT images. In a similar way, the authors of [14] presented a lightweight, approachable, professional DL model for classifying different types of infection from lung ultrasound images. The authors of [15] proposed three models (i.e., DenseNet, InceptionV3, and Inception-ResNetV4) for the diagnosis of different pneumonia from X-ray radiographs of patients. Moreover, the authors of [16] presented a multi-kernel-size spatial-channel attention technique for detecting COVID-19 from chest X-ray radiographs. The design of this method comprises three phases: (1) feature extraction; (2) pairing of parallel multi-kernel-size attention blocks to gather and learn the cross-channel and cross-spatial interrelatedness in manifold spaces using many convolutional kernels of diverse magnitudes to attain channel- and spatial-attentive feature maps; and (3) classification decision block.

For segmentation tasks, the authors of [17] developed an automated approach for segmenting respiratory parenchyma in CT radiographs and analyzing consistency characteristics from the segmented pulmonary parenchyma areas to support radiotherapists in COVID-19 diagnosis. To achieve this, the study presented a segmentation model that incorporates a 3D V-Net shape-deformation block based on a spatial transform network for pulmonary parenchyma segmentation from lung CT scans.

B.: Federated Learning for Domain Adaption

Recently, multiple research efforts have been devoted to promoting collaborative training for screening COVID-19-like pandemics from image data. The authors of [18] presented a data-driven FL framework, called Auto-FedAvg, to immediately learn the model aggregation updates from data making use of the Dirichlet distribution, in a way adaptive to the distribution of training data during learning. The authors of [19] presented an effective reinforcement learning (RL) mechanism for searching for the optimal hyperparameter of FL during training on multi-site medical images. The authors of [20] focused on managing both local and global drifts while also introducing a new harmonising framework, in which the authors proposed mitigating the local update drift by normalising the amplitudes of pictures that have been translated into the frequency domain to approximate a uniform imaging environment. Second, using the harmonised features, the authors built a client weight disturbance that leads each local model to a flat optimum. A smooth optimum is one in which a neighbourhood region surrounding the local optimal solution has a loss that is consistent over the entire neighbourhood. In addition, the authors of [21] proposed an FL approach that can perform inside and outside model personalization via an insubstantial-gradient method to make use of the local customized model by gathering global updates for general expertise and local updates for client-given optimization. Moreover, the authors of [22] identified and solved a novel issue setting known as federated domain generalization to enable learning a federated model from several remote source domains in such a way that it can directly generalise to domains that have not been encountered before. Episodic Learning in Continuous Frequency Space (ELCFS) was proposed to allow each client to leverage multi-source data distributions while adhering to the difficult restriction of data decentralisation.

To sum up, Table 1 presents an insightful conclusion about the literature studies discussed above.

3. System Design

This section describes the system model for screening of COVID-19-like pandemics from distributed multi-institutional medical images in IoMT. Expanding from centralized computing at the cloud, fog computing accommodates geographically dispersed fog servers so as to bring computational resources closer to participating institutions. Figure 1 depicts the proposed system model. Notably, the system model’s three main tiers each comprise a different set of devices. The three tiers are described as follows.

Edge tier: In IoMT, the edge tier comprises medical devices and instruments used to capture medical/patient information necessary for diagnosis. This includes implanted sensors, laptops, scanners, mobile devices, etc. These devices may have various quantities of data and various computational resources, capacities, power capabilities, acquisition techniques, and protocols leading to an many gaps. Some of these data might be private and cannot be revealed by medical institutions because of privacy concerns. Thus, these data are securely moved over the secure communication channel to a local fog server to be safely stored.
Fog tier: It acts as a bridge to connect cloud servers to edge devices, as it has more robust computational resources than the edge devices and is closer to the edge of the IoMT network. Hence, it can alleviate the resource-restriction issue of edge devices, while affording better communication efficiency than a cloud environment. In our IoMT system, the fog tier takes the responsibility of aggregating institutional patients’ data (e.g., CT or X-rays) from the neighbouring edge devices to be safely stored, annotated, and prepared without threatening the privacy of the data. Then, the fog servers also act as the training participants that host the local DL models that are collaboratively trained using their corresponding local institutional data. In each round of federated learning, the participating fog nodes upload their local parameters $W_{L}$ to the cloud for aggregation. Once the aggregation is complete, they receive the global parameters $W_{G}$ from the cloud and update the local parameters $W_{L}$ accordingly. To protect privacy, the fog servers apply differential privacy to the local parameters before uploading them.
Cloud tier: It contains the cloud server that acts as the training coordinator, sometimes called a parameter or aggregation server. It is primarily responsible for initializing global model parameters before training, broadcasting initial values to the local models $ℳ_{L}$ on the fog nodes, and selecting the training participants. More importantly, it also aggregates the local parameters/gradients $W_{L}$ from the participating fog nodes to calculate the new global parameters $W_{G}$ . It then broadcasts the global parameters back to the participants.

4. Methodology

In this section, we provide a comprehensive overview of the FL framework for cross-site screening of COVID-19 from CT scans at multiple institutions using IoMT, while protecting patients’ right to privacy. For the sake of brevity, we will use COVID-19 infection segmentation and/or quantification as an example of a typical use case for COVID-19 screening in this and subsequent sections. We explain how data privacy is established and how various data domains can be aligned in the next section.

4.1. Problem Formulation

Given matrix

D_{i}

representing the CT data held by site

i

,

N

sites

{S_{1}, . . . . S_{N}}

seek to train their DL models by combining their corresponding CT data

{D_{1}, . . . ., D_{N}}

. For COVID-19 analysis from CT scans, the number of scans at each site is insufficient to build and train an efficient DL approach. Traditionally, this issue could be solved by joining all data in a single site and employing

D = {D_{1} \cup D_{2} . . . . \cup D_{N}}

to train

ℳ_{B l e n d}

. Some of these CT data may be annotated, and others not. The input space

X

, ground-truth (GT) space

Y

, and sample-identity space

I

are the main constituents of the entire training data. For multi-site COVID-19-lesion segmentation,

D_{i}

consists of CT data aggregated from institution

S_{i}

, and X represents acquired CT features, which, with ground-truth

Y

, are used to segment and quantify the infection area. These datasets have a common feature space, yet their samples differ. For instance, heterogeneous sites have diverse patients. Nevertheless, the CT features are extracted and learned from the same pipeline. Consequently, the data distribution can be formulated as in (1),

X_{i} = X_{j}, Y_{i} = Y_{j}, I_{i} = I_{j} \forall D_{i} D_{j} i \neq j

(1)

which is a type of horizontal FL in which a variety of datasets have feature intersection and trivial sample-space intersection.

Rules and policies prohibit the sharing of data by clinical organizations. An FL scheme is a training schema in which institutions

S_{i}

cooperatively train a model

ℳ_{F E D}

without disclosing their data

D_{i}

. In our case, a centralized cloud server is used to compute the global model

ℳ_{G}

, while fog participants (i.e., from various institutions) employ the identical DL architecture for segmentation. The model is trained using the local data at each fog server, and the training parameters are regularly updated and transferred to the centralized server. The transmission of parameters between the fog nodes and the cloud coordinator server makes the model and data prone to parameter-reversal attacks. As soon as all parameters reach the server, the server performs average aggregation to compute the new global parameters

W_{G}

, then broadcasts them back to the local participants.

4.2. Privacy-Maintaining FL

The basic FL system has the phases of local upgrading and server communication in distributed optimization. This section presents the proposed privacy protection segmentation along with the steps of training, using Dice loss to train segmentation at any institution

n

,

L = 1 - \frac{2 \sum_{x_{i}} z_{i} \cdot y_{i}}{\sum_{x_{i}} z_{i} + \sum_{x_{i}} y_{i}},

(2)

where

z_{i}

and

y_{i}

are the model prediction and GT, respectively, for input

x_{i}

, and the training set is arbitrarily selected from the feature space

X_{n}

and GT space

Y_{n}

.

4.3. Local Differential Privacy

LDP [29,30] is a popular method for privacy protection in artificial intelligence (AI) and constitutes a robust standard for privacy assurance for data-driven AI solutions. LDP seeks to offer a limit ϵ such that adversaries might learn almost nothing more about a participant than if it were missing from the training data since the participant’s critical data are almost unrelated to the model’s outputs.

ϵ

signifies the level of privacy that can be managed by various parties. Several studies have tried to maintain DP at the data level in a centralized training scenario. To safeguard data from a reversal attack that exploits the local parameters

W_{L}

to deduce the training data, the DP was introduced to inject noise into the local parameters before sending them to the parameter server for aggregation. In this way, an adversary becomes unable to infer anything about the private patient data in the IoMT environment.

With a definite mapping method

f : D \to ℝ^{m}

, the sympathy

s_{h}

is the supreme absolute difference

{|| f (D) - f (D^{'}) ||}_{1}

, where

{|| D - D^{'} ||}_{1} = 1

, which represents a single element of variance separating

D

and

D^{'}

[14]. Herein, the

f

is a function that calculates the parameters

W

in the segmentation model. Presenting “noise” through the training procedure (i.e., input samples, parameters, or segmentation results) can restrain the granularity of common information and guarantee to realize the LDP for all

S \subseteq R a n g e (f)

as follows:

P r [f (D) \in S] \leq e^{ϵ} P r [f (D^{'}) \in S],

(3)

P r [f (D) \in S] \leq e^{ϵ} P r [f (D^{'}) \in S] + δ,

(4)

where δ is the failure probability and

ϵ

denotes the privacy budget. The lesser the value of δ, the tighter the distribution of the data production by

f

in both datasets. This work considers two LDP techniques to realize efficient privacy assurance [31] by incorporating noise in the shared network parameters.

First, Gaussian-based LDP [32] injects noise

N (0, s_{f}^{2} σ^{2})

with average

0

and

s_{f}^{2} σ^{2}

variance of a Gaussian distribution for the task

f (D)

as follows:

\tilde{f} (D) ≅ f (D) + N (0, s_{f}^{2} σ^{2})

(5)

where the sensitivity

s_{f}^{}

of the random function

f

is defined as follows:

Δ f = \max_{D, D^{'}} {|| f (D) - f (D^{'}) ||}_{1}

(6)

Hence, the random function

f

will satisfy the definition of LDP with

(ϵ, δ)

when

δ \geq \frac{4}{5} e x p (- {(σ ϵ)}^{2} / 2)

and

ϵ < 1

. This implies that the parameter of Gaussian noise

σ

can be associated with the privacy parameters

ϵ,

and

δ

.

Second, Laplace-based LDP [33] positions a Laplace distribution around zero, with scale

b

that represents the corresponding probability density function,

L a p (b) = L a p (x | b) = \frac{1}{2} e x p (- \frac{| x |}{b}) .

(7)

The Laplace variance is

σ^{2} = 2 b^{2}

. This technique includes noise

L a p (s_{f} / ϵ)

in

f (D)

with universal sympathy

s_{f}

, and conserves LDP with

(ϵ, 0)

. Hence, we associate the parameter

b

with the privacy parameter

ϵ

. In the COVID-19 segmentation scenario, f is implemented using the DL segmentation network, and the sympathy

s_{f}

cannot be calculated; hence we set

s_{f}

to 1.

To sum up, the notion behind LDP techniques is to combine a specific quantity of noise in the query output while maintaining the value of the original gradient. This noise is typically determined based on the privacy arguments

(ϵ, δ)

. Therefore, it regulates the parameters according to privacy obligations by associating the privacy and noise parameters.

4.4. Domain Adaption for Multi-Site FL-Based Screening

Despite the improved, efficient privacy realized by FL, there is an extra challenge in segmenting COVID-19 infections from CT scans that have heterogeneous data presented in the IoMT system, which causes a domain shift between institutions [34]. The key notion is that DA techniques can improve the segmentation performance of multi-site data in the IoMT and remain efficient even if noise is applied for privacy protection, particularly for institutions whose data distributions are completely different from those of other institutions. The following section presents the proposed DA technique, in which adaptation is performed at the level of learned knowledge representation.

4.5. Federated Generative Domain Adaption (FGDA)

CT scans are institutionally maintained in FL setups in order to safeguard patients’ right to confidentiality. In order to meet the requirements of the DA challenge, we will attempt to generalise from disparate source domains to the common domain space of target data. It is impossible to train a specialised segmentation model that can use both the source domain and the target domain simultaneously because an IoMT system constrains the exchange of data. These restrictions make it impossible to use both domains at the same time. We solve this problem by employing an adversarial alignment methodology [35,36,37] with two components, namely an institutional domain-relevant feature extractor and a universal discriminator, by means of segmentation models and by separating optimization into two stages that are independent of one another. Then the institutional extractor

G_{s}

is trained for the source institution

D_{s}

, and the institutional extractor

G_{t}

for the target institution

D_{t}

. For every pair (

D_{s}

,

D_{t}

), an adversarial domain discriminator

D

is trained to align the distributions. Once these are trained to recognize the domain from which the features originated, the discriminator

D

is confused by feature generators (

G_{s}

,

G_{t}

). Hence, in the situation of privacy preservation,

D

can use the noisy output representations generated from

G_{s}

and

G_{t}

with no leakage of private CT data. The discriminators of source domains receive inputs

M \circ G_{t} (x^{t})

and

M \circ G_{s} (x^{s})

, where

M (\cdot)

is the noise producer (See Figure 2). Thus, data penetration at the target institution is prohibited during discriminator training at the source institution. With

X^{S} and X^{T}

indicative of the source and domain data, correspondingly, the source domain can be perceived from the others using the following formula:

L_{D i s c r i m i n a t o r} (X^{S}, X^{T}, G_{s}, G_{t}) = - E_{x^{s} ~ X^{S}} [l o g (D_{s} (G_{s} (X^{s})))] - E_{x^{t} ~ X^{T}} [l o g (1 - D_{s} (M \circ G_{t} (X^{t})))] .

(8)

L_{Discriminator}

is kept constant, and

L_{Generator}

is updated as follows:

L_{G e n e r a t o r} (X^{S}, X^{T}, G_{s}, G_{t}) = - E_{x^{s} ~ X^{S}} [l o g (D_{s} (G_{s} (X^{s})))] - E_{x^{t} ~ X^{T}} [l o g (D_{s} (M \circ G_{t} (X^{t})))]

(9)

Once the training of the FL model is combined with an alignment component, the divergence between the source and target domains can be reduced. Algorithm 1 presents the implementation of FGDA.

Algorithm 1: FGDA
Input: (1) $X = {X_{1}, . . ., X_{N}}$ , COVID-19 CT data from $N$ institutions; (2) $G_{w_{G}} = {G_{w_{G_{1}}}, . . ., G_{w_{G_{N}}}}$ $represents institutional generators at N sites, with parameters w_{G_{i}}$ ; (3) ${S G}_{w_{S}} = {{S G}_{w_{s_{1}}}, . . ., {S G}_{w_{s_{N}}}}$ $represents segmentation models with parameters w_{S_{i}}$ ; (4) $D_{w_{D}} = {D_{w_{D_{1}}}, . . ., D_{w_{D_{N}}}}$ $represents discriminators at N sites, with parameters θ_{D_{i}}$ ; (5) $Y = {Y_{1}, . . ., Y_{N}}$ represents COVID-19 infection GTs; (6) M(·) is the noise originator; (7) K is the number of optimization phases; (8) $τ$ is a universal updating step, i.e., for every optimization phase, the universal and institutional parameters are communicated in $τ$ steps; (9) ${G_{{\bar{w}}_{1}}, C_{w_{c}}}$ is the universal model.
1:	$Initialize parameters {w_{G}, w_{C}, w_{D}}$
2:	$for k = 1$ to $K$ do:
3:	t = 0
4:	$for i = 1, j = 1$ to $N$ do:
5:	${(X_{i}^{S}, Y_{i}^{S})}_{i = 1}^{N}$ sample from source institution
6:	${(X_{j}^{T})}_{j = 1}^{N}$ sample from target institution
7	$Calculate gradient with Dice loss (Equation (2)) to update w_{G_{i}}^{(k)} {and w}_{S_{i}}^{(k)}$
8	Domain Adaption:
9	$Upgrade w_{D_{i}}^{(k)}$ , { $w_{G_{i}}^{(k)}, w_{G_{j}}^{(k)}$ } with Equation (8) and Equation (8), respectively
10:	Terminate for loop
11:	$t \leftarrow t + 1$
12:	If $t % τ = 0$ then:
13:	${\bar{w}}_{G}^{(k)} = \frac{1}{N} \sum_{N}^{} (w_{G_{i}}^{(k)} + M (w_{G_{i}}^{(k)}))$
14:	${\bar{w}}_{S}^{(k)} = \frac{1}{N} \sum_{N}^{} (w_{S_{i}}^{(k)} + M (w_{S_{i}}^{(k)}))$ universal parameter upgrade after $τ$ steps
15:	for n = 1 to N do:
16:	$w_{G}^{(k)} = {\bar{w}}_{G}^{(k)}$
17:	$w_{S}^{(k)} = {\bar{w}}_{S}^{(k)}$
18:	Terminate for loop
19:	Terminate if
20:	Terminate for loop
21:	$Return : universal model {G_{{\bar{θ}}_{G}},_{} {S G}_{{\bar{θ}}_{S}}}$ .

5. Experiments and Analysis

5.1. Dataset Description

We assessed the performance of the proposed framework on heterogeneous multi-source data using three publicly available COVID-19 CT datasets: (1) 829 slices from nine CT volumes with 372 infected slices, which we refer to as Institution 1; (2) COVID-19-CT-Seg [38], comprising 20 public COVID-19 CT volumes from the Coronacases Initiative and Radiopaedia, with more than 1,800 annotated slices, which we refer to as Institution 2; and (3) a 50 CT volume published in the MosMedData dataset [39], aggregated from municipal hospitals in Moscow, Russia, which we refer to as Institution 3.

5.2. Evaluation Metrics

We assess the segmentation performance of the proposed model according to three performance indicators: the Dice similarity coefficient (DSC), normalized surface distance (NSD), and Jaccard Similarity (JS). The mathematical definition of the above indicators is given as follow:

\begin{matrix} DSC = \frac{2 * T P}{(F P + T P) + (T P + F N)} \end{matrix}

(10)

\begin{matrix} JS = \frac{T P}{F P + T P + F N} \end{matrix}

(11)

NSD = \frac{2 | \partial S \cap B_{\partial S}^{τ} | + 2 | \partial S \cap B_{\partial G}^{τ} |}{| \partial S | + | \partial G |}

(12)

In the last formula, the symbols

B_{\partial S}^{τ} and B_{\partial G}^{τ}

represent the boundary area of segmentation maps and GT, respectively.

5.3. Implementation Setup

All the simulation experiments were performed on a Dell server equipped with NVIDIA Quadro GPU, Intel (R) Xeon (R) CPU E5-2670 0@ 2.60 GHz CPU (32 processors), 256 GB memory, and Windows 10 64-bit operating system. The implementation of the FL framework was coded with the PySyft and Pytorch libraries running on Python 3.7. The standard BiSeNet [40] was employed as the segmentation network. The output of the BiSeNet was a segmentation map indicating the COVID-19-infection regions. Dice loss was used as the cost function, and 5-fold cross-validation was adopted for training. The Adam optimizer was applied to the segmentation network. The learning rate of 1 × 10⁻³ was decreased by 0.5 after every 25 epochs and stabilized after the 75th epoch. Through all epochs, an institutional upgrade was carried out several times, depending on the interaction stride

R

, to replace a one-time upgrade, and 100 steps were used per epoch.

5.4. Numerical Results

To validate the efficiency and effectiveness of the proposed FL framework (FL-Cov) at fine-tuning the COVID-19 segmentation performance of multi-institutional CT data in IoMT, we compared it (with

τ = 15

and

α = 0.01)

to four non-federated learning schemas. The segmentation network was trained on each institutional dataset (Independent). The segmentation model was used on samples from one institution and tested on CT data from another institution (Cross). All data from different institutions were aggregated in a single location and used for training and testing (Blend). An ensemble DL framework was constructed using private models at diverse institutions (Ensemble). The Ensemble schema took the mean of outcomes of the independent schema trained inside the institution and a Cross schema trained using data from different institutions. The Independent and Cross schemas usually protect the privacy of CT data but cannot include data from other institutions. The Blend schema utilizes all CT data from various institutions but cannot ensure data privacy. The segmentation performance of the Blend schema is expected to improve compared to FL-Cov, as it uses a complete set of CT data.

To achieve a fair comparison, we selected the optimal parameters for diverse learning schemas by changing the network as little as possible. Considering the heterogeneity of the data distribution, we attempted to employ the previously discussed DA methods to fine-tune the segmentation performance of FL-Cov as discussed in the FGDA schemes. As a consequence of applying randomization techniques to FL-Cov, the generated feature representation distorted by Gaussian noise

ε_{n} (0, 0.01 σ)

was used as the input to discriminators

D

. The universal network was the concatenation of the generator

G

and segmentation network. The parameters of the private

G and C

are only shared with the universal network. We first trained the

G and S G

models for 10 epochs and broadcast the generative loss (Equation (8)). The entire framework was then trained using the settings employed for FL-Cov.

The segmentation results are presented in Figure 3, where the training data used in the Cross schema are written as “Cross-<dataset site>.” The test result on the training dataset is not reported, as all data from this site were used for training. When testing CT scans from the remaining institutions, we do not report a standard deviation (std) for Cross learning experiments. The segmentation results for the remaining schemas are stated in the form of “mean (std).” According to the comparison of the average DSC and NSD, we spotlight the optimal DSC and NSD scores in Figure 3. It can be noted that the Cross-learning schema realizes higher DSC and NSD values than the independent schema with less training data. This is obvious for training on Institution 1 data. When Cross-training is performed on Institution 3 data, the performance is improved by 1.3% and 0.7% over the independent learning schema on Institution 1 and Institution 2, respectively. DSC and NSD of the Ensemble schema do not improve over segmentation results of the independent schema, possibly because it might not take advantage of decisions taken by a variety of segmentation networks (i.e., private models), which impairs its predictive capability.

With respect to FL-Cov, average DSCs show great improvements of 2% for Institution 1, 2% for Institution 2, and 3.5% for Institution 3 over the Cross-learning schema for every institution, and DSC improvements of 2–5% over the Independent and Ensemble schemas. Similarly, the average NSDs achieved by FL-Cov present significant improvements (2.1% for Institution 1, 2.5% for Institution 2, 3.25% for Institution 3) over Ensemble learning. NSD of FL-Cov was greatly improved (2–3%) over the Independent and Ensemble schemas. Compared to FL-Cov, FGDA improved DSC on Institution 1 and Institution 2 by 1.7% (p-value: 0.04416) and 0.7% (p-value: 0.02227), respectively, and reduced the DSC of Institution 3 by 1%. Similarly, the NSD of FGDA was improved by 1.9% (p-value: 0.03562) and 0.2% (p-value: 0.01463) on Institution 2 and Institution 3, respectively, but decreased by 1.5%. Compared to Ensemble settings, FGDA improved DSC by 3.4% (p-value: 0.04085) on Institution 1, and realized equal and less DSC on Institution 1 and Institution 2, respectively. Similarly, the FGDA improved the NSD by 1% on both Institution 1 (p-value: 0.03735) and Institution 2 (p-value: 0.03678) and was comparable on Institution 3 when compared to FL-Cov. The observed behaviour of the DA technique might depend on the data distribution at different institutions.

5.5. Comparative Analysis

To validate the efficiency of the proposed framework, a set of fair experimental comparisons were made between the proposed solution and cutting-edge approaches. The experimental comparisons on each institutional COVID-19 dataset were performed and the corresponding results are given in Table 2. It is notable that the proposed FGDA can overcome all the competing methods with considerable margins (DSC: 1.34, NSD: 1.7, JS: 1.43) in all performance metrics in the case of Institution 1. Similarly, the proposed FGDA can overcome rall the competing methods with considerable margins (DSC: 1.27, NSD: 2.07, JS: 2.29) in all performance metrics in the case of Institution 2. Moreover, the proposed FGDA can overcome all the competing methods with considerable margins (DSC: 1.12, NSD:1.28, JS:2.14) in all performance metrics in the case of Institution 3.

To validate the robustness of the proposed FGDA against competing learning approaches, the statistical paired t-test was performed with a threshold of p-value < 0.05 (Confidence Interval: 95%). Scipy, which is a public scientific tool [41], was employed to perform this experiment, and the resultants p-values are presented in Table 2. The p-value < 0.05 means that the results from FGDA are statistically significant. It is notable that almost all p-values < 0.05 This further demonstrates the efficiency of the FGDA and supports our findings in the comparative analysis.

5.6. Privacy Analysis

This section discusses and evaluates the impact of the privacy budget on the performance of the proposed framework. In Figure 4, we plot the relationship between the privacy budget of Gaussian DP and segmentation performance. Similarly, in Figure 5, we plot the relationship between the privacy budget of Laplace DP and segmentation performance. It is obvious that there is a significant tradeoff between the performance and privacy of the model, and it can be noted that the performance considerably decreased after the privacy budget exceeded 0.001. This tradeoff necessitates a careful choice of the amount of injected noise during training, which might negatively affect the domain-adaption process.

5.7. Convergence Analysis

This subsection analyses the impact of the synchronisation stride

τ

on segmentation performance. To determine the optimal value

τ

, the performance of the proposed FGDA was evaluated under different values of

τ

and the segmentation results are presented in Figure 6. It is worth noting that the performance starts convergence after

τ = 20

, which indicates rapid convergence. This also indicates less communication overhead during the training, as little server–fog interaction is required.

5.8. Stability Analysis

The previous experiments only explored an IoMT federated application that trained ten participants. However, real-world IoMT applications often include thousands of nodes. Experiments were performed to investigate the scalability of the proposed framework. Figure 7 shows the relationship between segmentation performance and the number of fog nodes, ranging from three to 15. It can be seen that when more participants are engaged in federated training, the segmentation performance improves, then stabilizes after the number of fog nodes reaches ten participants. This further explains the ability of the proposed framework to be efficiently trained and applied in a large-scale IoMT environment.

6. Limitations

Though the experimental trials demonstrate that the communication stride, used to regulate the number of times the parameters of the private and global network are upgraded, did not influence the segmentation performance, it is difficult to conclude that the stride parameter is unrelated. Moreover, an extra number of stride values must be scrutinized based on the application. Also, we employed applied methods to study and analyse different LDP techniques (i.e., Gaussian and Laplacian). Nevertheless, the sympathy of the mapping function

f : D \to R^{m}

(i.e., the BiSeNet in our situation) is difficult to estimate. Thus, we could not specify the limit. Recent literature [23,24,25,26,27,28,29,30] has established that Laplace and Gaussian noise higher than a particular degree might offer perfect protection from data recovery invasion. Based on certain datasets or applications, it is possible to experimentally approximate an appropriate noise degree according to the attacking viewpoint.

Approaches such as a forest of randomization trees can be used to further improve performance in the Ensemble schema. According to the system model, a central coordinator is responsible for aggregating the local parameters, which makes the overall system vulnerable to a single point of failure and server-side attacks. The number of FL contributors could have an influential role, particularly when an averaging policy is employed to upgrade the global model. Integrating a further-improved network-election mechanism and incentive-updating policy might help to avoid the inclusion of erroneous private networks in updating procedures.

7. Conclusions and Future Work

To conclude, this study takes advantage of fog and cloud computing to design a collaborative FL framework for screening COVID-19-like pandemics from multi-institutional medical images in IoMT networks. The privacy of local parameters is protected using local differential privacy. A domain adaption method (i.e., FGDA) is introduced to tackle the problem of heterogeneous data distribution of the IoMT system. FGDA lessened the impact of heterogeneity and showed acceptable performance. The results showed that FGDA extensively improved segmentation performance on multi-institutional data. The FGDA could be generalized as a medical imaging tool to screen COVID-19-like pandemics.

In future work, several directions can be explored to realize improvements: (1) experiment with the proposed framework on different medical images and diseases; (2) extend the framework to semi-supervised training to enable learning from unlabeled CT scans, and (3) using a decentralized blockchain ledger to improve the security and privacy of federated training.

Author Contributions

Conceptualization, M.A.-B. and H.H.; methodology, M.A.-B. and H.H.; software, M.A-B. and H.H.; validation, M.A.-B., H.H. and M.A.; formal analysis, M.A.-B., H.H. and M.A.; investigation, M.A.-B. and M.A.; resources, M.A.-B., M.A. and H.H.; data curation, M.A.-B., H.H. and M.A.; writing—original draft preparation, M.A.-B. and H.H.; writing—review and editing, M.A.-B., H.H. and M.A.; visualization, M.A.-B., M.A. and H.H.; supervision, M.A.-B.; project administration, M.A.-B., H.H. and M.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Peng, Y.; Tang, Y.; Lee, S.; Zhu, Y.; Summers, R.M.; Lu, Z. COVID-19-CT-CXR: A Freely Accessible and Weakly Labeled Chest X-Ray and CT Image Collection on COVID-19 From Biomedical Literature. IEEE Trans. Big Data 2021, 7, 3–12. [Google Scholar] [CrossRef]
Rahman, A.; Hossain, M.S.; Alrajeh, N.A.; Alsolami, F. Adversarial Examples—Security Threats to COVID-19 Deep Learning Systems in Medical IoT Devices. IEEE Internet Things J. 2021, 8, 9603–9610. [Google Scholar] [CrossRef]
Mansour, R.F.; Escorcia-Gutierrez, J.; Gamarra, M.; Gupta, D.; Castillo, O.; Kumar, S. Unsupervised Deep Learning based Variational Autoencoder Model for COVID-19 Diagnosis and Classification. Pattern Recognit. Lett. 2021, 151, 267–274. [Google Scholar] [CrossRef]
Abbas, A.; Abdelsamea, M.M.; Gaber, M.M. Classification of COVID-19 in chest X-ray images using DeTraC deep convolutional neural network. Appl. Intell. 2021, 51, 854–864. [Google Scholar] [CrossRef]
Zhong, A.; Li, X.; Wu, D.; Ren, H.; Kim, K.; Kim, Y.; Buch, V.; Neumark, N.; Bizzo, B.; Tak, W.Y.; et al. Deep metric learning-based image retrieval system for chest radiograph and its clinical applications in COVID-19. Med. Image Anal. 2021, 70, 101993. [Google Scholar] [CrossRef]
Zhang, C.; Xie, Y.; Bai, H.; Yu, B.; Li, W.; Gao, Y. A survey on federated learning. Knowl. Based Syst. 2021, 216, 106775. [Google Scholar] [CrossRef]
Li, Q.; Wen, Z.; Wu, Z.; Hu, S.; Wang, N.; Li, Y.; Liu, X.; He, B. A Survey on Federated Learning Systems: Vision, Hype and Reality for Data Privacy and Protection. IEEE Trans. Knowl. Data Eng. 2021. [Google Scholar] [CrossRef]
Imteaj, A.; Thakker, U.; Wang, S.; Li, J.; Amini, M.H. A Survey on Federated Learning for Resource-Constrained IoT Devices. IEEE Internet Things J. 2022, 9, 1–24. [Google Scholar] [CrossRef]
Rahman, S.A.; Tout, H.; Ould-Slimane, H.; Mourad, A.; Talhi, C.; Guizani, M. A Survey on Federated Learning: The Journey From Centralized to Distributed On-Site Learning and Beyond. IEEE Internet Things J. 2020, 8, 5476–5497. [Google Scholar] [CrossRef]
Aledhari, M.; Razzak, R.; Parizi, R.M.; Saeed, F. Federated Learning: A Survey on Enabling Technologies, Protocols, and Applications. IEEE Access 2021, 8, 140699–140725. [Google Scholar] [CrossRef]
Yin, X.; Zhu, Y.; Hu, J. A Comprehensive Survey of Privacy-preserving Federated Learning. ACM Comput. Surv. 2021, 54, 1–36. [Google Scholar] [CrossRef]
Nguyen, D.C.; Ding, M.; Pathirana, P.N.; Seneviratne, A.; Li, J.; Poor, H.V. Federated Learning for Internet of Things: A Comprehensive Survey. IEEE Commun. Surv. Tutor. 2021, 23, 1622–1658. [Google Scholar] [CrossRef]
Javaheri, T.; Homayounfar, M.; Amoozgar, Z.; Reiazi, R.; Homayounieh, F.; Abbas, E.; Laali, A.; Radmard, A.R.; Gharib, M.H.; Mousavi, S.A.J.; et al. CovidCTNet: An open-source deep learning approach to diagnose COVID-19 using small cohort of CT images. NPJ Digit. Med. 2021, 4, 29. [Google Scholar] [CrossRef]
Awasthi, N.; Dayal, A.; Cenkeramaddi, L.R.; Yalavarthy, P.K. Mini-COVIDNet: Efficient Lightweight Deep Neural Network for Ultrasound Based Point-of-Care Detection of COVID-19. IEEE Trans. Ultrason. Ferroelectr. Freq. Control 2021, 68, 2023–2037. [Google Scholar] [CrossRef]
Albahli, S.; Ayub, N.; Shiraz, M. Coronavirus disease (COVID-19) detection using X-ray images and enhanced DenseNet. Appl. Soft Comput. 2021, 110, 107645. [Google Scholar] [CrossRef]
Fan, Y.; Liu, J.; Yao, R.; Yuan, X. COVID-19 Detection from X-ray Images using Multi-Kernel-Size Spatial-Channel Attention Network. Pattern Recognit. 2021, 119, 108055. [Google Scholar] [CrossRef]
Zhao, C.; Xu, Y.; He, Z.; Tang, J.; Zhang, Y.; Han, J.; Shi, Y.; Zhou, W. Lung segmentation and automatic detection of COVID-19 using radiomic features from chest CT images. Pattern Recognit. 2021, 119, 108071. [Google Scholar] [CrossRef]
Xia, Y.; Yang, D.; Li, W.; Myronenko, A.; Xu, D.; Obinata, H.; Mori, H.; An, P.; Harmon, S.; Turkbey, E.; et al. Auto-FedAvg: Learnable Federated Averaging for Multi-Institutional Medical Image Segmentation. Available online: http://arxiv.org/abs/2104.10195 (accessed on 15 April 2021).
Guo, P.; Yang, D.; Hatamizadeh, A.; Xu, A.; Xu, Z.; Li, W.; Zhao, C.; Xu, D.; Harmon, S.; Turkbey, E.; et al. Auto-FedRL: Federated Hyperparameter Optimization for Multi-Institutional Medical Image Segmentation. Available online: http://arxiv.org/abs/2203.06338 (accessed on 15 March 2022).
Jiang, M.; Wang, Z.; Dou, Q. HarmoFL: Harmonizing Local and Global Drifts in Federated Learning on Heterogeneous Medical Images. In Proceedings of the AAAI Conference on Artificial Intelligence, Vancouver, BC, Canada, 28 February–1 March 2022; Volume 36, pp. 1087–1095. [Google Scholar] [CrossRef]
Jiang, M.; Yang, H.; Cheng, C.; Dou, Q. IOP-FL: Inside-Outside Personalization for Federated Medical Image Segmentation. Available online: http://arxiv.org/abs/2204.08467 (accessed on 15 April 2022).
Liu, Q.; Chen, C.; Qin, J.; Dou, Q.; Heng, P.-A. FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space. Available online: http://arxiv.org/abs/2103.06030 (accessed on 15 March 2021).
Li, X.; Gu, Y.; Dvornek, N.; Staib, L.H.; Ventola, P.; Duncan, J.S. Multi-site fMRI analysis using privacy-preserving federated learning and domain adaptation: ABIDE results. Med. Image Anal. 2020, 65, 101765. [Google Scholar] [CrossRef]
Han, T.; Gong, X.; Feng, F.; Zhang, J.; Sun, Z.; Zhang, Y. Privacy-preserving multi-source domain adaptation for medical data. IEEE J. Biomed. Health Inform. 2022. [Google Scholar] [CrossRef]
Zhang, J.; Wan, P.; Zhang, D. Transport-Based Joint Distribution Alignment for Multi-site Autism Spectrum Disorder Diagnosis Using Resting-State fMRI. In International Conference on Medical Image Computing and Computer-Assisted Intervention; Springer: Cham, Switzerland, 2020; pp. 444–453. [Google Scholar] [CrossRef]
Yang, D.; Xu, Z.; Li, W.; Myronenko, A.; Roth, H.R.; Harmon, S.; Xu, S.; Turkbey, B.; Turkbey, E.; Wang, X.; et al. Federated semi-supervised learning for COVID region segmentation in chest CT using multi-national data from China, Italy, Japan. Med. Image Anal. 2021, 70, 101992. [Google Scholar] [CrossRef]
Wang, Z.; Liu, Q.; Dou, Q. Contrastive Cross-Site Learning with Redesigned Net for COVID-19 CT Classification. IEEE J. Biomed. Health Inform. 2020, 24, 2806–2813. [Google Scholar] [CrossRef] [PubMed]
Ahmed, S.M.; Raychaudhuri, D.S.; Paul, S.; Oymak, S.; Roy-Chowdhury, A.K. Unsupervised Multi-source Domain Adaptation without Access to Source Data. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 10103–10112. [Google Scholar] [CrossRef]
Rodríguez-Barroso, N.; Stipcich, G.; Jiménez-López, D.; Ruiz-Millán, J.A.; Martínez-Cámara, E.; González-Seco, G.; Luzón, M.V.; Veganzones, M.A.; Herrera, F. Federated Learning and Differential Privacy: Software tools analysis, the Sherpa.ai FL framework and methodological guidelines for preserving data privacy. Inf. Fusion 2020, 64, 270–292. [Google Scholar] [CrossRef]
Wei, K.; Li, J.; Ding, M.; Ma, C.; Yang, H.H.; Farokhi, F.; Jin, S.; Quek, T.Q.S.; Poor, H.V. Federated Learning With Differential Privacy: Algorithms and Performance Analysis. IEEE Trans. Inf. Forensics Secur. 2020, 15, 3454–3469. [Google Scholar] [CrossRef] [Green Version]
Chaudhuri, K.; Imola, J.; Machanavajjhala, A. Capacity bounded differential privacy. In Proceedings of the 33rd International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 8–14 December 2019; pp. 3474–3483. [Google Scholar]
Dwork, C.; Roth, A. The Algorithmic Foundations of Differential Privacy. Found. Trends® Theor. Comput. Sci. 2013, 9, 211–407. [Google Scholar] [CrossRef]
Dwork, C.; McSherry, F.; Nissim, K.; Smith, A. Calibrating Noise to Sensitivity in Private Data Analysis. In Proceedings of the Theory of Cryptography Conference, Chicago, IL, USA, 7–10 November 2006; pp. 265–284. [Google Scholar] [CrossRef] [Green Version]
Sheller, M.J.; Reina, G.A.; Edwards, B.; Martin, J.; Bakas, S. Multi-institutional deep learning modeling without sharing patient data: A feasibility study on brain tumor segmentation. In International MICCAI Brainlesion Workshop; Springer: Cham, Switzerland, 2018; Volume 11383, pp. 92–104. [Google Scholar] [CrossRef]
Hong, H.; Li, X.; Pan, Y.; Tsang, I.W. Domain-Adversarial Network Alignment. IEEE Trans. Knowl. Data Eng. 2022, 34, 3211–3224. [Google Scholar] [CrossRef]
Zuo, Y.; Yao, H.; Zhuang, L.; Xu, C. Margin-Based Adversarial Joint Alignment Domain Adaptation. IEEE Trans. Circuits Syst. Video Technol. 2021, 32, 2057–2067. [Google Scholar] [CrossRef]
Yang, J.; Xu, R.; Li, R.; Qi, X.; Shen, X.; Li, G.; Lin, L. An Adversarial Perturbation Oriented Domain Adaptation Approach for Semantic Segmentation. In Proceedings of the AAAI 2020—34th AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; pp. 12613–12620. [Google Scholar] [CrossRef]
Ma, J.; Wang, Y.; An, X.; Ge, C.; Yu, Z.; Chen, J.; Zhu, Q.; Dong, G.; He, J.; He, Z.; et al. Toward data-efficient learning: A benchmark for COVID-19 CT lung and infection segmentation. Med. Phys. 2021, 48, 1197–1210. [Google Scholar] [CrossRef]
Morozov, S.P.; Andreychenko, A.E.; Blokhin, I.A.; Gelezhe, P.B.; Gonchar, A.P.; Nikolaev, A.E.; Pavlov, N.A.; Chernina, V.Y.; Gombolevskiy, V.A. MosMedData: Data set of 1110 chest CT scans performed during the COVID-19 epidemic. Digit. Diagn. 2020, 1, 49–59. [Google Scholar] [CrossRef]
Yu, C.; Wang, J.; Peng, C.; Gao, C.; Yu, G.; Sang, N. BiSeNet: Bilateral segmentation network for real-time semantic segmentation. In Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Springer: Berlin/Heidelberg, Germany, 2018; Volume 11217, pp. 334–349. [Google Scholar] [CrossRef] [Green Version]
Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.; Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; et al. SciPy 1.0 Contributors. SciPy 1.0 Fundamental Algorithms for Scientific Computing in Python. Nat. Methods 2020, 17, 261–272. [Google Scholar] [CrossRef]

Figure 1. Systematic diagram for the proposed federated system for screening of COVID-19-like pandemics from multi-institutional medical images in IoMT.

Figure 2. Illustration of the domain alignment in the FGDA.

Figure 3. Illustration of quantitative results from comparison of different training scenarios in terms of DSC(upper), NSD (middle), and JS (bottom).

Figure 4. Impact of Gaussian DP on the segmentation performance in terms of DSC (left), NSD (middle), and JS (right).

Figure 5. Impact of Laplace privacy budget on performance in terms of DSC (left), NSD (middle), and JS (right).

Figure 6. Impact of communication rounds on segmentation performance in terms of DSC (left), NSD (middle), and JS (right).

Figure 7. Stability analysis of the proposed framework under different numbers of participating nodes.

Table 1. Summary of the related works.

Methods	Task	COVID-like	Data	Multi-Domain	Privacy	Unlabeled Target	Federated	Source
Auto-FedAvg [18]	Segmentation	✓	CT	✓	✗	✗	✓	Free
HarmoFL [20]	Segmentation	✗	MRI	✓	✗	✗	✓	Exist
IOP-FL [21]	Segmentation	✗	MRI	✓	✗	✗	✓	Exist
FedDG [22]	Segmentation	✗	MRI	✓	✗	✗	✓	Exist
Auto-FedAvg [18]	Segmentation	✓	CT	✓	✗	✗	✓	Free
Fed MoE [23]	Classification	✗	fMRI	✓	✓	✗	✓	Exist
Fed Align [23]	Classification	✗	fMRI	✓	✓	✗	✓	Exist
MS-SFDA [24]	Classification	✗	fMRI	✓	✗	✓	✗	Free
TMJDA [25]	Classification	✗	fMRI	✓	✗	✗	✗	Exist
FED [26]	Segmentation	✓	CT	✗	✗	✗	✓	Exist
COVID-Net [27]	Classification	✓	CT	✗	✗	✗	✗	Exist
DECISION [28]	Classification	✗	RGB	✓	✗	✓	✗	Exist

Table 2. Statistical significance of proposed FGDA against other learning schemas using DSC and NSD measures.

	Institution 1				Institution 2				Institution 3
	DSC	NSD	JS	p-Value	DSC	NSD	JS	p-Value	DSC	NSD	JS	p-Value
Fed MoE [23]	86.14	82.28	89.11	1.87 × $10^{- 7}$	86.17	86.44	89.34	1.34 × $10^{- 8}$	88.96	89.36	91.52	3.46 × $10^{- 9}$
MS-SFDA [24]	86.36	82.38	87.91	1.34 × $10^{- 2}$	86.19	86.54	88.93	2.06 × $10^{- 3}$	89.60	87.71	91.46	7.09 × $10^{- 10}$
Fed Align [23]	87.46	82.51	89.34	6.11 × $10^{- 6}$	87.56	85.69	89.41	1.27 × $10^{- 6}$	88.02	88.63	93.58	2.88 × $10^{- 3}$
FED [26]	84.98	81.07	85.40	3.73 × $10^{- 9}$	85.06	84.12	84.52	5.41 × $10^{- 7}$	84.19	82.89	81.60	6.33 × $10^{- 11}$
COVID-Net [27]	86.58	83.22	89.24	4.56 × $10^{- 7}$	86.43	86.07	89.44	3.19 × $10^{- 9}$	87.03	89.46	93.14	7.86 × $10^{- 5}$
FGDA	88.8	84.92	90.77	/	88.83	88.61	91.73	/	90.72	89.91	93.66	/

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Abdel-Basset, M.; Hawash, H.; Abouhawwash, M. Collaborative Screening of COVID-19-like Disease from Multi-Institutional Radiographs: A Federated Learning Approach. Mathematics 2022, 10, 4766. https://doi.org/10.3390/math10244766

AMA Style

Abdel-Basset M, Hawash H, Abouhawwash M. Collaborative Screening of COVID-19-like Disease from Multi-Institutional Radiographs: A Federated Learning Approach. Mathematics. 2022; 10(24):4766. https://doi.org/10.3390/math10244766

Chicago/Turabian Style

Abdel-Basset, Mohamed, Hossam Hawash, and Mohamed Abouhawwash. 2022. "Collaborative Screening of COVID-19-like Disease from Multi-Institutional Radiographs: A Federated Learning Approach" Mathematics 10, no. 24: 4766. https://doi.org/10.3390/math10244766

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Collaborative Screening of COVID-19-like Disease from Multi-Institutional Radiographs: A Federated Learning Approach

Abstract

1. Introduction

2. Literature Review

3. System Design

4. Methodology

4.1. Problem Formulation

4.2. Privacy-Maintaining FL

4.3. Local Differential Privacy

4.4. Domain Adaption for Multi-Site FL-Based Screening

4.5. Federated Generative Domain Adaption (FGDA)

5. Experiments and Analysis

5.1. Dataset Description

5.2. Evaluation Metrics

5.3. Implementation Setup

5.4. Numerical Results

5.5. Comparative Analysis

5.6. Privacy Analysis

5.7. Convergence Analysis

5.8. Stability Analysis

6. Limitations

7. Conclusions and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI