Performance Analysis of Adopting FSO Technology for Wireless Data Center Network

AlGhadhban, Amer; Abdulhussain, Sadiq H.; Alazmi, Meshari; Almalaq, Abdulaziz

doi:10.3390/systems11090482

Open AccessArticle

Performance Analysis of Adopting FSO Technology for Wireless Data Center Network

¹

Electrical Engineering Department, College of Engineering, University of Ha’il, Ha’il 55476, Saudi Arabia

²

Department of Computer Engineering, University of Baghdad, Al-Jadriya, Baghdad 10071, Iraq

³

Computer Science Department, College of Computer Science and Engineering, University of Ha’il, Ha’il 55476, Saudi Arabia

^*

Author to whom correspondence should be addressed.

Systems 2023, 11(9), 482; https://doi.org/10.3390/systems11090482

Submission received: 6 May 2023 / Revised: 29 August 2023 / Accepted: 12 September 2023 / Published: 20 September 2023

Download

Browse Figures

Versions Notes

Abstract

:

Free Space Optical Communication (FSO) is a promising technology to address wired Data Center Network (DCN) challenges like power consumption, low scalability and flexibility, congestion and cabling. Scholars have developed indirect line-of-sight (LoS) FSO schemes by reflecting the FSO beams via switchable mirrors. These schemes have introduced extra overhead delay to establish indirect LoS links, defined herein as the rack-to-rack FSO link setup process. The purpose of this work is to study and model this setup process with the consideration of the DC workloads. We found that the process involves a sequence of i.i.d random variables that contribute differently to its delay. Also, the process shows a statistical characteristic close to M/M/K. However, the number of FSO links, K, is random with time, which necessitates careful modeling. Finally, the PDF of the process total response time is close to the hypoexponential distribution, and it maintains its main characteristics even with different distributions for the service time.

Keywords:

performance analysis; optical wireless communication; FSO; data center

1. Introduction

Computation-intensive and bandwidth-hungry applications brought a revolution in Data Center Networks (DCNs) to support the continuously growing network traffic considering the network performance requirements [1,2]. The recent findings by Microsoft [1] and Facebook [2] showed that the DC’s racks exchanged a biased pattern of traffic workload. They found that few racks exchange the majority (>

80 %

) of the overall traffic whereas the remaining racks exchange less traffic or no traffic at all. Thereby, the DC links are either underutilized or overutilized while their uniform capacity and fixed topology prohibit them from optimally satisfying the workload capacity requirements. Optical Wireless Data Center Networks (OWDCNs) emerged as the alternative to conventional wired DC networks. OWDCNs have the agility to allocate the capacity where it is needed. Also, OWDCNs offer other benefits such as a lower number of cables and less maintenance overhead, power consumption and heat dissipation [1,3,4,5].

In recent works, researchers have proposed different use cases of optical wireless communication (OWC). One of the emerging technologies of OWC is FSO, in which a modulated light beam propagates in free space with no fibers involved. Therefore, FSO combines the edibility of wireless communication and the high speed/high bandwidth of optical communication. Due to its proven features, FSO has been widely used to tackle the aforementioned challenges. FireFly [4] and ProjecToR [1] are mainly designed to transmit all the DC workloads. F4Tele [5,6] utilizes OWC to build a dedicated network for management traffic. Umair et al. [7] proposed a wireless network for SDN traffic. In this context, Zhou et al. [8] built a separate wireless network for facilities traffic.

Intuitively, a normal DCN has thousands of Top of Rack (ToR) switches, and the physical dimension and processing capacity of a rack is not enough to install or process thousands of transceivers to communicate with every DC rack. Moreover, inadequate FSO links are unable to deal with multitudinous racks simultaneously. Thus, a line-of-sight (LoS) FSO link between every individual rack and others is hard to structure, and indeed this challenge exists for the wired network. This is also the same for normal DC traffic, control traffic, network management traffic and facility messages. To tackle this challenge, researchers such as those working on FireFly [4], ProjecToR [1], F4Tele [5] and others [9] establish indirect LoS FSO links on demand by exploiting emerging technologies (such as ceiling mirrors, disco balls, switchable mirrors and digital micromirror devices). Although these schemes solved the traffic workload challenges and increased the DC communication performance, a reasonable amount of time is needed to establish the indirect LoS FSO lightpath. From this, we understand that studying and analyzing the delay of establishing the FSO links, lightpaths, to transmit the traffic from source to destination is mandatory.

Thus, in this work, we aim to study and analyze all possible factors that contribute to building an indirect LoS FSO link between DC racks. The process of establishing the link involves a sequence of nonuniform processing steps. Every step is performed on a different device with unequal processing operations and service times, starting from examining the existence of a lightpath between the source and destination and ending at swiveling the FSO link gears (the FSO gears are reconfigurable and the CU can changes their directions both vertically and horizontally [1,4,5]) (transceivers and switching mirrors) toward the designated destination. This introduces a random process of arbitrary random variables which requires a deliberate analysis analytically and empirically. Additionally, the process of establishing an indirect FSO link is launched upon the arrival of a new flow (

f_{n e w}

) (the new flow is a flow with a destination that cannot be reached by any of the existing FSO links from its source rack) and terminated according to the controlling unit (CU) commands. In previous work, we studied the delay involved in the SDN flow setup [10], where OpenFlow switches communicate with the SDN controller to build an end-to-end path. Normally, OpenFlow switches need to be configured with proper configuration commands to route incoming packets to the right destination.

1.1. Motivations

Next-generation DCs are being modified by considering the FSO and Radio Frequency (RF) wireless communication to support the exponential growth of data. A huge amount of data is stored in the servers, and the number of servers reaches hundreds of thousands of servers to accommodate and process the massive data simultaneously [1,2]. In term of installation overhead and costs, FSO network doesn’t require the overhead and costs of building the ducts and pulling the wires through them. Fiber optic cables are inflexible and fragile. Also, fiber optic cables are prone to damage and cut during the construction and maintenance. In term of scalability, FSO networks can be expanded easily by adding enough number of transceivers at the edges without modifying the network infrastructure [1,4,9]. Moreover, FSO technology offers high-speed link capacity up to Terabyte per second [11].

1.2. Related Works

In order to resolve the weaknesses and limitations of wired DCs, researchers have attempted to reap the benefits of wireless communication technologies [9,12,13,14]. Researchers have classified DC traffic according to its size as large (elephant) and small (mice). Also, they have classified it according to its service: network management traffic and data traffic. The OWDCN researchers exploited these classifications in their schemes. The following paragraphs summarize their findings, particularly the FSO-related works as they are our interest herein. F4Tele [5] is introduced to build an FSO-based network dedicated for network management (NM) traffic. Rather than sending the NM traffic over the same network of the data traffic, the author attempts to utilize FSO technology to build a dedicated network to transfer it from data racks to the NM racks.

Similarly, the authors of ProjecToR [1] attempted to exploit the DC traffic communication pattern in the topology structure and the traffic scheduling, where few racks are overloaded and the majority of racks are underutilized. To facilitate this, they leveraged digital micromirror devices and disco balls to speed up the switching of FSO links. The digital micromirror device can direct FSO beams toward tens of thousands directions, while it needs 12 µs to switch between these directions. The authors of [15] introduced a new OWDCN solution by utilizing a nanosecond semiconductor optical amplifier and wavelength selectors and an arrayed waveguide grating router. The solution has been thoroughly investigated by using detailed sets of experiments and hardware. The authors of [16] proposed and evaluated a novel OWDCN architecture named ROTOS based on reconfigurable optical ToR switches. The wavelength capacity and beam directions are configured on demand from a centralized unit.

On the other hand, other researchers attempted to build an overlay FSO network dedicated to network management traffic. However, they encountered multiple challenges. The network management racks do not hold enough physical and processing capacities to serve thousands of FSO beams. Instead, the authors of [5] proposed a new traffic scheduling method compatible with the network management traffic workload. Moreover, the author of [6] attempted to reduce the number of FSO links between the data racks and management racks. Since the DCs show a skewed traffic distribution, the author attempts to shuffle the racks to regulate this distribution. The solution groups multiply the racks into one cluster, and every cluster has a dedicated FSO transceiver toward the management rack. This method simplifies the flow scheduling mechanism and unfairness. Similarly, the authors of [17] introduced an FSO scheme for the facility traffic.

1.3. Paper Objectives and Novel Contributions

The main contributions of this work can be summarized as follows:

•: This paper attempts to understand and model the process of establishing the R2R indirect LoS FSO link in wireless Data Center Networks.
•: The establishment of an R2R indirect LoS FSO link involves a sequence of nonuniform processing steps. Every step is performed on a different device with unequal processing operations and service times. This introduces a random process of arbitrary random variables which requires a deliberate analysis analytically and empirically.
•: According to recent data center traffic studies by Microsoft [1] and Facebook [2], the DC has short and long flows as well as short-term and long-tern rack-to-rack traffic directions. This article considered the variability in these traffic workloads. The first scenario attempts to present a model compatible with short flows and long-term traffic-direction workloads. In contrast, the second scenario considers a model suitable for long flows and short-term directions. Although the first scenario is more practical and easier for modeling, it is expected to face high power consumption challenges and it suffers from inefficient resource utilization. However, in the second scenario, the R2R FSO links are terminated after flow completion, which is suitable for power conservation and utilization solutions.
•: The flows in DCs could be forwarded to the same destination or otherwise. The probability that a flow is going to be forwarded to the same destination relies on the number of established FSO links S, where $S \in {0, 1, 2, 3, . . ., K}$ and K is the maximum number of FSO links that can be launched from an individual rack. In the second scenario, S is a stochastic variable that has multiple factors contributing to its state.

The remaining of the paper is organized as follows: Section 2, discusses the problem statement of the indirect LoS R2R FSO link setup process and where link management techniques contribute. Section 3, describes the mathematical description of the system model and their two scenarios. Section 4, the mathematical analysis including delay analysis of new link setup time, system capacity, and blocking probability are explained. The performance evaluation of results is revealed in Section 5, and finally, a conclusion is drawn in Section 6.

2. Problem Statement

DCs have thousands of servers grouped into almost identical racks, which means every rack has the same number of servers. The racks communicate with each other through a switch (also known as a top-of-rack (ToR) switch) installed on top of every rack. The ToR switches are connected through optical fiber cables with hundreds of intermediate high-speed switches. This wire-based structure encounters maintenance and development challenges and clear deficiencies in allocating the optimal capacity to serve forwarding flows. Recent studies on DC traffic characteristics have suggested the development of new alternatives for existing wired technologies. The wired DCs have a rigged structure and uniform distribution of the communication and processing resources that hinder them from efficiently coping with the requirements of the DC traffic workloads. On the other hand, emerging wireless technologies (e.g., FSO and mmWave) have the necessary features to be the superior alternative.

The DC servers exchange inter-rack and intra-rack (Local) traffic. The inter-rack traffic carries data, e.g., search queries, and control, e.g., syslog messages, traffic. When a new flow arrives at the ToR switch, conventionally, a path table is examined to determine the forwarding port and FSO link for this flow. However, this is not the end of the journey. The data center has thousands of racks and the ToR switch has a limited number of outgoing ports (FSO transceivers), which eliminates the possibility of building a direct link with every rack. To tackle this challenge, researchers have developed two things: indirect LoS FSO link mechanisms and on-demand FSO link-scheduling algorithms [1,4,5]. When the ToR switch does not have an FSO link to serve a flow, it establishes a new link to serve it. The establishment of a new FSO link involves further processing steps, which introduce extra delay and overhead. This process goes through a series of services starting from the ToR switch, then the control channel (CU) and finally the FSO link gears (mirrors and transceivers). Each service needs time to process the request. The ToR switches need time,

t_{T s}

, to process the request, read its switching table and forward it through the control channel to the CU. The control channel need time,

t_{c h}

, to transmit and propagate the request, and this depends on its data rate. When the request is received by the CU, it needs time,

t_{c u}

, to execute the optimization algorithm to find the optimal path between the source and destination ToR switches. Then, the CU instructs the FSO gears. These gears needs time,

t_{M s}

, to change their directions by spinning the transceivers and changing the switching-mirrors state. The flow at the ToR switch waits for all of these times to complete. The delay

D_{L s}

(the unit could be microseconds up to seconds depending on the adopted technology) for the setup of the new FSO link is given by

D_{L s} = t_{T s} + t_{c h} + t_{M s} + t_{c u}

(1)

3. R2R FSO Link Setup Process: Two Scenarios

The ToR switch can establish a limited number, K, of rack-to-rack (R2R) FSO links. This limitation is due to the finite processing capacity and number of outgoing ports. When a flow arrives at the ToR switch, it could find an R2R FSO link at its destination or wait for the ToR switch to establish a new link for it. In this work, we attempt to study and model this waiting time.

The ToR switch needs to establish a rack-to-rack (R2R) FSO link to serve a new flow,

f_{n e w}

. This process starts at the ToR switch by sending an R2R link establishment query to the CU. Since there are multiple numbers of choices to establish the R2R link, this creates a well-known integer linear programming problem: the multicommodity flow problem (also known as routing and wavelength assignment). The CU is expected to run one of the well-known resource allocation optimization algorithms to solve the integer linear program and find the optimal selection to set up the path (the multicommodity flow model is a nondeterministic polynomial-time-complete problem which can be solved by heuristic approaches). Accordingly, the CU has three tasks: (1) run the algorithm to find the optimal path, (2) command the selected path gears including FSO transceivers as well as the switching mirrors to establish the link and (3) provide the ToR switches with the necessary information to forward the flows via the right outgoing ports. Finally, the ToR switch adds a new entry to its path table about this link.

On the other hand, the newly installed link is going to be used by any subsequent flows that are going to the same destination rack,

f_{o l d}

. In this case, these flows,

f_{o l d}

, would not encounter the R2R FSO link setup process. The likelihood of this happening depends on the number of currently established R2R FSO links S, where

S \in {0, 1, 2, 3, . . ., K}

and K is the maximum number of FSO links that can be launched from an individual rack. The question is how many R2R FSO links exist, S, when a flow arrives at the ToR switch? The value of S could be static, where the system is configured to always have K FSO links. In this scenario, the value of S is K. Alternatively, the system is configured to establish the R2R FSO links on demand. In this scenario, S is a dynamic random value that varies with time. The configuration of the R2R FSO links changes the system characteristics and the system modeling accordingly.

These two configuration scenarios are expected to exist together at the same DC. In the first scenario, every ToR switch in the DC establishes and retains K FSO links all the time, regardless of their utilization. At the beginning, the system establishes these FSO links with random racks and then changes their directions according to the CU instructions. In the case of zero utilization “no traffic”, the links turn to idle mode. This scenario is suitable for small flows (mice flows) as well as long-term directions (high utilized racks). On the other hand, in the second scenario, the R2R FSO links are established on demand and are terminated immediately after the forwarding flows complete their transmission. When there is no flow, the system has no established link. This scenario is suitable for large flows (elephant flows) as well as short-term directions (low utilized racks). These two scenarios cover the DC workload requirements as described above and in [1,2] and are expected to be used together in the same DC.

3.1. Problem Formulation: First Scenario

The ToR switch maximum capacity is K FSO transceivers, and these transceivers can be used to build only K FSO links. The arrival flow is going to experience the waiting time in the case that the K FSO links are connected with racks other than its destination. Since the system has K servers and FSO channels, as shown in Figure 1, the closest model to it is M/M/K. However, the main difference between the R2R FSO link setup system and M/M/K is in the sharing ability of the FSO channels. These channels could be shared by all the flows that are going to the same destination. The FSO links have the physical layer electronics to be shared by multiple flows [9]. Table 1 illustrates the significant notations.

The M/M/K model reaches the waiting state when the number of customers in the system is larger than the number of servers. In contrast, due to the sharing ability of the FSO links, some of the flows (customers) in the presented system are not going to wait for other flows to complete their services. The flow enters the waiting queue when the ToR switch has no link to its destination. The question is how does the presented system get to the waiting state? At state 0, every ToR switch has K FSO links to randomly chosen destination racks. A new flow enters the waiting state when it carries a destination that differs from all the K destinations. In this case, the FSO link setup process is triggered, and then one of the K FSO links is re-established toward the new destination rack. The subsequent flows are either being served by these K links or by requesting to re-establish one of them.

In order to make the presented model similar to the M/M/K model, the transient event from one state to another needs to be clearly defined. In the presented system, the flows with the same destination share the same FSO link, and no transient event would happen for them. Contrary, the flow with a new destination would trigger the R2R FSO link setup process, which makes a transient event. Thereby, the transient event happens when a flow with a new destination rack arrives at the ToR.

The flows with different racks,

f_{n e w}

, have an arrival rate of

λ_{n e w} = (1 - p m) * λ

(2)

where pm is the probability that a flow matches one of the existing FSO links. This probability is clearly related to the ToR transceiver capacity (number of outgoing ports) and the total number of racks in the DC. The ToR switch has K R2R FSO links, and the DC has M racks. The matching probability, pm, can be calculated from

p m = \frac{K}{M}

(3)

To clarify the impact of

λ_{n e w}

(please note that this symbol is the arrival rate and it is not a wavelength) on the system, we need to assume that the system has only

f_{n e w}

. In this case, the transient state event happens with every arrival until all the K FSO links are established and the system reaches its full capacity. The subsequent arrivals need to wait for the other flows to be complete. This system presents similar characteristics to the M/M/K model, assuming the arrival rate follows the Poisson distribution and the channel service times follow an exponential distribution. Additionally, the waiting time has an extra component which is the R2R FSO link setup process changing the service time distribution into a general distribution and the model into M/G/K. The average waiting time is

{\bar{W}}_{n e w} = D_{L s} + {\bar{W}}^{M / G / K} .

(4)

When the flow arrival rate of

f_{o l d}

is considered,

λ_{o l d} = p m * λ

(5)

These flows,

f_{o l d}

, are not going to wait for the other flows including

f_{n e w}

or the FSO link setup process. The system immediately transmits them with their sisters sharing the same FSO link, and

μ_{f s o}

is the service time of a single FSO link:

{\bar{W}}_{o l d} = \frac{λ_{o l d}}{μ_{f s o}} .

(6)

On the other hand, during the waiting time of

f_{n e w}

for the system to find for them a link, a new

f_{o l d}

flow could arrive, which prolongs their waiting time

{\bar{W}}_{n e w}

because the system needs to wait for them to complete their service before using their FSO link:

{\bar{W}}_{n e w} = D_{L s} + {\bar{W}}^{M / G / K} + \frac{λ_{o l d}}{μ_{T s}} .

(7)

where the mean waiting time of M/G/K according to Lee and Longton [18] is given by

{\bar{W}}^{M / G / K} = (\frac{C^{2} + 1}{2}) \cdot {\bar{W}}^{M / M / K} .

(8)

where

C^{2}

is the square coefficient of variations of the ToR mean service time

μ_{T s}

. In this case, the total waiting time when the matching probability,

P r_{m a t c h}

, is considered is

\bar{W} = {\bar{W}}_{o l d} + (1 - p m) {\bar{W}}_{n e w} .

(9)

3.2. Problem Formulation: Second Scenario

The mathematical model of the second scenario is similar to the model of the first scenario except that the matching probability needs to be considered. Both scenarios have a statistical characteristic close to M/G/K. However, the variable K in first scenario is constant, while it is stochastic in this scenario. To make this clear, when a new flow arrives at the ToR switch, the number of FSO channels is always K in the first scenario and unknown,

K (t)

, in the second scenario. The following bullets describe its characteristics:

K as a constant value is no longer valid in this scenario because it represents the maximum capacity of the ToR switch; instead, the symbol k is used, which is Z∈ $[0, K]$ .
K increases with a mismatch when a flow of a different destination, $f_{n e w}$ , arrives at the ToR switch.
K decreases when an FSO link is terminated.

Accordingly, k is considered an independent and identical distributed (i.i.d.) random variable which impacts the modeling of the matching probability, pm. In order to calculate the pm, the system needs to know how many R2R FSO links exist when a flow arrived. Moreover, to find the total response time of this system we need to find the pm distribution. As explained above the first contributing factor into k and matching probability is the arrival-rate. The second factor is the FSO link termination. The arrivals could be

f_{n e w}

or

f_{o l d}

. The value of k as well as the probability are increased with the arrival of

f_{n e w}

, and decreased when an FSO link is terminated. The time between any two events is defined herein with

τ

. This time

τ

could be the lifetime of an individual FSO link is the time from establishing the FSO link until destroying it, or the inter-arrival time between two flows.

During

τ

three events could happen. Arrival of

f_{n e w}

which triggers the R2R FSO link setup process and k increases accordingly. Arrival of

f_{o l d}

to be forwarded through this FSO link or other links. Finally, a termination of this link or one of existing FSO links. When there is no link, the arrival flow is definitely new, and

τ

starts. In order to model the relation between these events,

τ

is discretized into small instants of time,

τ_{0}

. Only one event could happen in a single instant. Discretizing the time enables modeling the main factors of the matching probability by utilizing discrete-time Markov chain model (DTMC). The DMTC based model is shown in Figure 2. In this model one of the aforementioned events could happen at,

τ_{0}

.

The DTMC based model helps to get the pm probability through modeling the probabilities of these three events. First, the probability that the arrived flow is

f_{o l d}

which means it would be forwarded through one of the ToR established links, p

\cdot \frac{S}{K}

. Second, the probability that it is

f_{o l d}

and doesn’t match with any of the established links,

(1 - \frac{S}{K})

p which triggers the R2R FSO link setup process. Third, the probability that there is no arrival and instead a link termination event happens

(1 - p)

. The probability p examines all the time instants,

τ_{0}

, until arrival-event happens or a link terminated. From this we understand that

τ

is the time between two events because when there is no arrival during

τ

, the inprocess flow is complete and the FSO link terminates. This probability, p, is Geometric which alternates between two states whether having an arrival event or not. When there is no arrival at the whole time of

τ

, the link termination event arising. It is clear that from literature the probability of having a flow arrival at

τ_{0}

follows the Poisson distribution.

According to the DTMC model, the state probabilities are as follows:

p = P [X \leq τ / τ_{0}] = \sum_{n = 1}^{\frac{τ}{τ_{0}}} p_{τ_{n}} \cdot {(1 - p_{τ_{n}})}^{n}

(10)

\begin{matrix} (1 - p) = P [X > τ / τ_{0}] \end{matrix}

(11)

\begin{matrix} p_{τ_{n}} = (λ τ_{n}) \cdot e x p (- λ τ_{n}) \end{matrix}

(12)

\begin{matrix} P_{0} + P_{1} + P_{2} + P_{3} + . . . + P_{K} = 1 \end{matrix}

(13)

\begin{matrix} P_{1} = \frac{p P_{0}}{(1 - p)} \end{matrix}

(14)

\begin{matrix} P_{k} = \frac{(1 - \frac{k - 1}{M} \cdot p) P_{k - 1} - p \cdot (1 - \frac{k - 2}{M}) P_{k - 2}}{(1 - p)} \end{matrix}

(15)

\begin{matrix} P_{K} = \frac{p \cdot (1 - \frac{k - 1}{K}) P_{k - 1}}{(1 - \frac{k - 1}{K} \cdot p)} \end{matrix}

(16)

Figure 2. DTMC state diagram of the

P r_{m a t c h}

for the second scenario.

Figure 2. DTMC state diagram of the

P r_{m a t c h}

for the second scenario.

Finally, the Markov model contains the impact of the arrival rate and FSO link lifetime within the state probability

P_{S (τ)}

. Since the matching probability in this scenario is increased with the increase in the number of links and from the total probability theory,

P r_{m a t c h}

will be

\begin{matrix} p m = \sum_{n = 1}^{S} \frac{n}{K} \cdot P_{S (τ)} \end{matrix}

(17)

4. Mathematical Analysis

In this section, we present an analytical expression for the new link setup delay experienced by all classes of the links. We are interested in obtaining the mean waiting time

\bar{W}

and its second moment

\bar{W^{2}}

that will be experienced by all the flows. We first derived the mean results of the waiting time of all the components involved in the R2R FSO link setup process. The first component is the flow processing time at the ToR switch, assuming the arrival rate follows a Poisson distribution. Similarly, the southbound channel (control channel) is between the ToR switch and the CU. However, the service times,

μ_{1}

and

μ_{2}

, of them follow an exponential distribution, and they are different in terms of speed, where

μ_{1} < μ_{2}

. On the other hand, the CU and the FSO gear service times present an arbitrary distribution due to the involvement of diverse processing services, such as executing the FSO wavelength assignment algorithm and performing it physically on the FSO link gears. Consequently, the mathematical analysis of the CU as well as the FSO gear setup service time used in this section considers standard derivation steps of the

M / G / 1

model, and Table 1 has the definitions of the main notations. The incoming paragraphs define the definitions of other notations that are being used herein. The waiting time for the matching probability is calculated as follows:

\bar{W} = {\bar{W}}_{T s} + (1 - p m) [{\bar{W}}_{c u} + {\bar{W}}_{c h} + {\bar{W}}_{M s}]

(18)

Since the general distribution was considered for the service time of the CU, its average waiting time

\bar{W_{c u}}

is

{\bar{W}}_{c u} = \sum_{i = 1}^{N} X_{i} + R

(19)

where

X_{i}

is the service time of the path/link i while R is the residual time. The mean residual service time

\bar{R}

appearing in

{\bar{W}}_{c u}

can be derived by the same kind of graphical

t r i a n g l e t r i c k

as in the case of the (P-K) mean value formula. This residual time’s first instant can be expressed as

\bar{R} = \frac{1}{2} [ρ \cdot \frac{\bar{X^{2}}}{\bar{X}}]

(20)

The second moment of the waiting time is derived by the implementation of the additional algebraic manipulations:

\bar{W^{2}} = \bar{N} \cdot Var (X) + {[(1 + \frac{æ}{1 - æ}) \bar{R}]}^{2} + Var (R)

(21)

where

V a r (R) = \bar{R^{2}} - {\bar{R}}^{2}

. Equation (21) is procured by raising both sides of (19) to the second power and taking the mean. Note that the variables of

\bar{W^{2}}

are all known except that we need to evaluate

\bar{R^{2}}

. Thereby, the law of total expectation, which states that E[Y] = E[E[Y|X]], is employed to obtain

\bar{R^{2}} = \frac{1}{3} (λ \cdot \bar{X^{3}})

(22)

The average time that a flow f spends in the system is given by

{\bar{T}}^{f} = {\bar{W}}^{f} + {\bar{X}}^{f}

(23)

where the average response time of the CU and subsequent services when the matching probability are considered, and

\begin{matrix} {\bar{T}}^{f} = & 1 / (μ_{T s} - λ) + (1 - P r_{m a t c h}) \cdot [\frac{{\bar{R}}_{c u}}{(1 - ρ_{c u})} + {\bar{X}}_{c u}^{f} \\ 1 / (μ_{c h} - λ) + \frac{{\bar{R}}_{G s}}{(1 - ρ_{G s})} + {\bar{X}}_{G s}^{f}] \end{matrix}

(24)

The total response time distribution is

\begin{matrix} f (t) = & T {(μ_{1}, λ)}_{m / m / 1} + (1 - P r_{m a t c h}) \cdot [T {(μ_{2}, λ)}_{m / m / 1} \\ + T {(μ_{3}, λ)}_{m / m / 1} + T {(μ_{4}, λ)}_{m / m / 1}] . \end{matrix}

(25)

For the sake of accurate representation, we use different service times for each step in the R2R FSO link setup journey, where

μ_{1} < μ_{2} < μ_{3} < μ_{4}

, and these are the service times of the ToR switch, CU (the control channel) and the time to set up the FSO gears.

System Capacity

When it comes to real DC networks, the flow duration varies depending on its service. For instance, data centre web-search workloads present flows with different size and length compared to the data-mining workloads [19]. Thus, the flow waiting-time should be limited by a specific level of service time. Otherwise, some of the new incoming flows will wait and spend more time in the setup queue for an uncertain time which exceeds their duration. As a result, in this part, we attempt to figure out how much capacity the system has and how many flows it can handle at a certain response time quality of service

L_{Q o S}

.

K^{*} = a r g m a x_{K} \{\sum_{i = 1}^{K} L_{i}^{f} \leq L_{Q o S}\} .

(26)

As indicated in (25), the system response time is a random variable,

L_{i}

, where

\{i = 1, 2, 3, . . ., Z\}

is the flows’ response time index. In the case of the exponential distribution, the response-time equation will be,

\begin{matrix} L_{i} \approx & E x p (- μ_{1}) + (1 - p m) \cdot [E x p (- μ_{2}) + \\ E x p (- μ_{3}) + E x p (- μ_{4})] \end{matrix}

(27)

The newly arrived flow needs to wait for the flows ahead waiting in the queue. The summation of their response times is,

E = \sum_{i = 1}^{Z} L_{i}^{f}

. The value of E needs not to exceed the system QoS constraint, and the system blocks the subsequent flows. The question is what is the probability an arrival flows is going to be blocked. In order to get the blocking probability we need to find a closed formula for the distribution of the system response times L. In literature if

X_{i}

where

\{i = 1, 2, 3, . . ., U\}

is i.i.d Exponential R.V. with the constant parameter

μ

, the probability density function (PDF) of their sum is Erlang distribution with U and

μ

parameters

E (U, μ_{i})

. However, the R2R FSO link setup process has different service-time rates for each service, Exponential R.V., in its process. According to above mathematical analysis, we find the distribution of the total response time of R2R FSO link setup process in (25) take a shape close to an Hypo-exponential distribution with large U. Contrary, Erlang and Gamma distributions tend to take the bell shape as the value of U increases. Therefore, the approximated distribution is Hypo-exponential

H y p o (μ_{1}, . . . . . ., μ_{l})

,

\begin{matrix} \sum_{i = 1}^{U} T_{i}^{f} \approx \sum_{i = 1}^{U} μ_{i} \cdot E x p (- μ t) \{\prod_{j = 1, j \neq i}^{K} \frac{μ_{j}}{μ_{j} - μ_{i}}\} . \end{matrix}

(28)

The blocking probability is defined as, the probability that an R2R FSO link setup request needs a time to response exceeding the QoS time constraint

L_{Q o S}

,

P [L > t = L_{QoS}]

. The value of the threshold value, QoS time constraint, is obtained from the response time distribution L explained above. The threshold value can be expressed by the maximum number of R2R FSO link setup requests,

U^{*}

, that the system can handle before the

L_{Q o S}

value is exceeded. This can be approximated to the following equation where

\bar{L}

is the average response time,

5. Evaluation Results

The evaluation study is divided into two main parts. First, the validity of our propose model. In this part, we study the model with different distribution functions in both the simulation and analytical parts. Also, the model approximation to Hypo-Exponential distribution is examined. Second, we evaluated every delay part in the R2R FSO link setup process individually. In order to perform these evaluation parts a simulation model for the indirect LoS R2R FSO link setup process is designed by using Matlab. Both the simulation and the mathematical model runs alongside in the same Matlab code, and both of them read the same variables. Also, the matching probability (pm) is considered for both. The number of flows is 0.5 million and the average arrival-rate is

λ

=

\frac{f l o w}{m s e c .}

.

5.1. Model Validation

Initially, as a proof of concept and to examine the validity and robustness of our simulation, we compared the analytical and simulation results for different versions of M/G/1 while using different distributions (exponential, Gaussian and Erlang-2 distributions) as a service time for one of the delay parts, the CU; the results are shown in Figure 3. Figure 4 shows the PDF of the total response time when the aforementioned three distributions are used. The simulation results overfit the mathematical model results, which infers the high accuracy of the proposed model. On the other hand, in the above mathematical analysis, we conclude that the proposed mathematical model is close to a hypoexponential distribution. In order to validate this assumption, we used the QQ plot method to measure how much the proposed model is close to a hypoexponential distribution. The QQ plot results show that the model is very close to it, which confirms the accuracy of our assumption. The results are portrayed in Figure 5.

The analyzed system has four components that contribute differently to its total response time. For the sake of the accurate presentation of a real DC, these components are modeled by using different service times. The control channel has a service time that is faster than the other components, while the service time of the FSO link gears is the slowest. In order to study their individual impact on the total response time, we scaled every component independently by a set of values,

\{1, 5, 9\}

, while the matching probability is defined over another set of values,

\{0.1, 0.5, 0.9\}

, to match the PDF range. The results are presented in Figure 6, Figure 7, Figure 8, Figure 9, Figure 10, Figure 11, Figure 12 and Figure 13. The x-axis of these figures shows the load,

ρ

, and the y-axis shows the response time in milliseconds (msec). Figure 6, Figure 8, Figure 10 and Figure 12 display the simulation results while Figure 7, Figure 9, Figure 11 and Figure 13 display the analytical results. The results demonstrate that every value in the scaling set is evaluated independently with every value in the matching probability set.

5.2. Impact of the ToR Switch

The ToR switch resides at the head of every outgoing flow. The flow (the first packet of every flow) needs to be examined by the ToR switch before forwarding it to the R2R FSO link setup process or to one of the established R2R FSO links. This service time dominates the other service times, and in our model, it is the first part before the matching probability. This dominating impact is clearly shown in Figure 6 and Figure 7. The results show that whenever the service time is scaled up, the total delay and response time declines accordingly. Moreover, the significance of this service time is clearly illustrated with large scaling values. With these scaling values, five and nine, the impact of the other service times is minor. The results showed that when using the scale value of five, the total response time grows slowly from about 1.7 ms to 2.4 ms at the maximum load. In contrast, the total response time shows a steady increase from 2.2 ms to 12.5 ms for the scale value one even though the matching probability is 0.9. As expected, the matching probability demonstrated its impact clearly, and these findings demonstrate that the ToR switch is the critical point in the R2R FSO link setup process.

Figure 6. Simulation results when the ToR switch service time is scaled by different scale values and matching probability (pm).

Figure 7. Analytical results when the ToR switch service time is scaled by different scale values and matching probability (pm).

5.3. Impact of the Control Channel

The control channel is the communication media used by the ToR switches to communicate with the CU. It is one of the delay parts in the R2R FSO link setup process. When the new flow has not been matched by existing FSO links, the ToR switch uses this channel to request the CU to build a new R2R FSO link for this new flow. In contrast, when the matching probability is high, the new flow is going to be served by already-established FSO links, and most likely the incoming flows are not going to wait for the link to be established. In this case, the impact of the control channel delay is clearly exhibited with small matching probabilities. Figure 8 and Figure 9 show that when the matching probability is 0.1 of all the scales attempted, one, five and nine, the response-time curves score the slowest results, indicated by the top circle in the figure. In contrast, the figures show that as long as the matching probability is increasing, the speed of the response times is increasing as well. This behavior is clearly illustrated when the matching probability is 0.9, as the response-time curves score the fastest results.

Figure 8. Simulation results when the control channel service time is scaled by different scale values and matching probability (pm).

5.4. Impact of the Controlling Unit

The CU is the master mind of managing the R2R FSO links and their flows. As explained in the previous subsection, the CU handles the R2R FSO links setup requests. The control channel, the CU, and the FSO gear service are requested when the new flow is not matched by one of the existing FSO links. As a result, their relation with the matching probability is almost the same. At high pm values, the impact of the CU delay on the response time is close to the delay of the ToR switch, particularly when the scale value equals one. The average delay, when pm = 0.5 and the scale value is five, is about 3 ms. The results are shown in Figure 10 and Figure 11.

Figure 9. Analytical results when the control channel service time is scaled by different scale values and matching probability (pm).

Figure 10. Simulation results when the CU service time is scaled by different scale values and matching probability (pm).

Figure 11. Analytical results when the CU service time is scaled by different scale values and matching probability (pm).

5.5. Impact of the R2R FSO Gears

The R2R FSO gears include the FSO transceiver and the switching mirrors. The delay of spinning the transceiver and changing the transparent mode of the SM affects the R2R FSO link setup time. Similar to other delay parts of the R2R FSO link setup process, the delay of the FSO gears was evaluated with the same evaluating procedures. The total response time decreases when the service time of the R2R FSO gears is scaled up, as well as when the pm value is high. The results are shown in Figure 12 and Figure 13.

Figure 12. Simulation results when the R2R FSO link gears service time is scaled by different scale values and matching probability (pm).

Figure 13. Analytical results when the R2R FSO link gears service time is scaled by different scale values and matching probability (pm).

6. Conclusions

Researchers have adopted several mechanisms to build indirect FSO links and schedule algorithms to route flows through the best route. Changing the direction of the indirect FSO link involves multiple delays caused by the processing time of the ToR switch and the CU, the southbound channel and the time to setup the FSO link gears (transceivers and mirrors). In this paper, we modeled and studied the time needed to establish the indirect FSO link during different traffic loads. We found that three services (the CU, FSO gears and matching probability) are the main contributors to the process, particularly the matching probability, which can be improved physically and analytically. Physically, we can expand the FSO infrastructure including the ToR switches’ processing capacity and ports. Analytically, the DC traffic workloads’ distributions have patterns that can be learned and reflected in the FSO design structure. Also, we found that the distribution of the indirect FSO link setup system is close to a hypoexponential distribution, and it maintains its main characteristics even when we use different distributions for the service time.

Author Contributions

Conceptualization, A.A. (Amer AlGhadhban), S.H.A., M.A. and A.A. (Abdulaziz Almalaq); methodology, A.A. (Amer AlGhadhban); software, A.A. (Amer AlGhadhban) and S.H.A.; validation, A.A. (Amer AlGhadhban), S.H.A. and A.A. (Abdulaziz Almalaq); formal analysis, A.A. (Amer AlGhadhban); investigation, A.A. (Amer AlGhadhban), S.H.A., M.A. and A.A. (Abdulaziz Almalaq); resources, A.A. (Amer AlGhadhban), M.A. and A.A. (Abdulaziz Almalaq); data curation, A.A. (Amer AlGhadhban); writing—original draft preparation, A.A. (Amer AlGhadhban), S.H.A. and M.A.; writing—review and editing, A.A. (Amer AlGhadhban), S.H.A. and M.A.; visualization, A.A. (Amer AlGhadhban); supervision, A.A. (Amer AlGhadhban) and A.A. (Abdulaziz Almalaq); project administration, A.A. (Amer AlGhadhban), M.A. and A.A. (Abdulaziz Almalaq); funding acquisition, A.A. (Amer AlGhadhban), M.A. and A.A. (Abdulaziz Almalaq). All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Scientific Research Deanship at the University of Ha’il, Saudi Arabia, through project number BA-2024.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ghobadi, M.; Mahajan, R.; Phanishayee, A.; Devanur, N.; Kulkarni, J.; Ranade, G.; Blanche, P.A.; Rastegarfar, H.; Glick, M.; Kilper, D. Projector: Agile reconfigurable data center interconnect. In Proceedings of the 2016 ACM SIGCOMM Conference, Florianopolis, Brazil, 22–26 August 2016; ACM: New York, NY, USA, 2016; pp. 216–229. [Google Scholar]
Roy, A.; Zeng, H.; Bagga, J.; Porter, G.; Snoeren, A.C. Inside the Social Network’s (Datacenter) Network. SIGCOMM Comput. Commun. Rev. 2015, 45, 123–137. [Google Scholar] [CrossRef]
Terzi, C.; Korpeoglu, I. 60 GHz wireless data center networks: A survey. Comput. Netw. 2021, 185, 107730. [Google Scholar] [CrossRef]
Hamedazimi, N.; Qazi, Z.; Gupta, H.; Sekar, V.; Das, S.R.; Longtin, J.P.; Shah, H.; Tanwer, A. FireFly: A Reconfigurable Wireless Data Center Fabric Using Free-space Optics. In Proceedings of the ACM SIGCOMM, Chicago, IL, USA, 17–22 August 2014; pp. 319–330. [Google Scholar]
AlGhadhban, A. F4Tele: FSO for data center network management and packet telemetry. Comput. Netw. 2021, 186, 107711. [Google Scholar] [CrossRef]
AlGhadhban, A. FSO Clusters for Data Center Network Management and Packet Telemetry. In Proceedings of the SIGCOMM ’20 Poster and Demo Sessions, Virtual Event, 10–14 August 2020; Association for Computing Machinery: New York, NY, USA, 2021; pp. 9–11. [Google Scholar] [CrossRef]
Umair, Z.; Qureshi, U.M.; Cheng, Y.; Jia, X. An Efficient Wireless Control Plane for Software Defined Networking in Data Center Networks. IEEE Access 2019, 7, 58158–58167. [Google Scholar] [CrossRef]
Zhou, X.; Zhang, Z.; Zhu, Y.; Li, Y.; Kumar, S.; Vahdat, A.; Zhao, B.Y.; Zheng, H. Mirror mirror on the ceiling: Flexible wireless links for data centers. ACM SIGCOMM Comput. Commun. Rev. 2012, 42, 443–454. [Google Scholar] [CrossRef]
Hamza, A.S.; Deogun, J.S.; Alexander, D.R. Wireless Communication in Data Centers: A Survey. IEEE Commun. Surv. Tutor. 2016, 18, 1572–1595. [Google Scholar] [CrossRef]
AlGhadhban, A.; Shihada, B. Delay analysis of new-flow setup time in software defined networks. In Proceedings of the NOMS 2018—2018 IEEE/IFIP Network Operations and Management Symposium, Taipei, Taiwan, 23–27 April 2018; pp. 1–7. [Google Scholar] [CrossRef]
Fernandes, M.A.; Monteiro, P.P.; Guiomar, F.P. Single-Wavelength Terabit FSO Channel for Datacenter Interconnects Enabled by Adaptive PCS. In Proceedings of the 2021 Optical Fiber Communications Conference and Exhibition (OFC), San Francisco, CA, USA, 6–11 June 2021; pp. 1–3. [Google Scholar]
Katayama, Y.; Takano, K.; Kohda, Y.; Ohba, N.; Nakano, D. Wireless data center networking with steered-beam mmWave links. In Proceedings of the IEEE Wireless Communications and Networking Conference (WCNC) 2011, Cancun, Mexico, 28–31 March 2011; pp. 2179–2184. [Google Scholar] [CrossRef]
Hamza, A.S.; Deogun, J.S.; Alexander, D.R. Free space optical data center architecture design with fully connected racks. In Proceedings of the IEEE Global Communications Conference (GLOBECOM) 2014, Austin, TX, USA, 8–12 December 2014; pp. 2192–2197. [Google Scholar] [CrossRef]
Peng, M.; Li, Y.; Jiang, J.; Li, J.; Wang, C. Heterogeneous cloud radio access networks: A new perspective for enhancing spectral and energy efficiencies. IEEE Wirel. Commun. 2014, 21, 126–135. [Google Scholar] [CrossRef]
Zhang, S.; Xue, X.; Tangdiongga, E.; Calabretta, N. Low-latency optical wireless data-center networks using nanoseconds semiconductor-based wavelength selectors and arrayed waveguide grating router. Photonics 2022, 9, 203. [Google Scholar] [CrossRef]
Xue, X.; Yan, F.; Prifti, K.; Wang, F.; Pan, B.; Guo, X.; Zhang, S.; Calabretta, N. ROTOS: A reconfigurable and cost-effective architecture for high-performance optical data center networks. J. Light. Technol. 2020, 38, 3485–3494. [Google Scholar] [CrossRef]
Zhu, Y.; Zhou, X.; Zhang, Z.; Zhou, L.; Vahdat, A.; Zhao, B.Y.; Zheng, H. Cutting the Cord: A Robust Wireless Facilities Network for Data Centers. In Proceedings of the 20th Annual International Conference on Mobile Computing and Networking, MobiCom ’14, Maui, HI, USA, 7–11 September 2014; Association for Computing Machinery: New York, NY, USA, 2014; pp. 581–592. [Google Scholar] [CrossRef]
Lee, A.; Longton, P. Queueing processes associated with airline passenger check-in. J. Oper. Res. Soc. 1959, 10, 56–71. [Google Scholar] [CrossRef]
Benson, T.; Anand, A.; Akella, A.; Zhang, M. Understanding data center traffic characteristics. ACM SIGCOMM Comput. Commun. Rev. 2010, 40, 92–99. [Google Scholar] [CrossRef]

Figure 1. The new flow arrives at the ToR switch to be served by a rack-to-rack (R2R) FSO link connected to the required destination rack.

Figure 3. Validating our simulation and model with different distributions.

Figure 4. PDF of the proposed model with different distributions.

Figure 5. QQ plot evaluation results between the proposed model and hypoexponential distribution.

Table 1. Summary of important notations.

Symbol	Definition
$p m$	The probability that a flow is matched by existing R2R FSO link (matching probability).
K	The maximum number of R2R FSO links that a ToR switch can establish.
M	The total number of racks in a DC.
$P_{S (τ)}$	Probability of having S R2R FSO links during $τ$ .
$f_{n e w}$	The flows that cannot be served by existing FSO links (unmatched flows).
$f_{o l d}$	The flows that can be served by existing FSO links (matched flows).
$λ_{n e w}$	Poisson arrival rate of unmatched flows $f_{n e w}$ .
$λ_{o l d}$	Poisson arrival rate of matched flows $f_{o l d}$ .
$γ$	The sum of $λ$ s that arrived at the CU from other data-plane devices.
$X_{i}$	Service time random variable for flow i.
R	Residual service time.
$W_{n e w}$	Waiting time of $f_{n e w}$ .
$W_{o l d}$	Waiting time of $f_{o l d}$ .
$W_{T s}$	Flow waiting time for the ToR switch to complete its process.
$W_{c n}$	Flow waiting time for the CU to complete its process.
$W_{c h}$	Flow waiting time for the control channel to complete its process.
$W_{M s}$	Flow waiting time for the FSO link gears to complete their process.
$μ_{T s}$	The service rate of the ToR switch.
$μ_{c h}$	The control channel bit rate.
$μ_{c n}$	The service rate of the CU.
$μ_{M s}$	The processing speed of FSO gears.
${\bar{L}}^{f}$	Average response time of an R2R FSO link setup process

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

AlGhadhban, A.; Abdulhussain, S.H.; Alazmi, M.; Almalaq, A. Performance Analysis of Adopting FSO Technology for Wireless Data Center Network. Systems 2023, 11, 482. https://doi.org/10.3390/systems11090482

AMA Style

AlGhadhban A, Abdulhussain SH, Alazmi M, Almalaq A. Performance Analysis of Adopting FSO Technology for Wireless Data Center Network. Systems. 2023; 11(9):482. https://doi.org/10.3390/systems11090482

Chicago/Turabian Style

AlGhadhban, Amer, Sadiq H. Abdulhussain, Meshari Alazmi, and Abdulaziz Almalaq. 2023. "Performance Analysis of Adopting FSO Technology for Wireless Data Center Network" Systems 11, no. 9: 482. https://doi.org/10.3390/systems11090482

APA Style

AlGhadhban, A., Abdulhussain, S. H., Alazmi, M., & Almalaq, A. (2023). Performance Analysis of Adopting FSO Technology for Wireless Data Center Network. Systems, 11(9), 482. https://doi.org/10.3390/systems11090482

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Performance Analysis of Adopting FSO Technology for Wireless Data Center Network

Abstract

1. Introduction

1.1. Motivations

1.2. Related Works

1.3. Paper Objectives and Novel Contributions

2. Problem Statement

3. R2R FSO Link Setup Process: Two Scenarios

3.1. Problem Formulation: First Scenario

3.2. Problem Formulation: Second Scenario

4. Mathematical Analysis

System Capacity

5. Evaluation Results

5.1. Model Validation

5.2. Impact of the ToR Switch

5.3. Impact of the Control Channel

5.4. Impact of the Controlling Unit

5.5. Impact of the R2R FSO Gears

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI