Next Article in Journal
Experimental Study on Splitter Plate for Improving the Dielectric Recovery Strength of Low-Voltage Circuit Breaker
Next Article in Special Issue
Combined Access Barring Scheme for IoT Devices Using Bayesian Estimation
Previous Article in Journal
Generation of 3-D Grid Multi-Scroll Chaotic Attractors Based on Sign Function and Sine Function
Previous Article in Special Issue
An Adjusted Free-Market-Inspired Approach to Mitigate Free-Riding Behavior in Peer-to-Peer Fog Computing
 
 
Article
Peer-Review Record

A Prediction-Based Model for Consistent Adaptive Routing in Back-Bone Networks at Extreme Situations

Electronics 2020, 9(12), 2146; https://doi.org/10.3390/electronics9122146
by Qianru Zhou 1 and Dimitrios Pezaros 2,*
Reviewer 1: Anonymous
Reviewer 2: Anonymous
Reviewer 3: Anonymous
Electronics 2020, 9(12), 2146; https://doi.org/10.3390/electronics9122146
Submission received: 12 November 2020 / Revised: 6 December 2020 / Accepted: 8 December 2020 / Published: 15 December 2020
(This article belongs to the Special Issue Future Networks: New Advances and Challenges)

Round 1

Reviewer 1 Report

The authors analyze the problem of path (in)stability under extreme congestion, and propose a solution to switch to backup paths when congestion triggers. Contrary to one of the claims, the issue of route flapping (with or without congestion) has been extensively studied in the literature in the past, and a number of proposed solutions exists. These are not discussed or referred in the paper. Additionally, the paper has a number of issues, in my opinion, that should be addressed:

1) The notation is sometimes inconsistent, e.g. in equations (3), (4) and (5) there is some confusion regarding t and Δt in the differential congestion indicators. More importantly, the DPCI (1) is presented as a Markov process, but it is actually not Markovian unless further assumptions are made on L(t). Fortunately, the Markovian property is not exploited at all in the paper.

2) The cumulative DPCI presented in (6)-(9) is basically a moving average, which is not novel. It is rather a well known mechanism for smoothing a volatile signal. The key observation is, as the authors recognize, that these quantities depend crucial of a manual parameter, θ, acting as a hysteresis factor, and its effect is not discussed at all in the paper.

3) From an implementation point of view, how is latency measured between two endpoints in a path? I guess it's simply the RTT in the transport protocol (e.g., TCP) but this is not done so in the experiments, where ping is employed. Are ping probes necessary in parallel to measure latency?

4) Given that, increasingly, delay-based congestion control algorithms are being used in the Internet, and these suffer from less congestion episodes than loss-based congestion control protocols, to what extent is the proposed method appropriate in modern networks?

5) The experiments presented use the Abilene topology as an example, and quite old traffic traces. While the topology might be acceptable (even if the current topology of Internet is more 'flat' the 20 years ago), the traffic traces are outdated and not representative of typical applications today.

6) In Figs. 5-8, it is said that 'different parameters θ' are compared. Which ones? No mention or discussion accompanies in the main text.

Overall, the support and evidence that the proposed metric for switching paths is robust needs improvements: the formulation needs to be refined, the experiments need more detail, the traffic models seem to be unresponsive to congestion, and the main concepts that support the proposal are not particularly novel (a moving average of delay measurements). All of these requires further clarification.

 

Author Response

The authors would like to thank the reviewer for his/her helpful and insightful comments. We have carefully improved the quality of the manuscipt by taking all the comments into account. The modification of the manuscript and the responses to the reviews point-to-point are described as follows in red. 

The authors analyze the problem of path (in)stability under extreme congestion, and propose a solution to switch to backup paths when congestion triggers. Contrary to one of the claims, the issue of route flapping (with or without congestion) has been extensively studied in the literature in the past, and a number of proposed solutions exists. These are not discussed or referred in the paper. Additionally, the paper has a number of issues, in my opinion, that should be addressed:

The existing work on reduce route flapping are mainly route dampening and route aggregation. These methodologies try to reduce route flapping passively, in other words, they didn't change the flapping route itself, but execute some additional calculation and operation on these flapping routes. Besides, to our knowledge, the existing works did not report experiment results on the same use case as ours, nor did they consider various congestion situations when traffic load is 5, 7.5, and 20 times of bandwidth. By taking history latency of the path and competing path into consideration, the proposed method achieve more stability while achieving acceptable channel situation improvement (get a stable route last for more than 3000 seconds at longest, more than 1000 seconds on average). In the revised paper, more decriptions are added to the paper on Page 2, the second paragraph of Section 2, marked in red.

1) The notation is sometimes inconsistent, e.g. in equations (3), (4) and (5) there is some confusion regarding t and Δt in the differential congestion indicators. More importantly, the DPCI (1) is presented as a Markov process, but it is actually not Markovian unless further assumptions are made on L(t). Fortunately, the Markovian property is not exploited at all in the paper.

The notations t is used when describing continuous-time process, while notation Δt for discrete-time process, we have added more declaration in Page 4, Line 105, 110, and 114. 

In the revised manuscript, we have removed the description about Markovian property.

2) The cumulative DPCI presented in (6)-(9) is basically a moving average, which is not novel. It is rather a well known mechanism for smoothing a volatile signal. The key observation is, as the authors recognize, that these quantities depend crucial of a manual parameter, θ, acting as a hysteresis factor, and its effect is not discussed at all in the paper.

Indeed the proposed model has a "moving" property, however, it is not moving average, as defined in Section 3. The stability of parameter θ has been addressed in detail in Experiment results presentaton on Section 4. In the revised paper, more details of the effect and importance of parameter θ have been added on Page 3, second paragraph in Section 3.1.1. 

3) From an implementation point of view, how is latency measured between two endpoints in a path? I guess it's simply the RTT in the transport protocol (e.g., TCP) but this is not done so in the experiments, where ping is employed. Are ping probes necessary in parallel to measure latency?

We use RTT measured by ping as the latency, as presented in Section 4, from page 6 to 7. In the revised manuscript, more detail was added on Page 7, Line 207-215, and marked in red. 

4) Given that, increasingly, delay-based congestion control algorithms are being used in the Internet, and these suffer from less congestion episodes than loss-based congestion control protocols, to what extent is the proposed method appropriate in modern networks?

In our experiment, we choose delay measurement as the metric, for according to Hayes et.al [1], delay provide more timely feedback than packet loss information on the state of network. From our experiment data, packet-loss rate is in direct proportion to latency when channel is extremely congested. We think it is an interesting direction for future work.

[1] D. A. Hayes and D. Ros, "Delay-based Congestion Control for Low Latency," 2013.

5) The experiments presented use the Abilene topology as an example, and quite old traffic traces. While the topology might be acceptable (even if the current topology of Internet is more 'flat' the 20 years ago), the traffic traces are outdated and not representative of typical applications today.

The focus of this paper is on a backbone network topology rather than trying to capture the Internet-wide, cross-domain structure. Indeed the traffic dataset is dated in 2003, which is a little outdated, but it only serves as a baseline, and we has tried out different congestion situations when traffic load is 5, 7.5, and 20 times of channel bandwidth, and different experiment results suggest the same evidence of the effect of proposed PSI. 

6) In Figs. 5-8, it is said that 'different parameters θ' are compared. Which ones? No mention or discussion accompanies in the main text.

In the revised paper, the captions of Fig. 5-8 are revised. 

 

Reviewer 2 Report

The authors have proposed a solution based on prediction-based model for consistent adaptive routing in back-bone networks that is capable of measuring the consistency of path latency difference. By learning the history latency of all optional paths, PSI is able to predict the onset of an obvious and steady channel deterioration, and make the decision to switch path. The topic is really interesting specially for dense edge networks with cloud integration. The proposed solution is supported with the results. However, I would like to ask authors to add the following information. 

  1. The authors have mentioned the use of Docker platform, however very less information is provided about the use, hosting and containerization. This information should be added in detail. 
  2. Since the author's main focus is latency. What will be the impact on latency if cloudlets (Docker) are hosted near to the edge nodes as compared to a central cloud (Docker)? Resources, optimization, density of the network nodes should be discussed.  
  3. The authors should compare their results with already available techniques, lets say in Fig. 4, with AI based predictive routing. That will give the readers an idea about the complexity of the algorithms vs. performance. 

Author Response

The authors would like to thank the reviewer for his/her helpful and insightful comments. We have carefully improved the quality of the manuscipt by taking all the comments into account. The modification of the manuscript and the responses to the reviews point-to-point are described as follows. 

The authors have proposed a solution based on prediction-based model for consistent adaptive routing in back-bone networks that is capable of measuring the consistency of path latency difference. By learning the history latency of all optional paths, PSI is able to predict the onset of an obvious and steady channel deterioration, and make the decision to switch path. The topic is really interesting specially for dense edge networks with cloud integration. The proposed solution is supported with the results. However, I would like to ask authors to add the following information. 

1. The authors have mentioned the use of Docker platform, however very less information is provided about the use, hosting and containerization. This information should be added in detail. 

In the revised manuscript, more details of the Docker platform is provided on Page 6-7, Line 187-191.    2. Since the author's main focus is latency. What will be the impact on latency if cloudlets (Docker) are hosted near to the edge nodes as compared to a central cloud (Docker)? Resources, optimization, density of the network nodes should be discussed.  
An edge-based cloudlet approach would definitely help reducing latency of particular traffic flows if they exploited locality characteristics but the backbone network could/would still experience high congestion levels that would impact all traffic aggregates over its connecting POPs. In this study, we have focused on improving the latency of such aggregates.   3. The authors should compare their results with already available techniques, lets say in Fig. 4, with AI based predictive routing. That will give the readers an idea about the complexity of the algorithms vs. performance.    In the revised paper, the comparization between AI-based approaches with our proposed one on Page 3, second paragraph, marked in red. 

 

Reviewer 3 Report

The authors of this paper study the problem of selecting paths that stay consistently optimal for a long term in extremely congested situations. To solve that issue, a model that can measure the consistency of path latency difference is proposed and evaluated.

The work is quite interesting and deserves publication. However, some minor comments should be taken into consideration:

1) First of all, it is necessary to revise the manuscript since there a lot of grammatical/syntax errors that should be taken into account. An indicative (not full) list is presented below

Page 2, line 33, revise the phrase: “Although numbers work has been…”

Page 93, the acronym RTT is not defined (although it is a quite well known acronym).

Page 3, line 104, revise the phrase: “…from the definition Eq. 1…”. Also, for a reader who is not familiar with the subject, it would be nice to explain why it is obvious that ω has the Markov property in (1).

Page 3, line 113, revise the phrase: “…we are able to describe the consistency of the relative path congestion situation use the…”

Page 4, line 117, revise the phrase: “…can be get from…”

Page 4, line 146, revise the phrase: “…indicators may becomes…and oscillations happens”

Page 5, line 147, revise the phrase: “They are depend on…”

Page 5, line 149, revise the phrase: “In our evaluation experiment, where the traffic data used…”

Page 5, line 163, revise the phrase: “…are 1, it happens when both…”

In line 165, the number of the Table is missing.

Line 172, revise the phrase: “…with topology the…”

In line 179, the acronym PSI should be used (and not Path Swap Indicator). The same applies to other lines as well.

Lines 180-181, revise the phrase: “…what is the proportions of path  i’s congestion situation is really worse…”

Line 184, revise the phrase: “…before the indicator change…”

Line 206, revise the phrase: “A tool to active measure…”

Line 210: the acronym OSPF should be used.

Line 213, revise the phrase: “…peak traffic severely exceed…”

Line 220, revise the phrase: “…and in each networks…”

Line 340, revise the phrase: “…we proposed an Markov…”

2) Some future directions should be included in the conclusion. In that case is it possible to discuss how difficult it is to incorporate the determination of call blocking probabilities in your method?

Author Response

The authors would like to thank the reviewer for his/her helpful and insightful comments. We have carefully improved the quality of the manuscipt by taking all the comments into account. The modification of the manuscript and the responses to the reviews point-to-point are described as follows in red. 

The authors of this paper study the problem of selecting paths that stay consistently optimal for a long term in extremely congested situations. To solve that issue, a model that can measure the consistency of path latency difference is proposed and evaluated.

The work is quite interesting and deserves publication. However, some minor comments should be taken into consideration:

1) First of all, it is necessary to revise the manuscript since there a lot of grammatical/syntax errors that should be taken into account. An indicative (not full) list is presented below

Page 2, line 33, revise the phrase: “Although numbers work has been…”

In the reivsed paper, the phrase has been revised, and marked in red. 

Page 93, the acronym RTT is not defined (although it is a quite well known acronym).

In the reivsed paper, the defination has been provided on Page 3, Line 102. 

Page 3, line 104, revise the phrase: “…from the definition Eq. 1…”. Also, for a reader who is not familiar with the subject, it would be nice to explain why it is obvious that ω has the Markov property in (1).

In the revised paper, the phrase has been revised. As suggested by another reviewer, we have removed the description about the Markovian properties from the paper.

Page 3, line 113, revise the phrase: “…we are able to describe the consistency of the relative path congestion situation use the…”

In the reivsed paper, the phrase has been revised, and marked in red.

Page 4, line 117, revise the phrase: “…can be get from…”

In the reivsed paper, the phrase has been revised, and marked in red.

Page 4, line 146, revise the phrase: “…indicators may becomes…and oscillations happens”

In the reivsed paper, the phrase has been revised, and marked in red.

Page 5, line 147, revise the phrase: “They are depend on…”

In the reivsed paper, the phrase has been revised, and marked in red.

Page 5, line 149, revise the phrase: “In our evaluation experiment, where the traffic data used…”

In the reivsed paper, the phrase has been revised, and marked in red.

Page 5, line 163, revise the phrase: “…are 1, it happens when both…”

In the reivsed paper, the phrase has been revised, and marked in red.

In line 165, the number of the Table is missing.

In the reivsed paper, the Table has been moved to Abbreviation.

Line 172, revise the phrase: “…with topology the…”

In the reivsed paper, the phrase has been revised, and marked in red.

In line 179, the acronym PSI should be used (and not Path Swap Indicator). The same applies to other lines as well.

In the reivsed paper, the phrase has been revised, and marked in red.

Lines 180-181, revise the phrase: “…what is the proportions of path  i’s congestion situation is really worse…”

In the reivsed paper, the phrase has been revised, and marked in red.

Line 184, revise the phrase: “…before the indicator change…”

In the reivsed paper, the phrase has been revised, and marked in red.

Line 206, revise the phrase: “A tool to active measure…”

In the reivsed paper, the phrase has been revised, and marked in red.

Line 210: the acronym OSPF should be used.

In the reivsed paper, the phrase has been revised, and marked in red.

Line 213, revise the phrase: “…peak traffic severely exceed…”

In the reivsed paper, the phrase has been revised, and marked in red.

Line 220, revise the phrase: “…and in each networks…”

In the reivsed paper, the phrase has been revised, and marked in red.

Line 340, revise the phrase: “…we proposed an Markov…”

In the reivsed paper, the phrase has been revised, and marked in red.

2) Some future directions should be included in the conclusion. In that case is it possible to discuss how difficult it is to incorporate the determination of call blocking probabilities in your method?

In the revised paper, future work has been added to the end of Section 5, on Page 16, Line 372-374. 

 

Round 2

Reviewer 1 Report

The authors have made an important effort in improving the contents and clarity of this second version of their manuscript, clarifying many of the previous concerns. However, there still remain some issues which induce doubts:

1) The mathematical notation is still confusing or sloppy. E.G., in (5) Δt is used both as an argument and as a (discrete) time index, and subindices are missing in the sum. the same happens in (6) and (7) and (8), and all along the text.

2) The thresholds are imprecise. In linee 133 θ = 20: in what units? Later, in line 162, θ_s+ = 6% is a relative number.Same in Table 3.

3) The quantities S_w in (8) and (9) are cumulative sums of 0-1 binary variables (s_w) over a time window T_s. Thus, their value can be greater than 1, depending on the evolution of the indicator variables. Therefore, why in Table 1 only values 0 and 1 are considered for I_Δt? This needs explanation.

4) Overall, the proposal in interesting, but the basis is just a a PI controller, where the integrator signals drive the decisions to switch between the primary and the backup paths. I believe the novelty is incremental.

Author Response

The authors would like to thank the reviewer for his/her helpful and insightful comments. We have carefully improved the quality of the manuscipt by taking all the comments into account. The modification of the manuscript and the responses point-to-point to the reviews are described as follows, marked in red. 

The authors have made an important effort in improving the contents and clarity of this second version of their manuscript, clarifying many of the previous concerns. However, there still remain some issues which induce doubts:

1) The mathematical notation is still confusing or sloppy. E.G., in (5) Δt is used both as an argument and as a (discrete) time index, and subindices are missing in the sum. the same happens in (6) and (7) and (8), and all along the text.  

In the revised paper, we have fixed the typos, as marked in red.  

2) The thresholds are imprecise. In linee 133 θ = 20: in what units? Later, in line 162, θ_s+ = 6% is a relative number.Same in Table 3.

Parameter θ represents the Latency Differential Sensitivity, in milliseconds. In the revised paper, we have added the units, as shown in Page 4, Line 131, marked in red. 

Parameter θ_s+/- is a different parameter representing Latency Consistency Sensitivity, and represent the frequency that S_w increase/decrease during a certain time period, as stated in Page 4, Line 143. In the revised paper, we have deleted the % of the values of θ_s+/-.

3) The quantities S_w in (8) and (9) are cumulative sums of 0-1 binary variables (s_w) over a time window T_s. Thus, their value can be greater than 1, depending on the evolution of the indicator variables. Therefore, why in Table 1 only values 0 and 1 are considered for I_Δt? This needs explanation.

In the revised paper, we have fixed the definition of S_w.

4) Overall, the proposal in interesting, but the basis is just a a PI controller, where the integrator signals drive the decisions to switch between the primary and the backup paths. I believe the novelty is incremental.

We claim our methodology differs from a PID controller since it is structured around a nested two layers of moving cumulative frequencies (Eqs. 1, 5, 6, 7, 8, 9) and a set of logical rules for definition strategy (Table 1), rather than the propotional-integral-derivative of PID control. In addition, PID controllers are used in linear and symmetric systems and, in particular, systems with constant time dynamic behaviour. On the contrary, route fluctuations due to extreme congested traffic load are not linear and exhibit time-varying dynamic behaviour. We trust our proposed methodology and its application domain are novel and, to the best of our knowledge, have not been previously used in solving network route fluctuation. 

 

 

 

Reviewer 2 Report

The authors have revised the manuscript and incorporated all the necessary information. I recommend to accept this article. 

 

Minor Correction: There are some citations on page 2, that are missing. The original manuscript file should be compiled correctly before creating the final manuscript.  

Author Response

The authors would like to thank the reviewer for his/her helpful and insightful comments. We have carefully improved the quality of the manuscipt by taking all the comments into account. The modification of the manuscript and the responses to the reviews are described as follows, marked in red. 

The authors have revised the manuscript and incorporated all the necessary information. I recommend to accept this article. 

Minor Correction: There are some citations on page 2, that are missing. The original manuscript file should be compiled correctly before creating the final manuscript.  

In the revised paper, we have fixed this error.

Round 3

Reviewer 1 Report

The authors have made important corrections and clarifications in the manuscript, which is now consistent and complete. Therefore, it is recommended for publication. Just a minor observation: line 72 still contains missing references.

Author Response

The authors would like to thank the reviewer for his/her insightful comments. Please kindly find the response to the comments below, marked in red.

 

The authors have made important corrections and clarifications in the manuscript, which is now consistent and complete. Therefore, it is recommended for publication. Just a minor observation: line 72 still contains missing references.

In revised paper, the authors have recompiled and double checked the manuscript. All references are updated through the submitted .bib and .tex file.

 

Back to TopTop