Concrete Cracks Detection and Monitoring Using Deep Learning-Based Multiresolution Analysis
Round 1
Reviewer 1 Report
This article presents a novel implementation for crack monitoring in concrete structures. Please find my comments as follows:
- In Abstract we have some typos
- Please improve Introduction with literature about CNN and related works.
- Please give some figures of NDT experimental images and also improve Discussion subsection and give more details about your approach.
- Improve Conclusions section
- Some minor comments: In general all the text has formatting issues - Section 3, paragraph formatting.- line 501 text typos- line 124 typo- Figure 10 formatting- lines 222-227- line 213
Author Response
Thank you for your thorough review of our paper.
Please find below our responses given point-by-point.
Please also refer to the revised manuscript whose changes are in red.
Author Response File: Author Response.pdf
Reviewer 2 Report
The article is devoted to the topic of automatic detection of hidden defects in concrete structures. As the main contribution, the authors declare a new method of automatic detection, based on the use of a convolutional neural network, the input of which is wavelet multiresolution analysis. The topic of the article is interesting, but at the beginning of this review I must state that the article does not meet the qualitative conditions so that it can be recommended for publication in the journal Electronis. In addition to unconvincing English, the main shortcomings of the article are the following:
(1) Unbalanced content structure of the article.
- Too much space is devoted today to a trivial discussion of the principles, advantages and disadvantages of wavelet analysis (eg lines 133-169) and neural networks (almost the whole chapters 2.3.1 and 2.3.2), although in the experimental part they used standard CNN AlexNet and ResNet50.
- The experimental part of the work is insufficiently described. How much data was obtained in the experiment described in section 2? What were the input and output signals? The authors give one example of a scalgram - was it the only entry into a convolutional neural network? How the data were distributed in the training and testing of neural networks, i. what was the training, evaluation and testing part like?
(2) I consider it a big mistake to use picture no. 2. Obviously this is a software-edited photo (even of poor quality), which was downloaded from the Internet !!!
(3) The methodological aspect of the experimental part is controversial. Although the authors compare the accuracy of detecting visible and invisible material defects, they are based on two incomparable image databases.
Author Response
Thank you for your thorough review of our paper.
Please find below our responses given point-by-point.
Please also refer to the revised manuscript whose changes are in red.
Author Response File: Author Response.pdf
Reviewer 3 Report
First of all, I would like to notice the paper is original research work without plagiarism suspicion. The non-destructive technique of concrete crack detection attracted much attention being a powerful tool for monitoring also. Ultrasonic wave propagation methods have good practical implementations and electronic devices based on that principle have reasonable cost and thus can be widely applied.
However, I suggest several comments could improve the readability of this paper.
1. I suggest spell and typos checking to avoid "on a n this paper" (line 16) and others.
2. Rethinking and reorganizing Section 2.3 is strongly recommended. On the strong background of the previous Section 2.2 (it is a well-written one), this Section looks like a student's lightweight discussion of deep learning technologies. And the title of subsection 2.3.1 "Neuronal Networks" is abstract without any regarding of the main context. Figures 8 and 9 are trivial examples and I think most readers will just slide that text with general pictures down. I strongly recommend giving real examples closely related to the main topic.
3. A similar problem concerns Figures 11, 12, 13. I think the readers of high-quality journal Electronics should be aware of RELU function and max-pooling operation in a general case. I recommend changing those figures to similar pictures that reflect operations with numerical values given from a really constructed ANN architecture.
4. And finally, I think the whole monitoring system structure can be presented in the resulting section. It can be not a system structure but a flow diagram, for example. This Figure can help to understand the proper functioning of the proposed monitoring procedure (or system).
Author Response
Thank you for your thorough review of our paper.
Please find below our responses given point-by-point.
Please also refer to the revised manuscript whose changes are in red.
Author Response File: Author Response.pdf
Reviewer 4 Report
An undoubted achievement is the creation of your own image database, but without making it public it is difficult to refer to the published results. We do not know what the scale of difficulties is in relation to the well-known base SDNET2018 dataset.
It is a good idea to use the F1 indicator defined in your work to evaluate the results. Figures 8 and 9 add nothing (are well known) and can be removed.
Figures 8 ,9, 11 add nothing (are well known) and can be removed.
Instead of the above-mentioned drawings, it would be useful to present a more spectacular presentation of the developed image database.
"The procedure for these images is described in Section 2. This is our main contribution here." - Section 2 - Has its own subtitle, wouldn't it be better to use it here.
Author Response
Thank you for your thorough review of our paper.
Please find below our responses given point-by-point.
Please also refer to the revised manuscript whose changes are in red.
Author Response File: Author Response.pdf
Reviewer 5 Report
- typo – a n? What is this trying to say?
- Which multi-resolution analysis is it based on?
- Which studied material?
- Several types? Which types?
- Describe dedicated wavelet
- Several scales? Which scales?
Describe your dataset. What the classifications are
- Top-1 accuracy?
- This closing sentence is currently irrelevant.
- Reference?
- mechanical behavior – What do you mean?
- Reference?
- What are the equally devastating factors. Mention them here if you bring them up.
- Reference?
- Reference?
- Clean up list formatting here. Honestly, I’d scrap line 81 – 91 all together.
- Space Figure3
- What sensors? Describe in detail. Is there a max/min width? – I see you describe it later. Reformat to bring them sensors up and describe them in the same paragraph
- Evolution is not the right word here.
- Space [24]of
- Describe this replacement in detail
- Unclear
- Reference?
- Why are these objectives important?
- Reference?
- Again. Make sure this process is described in detail. Somewhat repetitive.
- So-called? Are the called that or not? What does “so-called” add to your paper
- Reference?
- Called a wavelet, because
- Reference
- Such a way – what way. Be specific. Your paper is written in a way that the reader has to go and gather information from multiple other sources
- This section so far is long-winded and unclear. I recommend breaking the list down into subheaders and describing each item within the subheader
- in various fields do what?
- Reference
- This section should have it’s own subsection. Provide an introduction paragraph an closing paragraph of the main takeaways
- Are not all sections of your paper important?
- Neural Networks
- This section needs references throughout!! You’re missing over 30 sentences that require references.
- Absolutely not. Neural networks do not mimic the function of the human brain neurons. Neural networks are software. Neural networks are a function approximator. They use an action potential, but that is the only similarity to the human brain.
- No. Each neuron does not have a digital input and output. Each neuron has a weight modified through gradient decent. Can think of the weight like m in Y=mx+b
- What do you mean, behavior. Be specific.
- Transfer function? Do you mean activation function?
- Delete with or without loops.
- Which basic model? You’re describing a node below.
- Describe ReLu or Sigmoid. Those are the most common activation functions. Which did you use?
- Diffused? I don’t know what you mean.
- Each neuron has a set of parameters? What do you mean. This section does not have a single reference. What do you mean learning, or training?
- What do you mean? Which functions are more expensive than others?
- Type of connection? What other types of connections are there other than the propagation of latent features through the weights?
- Design? Do you mean architecture? Oh. No you don’t. Just say training a network requires…
- Just training. No one calls it the learning phase.
- No. The model never learns the output classes. It maps the weights to the provided labels using gradient descent with backpropagation.
- Exploitation? I’ve never heard this term before. Are you referring to validation? Or just running the model?
- No. You are not searching for weights. You are taking the derivative relative to your error to modify the weights in a direction suitable to reducing your loss.
- All examples is extremely rare (and frankly means you’re overfitting).
- Does “ this is basically” add anything your paper?
- Reference. What kind of diagnostics? Be specific
- This isn’t a different “configuration”.
- References.
- Why is this section here? It adds nothing as you move to overfitting
- It’s not a phenomenon. Bad dimensioning? That’s not the correct term
- Why?
- Remove learning by heart
- References. Why?
- Not be too important? What do you mean?
- Your going from list to list and your flow is extremely confusing. This is written like an undergrad taking notes.
- Is “by definition” necessary here? You said this already
- This is in French?
- You said this already
- You said this already
- You haven’t mentioned a differentiable function at all yet.
- This is your first mention of a loss function.
- What is cross-entropy loss?
- Your first mention of back-propagation.
- Reference
- CNNs have actually been around since 1979. See fukushima1979
- How do they “detect their features”. What makes CNNs unique? CNNs are the classifier. It makes no sense to say “then train a classifier” as if they’re different.
- Machine learning methods do not anything “by hand”. That is the definition of machine learning.
- “in fact” why is this here? What do you mean linear filtering? Describe in detail.
- What 3 operations? Don’t just reference a figure.
- In your example it is crack vs non-crack (which you never formally defined). But it can be whatever output you want.
- “brick” not appropriate.
- “dragging”??
- How are these filtered updated? What types of features do they learn? Is it the same at each layer of the network?
- “volume”??
- No. Depth is the number of layers.
- “pitch”?? I’ve never heard this term before.
- What is padding?
- BN Is not unique to CNNS.
- ReLU is not unique to CNNs. Why are they in this section?
- Also helps improve generalization
- Dropout is not unique to CNNS
- Describe bagging.
- “learn well”??
- What do you mean co-adaptation?
- Softmax is not unique to CNNs
- You haven’t described ResNet at all. Or transfer learning. Or referenced ResNet. You need to describe what transfer learning is. And also what skip connections are. And why you chose ResNet over other architectures.
- Remove intro paragraph here.
- Discussion?
Author Response
Thank you for your thorough review of our paper.
Please find below our responses given point-by-point.
Please also refer to the revised manuscript whose changes are in red.
Author Response File: Author Response.pdf
Round 2
Reviewer 1 Report
I am satisfied with your correction based on all reviewers' comments.
Some minor comments:
I suggest redesigning figure 3 (use modern style)
line 92 ... ?
All Section 3 paragraph formatting
line 618 numeric typos
line 634 rewrite the sentence
Reviewer 3 Report
The paper has been significantly improved. The authors have cleared the vague issues, have added relevant figures, and have done necessary comments. I think this paper can be recommended for acceptance in the present form.
Reviewer 5 Report
- What are the the crack types? How many classifications are you testing?
- “Machine is able to learn by itself”. What does this mean? Talk about the weights of a neural network modified through training using gradient descent with backpropagation
- “Most powerful deep learning architecture”. What do you mean most powerful? This terminology does not apply to neural networks as different architectures are often specialized for a specific task. CNNs are successful for images, but transformer models are also applicable to images and have excellent performance.
- No. CNNs do not automatically detect and extract features. Within the conv filters the model learn spatial patterning of pixel values representative of features that help solve the given classification task.
- What is pooling? Why is it valuable?
- …?
95. Cite additional papers more recent than 2012. - Unnecessary
- Don’t use the term “intelligent computer system”. It’s a function approximator.
- It is not and/or software. I’m not sure what you mean by this.
- All examples in the training set is unlikely.
- It’s not actually new. CNNs have been around since the 80s https://www.cs.princeton.edu/courses/archive/spr08/cos598B/Readings/Fukushima1980.pdf. AlexNet just brought modern computation to the approach
- particularity? rephrase
- …?
- Don’t put an ! in a paper.
- What is Keras?
- Avoid term such as “Nowadays”