1. Introduction
Concrete is one of the most important materials in the construction industry and is the most used structural element in the world. It can be defined as a composite material that consists of a binding medium within which particles or fragments of the aggregates are incorporated. It is usually composed of Portland cement, inorganic material, aggregates (granular materials divided into coarse and fine according to the fineness modulus of the particles), and water (responsible for the kneading and chemical reactions in the cement, causing it to harden and increase the strength of the material [
1,
2]).
Ethylene vinyl acetate (EVA) is a polymeric material made from oil and extensively utilized in the sport and footwear industry. In these sectors, the disposal can account for up to 20% of the total EVA consumption. Depending on the vinyl concentration, EVA becomes thermoset, rendering it unsuitable for recycling, as it cannot be melted or remolded. Consequently, the environmental impact of discarding EVA into the environment, like many other plastic materials, is a grave concern. Given the substantial volume of EVA waste, extensive storage spaces are required, often leading to the use of open landfills, which increases the risk of environmental contamination, including the emission of toxic gases resulting from EVA combustion [
3].
One viable solution to mitigate this environmental issue is the reuse of EVA in other industries, particularly in civil construction. EVA possesses favorable characteristics, such as good processability, thermal stability, impact resistance, fatigue resistance, resilience, toughness, and flexibility [
4,
5,
6]. As a result, numerous studies have been conducted to explore its potential as an aggregate in the civil construction sector, aiming to produce lightweight concretes with good acoustic and thermal performance [
7,
8,
9].
However, it is worth noting that one drawback of lightweight concrete is a reduction in its compressive and tensile strength [
10,
11]. To address this limitation, one of the alternatives is to incorporate fibers that optimize the distribution of internal stresses in the concrete, thereby bolstering its resistance to demands on the material [
2,
3,
12,
13].
As new materials are introduced and incorporated into production processes, the need arises to characterize these materials and comprehend how their internal characteristics influence the material properties. X-ray computed microtomography (Micro-CT) is one of the non-destructive analysis techniques employed for this propose. Micro-CT allows the acquisition of internal images of objects in two and three dimensions, enabling the analysis and characterization of materials through the use of digital image processing (DIP) techniques. In the literature, numerous studies demonstrate the frequent use of DIP to analyze and characterize features like pores in concrete samples. These studies not only show the properties, performance and efficiency of the concrete but also detect internal elements, such as fibers, in the cement matrix [
14,
15,
16,
17,
18,
19,
20,
21].
However, the characteristics of the constituent elements of lightweight concrete, namely, pores, EVA grains, and piassava fibers, exhibit a diffuse nature that poses challenges when classifying them using Micro-CT images and conventional DIP techniques [
22]. In this context, the utilization of a fuzzy-logic-based system becomes necessary, as it accommodates degrees of uncertainty associated with object detection, unlike crisp-logic-based systems [
23]. A fuzzy system considers the fuzzy nature of object detection [
16,
24,
25,
26].
This study aims to apply image processing techniques and a fuzzy inference system to analyze and classify lightweight concrete containing EVA and piassava fibers. The proposed methodology encompasses three key steps: (1) image segmentation; (2) feature extraction; and (3) development of a fuzzy system to classify the internal objects within the samples.
One of the primary challenges in this task is the absence of precise reference values for assessing the accuracy of the estimates related to different composition features. This is due to the spatial heterogeneity of specimens, stemming from the variable morphology and granularity of the concrete components. Although there is a concrete homogenization process during production, the volumetric density of each component within a specimen is not constant. Consequently, the composition of an extracted core may vary depending on its extraction location and have significant lower or higher volumetric fractions than the average expected from the concrete mix. Therefore, quantitative evaluation of estimation errors is not feasible; we can only assess weather estimated values are “similar” to the theoretical values expected based on the mix.
The primary contribution of this work is the presentation of a classification model based on the Takagi–Sugeno–Kang (TSK) inference system, wherein the inputs are fuzzy and the output is crisp. Additionally, this approach’s application in material characterization, specifically in lightweight concrete, is unprecedented.
The structure of this article is as follows:
Section 2 describes the materials used to carry out the research, along with the segmentation, feature extraction, and fuzzy inference system steps.
Section 3 presents and discusses the obtained results, and
Section 4 provides the conclusions of this research stage and offers suggestions for future studies.
3. Results
As briefly outlined in the introduction, in this problem the true values of the variables that are estimated with the developed method are not available, so it is not possible to calculate the error of the estimates necessary to evaluate the accuracy of the method. This is due to it being a core extracted from a highly heterogeneous concrete specimen that is scanned in the tomography scanner at core-size scale, so the actual composition of the core is unknown. For the analysis the specimen, volumetric compositions were used as reference value.
The difference between the values estimated in the core with our method and the reference values of the specimen consists of two components that are not measurable in practice; the first component is due to imprecision of the method used, and the second component is due to the difference between the composition of the core and the specimen.
In this context, the objective of the analysis was divided into three parts:
Quantification of the magnitude of the differences between estimated and reference values, supporting the establishment of the range of variation of the different measured variables. Therefore, when a core is analyzed and the percentage of each component is estimated, the interval to which its percentage belongs in the specimen can be inferred.
Identification of probable systematic deviations that may signal bias in the classification method. The identification and characterization of systematic deviations serves two purposes: (1) to signal the need to improve the method in specific processes, and (2) to carry out a calibration of the reading that corrects the identified deviation.
Assessment of the degree of correlation between the estimated and reference values. If there is no reasonable degree of correlation between the estimated and reference values, it is not feasible to infer the composition of the specimen from the estimated value in the core.
Accordingly, the reference values and estimates of the percentages of objects (EVA + pore + fiber) and gravel resulting from the first step of the pipeline (
Figure 3) are shown in
Table 4, and the reference values and the estimates of percentages of EVA, pore, and fiber, resulting from the last pipeline step (
Figure 7) are shown in
Table 5.
The estimated and reference percentages of gravel, EVA, pore, and fiber shown in
Table 4 and
Table 5 are plotted in
Figure 8 to obtain a qualitative assessment of the results. The rhombuses represent the reference values and the squares represent the estimated values of the percentages of gravel, pore, EVA, and fiber. In this type of radar or spider web graph, the plotted variable grows radially, such that the differences between the estimated and reference values are measured along the radius.
The method overestimated the content of gravel in all the samples with additives, while it underestimates the EVA content in most samples. Although a more detailed analysis needs to be performed, this suggests that in the current configuration, a part of the EVA is classified as gravel. If this is confirmed, changes should be made to correct this problem in segmentation with K-means (see
Figure 3). On the other hand, from a practical point of view, it represents a systematic deviation that can be corrected with adequate calibration.
Estimates of porosity and fiber content do not show a systematic deviation, indicating that the difference between the predicted and reference porosities are the smallest of the four variables studied, while in the case of fiber the largest differences are obtained. In the latter, the estimated values are not correlated with the reference values, which leads to such large differences. The probable cause of this lack of correlation between the estimated and reference values is that 1% of fiber is below the detection threshold of our method, such that the estimated values lack practical value. For this reason, subsequent analysis focuses on the gravel, EVA, and pore content. Determining the fiber detection threshold is a future study to be carried out.
Figure 9 shows the estimated values corrected by calibration and the reference values for crushed gravel and EVA contents. The calibration was performed by translation, that is, adding the constants
.
and
.
to the estimated percentages of gravel and EVA, respectively. The constants were calculated using the formula
where
and
are estimated and reference values of the variable
v in core
i, respectively, and
n is the number of cores studied. The corrected estimates of the variable
v are calculated as
, fulfilling
.
In addition to enabling correction of the systematic deviation in the measured variables, the applied calibration procedure enables measurement of the size of the absolute deviation; that is, we know that our method overestimates the gravel content by approximately 12% and underestimates the EVA content by approximately 2%. In the remainder of the article, when referring to the estimated gravel and EVA contents, we use the calibrated values. The porosity estimates were not calibrated. In
Table 6, we list the estimated and reference values that will be the object of subsequent analysis in this study.
The differences between estimates and reference values of the volumetric contents in the lightweight concrete components are not only out of the imprecision of the developed component quantification method. Nevertheless, it is necessary to establish a resolution concept for the method from the difference between the estimates and the reference values. In other words, we will assume that resolution of the method increases when the difference between estimates and reference values decreases. Thus, the higher the resolution, the lower the likely range of variation in the composition of the specimen studied around the values estimated from the analysis of its core with the method developed.
It is important to emphasize that even in the ideal case of null imprecision of the method of analysis of the core, there will be a difference between the estimates and the reference values due to the compositional difference between the core and the specimen, such that the ideal resolution does not imply the absence of uncertainties.
To compare the estimates with the references, it is necessary to establish a relative measurement that considers the difference in absolute value between the estimate and its reference in relation to the reference value. According to this, we define the mean absolute percentage difference (MAPD) using the same notation as Equation (
10), of the form (the formulation is similar to the mean absolute percentage error (MAPE)).
The MAPD calculated for each variable measured in our method is an adequate measure to characterize the quality of our method in the range of variation of the composition of the specimens, that is, with EVA contents ranging between 0% and 25%, fiber content ranging between 0% and 1%, and volumetric gravel content equal to 44.2% for EVA and fiber contents (see
Table 2).
However, MAPD has a natural limitation, which is that it can only handle cases in which the reference variable is not null, to avoid division by zero. Therefore, the case studies in which were not included in the calculation of the mean differences or in the variation of the differences in what follows. That is, this analysis leaves out four (CR, EF5, EF15, EF25, and EG15) of the nine test cases for fiber and one test case (RC) for EVA.
Table 7 presents the mean absolute percentage difference (MAPD) between the estimated and reference percentages of the volume of pores, gravel, and EVA, considering the nine samples (except EVA where the reference concrete was not included, leaving only eight samples). The MAPD for fiber was not calculated for the reasons explained above.
MAPD varies between 11.3% for the gravel volume and 31.0% for the EVA content, which defines an acceptable range, taking into account that the composition of the cores scanned with Micro-CT can differ significantly from the composition of the specimen from which it was extracted. Establishing a dependence between MAPD and the quality Q of the estimates, such that , we would have qualities ranging between 69.0% and 88.7%. It is worth highlighting that in this study, we describe the results of the first version of the method, which may be subject to future improvements.
Furthermore, it was found that the absolute percentage difference between the reference percentages and those estimated for EVA, pore, and gravel, decreases with the increase in the content of additives (EVA + fiber), as shown in
Figure 10. That is, the quality is superior when analyzing concretes with more content of light aggregates, which is the objective in civil construction practice: to add the largest quantity of light aggregates possible without compromising the mechanical strength characteristics of the structures.
The percentage decrease rates of the relative differences (coefficient a, in the linear model equation y = a*x + b) vary between for gravel content and for EVA volume. In other words, the relative difference between the reference values and those estimated for the volume of gravel decreases 0.475%, on average, for each 1% increase in the EVA + fiber content, decreasing 2.842%, on average, in the case of the EVA volume.
These rates of decrease are statistically supported by the significant anti-correlation observed, with coefficients varying between
for pore content and
for EVA volume. The standard deviation of the differences in relation to the linear model was confined to the range of
for gravel content to
for EVA content. The values of the rates of decrease in the relative differences between the estimated and the reference values, as well as the correlation with the content of additives (EVA + fiber) and the standard deviation in relation to the adjusted linear model, are listed in
Table 7. These results confirm that the relative difference between estimates and references decreases as the proportion of light aggregates increases.
From a practical point of view, it is also very important that there is a good correlation between the estimated and reference values. To assess the degree of correlation between the reference volumes and the estimates of the lightweight concrete components in the nine specimens studied, we used three different metrics:
The linear correlation coefficient
, defined as:
where
is the covariance of the data, and
and
are the means of the reference and estimated values, respectively;
is the root of the standard deviation of the reference values; and
is the root of the standard deviation of the estimated values.
The coefficient varies between −1 and 1 where the positive sign indicates that the independent variable y increases when the independent variable x increases and vice versa for the negative sign. However, absolute values close to 1 indicate that the points align, forming a line, increasing the dispersion of the points in relation to the line when the absolute value of decreases. In the present study, positive values close to 1 indicate a good alignment of the reference and estimated values, which is one of the requirements for the application of the developed method.
The root of normalized root mean square deviation nRMSD >= 0, defined as
where
is the root mean square deviation and
is the mean of the reference values.
The RMSD serves to aggregate the magnitudes of the differences between the estimated and reference values for several specimens in a single measurement. It is a widely used measurement to assess the accuracy of predictions, but it depends on the scale. For this reason, in this study, we used the version normalized by the mean value of the reference, which generally varies between 0 and 1 in different applications, although it can exceed 1 in cases with a very low correlation between the estimated and expected data. In our case, we require low values of , much lower than 1.
The concomitant use with is due to the fact that although both measurements measure the distance between the estimated and reference values, they accomplish this differently: with bivariate and univariate as they capture different characteristics of the compared data.
Finally, after correcting for systematic deviations, the estimated values must be proportional to the reference values and vice versa, and the proportionality coefficient must be close to 1. In other words, the coefficient of the proportional model
must be close to 1. Applying minimum squares, the formula for
a was obtained, such that
Including the proportionality coefficient is strictly necessary, as neither or are affected by the dependence slope or the displacement of the line that represents the relationship between the estimated and reference values. In other words, two linear dependence functions and , with and , can have the same and correlation coefficients. This indicates that these two metrics do not carry information on the type of linear dependence that relates the two variables. In this case, the dependence between the reference values and the estimated values being proportional is a requirement to validate the developed method, which implies that considering the null intercept, , the linear coefficient of the regression is close to 1, .
The calculated metrics are shown in
Table 8, and in
Figure 11 we plot the reference values on the vertical axis and the estimated values for the pore, gravel, and EVA contents on the horizontal axis, together with the proportional model obtained by regression in each case.
Analyzing the component with the best of each of the metrics, it can be noted that the EVA estimates have the closest coefficient to 1 (
), the porosity estimates have the highest correlation (
), while the gravel volume estimates have the lowest
. Under these conditions, it is difficult to conclude which of the three compositional variables was the best estimated. Therefore, we introduce a quality metric of estimates,
, which considers the three basic metrics in an integrated manner. The quality is defined according to the harmonic mean of the deviations from the optimal value of each metric, as follows:
This metric varies between 0 and 1 and gives equal weight to all the integrated metrics, serving as an indicator to compare the performance of concrete characterization methods.
It can be seen that the developed method achieves significant quality values, varying between .8732 for gravel content and .9247 for porosity, with a mean quality of .
Despite the satisfactory obtained results, some drawbacks can be considered in the proposed methodology. First, the descriptor set can be increased to include some other features, such as color and texture. Also, a hierarchical classification between the descriptors must be made since now all features have the same weight.