3.1. Project Dataset
The database for this research includes 28 completed projects related to the construction of European Road Corridor 10 through the Republic of Serbia. This corridor traverses Serbia from the northern border with Hungary towards the south. Near the city of Niš, Corridor 10 branches into two routes: the southern branch leading towards Greece and the capital, Athens, and the eastern branch extending towards Asia Minor via Bulgaria. The construction of the Southern branch, which spans 74.2 km, commenced in 2010 and concluded in 2024, encompassing 10 sections and resulting in 10 contracts awarded through internationally standardized tender procedures. Additionally, two contracts were executed for parallel non-commercial roads intended for the local population, whose existing roads were affected by the new highway’s construction.
The eastern branch of Corridor 10, stretching 86.5 km, also began construction in 2009 and was completed by 2024. This section involved 16 contracts, similarly finalized following international financial institutions’ rules. The projects on this route included constructing a full-profile highway, seven tunnels, and numerous bridges. The associated contracts were signed under the Conditions of Contract for Construction, Multilateral Development Bank, Harmonized Edition, June 2010. The tender procedures adhered to guidelines set by the World Bank, EBRD Procurement Rules, and the procurement guidelines effective at the time of the finance contracts. The primary procurement system employed was a two-envelope system with post-qualification, while a pre-qualification system was used for only four procurements.
Project Characteristics
In project analysis, it is crucial to thoroughly examine specific aspects that can significantly impact the execution and success of each individual project.
In order to obtain a higher degree of accuracy when determining the probability of the occurrence of certain risk events, a list containing 25 project parameters was taken into consideration.
The list of parameters in this research was created based on a review of the available literature and a specially organized expert interview called Risk Management, which was conducted during a two-day interview and discussion. A total of 14 engineers, experts from various fields with many years of experience in road infrastructure projects in Serbia and foreign countries, participated in the expert interview. All interviewed experts had also participated in some of the projects subjected to this analysis, as representatives of the employer, engineers, or technical consultants.
The following characteristics provide a comprehensive overview of the key elements essential for evaluating and managing projects. For the analyzed 28 completed projects, the following project characteristics were known:
Accepted Contract Amount [EUR]—This represents the amount accepted in the letter of acceptance for the execution and completion of the works and the remedying of any defects. A letter of acceptance is a letter of formal acceptance, signed by the employer, of the letter of tender, including any annexed memoranda comprising agreements between and signed by both parties—employer and contractor. Values in euros were used for quantification;
Time for Completion [days]—This represents the time for completing the works, as stated in the appendix to tender, calculated from the commencement date. The number of days was used for quantification;
Landslides along the route [1, 2, 3, or 4]—As part of the project for the building permit, previous geological and geotechnical surveys and analysis were carried out. On the basis of these results, geological processes and phenomena were discovered, such as gully lines with erosion processes, conditionally stable slopes with occasional sliding and notch erosion, and unstable active slopes with shedding landslides and over-consolidation. Based on the types of these processes and the number of occurrences in the sections, each project was classified into the following four categories: 1—no landslides; 2—sporadic landslides (there are landslides nearby, but outside the section area); 3—moderate number of landslides (less than five landslides in the section area); 4—significant number of landslides (five or more landslides in the section area);
Archaeological sites along the route [1, 2, 3, or 4]—For all the projects that were processed, there was an obligation to prepare an environmental and social impact assessment, as well as an environmental and social management plan, within which, among other things, immovable cultural assets were defined. Based on the preconditions obtained from Institute for Protection of Cultural Monuments and whether the statutorily protected archaeological sites will be directly affected by the construction works, each project was classified into the following four categories: 1—no recorded cultural assets; 2—existing cultural assets nearby, but outside the section area; 3—necessary preliminary archaeological research (geomagnetic methods, georadar prospecting, and conducting archaeological control probes); 4—necessary archaeological excavation work;
Population density in the future route zone [n/km2]—For all the projects that were processed, there was an obligation to prepare an environmental and social impact assessment, as well as an environmental and social management plan, within which, among other things, demographic characteristics were defined. Population density in the future route zone was defined as the number of inhabitants divided by the zone of influence (radius of 5 km around the section);
Difference between the highest and lowest points on the route [m]—This represents the difference between the highest and lowest points of the terrain read from the longitudinal profile in relation to the vertical alignment of the road defined in the project for the building permit. These data show the topographic characteristics of the terrain that indicate the degree of complexity of the project. Three types of terrain were defined: plain (≤50 m), hilly (50–150 m), and mountainous (≥150 m);
Section length [km]—This represents the length of horizontal alignment from the layout plan of the road defined in the project for the building permit;
Percent of length of embankments on route [%]—This represents the ratio of the total length of all the embankments in a section in relation to the section length, expressed as a percentage. The length of the embankments is read from the longitudinal profile in relation to the vertical alignment of the road defined in the project for the building permit;
Percent of length of bridges on route [%]—This represents the ratio of the total length of all the bridges in a section in relation to the section length, expressed as a percentage. The length of the bridges is read from the longitudinal profile in relation to the vertical alignment of the road defined in the project for the building permit;
Percent of length of cuts on route [%]—This represents the ratio of the total length of all the cuts in a section in relation to the section length, expressed as a percentage. The length of the cuts is read from the longitudinal profile in relation to the vertical alignment of the road defined in the project for the building permit;
Percent of length of tunnels on route [%]—This represents the ratio of the total length of all the tunnels in a section in relation to the section length, expressed as a percentage. The length of the tunnels is read from the longitudinal profile in relation to the vertical alignment of the road defined in the project for the building permit;
Maximum height of cuts [m]—This refers to the situation when the elevation of the ground is higher than the elevation of the road alignment, and represents the highest distance between the terrain read from the longitudinal profile perpendicular to vertical alignment, and the vertical alignment of the road read from the longitudinal profile defined in the project for the building permit. This information indicates if it is justified to build a cut instead of a tunnel or gallery;
Maximum height of embankments [m]—This refers to the situation when the elevation of the road alignment is higher than the elevation of the ground, and represents the highest distance between the terrain read from the longitudinal profile perpendicular to vertical alignment, and the vertical alignment of the road read from the longitudinal profile defined in the project for the building permit. This information indicates if it is justified to build an embankment instead of a bridge;
Predominant material category along the route [A-1, A-2, A-3, A-4, A-5, A-6, or A-7]—A soil classification system was developed by the American Association of State Highway and Transportation Officials (AASHTO), and is used as a guide for the classification of soils and soil aggregate mixtures for highway construction purposes. Previous geological and geotechnical surveys and analyses were carried out as part of the project for the building permit. Based on these results, a geological and geotechnical layout and longitudinal plan, as well as cross-sections, were defined. Taking these data into account, the predominant material category along the route can be determined;
Number of collisions (box culverts, overpasses, watercourses, or utilities) [n]—Read from layout plan defined in the project for the building permit, the number of collisions with local roads (box culverts and overpasses), watercourses (bridges), railways (underpasses and overpasses), and electrical utilities (telecommunication utilities and local sewerage systems) was counted;
Type of foundation [shallow or deep]—Shallow foundations are applied in places where the soil can adequately support the intended load and include the following types: strip foundations, footings, isolated foundations, or foundation slabs. Deep foundations are applied in places where the soil is of low resistance to considerable depth and these include the following types: pile foundations, caisson foundations, box caisson foundations, and well foundations. Based on the longitudinal profile defined in the project for the building permit, the predominant type of foundation along the route can be determined;
Those whose contractual obligation is to prepare the project for execution [employer or contractor]—The employer is required to provide the project for the construction permit as the documentation necessary for the tender for contracts that will be signed under the Conditions of Contract for Construction, Multilateral Development Bank, Harmonized Edition, June 2010. However, the project for execution is not necessary to be provided by the employer, and if he provides it, then that saves time but increases the liability for risks related to design errors;
Level of land expropriation completion at the time of tender announcement [%]—This represents the ratio of the number of parcels that completed land expropriation in relation to the total number of parcels for which the land expropriation needs to be completed, expressed as a percentage. The total number of parcels for which the land expropriation needs to be completed is defined in the expropriation design of the project for the building permit;
Whether the designer is a state-owned company [yes or no]—This information indicates the expected quality of project documentation as well as the possibility of holding responsible designers accountable;
Number of amendments and clarifications to tender documents [n]—These data indicates the number of corrections and additions to the tender documentation, which inevitably affects the duration of the tender procedure;
Number of submitted bids [n]—In addition to indicating the interest of bidders, it also affects the time for review of all submitted bids and a higher probability of filing appeals against the decision to award the contract;
Whether the price adjustment for changes in cost is contracted [yes or no]—According to the regulations of certain banks, the price adjustment scale is not introduced for contracts whose time for completion is less than 18 months. If the price adjustment for changes in cost applies, then the amount payable to the contractor shall be adjusted for rises or falls in the cost of labor, goods, and other inputs to the works, by the addition or deduction of the amounts determined by the formulae prescribed in the contract;
Percent of contractual advance payment [%]—This represents the ratio of advance payment to the accepted contract amount, expressed as a percentage. This directly affects the contractor’s cash flow;
Those whose contractual obligation is to provide borrow pits for material [employer or contractor]—These data are defined in the bill of quantities and affect who will be responsible if the risk event arises;
Those whose contractual obligation is to provide a material disposal area [employer or contractor]—These data are defined in the bill of quantities and affect who will be responsible if the risk event arises.
Experts evaluated each project characteristic using a Likert scale ranging from 1 to 7, where the ratings reflected the degree of impact each characteristic had on EoT and ICP. Based on these evaluations, a multi-criteria analysis was conducted using the weighted linear combination (WLC) method. This approach integrates various criteria into a single score by normalizing the values of each characteristic and weighting them according to their importance. The resulting rankings of project characteristics will serve as inputs for the Sugeno fuzzy system, providing detailed insights into priority aspects for optimization and decision-making. The WLC procedure is shown below:
where EoT
norm is the normalized rating of experts for EoT and ICP
norm is the normalized rating of experts for ICP.
Based on the weighted values shown above, an overall score (E
s) can be calculated:
where ω
1, and ω
2 are weighting factors that reflect the relative importance of each parameter. For the given example, both factors are equally important: ω
1 = ω
2 = 1/2.
Table 2 shows the obtained score values and the order of project characteristics.
The WLC analysis enabled the prioritization and ranking of the project’s characteristics based on their significance. This expert evaluation helps in identifying and prioritizing the critical attributes that can impact project timelines and costs, ensuring that the fuzzy logic model is informed by relevant and substantiated data. These results were then submitted to a panel of experts for validation. The experts unanimously agreed with the ranking of the project characteristics and recommended that the 10 most significant characteristics be considered in further analysis. Consequently, these top 10 characteristics will be used as inputs for the Sugeno fuzzy system, enhancing the decision-making process for future project evaluations. Concrete values from each project were utilized to develop the fuzzy model. However, the detailed project characteristics for each individual project are not disclosed. Instead, only the results of descriptive statistics are presented, providing an overview of the approximate parameters of the projects. This approach ensures that while the fuzzy model is informed by real data, the confidentiality of the source database is maintained, as it cannot be publicly published. This balance between data utilization and confidentiality is crucial for preserving the integrity and privacy of the project information. The results of the descriptive statistical analysis for these characteristics are presented in
Table 3.
In further analysis, the values of ICP and EoT are expressed as percentages. This approach was adopted for the practical application of the model and facilitates its use in various projects, which may have variable ICP and EoT values.
Table 4 presents the descriptive statistics for EoT and ICP, which represent the output values from the models.
3.2. Case Study of Modeling and Predicting ICP and EoT in Road Infrastructure Projects in the Republic of Serbia
In this case study, a Sugeno fuzzy logic system was used for modeling and predicting Increasing Contract Price (ICP) and Extension of Time (EoT) in road infrastructure projects, based on a set of input and output variables. This approach was chosen as suitable for this task due to its computational efficiency and its ability to provide accurate mathematical models using linear and polynomial approaches for output modeling.
The data used in this case study were collected from 28 real and highly significant road infrastructure projects in the Republic of Serbia, which has over 45,000 km of roads, of which about 1000 km are built auto roads [
56]. Out of these, data from 25 completed projects were used for training the FIS, while data from 3 completed projects were used for validating the FIS.
The structure of the FIS (
Figure 1), created for these purposes using Matlab 2024r, consists of 10 input variables (
Table 3) and 2 outputs: Output 1: ICP; Output 2: EoT (
Table 4).
This problem requires solving several different mathematical tasks. It is necessary to define input and output variables, membership functions of input variables, a set of rules, and the mathematical form of the rules, and to perform aggregation and defuzzification. Generally, for an FIS with 10 inputs and 2 outputs, the tasks can be defined as follows [
57,
58]:
Defining input and output variables: ten inputs from , ; two outputs: and ;
Determining membership functions for each input, which involves defining membership functions that will describe the different values of the input variable in fuzzy terms (e.g., low, medium, or high). For example, if we use Gaussian membership functions, for input
we will have:
where
is the center and
is the standard deviation of the membership function. This process is repeated for all inputs.
Defining the rule set: Rules combine input membership functions to generate outputs. Generally, a rule might look like this:
where
,
,
and
are coefficients that are adjusted during training.
Mathematical form of the rules, which can be mathematically expressed as weighted membership functions. For example:
where
represents the weight, or the strength of activation of rule
.
Aggregation and defuzzification: Outputs are obtained by aggregating the weighted outputs of all rules:
where
M is the total number of rules.
In
Figure 2, the input variables in the FIS are shown, including their membership functions and value ranges.
Gaussian membership functions were used for the inputs, which allowed for modeling uncertainty in the input data. Fuzzy numbers, particularly Gaussian ones, have been used to model input variables to effectively address the inherent uncertainty and imprecision of information available in the initial phases of project analysis and development [
59,
60]. For example, while the accepted contract amount and scheduled time for completion might appear as parameters that should be defined with precise values, in practice, these values are often subject to changes due to unforeseen circumstances, justifying the use of fuzzy logic. Additionally, the projects under investigation are large in every sense, making it difficult to precisely define all characteristics in advance and ensure they remain unchanged upon project completion. This complexity and scale often lead to uncertainties and variations, further complicating the task of accurate prediction and planning. Fuzzy logic proves particularly useful when the information accompanying projects is imprecise or incomplete, which is common in the initial planning phase of a project. Gaussian membership functions were selected due to their ability to model real phenomena appropriately, allowing the model to more accurately represent how uncertainty in input data affects key project output variables, contributing to more reliable predictions. Gaussian membership functions are smooth and continuous, without sharp transitions or discontinuities, which allows for more stable system behavior. Unlike, for example, triangular or trapezoidal functions, Gaussian functions do not have sharp edges, which reduces the possibility of sudden changes in the system output. This is useful in applications where gradual changes are needed, such as control systems or prediction models, as is the case in this study. These functions often better reflect real physical processes compared to other membership functions. This approach also facilitates the integration of various types of data.
The outputs (ICP and EoT) were modeled using linear functions, which facilitated precise and mathematically defined modeling of the results.
The rules were formulated as “IF-THEN” statements using mathematical functions, providing flexibility and precision in modeling. The genfis2 function was used, as detailed below:
Specifically, the training data are organized in a matrix where the first ten columns (A to J) represent the input data, while two columns (K and L) represent the output data. The parameter 0.5 determines the clustering parameter, which affects the number of rules generated in the FIS. This way, 25 fuzzy rules were generated, and it was clear that the system performed better with a larger amount of input data for training, meaning greater coverage of the variables by the fuzzy rules.
One example of fuzzy inference generated by the FIS for specific values is shown in
Figure 3.
In
Figure 4, the surface plot of the fuzzy logic system (FIS) in MATLAB is shown. The three-dimensional view allows for the analysis of dependencies between combinations of two input variables and one output variable in the Sugeno FIS. Since there are a large number of variables and their combinations, only one representative example is presented, specifically for the first input variable (Input 1: accepted contract amount) and the last input variable (Input 10: number of collisions (box culverts, overpasses, watercourses, and utility lines)) with respect to the first output variable (Output 1: ICP). The X-axis displays the first input variable with a wide range of values, the Y-axis shows the tenth input variable with a narrower range of values, and the Z-axis represents the values of the first output variable.
The colors on the surface represent different levels of the output variable, ranging from blue (low level) to yellow (high level). The surface has a complex shape with various slopes and curvatures, indicating intricate interdependencies between the input variables. Therefore, the surface is not flat, which also signifies nonlinear relationships between the input and output variables. This is expected, as nonlinearity is characteristic of many real-world systems. It can be said that the slopes and curvatures have relatively smooth transitions be-tween different parts of the surface, which means that the system is relatively stable without large, pronounced jumps or discontinuities. Monotonic trends can also be observed, where an increase in one or both input variables leads to an increase or decrease in the output variable.
The control surface below shows how the same combination of inputs affects the second output (
Figure 5).
In this case as well, nonlinearity can be observed, with maximum output values occurring when both inputs are at higher levels. The surface has relatively smooth transitions between different values, indicating a relatively stable system that does not show abrupt jumps in output values. Such stability is desirable as it ensures the appropriate consistency of the system in response to changes in inputs.
Both FIS systems show smooth transitions and stability without sharp jumps. Both surfaces have clear maximum and minimum points that depend on the combinations of inputs. The second surface has a more pronounced peak and a wider range of output values, indicating stronger interaction between the inputs and greater sensitivity to changes. The first surface is more consistent with gradual changes, which may mean that it is more stable in real-world applications.
A notable feature of the Sugeno FIS is its capability to handle nonlinear systems well, thus efficiently modeling complex dependencies. The surface plot can be used for further analysis and optimization of the fuzzy logic system to achieve the desired output results.
The convergence analysis theorem for the Sugeno fuzzy logic approach involves understanding how and why this method converges towards accurate predictions for Extension of Time (EoT) and Increasing Contract Price (ICP). This analysis is crucial for verifying the reliability and stability of the model’s predictions. The optimal and careful selection of 10 project characteristics (rather than using all 25) facilitates the practical application of the model. This is particularly important as many real-world projects may not have a complete dataset for all 25 characteristics. By focusing on the most relevant 10 characteristics, the model’s prediction accuracy is enhanced. This targeted approach not only improves the precision of the model but also broadens its applicability on a global scale, regardless of the specific domain or project type. While increasing the number of real projects in the dataset could potentially refine the model’s accuracy further, the evaluation based on the three selected projects demonstrates that the model predicts ICP and EoT results with considerable accuracy. A detailed evaluation of the model is provided in the following section.
Evaluation of the FIS
The evaluation of the generated FIS was carried out using data from three real road infrastructure projects in the Republic of Serbia.
Table 5 presents the average values of 25 project characteristics for the selected three projects to provide insight into the approximate values of real-world projects. The 10 characteristics specifically chosen for use in the fuzzy system are highlighted.
In
Table 6, the data for ICP and EoT from the real system and the outputs from the FIS are presented, along with the corresponding deviations.
Based on the obtained results, it can be concluded that the generated FIS, for the selected completed projects, makes certain deviations during prediction, which are approximately within the range of 10%. In practical terms, for project planning, this accuracy is at a fairly satisfactory level. Additional training of the FIS is a prerequisite for improving its prediction accuracy.