A Comprehensive MCDM-Based Approach for Object-Oriented Metrics Selection Problems

Maddeh, Mohamed; Al-Otaibi, Shaha; Alyahya, Sultan; Hajjej, Fahima; Ayouni, Sarra

doi:10.3390/app13063411

Open AccessArticle

A Comprehensive MCDM-Based Approach for Object-Oriented Metrics Selection Problems

by

Mohamed Maddeh

^1,2

,

Shaha Al-Otaibi

³,

Sultan Alyahya

⁴

,

Fahima Hajjej

³

and

Sarra Ayouni

^3,*

¹

College of Applied Computer Science, King Saud University, Riyadh 11451, Saudi Arabia

²

Higher Institute of Finance and Taxation Sousse, University of Sousse, Sousse 4023, Tunisia

³

Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia

⁴

Information Systems Department, King Saud University, Riyadh 11451, Saudi Arabia

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2023, 13(6), 3411; https://doi.org/10.3390/app13063411

Submission received: 15 February 2023 / Revised: 1 March 2023 / Accepted: 4 March 2023 / Published: 7 March 2023

(This article belongs to the Section Computing and Artificial Intelligence)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Object-oriented programming (OOP) is prone to defects that negatively impact software quality. Detecting defects early in the development process is crucial for ensuring high-quality software, reducing maintenance costs, and increasing customer satisfaction. Several studies use the object-oriented metrics to identify design flaws both at the model level and at the code level. Metrics provide a quantitative measure of code quality by analyzing specific aspects of the software, such as complexity, cohesion, coupling, and inheritance. By examining these metrics, developers can identify potential defects in OOP, such as design defects and code smells. Unfortunately, we cannot assess the quality of an object-oriented program by using a single metric. Identifying design-defect-metric-based rules in an object-oriented program can be challenging due to the number of metrics. In fact, it is difficult to determine which metrics are the most relevant for identifying design defects. Additionally, multiple thresholds for each metric indicates different levels of quality and increases the difficulty to set clear and consistent rules. Hence, the problem of object-oriented metrics selection can be ascribed to a multi-criteria decision-making (MCDM) problem. Based on the experts’ judgement, we can identify the most appropriate metric for the detection of a specific defect. This paper presents our approach to reduce the number of metrics using one of the MCDM methods. Therefore, to identify the most important detection rules, we apply the fuzzy decision-making trial and evaluation laboratory (Fuzzy DEMATEL) method. We also classify the metrics into cause-and-effect groups. The results of our proposed approach, applied on four open-source projects, compared to our previous published results, confirm the efficiency of the MCDM and especially the Fuzzy DEMATEL method in selecting the best rules to identify design flaws. We increased the defect detection accuracy by the selection of rules containing important and interrelated metrics.

Keywords:

object-oriented metrics; design defect; Fuzzy DEMATEL

1. Introduction

Software quality is a measure of how well a software product meets the needs and expectations of its users. High-quality software is free from defects and errors that could affect its functionality or performance. It is more reliable and less likely to fail or cause errors, and easier to maintain and update, reducing the risk of bugs over time.

One important issue in software engineering is to find code smells in software before its delivery. In fact, defect prediction is a technique to control the schedule and cost of a software system. Detecting and fixing OOP defects early in the development cycle can save time and money by reducing the need for expensive rework or corrective action later in the project [1].

Code smells [2,3,4] refer to any symptom in the object-oriented program that possibly hinders software maintenance and evolution. The textual description of defects or code smells is very subjective and depends on the designer/programmer interpretation. As an example, we can present the defect Feature Envy (i.e., this happens when a method is defined in a wrong class based on the class attributes and other method invocation). Depending on a subjective interpretation, each designer could decide in a different way which methods are candidates to present a Feature Envy defect. In fact, the selection of those methods is based on information such as “The number of communications with a given class“. Depending on the context, the same value could be evaluated as high, medium, or even low.

One approach that has gained popularity for detecting design defects is to use the object-oriented metrics. In [5], the authors presented an overview of 3295 papers extracted from the most popular electronic databases related to the UML model-refactoring field. Seventeen percent of the studies used OO metrics and rules-based metrics to detect design defects. Unfortunately, a single metric may not be sufficient to capture all aspects of the system’s quality, and it may lead to incomplete or inaccurate assessments of the system’s overall quality, but a combination of metrics is a powerful heuristic that can identify and standardize the way of defining and detecting code smells. The majority of works in literature focuses on combining metrics, generating a set of detection rules; they use standard object-oriented metrics as well as metrics defined in an ad hoc way [6,7,8]. The accuracy of rule-based metrics is directly affected by the selected metrics. Defining object-oriented rule-based metrics can be challenging. Not only does it require a thorough understanding of OOP concepts and their relationship with software quality attributes, but it also depends on the selection of rules with the best metrics as well as the best thresholds for each metric. The complexity of this combinatory problem is significantly reduced when the number of metrics involved for the detection rule decreases.

In fact, there is no standard set of object-oriented rule-based metrics that everyone agrees upon. This can make it difficult to compare software systems using different metrics. The problem of the metrics selection is, by excellence, a Multi-Criteria Decision-Making (MCDM) problem if we consider the huge number of possibilities and combinations between the metrics and thresholds. Researchers are facing a challenge in the selection of metrics and the definition of the respective thresholds. They have the challenge to find the most relevant set of rules-based metrics.

In this paper, we propose to use the fuzzy decision-making trial and evaluation laboratory (DEMATEL) method to determine the most influential criteria and to find out the ranking of those criteria [9,10,11]. The goal of this work is to reduce the number of metrics combinations leading to rules that are more accurate. This paper answers the question: “What is the best set of rules that can detect a specific defect?” Only rules with the most relevant metrics are considered. The results are validated on previous published work [12], and show an improvement in the detection process of four design defects (i.e., The Blob, Feature Envy, Lazy Class, and Data Class defect) based on sixteen object-oriented metrics.

The paper is structured as follows. Section 2 defines the object-oriented metrics. Section 3 presents an overview of the Fuzzy DEMATEL method. Section 4 details the findings. Section 5 is dedicated to the validation and Section 6 concludes the paper.

2. Object-Oriented Metrics

The majority of the existing works rely on rule-based metrics to detect design defects [13,14,15]. Researchers combine metrics in rules to detect a specific defect. They use object-oriented metrics to quantify the code quality and predict the occurrence of design defects. As shown in Table 1, in our experiments, we propose to use sixteen static and dynamic metrics useful for software measurement and design flow detection. The metrics are inspired by [16,17,18].

3. Fuzzy DEMATEL for Object-Oriented Metrics

The DEMATEL technique is an initiative of the Battelle Memorial Institute through the Geneva Research Centre [19]. It is a comprehensive method for illustrating the structure of complicated cause-and-effect relationships. It aims to find the critical attributes through a visual structural model [20].

We use the DEMATEL method to identify the most significant metrics that affect other metrics. It also converts the relationship between cause-and-effect metrics into a structural model. It identifies the set of cause metrics and effect metrics.

Using DEMATEL, we aim to decrease the number of metrics needed to measure the defects, which leads to improving the effectiveness of the defect detection rules using the digraph map.

3.1. Research Methodology

This research is based on Fuzzy DEMATEL, because of its suitability for evaluating expert answers. In fact, evaluating the importance of a metric is very subjective depending on each expert experience. Thus, it cannot be assessed by crisp values. The concept of fuzzy sets [21] combined with DEMATEL methods perfectly handles the vagueness of expert answers. To deal with this imprecise decision-making problem, we adopt the triangular fuzzy number identified by Akyuz and Celik [22]. This fuzzy representation is defined by a set of values (Lower, Medium, Upper). The influence of each metric is measured over a five-level fuzzy scale, as shown in Table 2.

After the selection of the set of metrics and the experts in the object-oriented programming field, the experts evaluate the effect between metrics using a pairwise comparison, and we start the process of generating and normalizing the Fuzzy Direct-Relation Matrix (FDRM) as an aggregation of all expert matrices. Then, we calculate the total-relation fuzzy matrix, and we obtain as the final step the classification of metrics based on their importance and influence. Finally, we validate our findings from our previous work [12], by refining the set of the rules-based metrics identified in [12] regarding the metrics importance and influence found using Fuzzy DEMATEL. We present in Figure 1 the main steps to apply Fuzzy DEMATEL.

3.2. Fuzzy Direct-Relation Matrix

Each expert generates a direct-relation matrix with a pairwise comparison M_e, where e represents the expert, M is a (n × n) non-negative matrix, and M(e, i, j) represents the direct impact of the Metric i on the Metric j for the expert e. When i = j, the diagonal elements M(e, i, j) = 0. In Table 3, we present an example of the linguistic scores of one expert evaluation for the Blob anti-pattern.

Based on Table 2, the fuzzy linguistic matrix is then converted into a fuzzy scaled direct-relation matrix M. Table 4 presents the aggregated fuzzy direct-relation matrix collected from the different experts’ judgments.

3.3. Normalized Fuzzy Direct-Relation Matrix

The first step is the defuzzification of the normalized fuzzy direct-relation matrix that is based on the Best Non-fuzzy Performance (BPN) method [23]. It is a technique used to generate crisp values from fuzzy values. The BNP of a triangular fuzzy number N (Lower, Medium, Upper) can be expressed as:

B P N = L o w e r + \frac{(U p p e r - L o w e r) + (M e d i u m - L o w e r)}{3}

(1)

Table 5 presents the crisp direct-relation matrix (CM). Using the Formula (2), we transformed the CM into a normalized direct-relation matrix (NMR) as shown in Table 6. Considering the initial

n \times n

matrix (CM), aij is denoted as the degree to which the criterion i affects the criterion j.

NMR = \frac{CM}{\max \sum_{j = 1}^{n} a i j} 1 \leq i \leq n

(2)

3.4. Total-Relation Fuzzy Matrix

At this level, we generate the total-relation matrix (TRM) as shown in Table 7, using the Formula (3).

T R M = N M R {(I - N M R)}^{- 1}

(3)

where I is denoted as the identity matrix.

3.5. Metrics of Cause and Effect Matrix

As presented in Table 8, the value of (D + R) represents the importance of the metric in the rule detection. The higher the value of (D + R), the more important the metric. Therefore, it should be included in the rule generation process. The value of (D − R) classifies the metrics into cause-and-effect metrics. D represents the sum of the rows and R represents the sum of the columns.

Using the Formulae (4) and (5), let TMR = (TMRij), i, j ∈{1,2,...,n).

D_{i} = \sum_{j = 1}^{n} T M R i j

(4)

R_{i} = \sum_{i = 1}^{n} T M R i j

(5)

In Figure 2, we present the causal diagram. The horizontal axis represents (D + R) and the vertical axis represents (D − R). If we consider the values (D + R), it appears that some metrics are more important than others. We can split the set of metrics into three main groups, labeled Low Importance, Important, and High importance.

4. Results of Fuzzy DEMATEL Method for Object-Oriented Metrics

Table 9 presents the groups identified in the metrics causal diagram. We classified the set of metrics into cause metrics and effect metrics. In each group, metrics have importance levels (Low, Normal, and High). For example, in the cause group, the highest metrics are NIC and CM, and for the effect group, the highest metrics are ATFD, NOM NOC, and NCC.

Cause metrics indicate the implication of the influencing metrics on the effect metrics. Considering the interdependence among the metrics, the detection rules should consider both the cause metrics and the related influence on the effect metrics [24]. Therefore, by selecting rules combining cause and effect metrics, we can improve the accuracy of design defect detection, giving priority to the metrics having the highest importance.

Based on the DEMATEL method, we classified the metrics into two orthogonal dimensions. The horizontal dimension represents the metrics importance and the vertical dimension represents the cause and effect metrics. Now, we can limit the detection rule; for example, we can exclude rules with low-importance metrics from the detection process. Based on Table 9, it becomes clear that detection rules should contain as many metrics as possible from the “important” and “highly important” horizontal dimension. These rules should also combine metrics from the vertical dimension (i.e., both cause and effect metrics). Based on this finding, an excellent detection rule combines metrics such as NIC or/and CM with metrics such as ATFD or/and NOM or/and NOC or/and NCC.

The following section explores in depth the impact of the above finding on the accuracy of design defect detection.

5. Validation

The findings presented in the previous section are very important in the selection of metrics. In order to validate our results, we refer to our previous study in [12], where we used the decision tree algorithm to generate defect rules. Applying our finding to our previous work by refining the set of rules-based metrics and comparing the results with the results in [12] shows how could Fuzzy DEMATEL improves the process of identifying the best set of rules-based metric.

5.1. Reference Study

The first study we conducted [12] represents the reference study to validate our findings. We experimented in four design defects: The Blob, Data Class (DC), Lazy Class (LC), and Feature Envy (FE) defects, using 15 object-oriented metrics:

The Blob anti-pattern or God class [25] corresponds to a large controller class that depends on data stored in other classes. This is typically the case for large classes declaring many fields and methods and resulting in a low cohesion.

A Data Class bad smell [24] corresponds to a class that stores data passively. This class contains data and no methods to operate on that data.

A Lazy Class bad smell corresponds to a class that is not doing enough to pay for itself. There is no need for additional classes that could increase the project complexity.

A Feature Envy bad smell corresponds to a method that uses another class excessively. It should belong to that class.

We tested five open-source projects: Xerces v2.7, ArgoUML 0.19.8, Lucene 1.4, Log4j 1.2.1, and GanttProject v1.10.2. Table 10 summarizes the characteristics of the five projects.

5.2. Validation Methodology

In [12], the main objective of the study was to extract rules based on the decision tree algorithm. The rules are of the form:

For a defect D: “IF metric1 is higher/lower than threshold1 AND metric2 higher/lower than threshold2…. AND metricn higher/lower than thresholdn THEN defect D is suspected”. As an example, we present three rules, R1, R2, and R3, generated for the detection of the Data Class defect:

R1: IF ATFD <= 16.5 THEN Data class = Yes

R2: IF ATFD > 16.5 AND NOA > 11.25 THEN Data class = Yes

R3: IF ATFD > 16.5 AND NOA <= 11.25 AND NC > 251 THEN Data class = Yes.

R1 and R2 combine a few numbers of metrics resulting in the generation of a huge number of suspect classes. Therefore, R3 is the most appropriate rule to be considered as an illustrative example. In [12], we considered that the number of metrics is the only criteria that matters for the rule selection. In fact, the process of selecting rules was based on a parameter N fixed by the tester. This parameter is estimated through a series of tests on the base of examples. It represents the number of metrics in rule detection. The value of N directly affects the accuracy of the detection. A small value will generate a high number of false positives due to over-detection; we will detect more than the real existent defects. However, we get the opposite result for a high N value; it will generate a high number of false negatives and we will detect a very small number of defects compared to the existing one.

Based on Fuzzy DEMTEL and the results shown in Table 9, we follow an alternative approach. For instance, R1 contains only one metric from the effect group, so it cannot be considered as an important rule. The decision of not considering the rule R1 as important is not based on the number of metrics but on the fact that rule should combine both cause and effect metrics. For the rules R2 and R3, they combine cause and effect important and highly important metrics. As matter of fact, the rules R2 and R3 are important rules that DEMTEL suggests to include in the evaluation process.

This experiment is based on the same set of rules generated by the decision tree algorithm proposed in [12]. In this paper, we select rules based on the metrics importance presented in Table 9, instead of selecting rules based on the N parameter as presented in [12].

The selected rules for defect detection give priority to the rules that first contain both cause and effect metrics. We start by selecting rules including first the important and high important metrics. If we get a small set of rules, we include those containing only effect or cause metrics, but we choose rules with important and high important metrics.

We use the precision and recall measurements to validate our findings. To evaluate the correctness of the approach, we calculate the precision. It represents the fraction of the true design defects among the set of all detected defects (6). The precision measures the number of true identified defects. The fraction of correctly detected design defects over the set of expected defects is the recall (7).

Precision = \frac{Detected defects ∩ Expected defects}{Detected defects}

(6)

Recall = \frac{Detected defects ∩ Expected defects}{Expected defects}

(7)

5.3. Fuzzy DEMATEL for Object-Oriented Metrics Results Discussion and Validation

The first step in the validation process of the Fuzzy DEMATEL approach is to select the set of rules for each defect. In fact, we use the same rules generated in the reference study [12], based on the projects PMD 5.4.3 (with 433 classes) and Nutch 1.12 (with 247 classes). However, we introduce two formulae to compare the method of selecting the rules used in the reference study (i.e., based on the parameter N) and the new proposed one (i.e., based on Fuzzy DEMATEL).

The first, Formula (8), represents the ratio of the number of rules in the new selected set divided by the number of rules in the original set.

The second, Formula (9), represents the similarity between the two sets of rules: the set of rules identified using the parameter N and the set of rules identified using the Fuzzy DEMATEL approach.

Ratio of Rules = \frac{R 1}{R 0}

(8)

Rules Similarity = \frac{R 0 ∩ R 1}{Min (R 0, R 1)}

(9)

where:

R0 represents the original set of rules selected based on the parameter N;

R1 represents the set of rules selected based on metrics importance (Fuzzy DEMATEL).

As presented in Table 11, based on Fuzzy DEMATEL, we reduced about a half the number of rules generated in [12]. The degree of similarity varies depending on the defects. For example, the DC defect detection is based on 0.462 rules that are similar to the rules used in [12], it means that we selected over than 50% of new rules comparing to [12]. For LC, the degree of similarity is 0.833, which means that we almost use the same rules as those selected based on the N parameter, but we reduced the number of rules, eliminating the non-useful rules.

In Table 12, we present the detailed detection results by project, and of each defect. In fact, we significantly increased the precision and recall. The F1 score is higher when using rules selected based on the metrics cause and effects. We can identify clearly through Figure 3 that the accuracy of the detection is improved using rules selected based on metrics importance.

However, we notice that, only for the LC defect, the improvement in the accuracy was due to the improvement in the precision. In fact, in Figure 3, there is no big difference in the recall curve for both selection methods. This is normal; as we can see in Table 11, the rules similarity is very high. It means that we almost used the same rule, which implicates a similar detection rate. However, the ratio R0/R1 shows that we used approximatively only 0.6% of the rules compared to the set of rules selected based on the N parameter. The selection of rules based on their importance reduced the set of rules by 0.4%. This has a direct impact on the precision; in fact, we use less rules, minimizing over-detection, and we decrease the number of false positive detections.

6. Conclusions

Object-oriented metrics offer quantitative measures of object-oriented software. It can be a valuable tool for assessing its quality. By using these metrics, developers can identify areas for improvement, and refactoring opportunities to optimize the software’s performance, maintainability, and other important factors. However, developers need to carefully choose the right metrics, combine them in rules, and apply the rules consistently to identify code defects. This is challenging task due to the number of metrics and the multiple thresholds for each one.

In this paper, we propose to apply a fuzzy multi-criteria decision-making approach, Fuzzy DEMATEL, to identify and select the most important object-oriented metrics that can enable the detection of design defects. Compared to the findings of our previous study in [12], the results of the current work show that the new set of rules selected based on Fuzzy DEMATEL improves the defect detection accuracy. We are convinced that the metrics importance identified in this work is useful for the entire community of researchers. In fact, generating rules based on only important and highly important metrics reduces the number of metrics combinations and consequently the number of rules, and improves the design defect detection process. This work represents the first step for software refactoring. In future work, we will go a step further; after detecting the defects, we have to correct them based on a set of refactoring rules.

Author Contributions

Conceptualization, M.M.; methodology, M.M., S.A. (Sultan Alyahya), S.A.-O. and S.A. (Sarra Ayouni); software, M.M.; validation, M.M., S.A. (Sultan Alyahya), S.A.-O. and S.A. (Sarra Ayouni); formal analysis, M.M.; investigation, M.M.; resources, F.H.; data curation, S.A. (Sultan Alyahya); writing—original draft preparation, M.M. and S.A. (Sultan Alyahya); writing—review and editing, M.M. and S.A.-O.; visualization, M.M. and S.A. (Sultan Alyahya); supervision, S.A. (Sultan Alyahya) and S.A. (Sarra Ayouni); project administration, S.A. (Sarra Ayouni); funding acquisition, S.A.-O. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia, Project Number PNURSP2023R136.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used to support the findings of this study are included within the article.

Acknowledgments

Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2023R136), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Conflicts of Interest

The authors declare no conflict of interest.

References

Freire, S.; Passos, A.; Mendonça, M.; Sant’Anna, C.; Spínola, R.O. On the Influence of UML Class Diagrams Refactoring on Code Debt: A Family of Replicated Empirical Studies. In Proceedings of the Euromicro Conference on Software Engineering and Advanced Applications, Virtual, 26–28 August 2020. [Google Scholar]
Zhang, M.; Hall, T.; Baddoo, N. Code Bad Smells: A review of current knowledge. J. Softw. Maint. Evol. Res. Pract. 2010, 23, 179–202. [Google Scholar] [CrossRef]
LewowskiLech, T.; Madeyski, M. How far are we from reproducible research on code smell detection? A systematic literature review. Inf. Softw. Technol. 2022, 144, 106783. [Google Scholar] [CrossRef]
Amandeep, K.; Sushma, J.; Shivani, G.; Gaurav, D. A Review on Machine-learning Based Code Smell Detection Techniques in Object-oriented Software System(s). Recent Adv. Electr. Electron. Eng. 2021, 14, 290–303. [Google Scholar]
Misbhauddin, M.; Alshayeb, M. UML model refactoring: A systematic literature review. Empir. Softw. Eng. 2013, 20, 206–251. [Google Scholar] [CrossRef]
Di Nucci, D.; Palomba, F.; Tamburri, D.; Serebrenik, A.; De Lucia, A. Detecting Code Smells using Machine Learning Techniques: Are We There Yet? In Proceedings of the 25th IEEE International Conference on Software Analysis, Evolution, and Reengineering, Campobasso, Italy, 20–23 March 2018. [Google Scholar]
Lanza, M.; Marinescu, R. Object-Oriented Metrics in Practice; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
Fernandes, E.; Oliveira, J.; Paiva, V.G.; Figueiredo, E. A Review-based Comparative Study of Bad Smell Detection Tools. In Proceedings of the 20th International Conference on Evaluation and Assessment in Software Engineering (EASE), Limerick, Ireland, 1–3 June 2006. [Google Scholar]
Alamoodi, A.; Albahri, O.; Zaidan, A.; Alsattar, H.; Zaidan, B.; Albahri, A. Hospital Selection Framework for Remote MCD Patients Based on Fuzzy Q-Rung Orthopair Environment. Neural Comput. Appl. 2023, 35, 6185–6196. [Google Scholar] [CrossRef] [PubMed]
Ayouni, S.; Laila, J.; Hajjej, F.; Maddeh, M.; Al-Otaibi, S. Fuzzy Vikor Application for Learning Management Systems Evaluation in Higher Education. Int. J. Inf. Commun. Technol. Educ. 2021, 17, 19. [Google Scholar] [CrossRef]
Ayouni, S.; Laila, J.; Hajjej, F.; Maddeh, M. A Hybrid Fuzzy DEMATEL-AHP/VIKOR Method for LMS Selection. In Proceedings of the European Conference on e-Learning, Kidmore End, Copenhagen, Denmark, 7–8 November 2019. [Google Scholar]
Maddeh, M.; Ayouni, S.; Alyahya, S.; Hajjej, F. Decision tree-based Design Defects Detection. IEEE Access 2021, 9, 71606–71614. [Google Scholar] [CrossRef]
Boczar, B.; Pytka, M.; Madeyski, L. Which Static Code Metrics Can Help to Predict Test Case Effectiveness? New Metrics and Their Empirical Evaluation on Projects Assessed for Industrial Relevance. Dev. Inf. Knowl. Manag. Bus. Appl. 2022, 3, 201–215. [Google Scholar]
Bhatia, M.K. A Survey of Static and Dynamic Metrics Tools for Object Oriented Environment, Emerging Research in Computing, Information, Communication and Applications; Springer: Singapore, 2021; Volume 790, pp. 521–530. [Google Scholar]
Badri, S.; Moudache, M. Using Metrics for Risk Prediction in Object-Oriented. J. Softw. 2022, 17, 1–20. [Google Scholar]
Van, P.; Chris, L.; Kathryn, K. A Better Set of Object-Oriented Design Metrics for Within-Project Defect Prediction. In Proceedings of the Evaluation and Assessment in Software Engineering, Trondheim, Norway, 15–17 April 2020. [Google Scholar]
Erni, K.; Lewerentz, C. Applying Design Metrics to Object-Oriented Frameworks. In Proceedings of the 3rd International Software Metrics Symposium, Berlin, Germany, 25–26 March 1996; pp. 64–74. [Google Scholar]
Amjad, A.; Alshayeb, M. A Metrics Suite for UML Model Stability. Softw. Syst. Model. 2019, 18, 557–583. [Google Scholar]
Gabus, A.; Fontela, E. World Problems, An Invitation to Further Thought within the Framework of DEMATEL; Battelle Geneva Research Centre: Geneva, Switzerland, 1972. [Google Scholar]
Si, S.; You, X.; Liu, H.; Zhang, P. DEMATEL Technique: A Systematic Review of the State-of-the-Art Literature on Methodologies and Applications. Math. Probl. Eng. 2018, 2018, 3696457. [Google Scholar] [CrossRef] [Green Version]
Zadeh, L.A. Fuzzy Sets. Information and Control. J. Symb. Log. 1965, 38, 338–353. [Google Scholar]
Akyuza, E.; Celik, E. A Fuzzy DEMATEL Method to Evaluate Critical Operational Hazards During Gas Freeing Process in Crude Oil Tankers. J. Loss Prev. Process Ind. 2015, 38, 243–253. [Google Scholar] [CrossRef]
Ross, T. Fuzzy Logic with Engineering Applications. MCGRAW-HILL: New York, NY, USA, 1995. [Google Scholar]
Fontela, E.; Gabus, A. The Dematel Observer; Battelle Geneva Research Center: Geneva, Switzerland, 1976. [Google Scholar]
Malveau, R.; Brown, W.J.; McCormick, H.; Mowbray, T. AntiPatterns: Refactoring Software, Architecture and Projects in Crisis; John Wiley & Sons: Hoboken, NJ, USA, 1998. [Google Scholar]

Figure 1. Fuzzy DEMATEL method.

Figure 2. Causal diagram metrics.

Figure 3. Variation of recall and precision depending on the selection method.

Table 1. Object-oriented metrics.

Metrics	Description	Representation
NC	Number of classes	C1
PS	Package size	C2
NOA	Number of attributes	C3
NOM	Number of methods	C4
NOD	Number of descendent classes (Inheritance)	C5
NODD	Number of direct descendent classes (Direct inheritance)	C6
NMSC	Number of messages send from a class to itself (Internal-messages)	C7
NOC	Number of messages sent for other classes	C8
NCC	Number of classes affected by the measured class	C9
ATFD	Access To foreign data	C10
NOP	Number of parameters	C11
NIC	Number of interconnected classes/Number of classes affected by the measured method	C12
CM	Number of methods affected by the measured method	C13
NOPM	Number of packages in the model	C14
PUC	Percentage of used classes/Number of classes used outside the measured package	C15
NMR	Number of messages received by a class	C16

Table 2. Fuzzy linguistic scale.

Linguistic Terms	Triangular Fuzzy Numbers
No influence (NO)	(0, 0, 0.25)
Very low influence (VL)	(0, 0.25, 0.5)
Low influence (L)	(0.25, 0.5, 0.75)
High influence (H)	(0.5, 0.75, 1)
Very high influence (VH)	(0.75, 1, 1)

Table 3. Linguistic evaluation of criteria interdependence.

	C1	C2	C3	C4	C5	C6	C7	C8	C9	C10	C11	C12	C13	C14	C15	C16
C1:NC	0	VH	H	VH	H	H	NO	VH	VH	VH	NO	H	H	VH	VH	H
C2:PS	NO	0	NO	NO	NO	NO	NO	NO	NO	NO	NO	NO	NO	VH	VH	VL
C3:NOA	VL	VL	0	VH	NO	NO	H	VH	VH	VH	VL	H	H	NO	NO	NO
C4:NOM	VH	H	L	0	L	L	VH	VH	VH	VH	VH	VH	H	L	H	H
C5:NOD	L	L	VH	VH	0	VH	H	L	L	VL	NO	VL	L	NO	VL	L
C6:NODD	L	L	VH	VH	VH	0	H	VL	VL	VL	NO	L	L	NO	VL	L
C7:NMSC	NO	NO	H	NO	NO	NO	0	VH	VH	VH	NO	H	H	NO	H	H
C8:NOC	NO	NO	H	H	NO	NO	H	0	VH	VH	NO	H	VH	NO	VH	VH
C9:NCC	L	VL	NO	L	VL	VL	VH	VH	0	VH	NO	VH	H	NO	H	VH
C10:ATFD	VL	VL	VL	H	VL	VL	VH	VH	VH	0	VL	H	H	NO	VH	VH
C11:NOP	NO	NO	H	L	NO	NO	L	L	L	VL	0	H	H	NO	NO	NO
C12:NIC	L	H	H	H	VL	VL	VH	VH	VH	VH	VL	0	H	VL	H	VH
C13:CM	L	L	VL	L	NO	NO	L	VH	VH	VH	NO	H	0	NO	H	VH
C14:NOPM	VL	VH	NO	NO	NO	NO	NO	NO	NO	NO	NO	NO	NO	0	H	NO
C15:PUC	H	VH	NO	L	NO	NO	NO	NO	NO	NO	NO	NO	NO	VL	0	VL
C16:NMR	NO	NO	NO	L	NO	NO	VH	VH	VH	VH	NO	L	H	L	L	0

Table 4. The aggregated fuzzy direct-relation matrix.

	C1			C2			C3			C15			C16
C1:NC	0	0	0	0.69	0.94	1	0.59	0.84	1	0.69	0.94	1	0.59	0.84	1
C2:PS	0.06	0.22	0.47	0	0	0	0.06	0.22	0.47	0.69	0.94	1	0	0.09	0.34
C3:NOA	0	0.09	0.34	0	0.09	0.34	0	0	0	0.06	0.22	0.47	0.09	0.25	0.5
C4:NOM	0.28	0.41	0.56	0.19	0.28	0.53	0.16	0.41	0.66	0.28	0.53	0.78	0.28	0.53	0.78
……
C14:NOPM	0	0.13	0.38	0.69	0.94	1	0.06	0.22	0.47	0.59	0.84	1	0	0	0.25
C15:PUC	0.59	0.84	1	0.69	0.94	1	0	0	0.25	0	0	0	0.06	0.31	0.56
C16:NMR	0	0.03	0.28	0	0	0.25	0.06	0.22	0.47	0.47	0.72	0.91	0	0	0

Table 5. The crisp direct-relation matrix.

	C1	C2	C3	C4	C5	C13	C14	C15	C16
C1:NC	0	0.875	0.8125	0.875	0.8125	0.5313	0.875	0.875	0.8125
C2:PS	0.25	0	0.25	0.25	0.0833	0.0833	0.875	0.875	0.1458
C3:NOA	0.1458	0.1458	0	0.8542	0.5417	0.7917	0.0833	0.25	0.2813
…..
C4:NOM	0.4167	0.3333	0.4063	0	0.4063	0.7917	0.2604	0.5313	0.5313
C14:NOPM	0.1667	0.875	0.25	0.25	0.0833	0.0833	0	0.8125	0.0833
C15:PUC	0.8125	0.875	0.0833	0.2396	0.1042	0.25	0.1458	0	0.3125
C16:NMR	0.1042	0.0833	0.25	0.4063	0.0833	0.5313	0.4375	0.6979	0

Table 6. The normalized direct-relation matrix.

	C1	C2	C3	C4	C5	C13	C14	C15	C16
C1:NC	0	0.0758	0.0704	0.0758	0.0704	0.046	0.0758	0.0758	0.0704
C2:PS	0.0217	0	0.0217	0.0217	0.0072	0.0072	0.0758	0.0758	0.0126
C3:NOA	0.0126	0.0126	0	0.074	0.0469	0.0686	0.0072	0.0217	0.0244
C4:NOM	0.0361	0.0289	0.0352	0	0.0352	0.0686	0.0226	0.046	0.046
……..
C14:NOPM	0.0144	0.0758	0.0217	0.0217	0.0072	0.0072	0	0.0704	0.0072
C15:PUC	0.0704	0.0758	0.0072	0.0208	0.009	0.0217	0.0126	0	0.0271
C16:NMR	0.009	0.0072	0.0217	0.0352	0.0072	0.046	0.0379	0.0605	0

Table 7. The total-relation matrix.

	C1	C2	C3	C9	C12	C13	C14	C15	C16
C1:NC	0.0586	0.1392	0.1517	0.1943	0.1699	0.1535	0.1194	0.2011	0.1762
C2:PS	0.0433	0.0289	0.047	0.0602	0.0543	0.0418	0.0902	0.1176	0.0473
C3:NOA	0.0537	0.0554	0.0619	0.1616	0.1247	0.149	0.0364	0.1146	0.1088
C4:NOM	0.0804	0.078	0.1013	0.1725	0.1575	0.1559	0.0567	0.1482	0.1369
…….
C14:NOPM	0.0346	0.0969	0.0447	0.0565	0.0385	0.0384	0.018	0.1085	0.039
C15:PUC	0.0909	0.1018	0.0411	0.0702	0.0636	0.063	0.0373	0.0549	0.0704
C16:NMR	0.0456	0.0476	0.0698	0.1492	0.1045	0.1151	0.0618	0.1411	0.0728

Table 8. The metrics influence received and given.

	D	R	D + R	D − R
C1:NC	2.395	0.918	3.313	1.477
C2:PS	0.817	1.064	1.881	−0.247
C3:NOA	1.699	1.376	3.074	0.323
C4:NOM	1.888	1.897	3.785	−0.008
C5:NOD	1.066	0.877	1.943	0.190
C6:NODD	1.077	0.820	1.897	0.257
C7:NMSC	1.294	1.769	3.062	−0.475
C8:NOC	1.893	1.907	3.800	−0.014
C9:NCC	1.628	1.967	3.595	−0.339
C10:ATFD	1.666	1.954	3.620	−0.288
C11:NOP	0.920	0.577	1.497	0.342
C12:NIC	1.890	1.695	3.585	0.195
C13:CM	1.764	1.755	3.520	0.009
C14:NOPM	0.744	0.753	1.497	−0.008
C15:PUC	0.945	2.051	2.996	−1.106
C16:NMR	1.449	1.756	3.205	−0.308

Table 9. Metrics importance.

	Metrics
Groups	Low Importance	Important	High Importance
Cause Group	NOP	NC	NIC
	NODD	NOA	CM
	NOD
Effect Group	NOPM	NMR	ATFD
	PS	NMSC	NOM
		PUC	NOC
			NCC

Table 10. Project information.

	Number of Classes	Number of Blob	Number of LC	Number of DC	Number of FE
Xerces v2.7	683	29	23	17	58
ArgoUML 0.19.8	1244	15	29	15	78
Lucene 1.4	189	7	8	2	23
Log4j 1.2.1	209	11	6	5	33
GanttProject v1.10.2	241	22	12	10	46

Table 11. The generated rules.

	R0 (N Parameter)	R1 (Metrics Importance)	R0 ∩ R1	R1/R0	Rules Similarity
Blob	31	17	10	0.548	0.588
DC	24	13	6	0.542	0.462
LC	19	12	10	0.632	0.833
FE	22	14	7	0.636	0.500

Table 12. Detection results.

Project	Defect	Rule Selection Based on N			Rule Selection based on Metrics Importance
Project	Defect	Recall %	Precision %	F1 Score	Recall %	Precision %	F1 Score
Xerces v2.7	Blob	55	64	59.16	66	88	75.43
	LC	47	57	51.52	49	90	63.45
	DC	52	68	58.93	71	70	70.50
	FE	51	59	54.71	53	59	55.84
ArgoUML 0.19.8	Blob	60	56	57.93	71	88	78.59
	LC	51	65	57.16	53	65	58.39
	DC	73	53	61.41	84	77	80.35
	FE	38	62	47.12	62	67	64.40
Lucene 1.4	Blob	100	80	88.89	100	86	92.47
	LC	100	55	70.97	100	93	96.37
	DC	100	71	83.04	100	91	95.29
	FE	100	62	76.54	100	96	97.96
Log4j 1.2.1	Blob	100	47	63.95	100	78	87.64
	LC	100	49	65.77	100	89	94.18
	DC	100	52	68.42	100	79	88.27
	FE	100	56	71.79	100	94	96.91
GanttProject v1.10.2	Blob	54	69	60.59	75	77	75.99
	LC	75	44	55.46	75	63	68.48
	DC	90	49	63.45	90	82	85.81
	FE	58	54	55.93	62	76	68.29

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Maddeh, M.; Al-Otaibi, S.; Alyahya, S.; Hajjej, F.; Ayouni, S. A Comprehensive MCDM-Based Approach for Object-Oriented Metrics Selection Problems. Appl. Sci. 2023, 13, 3411. https://doi.org/10.3390/app13063411

AMA Style

Maddeh M, Al-Otaibi S, Alyahya S, Hajjej F, Ayouni S. A Comprehensive MCDM-Based Approach for Object-Oriented Metrics Selection Problems. Applied Sciences. 2023; 13(6):3411. https://doi.org/10.3390/app13063411

Chicago/Turabian Style

Maddeh, Mohamed, Shaha Al-Otaibi, Sultan Alyahya, Fahima Hajjej, and Sarra Ayouni. 2023. "A Comprehensive MCDM-Based Approach for Object-Oriented Metrics Selection Problems" Applied Sciences 13, no. 6: 3411. https://doi.org/10.3390/app13063411

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Comprehensive MCDM-Based Approach for Object-Oriented Metrics Selection Problems

Abstract

1. Introduction

2. Object-Oriented Metrics

3. Fuzzy DEMATEL for Object-Oriented Metrics

3.1. Research Methodology

3.2. Fuzzy Direct-Relation Matrix

3.3. Normalized Fuzzy Direct-Relation Matrix

3.4. Total-Relation Fuzzy Matrix

3.5. Metrics of Cause and Effect Matrix

4. Results of Fuzzy DEMATEL Method for Object-Oriented Metrics

5. Validation

5.1. Reference Study

5.2. Validation Methodology

5.3. Fuzzy DEMATEL for Object-Oriented Metrics Results Discussion and Validation

6. Conclusions

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI