1. Introduction
The membership function is used to define the fuzzy set
. The uncertainty model effectively by the fuzzy set theory define by Zadeh [
1]. The fuzzy set theory only focuses on one aspect of information, the containment or belongingness. Attansove defines the intuitionistic fuzzy set
[
2], which is the generalization of FS and model uncertainty effectively. The membership and non-membership functions are used to define
. Due to the consideration of non-membership function, the
is more effective than
for practical applications. The membership functions for the interval fuzzy set
and interval-valued intuitionistic fuzzy set
describe in intervals instead of single values [
3]. The experts give their preferences in the form of intervals in
and
. Due to intensive quantity and type of uncertainties, these approaches are not sufficient to cover all aspects [
4]. Molodtsov soft set theory model uncertainty by parametric point of view [
4]. Nowadays, many authors define the hybrid model of soft sets with
,
,
, and
[
5,
6,
7,
8,
9].
The picture fuzzy set (
) define by Coung is another generalization of
and
[
10]. The generalization in the sense that the membership, neutral and non-membership functions are used to define
. In
, the preferences of the experts describe precisely because it contains all aspects of assessment of information like yes, abstain, no and refusal. The addition of the representative’s functions in
should be less than or equal to one. This condition restricts the expert preferences domain. The hybrid model of
and soft set is obtained by Yang [
11]. Khan et al. [
12] define the generalized picture fuzzy soft set and applied them to decision-making problems. For study more about decision making, we refer to [
13,
14,
15,
16,
17,
18,
19].
Yager [
20,
21] defines the Pythagorean fuzzy sets
, which is the successful extension of intuitionistic fuzzy sets, by putting a new condition on positive membership
and negative membership functions
, i.e.,
. This new condition expand the domain of membership functions like if we have
and
, then we cannot deal it with intuitionistic fuzzy set because
but
and hence
applied successfully. The concept of Pythagorean fuzzy number
and detailed mathematical expression of
is presented by Zhang [
22]. To solve the multi criteria group decision-making problem with
, Peng defines the division and subtraction operations for
and also developed a Pythagorean fuzzy superiority and inferiority ranking method [
23]. Reformat and Yager applied the
in handling the collaborative-based recommender system [
24]. In [
25], Peng defines several distance, similarity, entropy and inclusion measures for
and their relations between them.
Ashraf [
26,
27,
28] defines the spherical fuzzy sets
, which is the successful extension of picture fuzzy sets and
, by putting a new condition on positive membership
, neutral membership
and negative membership functions
, i.e.,
. This new condition expand the domain of membership functions like if we have
,
and
, then we cannot deal it with picture fuzzy set because
but
and hence
applied successfully. In [
29], Rafiq proposed similarity measure based on cosine and cotangent functions for
and applied them to the pattern recognition. In [
30], multi-attribute group decision making problem is solved by symmetric sum based aggregation operators for spherical fuzzy sets.
Keeping in mind the importance of similarity measure and application in data mining, medical diagnosis, decision making and pattern recognition many authors work on this topic. A wide theory of similarity measures of fuzzy sets and intuitionistic fuzzy sets are presented in the literature [
31,
32,
33,
34].
The generalization of is specified on in a sense that the domains of membership, neutral and non-membership functions are grater in i.e., the experts give their judgments more freely. In , the preferences of the experts describe precisely because it contains all aspects of assessment of information like yes, abstain, no and refusal. The generalization of is specified on because it contains an extra degree of preferences: the neutral degree or neutral membership function.
The aim of this paper is to define the new similarity measures for and discuss the selection of mega projects for under developing countries. Since, it is important for under developing countries to select upcoming mega projects on priority which has less effect on their economy, environment, less maintenance cost has long term benefits, fewer peoples effects from that project and generate high revenue. Normally, the megaprojects are characterized by vast complexity (especially in organizational terms), large investment commitment, long-lasting impact on the economy, the environment, and society.
Some of the proposed similarity measures for
have some problems which are pointed out in
Section 4. To improve the idea of the similarity measure, we proposed the set-theoretic similarity and distance measures. The proposed similarity measure is then applied to the pattern recognition. The selection of mega projects for under developing countries is done by the proposed similarity measure.
The remaining paper is organized as follows: Introduction and preliminaries are presented in
Section 1 and
Section 2. In
Section 3, we proposed the set-theoretic similarity measures for
. In
Section 4, we provide some counterexamples for already proposed similarity measures. To support the proposed similarity measure a numerical example of selecting mega projects in under developing countries is presented in
Section 5. Comparison analysis and conclusion are presented in
Section 6 and
Section 7.
2. Preliminaries
In this section, we provide some basic definitions of , , , and . The already proposed similarity measures for are discussed.
A fuzzy set is defined by Zadeh [
1], which handles uncertainty based on the view of gradualness effectively.
Definition 1. [1] A membership function defines the fuzzy set over the , where particularized the membership of an element in fuzzy set . In [
10], Cuong defines the
, which is an extension of a fuzzy set and applicable in many real-life problems. The picture fuzzy set is obtained by adding an extra membership function, namely, the degree of the neutral membership in
. The information regarding the situation of type: yes, abstain, no and refusal can be model by using picture fuzzy set easily. Voting can be a good example of a picture fuzzy set because it involves the situation of more answers of the type: yes, abstain, no, refusal.
Definition 2. [10] A over the universe is defined aswhere , and are the degree of positive membership, neutral membership and degree of negative membership, respectively, such that . Definition 3. [26] A over the universe is defined aswhere , and are the degree of positive membership, neutral membership and degree of negative membership, respectively. Furthermore, it is required that . Then for , is called the degree of refusal membership of y in . For are said to spherical fuzzy value or spherical fuzzy number and each can be denoted by , where , and , with condition that . Therefore, the information regarding the situation of type: yes, abstain, no and refusal can be model more easily by using than . We can easily observed that the is an extension of . For example, if we have , and , then . However, , hence expand the domain of memberships functions.
In [
29], Rafiq defines some similarity measures for
based on cosine and cotangent functions.
Definition 4. [29] For two and in , a cosine similarity measure between and is defined as follows: Definition 5. [29] For two and in , similarity measures using cosine function between and are defined as follows:where ∨ is the maximum operation. Definition 6. [29] For two and in , a cotangent similarity measure between and are defined as follows:where ∨ is the maximum operation. Definition 7. [29] For two and in , a cosine similarity measure by using degree of refusal membership between and are defined as follows:where ∨ is the maximum operation. Definition 8. [29] For two and in , a cotangent similarity measure by using degree of refusal membership between and is defined as follows:where ∨ is the maximum operation. 3. A New Similarity Measures for
In this section, we define new similarity and distance measures for give their proof.
Definition 9. A distance measure between and is a mapping , which satisfies the following properties:
- (D1)
- (D2)
- (D3)
- (D4)
If then and .
Definition 10. A similarity measure between and is a mapping , which satisfies the following properties:
- (S1)
- (S2)
- (S3)
- (S4)
If then and .
Definition 11. For two and in , a new similarity measures is defined between and as follows: Example 1. Let be the universal set. We consider two and in , which are given as follows: Theorem 1. is the similarity measure between two and in .
Proof. To prove a similarity measure, we have to verify the four conditions of Definition 10 for .
Since for all
, we have
,
and
. Therefore for each
, we have
Therefore for all
, we have
Suppose
. We have to prove
. By definition of
,
Now we claim that
,
and
.
Suppose , since , there exists such that .
Similarly there exists such that and .
By hypothesis it follows that This implies that , which is not possible. This implies that , and . This implies that , and . Hence .
Converse, trivially follows from Definition 11.
is trivial.
For three
,
and
in
. The similarity measures between
,
and
,
are given as:
Suppose
. For all
, we have
,
and
. This implies that
,
and
. Then we have
We claim that for all
, we have
because
and
. Similarly, we have
By adding Equations (
18)–(
20), we have
Hence
. Similarly, we can prove
.
Hence from , we conclude that is the similarity measure between and . □
Definition 12. Two and are called -similar, denoted as , if and only if for .
Corollary 1. is reflexive and symmetric.
Proof. The reflexive and symmetric part follows from Theorem 1. □
The following example shows that the relation is not transitive.
Example 2. Let be the universal set. Let us define . We consider three , and in , which are given as follows:
Then and , but . Hence the relation is not transitive. Sometimes the alternatives under observations are not of equal importance, therefore, we defines weights of alternatives to signify their importance and defines weighted similarity measures between .
Definition 13. For two and in , a new weighted similarity measure is defined between and as follows:where are the weights of alternatives, but not all zero, . If , then we have Example 3. In Example 1, consider , , , and be the weights of , , , and , respectively. Then Theorem 2. is the similarity measure between two and in .
Proof. The proof is similar to the proof of Theorem 1. □
On the basis of new similarity measure , we define distance measures for .
Definition 14. For two and in , a new distance measures is defined between and as follows: Definition 15. For two and in , a new weighted distance measure is defined between and as follows:where are the weights of alternatives, but not all zero, . If , then we have Theorem 3. and are the distance measures between .
Proof. The proof is similar to the proof of Theorem 1. □
4. Applications in Pattern Recognition and Counter Examples
In this section, we provide some counter examples for already proposed similarity measures in the literature in pattern recognition. We have seen that the already proposed measures cannot classify the unknown pattern while set theoretic similarity measure classify the unknown pattern, which shows that our proposed similarity measure is applicable in pattern recognition problems.
Example 4. In this example, we have seen that the second condition of Definition 10 is not satisfied for cosine similarity measure (Definition 4), i.e., if and are two in with , , and , then . However, .
For example, let and in are Clearly, but . Hence is not effective for these cases and not reliable to find the similarity measure between . However, when we find the similarity measure by using , we get .
Example 5. Let and be two known patterns with class labels and , respectively, are given. The are used to represents the patterns in as follows:P is the unknown pattern which is given as follows:Our aim is to find out the class of unknown pattern P belongs to. However, when we use cosine similarity measure (Definition 5), we get the same similarity measure i.e., . Furthermore, when we use the cotangent similarity measure (Definition 6), we get the same similarity measures i.e., . Hence in this case we cannot decide the class of unknown pattern P by using and . However, when we find the similarity measure by using , we get and . Since , therefore, the unknown pattern P belongs to class . Example 6. Let and be two known patterns with class labels and , respectively, are given. The are used to represents the patterns in as follows:P is the unknown pattern which is given as follows:Our aim is to find out the class of unknown pattern P belongs to. However, when we use cosine similarity measure (Definition 5), we get the same similarity measure i.e., . Furthermore, when we use the cotangent similarity measure (Definition 6), we get the same similarity measures i.e., . Hence in this case we cannot decide the class of unknown pattern P by using and . However, when we find the similarity measure by using , we get and . Since , therefore, the unknown pattern P belongs to class . Example 7. Let and be two known patterns with class labels and , respectively, are given. The are used to represents the patterns in as follows:P is the unknown pattern which is given as follows:Our aim is to find out the class of unknown pattern P belongs to. However, when we use cosine similarity measure (Definition 7), we get the same similarity measure i.e., . Furthermore, when we use the cotangent similarity measure (Definition 8), we get the same similarity measures i.e., . Hence in this case we cannot decide the class of unknown pattern P by using and . However, when we find the similarity measure by using , we get and . Since , therefore, the unknown pattern P belongs to class . Example 8. Let and be two known patterns with class labels and , respectively, are given. The are used to represents the patterns in as follows:P is the unknown pattern which is given as follows:Our aim is to find out the class of unknown pattern P belongs to. However, when we use cosine similarity measure (Definition 7), we get the same similarity measure i.e., . Furthermore, when we use the cotangent similarity measure (Definition 8), we get the same similarity measures i.e., . Hence in this case we cannot decide the class of unknown pattern P by using and . However, when we find the similarity measure by using , we get and . Since , therefore, the unknown pattern P belongs to class . 5. Selection of Mega Projects in Underdeveloping Countries
The megaprojects are characterized by vast complexity (especially in organizational terms), large investment commitment, long-lasting impact on the economy, the environment, and society. So it is important to choose the best method for the selection of mega projects for under developing countries. Because it affects the lives of millions of peoples, take much time to develop and build, involve multiple public and private stakeholders, and have a long-lasting impact on the economy, the environment, and society. As we have seen that the proposed similarity measures have counter-intuitive cases for Examples 4–8. Therefore, the selection of mega projects for under developing countries is done by the proposed similarity measure.
It is important for under developing countries to select upcoming mega projects on priority which has less effect on their economy, environment, less maintenance cost has long term benefits, fewer peoples effects from that project and generate high revenue. For example, a country has to start the mega project and they get a loan from the world bank so the country has to think before spending the money because they have to refund after some time. The government has five projects in his focus like 1 million house construction, dam construction, orange metro train, invest in industry and power sector. This set can be represented as
U and the elements of
U represented as
, that is
To select the project on a priority basis, there are some parameters selected by experts from different fields to check the importance of projects like long term benefits, time, impact, revenue generated, costs and short term benefits. We represents this criteria as a set
W and the elements of
W represented as
, that is
We apply proposed technique for selecting upcoming mega projects on priority basis which is the classical multi attribute decision making problem. The weight vector for each attribute
is
. All the data collected in spherical fuzzy information is summarized in
Table 1. In
Table 1, we have seen that for each mega project
, experts interpret their evaluation in the form of
corresponding to each attribute (criteria).
To apply the proposed method, we calculate the ideal alternative (mega project)
from given data as follows
,
and
. Then the similarity measures between each alternative and ideal alternative are calculated. Heigh values of similarity measure more closer to the ideal alternative. In this case, the ideal alternative is
Then the similarity measures
between between each alternative
and ideal alternative
are calculated. The details of similarity measures presented in
Table 2 and the ranking of alternatives (mega projects) is given as follows:
The comparison between the already proposed similarity measures and proposed similarity measure is presented in
Table 2.
6. Comparison Analysis
A comparison between new proposed similarity measure and already proposed similarity measure for is conducted to illustrate the superiority of the new similarity measure.
We have seen from Example 4 that the second condition of Definition 10 is not satisfied for cosine similarity measure , i.e., even . Furthermore, we provide a general criteria when second condition of Definition 10 is not satisfied for cosine similarity measure . In Example 5, we have seen that the and can not classify the unknown pattern from known pattern. In Example 6, we have seen that the and can not classify the unknown pattern from known pattern. In Example 7, we have seen that the and can not classify the unknown pattern from known pattern. In Example 8, we have seen that the and can not classify the unknown pattern from known pattern.
However, in all Examples 4–8, the new similarity measure
classify the unknown pattern and hence successfully applicable to the pattern recognition problems. In
Section 5,
applied successfully to selecting the mega projects for under developing countries.
From
Table 3, we have seen that for different special cases, the already proposed similarity measures are not illegible for classification of unknown pattern but
applied successfully. For cases 1 and 2, the similarity measures
,
,
and
provide counter-intuitive cases. The similarity measures
,
,
and
provide counter-intuitive cases for 3 and 4 cases. The second axiom of similarity measure for
(Definition 4) is not satisfied for case 5. As we have seen in Example 4, that if we have membership, neutral and non-membership degrees for a set are equal but different from another set which has also same membership degrees, then the
has result 1. This is inconsistent with the definition of a similarity measure.
7. Conclusions
In this paper, we have defined new similarity measures for called set theoretic similarity measures. We define set theoretic similarity measure, weighted set theoretic similarity measure, set theoretic distance and weighted set theoretic distance measures and provide their proofs in this paper. We discuss some special cases (Examples 4–8) where already proposed similarity measure fails to classify the unknown pattern while the proposed similarity measure successfully applied to the pattern recognition problems. Furthermore, applied successfully to selecting the mega projects for under developing countries.
In the future direction, we will apply the set theoretic similarity measure to data mining, medical diagnosis, decision making, complex group decision making, linguistic summarization risk analysis, pattern recognition, color image retrieval, histogram comparison and image processing.