1. Introduction
The term chirality was introduced by Lord Kelvin to explain the relation between the optical isomers or enantiomers that differ in their three-dimensional disposition around an asymmetric carbon atom, which are related as the object and the non-superimposable mirror image. In chemistry, there is a solubility rule to explain the solubility of solutes and it states that “Like dissolves like,” which means polar solvents dissolve polar solutes and non-polar solvents dissolve non-polar solutes. The same is true for the chiral distinction. Only a chiral environment can distinguish chiral isomers. Biological systems are chiral in nature, owing to the homochirality of the building blocks of the macromolecules. Thus, chirality is the biosignature present in all biological macromolecules [
1]. The homochirality that is prevailing among the biological molecules is considered to have a cosmic origin [
2] in support of these propylene oxides, since a chiral molecule has been identified in interstellar space [
3]. The biological systems are enantio-selective, and the enantio-distinction plays a vital role in drug action, i.e., pharmacology [
4,
5]. The chiral distinction or enantio-selectivity is not a phenomenon relevant only to the pharmaceutical industry [
6] but extends to other areas such as the agriculture [
7], fragrance, and food industries [
8], and environmental toxicology [
9].
The QSAR approach has been extensively used to model various physiochemical properties, biological activities, and toxicological end points [
10,
11,
12,
13,
14,
15,
16,
17,
18]. In the QSAR approach for modeling and predicting the property/activity under consideration, computed molecular descriptors are preferred and have been successfully used over the years. The large set of molecular descriptors used in the QSAR approach under the name of topological indices are derived from their molecular graphs. In a molecular graph, the atoms form the vertices, and the bonds form the edges. The topological indices are therefore computed based on the connectivity of atoms in the molecular graphs. In the case of enantiomers, though they have the object and the non-superimposable mirror image relation, the molecular connectivity remains the same. That is, enantiomers have identical molecular graphs; therefore,
R- and
S- isomers will have the same numerical values for the molecular descriptors and fail to distinguish them. To overcome this limitation in modifications, which are generally regarded as chiral modifications, chiral corrections are applied in the computation of the topological indices. These modifications may be classified into two major types:
- (1)
including a chiral correction into the connectivity matrix and computing the chiral descriptors.
- (2)
applying a chiral correction factor to the topological indices computed based on a well-defined algorithm, such that new Chiral TI = conventional TI × correction factor.
The second approach modifies the existing topological index; therefore, they must be computed as the first step. The different methods on the numerical characterization of chirality have resulted in chirality descriptors that could distinguish enantiomers and diastereomers [
19,
20,
21,
22,
23,
24]. In most of these approaches, only one type of index is derived and of course they satisfy the primary objective in differentiating the enantiomers. The numerical characterization approach that provides just a pair of descriptors for a set of enantiomers has a limitation in its application to different biological responses for the same set of chiral molecules. The stereospecific recognition of the same set of molecules by two different receptors may be completely uncorrelated. The chiral descriptors might be successful in modeling one of these biological activities of a set of enantiomers but fail to model other properties related to a different receptor. In such a scenario, a single set of descriptors may be able to model one of the two responses. This was illustrated in the dopamine sigma receptor and D2 receptor affinities for seven pairs of 3HPPs, 3-(3-hydroxyphynyl)piperidine [
19]. These two biological responses are mutually uncorrelated (correlation coefficient r = 0.195). When a single set of chiral descriptors for these seven pairs of enantiomers were used, it could model the sigma receptor affinities but failed in modeling the D2 receptor affinities. In order to handle such a situation, a family of chiral descriptors was proposed [
19]. Among these descriptors, some were mutually perfectly correlated (r = 0.999), while some were uncorrelated (r < 0.1). This approach was like that of Kier and Hall [
16,
17,
19] in extending Randić connectivity indices [
20] to calculate a family of topological indices for a given set of molecules. Similar to Kier and Hall’s approach, this approach gave the Randić connectivity index new dimensions and extended its applicability. The QSAR approach, which is a new approach of calculating a large pool of chirality indices for chiral molecules, was expected to have wider applicability to many different properties, because the chiral descriptors provide a multidimensional space. The new type of chirality indices proposed to differentiate enantiomers and diastereomers were called the relative chirality indices (RCIs).
Readers may wonder how chirality, an intrinsically three-dimensional (3D) phenomenon, is characterized by a graph theoretic approach which is only two-dimensional (2D). Our approach of computing relative chirality indices may be looked upon as a method of transforming the 3D disposition of the four substituents attached to a chiral center into two directed graphs. The transition of CIP rules of ordering the four substituents attached to a chiral carbon into two directed graphs to represent
R- and
S- isomers is outlined in
Figure 1. The four groups A, B, C, and D are attached to the chiral carbon, and the order of priority according to Cahn–Ingold–Prolog (CIP) rules is A > B > C > D. While assigning the R/S configuration, group D (least priority) is kept away from the viewer and the other three groups are placed in the front (
Figure 1). This disposition is transferred into two directed graphs:
The two directed graphs differ in their adjacency matrix. The adjacency matrices for the two directed graphs are as follows:
In order to derive indices from the directed graph, a set of formulas were proposed to calculate chirality indices that could distinguish the enantiomers. The initial formulation [
25] was recently modified [
22] to compute a pack of several chirality indices for a given set of enantiomers. The formulas to calculate the chirality descriptors, relative chirality indices (RCI),
RRCI, and
SRCI are given below:
where δ
a, δ
b, δ
c, and δ
d are the weights assigned to the four substituents A, B, C, and D attached to the chiral carbon, while the CIP priority order is A > B > C > D.
This new approach enables the extension the of computation of RCIs to molecules with more than one chiral center. When a molecule has more than one chiral center, the RCI is computed for each center based on substituents attached, and the individual RCI values thus computed for each chiral center in a molecule is combined by taking a root mean square value to derive a single RCI for the chiral molecule (Equation (3)). Thus, RCIs can be calculated not only for enantiomers but also for diastereomers.
The calculation of the RCI and its application to model the biological activity of diastereomers was illustrated for the differential repellency (mosquito) of
SS 220 [
21], and recently, Natarajan et al. [
19] extended the application of the large pool of RCIs calculated to model two biological responses for a set of seven pairs of enantiomers of 3-(3-hydroxyphenyl)piperidines. The present paper is an extension of this approach to model the biological activities of a diverse data set of chiral molecules.
Physiological and pathological processes in the human body are regulated by a system of chemokine and chemokine receptors, a subfamily of human Class A G-protein coupled receptors (GPCRs) [
22,
23]. The chemokine receptors play a significant role in the migration and localization of leukocytes. To date, the protein data bank (PDB) has a repository of the structures of 22 chemokine receptors. Urvas and Kellenberger [
24] analyzed and compared their structures in a recent review. Out of the chemokine receptors, the CC-chemokine receptor 2 (CCR2) is the second most studied receptor. CCR2 is involved in various neurodegenerative disorders including Alzheimer’s disease, multiple sclerosis, and ischemic brain injury [
25,
26,
27,
28]. During the SARS-CoV-2 pandemic, the involvement of CCR2 in fighting the inflammation of lungs was extensively studied [
29,
30]. Hence, CCR2 has attracted attention as a therapeutic target for autoimmune diseases such as rheumatoid arthritis [
27], cancer [
25,
29], traumatic brain injury [
31], etc. In most of these, the overexpression of CCR2 is the main cause; therefore, the suppression or dampening by CCR2 antagonists is one of the therapeutic strategies. Owing to the therapeutic importance of CCR2, we considered developing QSAR models for CCR2 antagonists that are chiral using a large pool of RCIs computed based on various algorithms to assign weights to the four substituents attached to the chiral carbon.