1. Introduction
Leishmaniasis refers to a collection of diseases, caused by intracellular parasites of the
Leishmania genus [
1]. Transmission can occur in humans through the bite of Diptera, in the Psychodida family, which are hematophagous insects in the genera
Phlebotomus or
Lutzomyia [
2]. Leishmaniasis is endemic in at least 98 countries [
3] and is considered to be a neglected disease because it affects primarily low-income populations but remains a relevant public health problem, with considerable incidence and prevalence rates in Brazil [
4]. The World Health Organization considers leishmaniasis to be one of the six most neglected tropical diseases, affecting 2 million people annually, worldwide [
5]. More than 20,000 deaths are estimated to occur annually, due to disease complications [
6].
According to Akhoundi et al., 21 species of the genus Leishmania are known to be pathogenic to humans, including
L. major,
L. tropica,
L. aethiopica,
L. donovani,
L. infantum,
L. mexicana,
L. amazonensis,
L. braziliensis,
L. peruviana,
L. guyanensis,
L. panamemsis,
L shawi,
L. lainsoni,
L. lindenbergi,
L. martiniquensis,
L. siamensis, and
L. colombiensis. [
7]. Leishmaniasis can be categorized as tegumental and visceral leishmaniasis, and both can be further subdivided into other clinical forms of infection, defined by the location of the parasite in infected tissues and by the species, including cutaneous leishmaniasis (CL), mucocutaneous leishmaniasis (MCL), visceral or kala azalis leishmaniasis (VL), and post-kala azal dermal leishmaniasis (PKDL) [
8]. The most common form is CL, and more than 90% of all CL cases are distributed across three primary regions: (i) Afghanistan, Iran, Saudi Arabia, and Syria; ii) Algeria and Tunisia; and (iii) Brazil and Peru [
9].
CL symptoms present as one or more lesions, which are often painless, and purulence is uncommon. Lesions are typically located in exposed areas, such as the face and extremities, with ulcers and nodule/plaque appearances being common, and transmission occurs through the bites of mosquitos infected with the parasite [
10]. Although not fatal, multiple lesions tend to leave permanent scars that can lead to social stigmatization.
In the Americas, the primary etiological agent of CL is
L. braziliensis [
11], which generally encompasses severe clinical forms associated with skin, mucosal, mucocutaneous, and subcutaneous nodular lesions [
12,
13], and large genetic polymorphisms can cause visible deformities in the host, in addition to psychological, social, and economic impacts [
14]. CL is a chronic inflammatory disease that is widely distributed in Brazil and primarily caused by
L. braziliensis [
15].
In India, the Middle East, Central Asia, and North and West Africa, CL is primarily caused by the species
L. major [
12,
16], causing mild to severe skin disorders that can result in disfigurement if left untreated [
17]. In the Middle Eastern region and Israel, CL is frequently caused in humans by infection through wild mammal reservoirs, with
Phlebotomus papatasi being the primary vector for
L. major [
16]. Although
Leishmania parasites affect millions of people, in several countries, around the world, no human vaccine is currently available for the treatment of CL caused by
L. major, and the available treatments are expensive and have toxic side-effects. Thus, the identification of new therapeutic options against the disease remains necessary [
18].
Current treatments for leishmaniasis are based on chemotherapeutic drugs, which are often inefficient or harmful, due to the development of drug resistance and side effects, associated with the high toxicity index. In addition, no vaccines are currently available, although some vaccine candidates for the treatment of these diseases are currently in the pre-clinical and clinical testing phases [
19,
20].
Because
L. major and
L. braziliensis cause lesions in the host that can be disseminated to other sites and the exacerbated production of cytokines and chemokines that cause oxidative stress, trigger the amplification of the inflammatory response [
21], we chose liganans and neolignans because they have properties favorable to drug development. Factors in the host such as immunosuppressant, malnutrition, and co-infection or genetic and environmental factors are factors that aggravate the disease. In this sense, we chose lignans and neolignans to investigate antileishmania activity in enzymes important for the survival and proliferation of parasites, reducing injuries and decrease the inflammatory response [
22]. In addition, lignans are known for their anti-inflammatory and antioxidant activity, which could minimize the effects of the inflammatory response. In addition, a study by Pilkington [
23], evaluating the absorption, distribution, metabolism, excretion, and toxicity (ADMET) profiles of lignans found that more than 75% of lignans met all the requirements for drug-likeness. The study concluded that lignans show a high level of drug similarity.
Computational tools can contribute greatly to database creation, by predicting protein functions, modeling protein structures, simulating metabolic pathway kinetics, predicting biological activities, predicting toxicity, and predicting the affinities and flexibilities between receptors and ligands, which can facilitate the development and identification of drugs with the potential to treat various diseases and promote the development of efficacious drugs with reduced toxicity [
24,
25].
Therefore, this study aimed to use virtual screening and experimental validation to identify lignans with leishmanicidal potential, low toxicity, and selective activity against several Leishmania targets.
2. Results
2.1. Prediction of ADMET Properties
Various predictive parameters were determined for a set of 160 lignans and neolignans, to identify compounds with the best ADMET profiles for further examination using other methodologies. The results showed that among the 160 lignans and neolignanas, only 34 failed the Lipinski rule. Because the application of the Lipinski rule did not decisively filter the molecules, we used additional methodologies to select those compounds with the best profiles.
During the analysis of lipophilicity and water solubility, 148 compounds (92%) obtained good results, presenting consensus log
p values below <4.15 and/or at least two descriptors with the classification “Low solubility” (
Table S1). Then, the 148 compounds were submitted to pharmacokinetic analyzes. The results showed that 42 lignans (28.3%) had adequate pharmacokinetics (
Table S1).
Toxicity was assessed for the 42 lignans and we found that 33 (78%) of the 42 compounds with good pharmacokinetic action had low or no predicted risk for the development of mutagenicity, tumorigenesis, negative effects on the reproductive system, or irritability (
Table S2).
2.2. Quantitative Structure-Activity Relationship (QSAR) Modeling
To perform ligand-based virtual screening, two prediction models were built, using the random forest (RF) algorithm. To construct these models, molecular descriptors were calculated for the bank of molecules with known activity against L. major and L. braziliensis, obtained through the ChEMBL database. After validating the models, 33 lignans with excellent ADMET profiles were analyzed for leishmanicidal activity, using the prediction models.
The RF models were evaluated for their predictive powers, using the parameters of specificity, sensitivity, accuracy, positive predicted value (PPV), and negative predicted value (NPV), in addition to performance and robustness, such as the receiver operating characteristic (ROC) curve and Mathews correlation coefficient (MCC).
Table 1 describes the characteristics of the two models, in terms of predictive power and robustness, and
Figure 1 and
Figure 2 show the performances of the models.
The cross-validation results demonstrated that the generated models obtained excellent performance results, with an accuracy greater than 76%. The L. major ROC curve showed a value greater than 0.89 and the MCC value was 0.63, which indicated that the model demonstrated excellent classification, performance, and robustness rates. The ROC curve for the L. braziliensis model was greater than 0.86, with an MCC value equal to 0.87, which were also good results.
With the models created and demonstrated to have excellent performance, the lignan bank was then screened to select compounds that are potentially active against L. major and L. braziliensis.
The RF model was able to select 11 compounds with active potential, with probabilities ranging from 50% to 57%, for
L. major (
Table 2), whereas the model for
L. braziliensis was able to select 21 potentially active compounds, with probabilities ranging between 50% and 75% (
Table 3). Compounds 86 and 160 were considered to be most active for
L. major, and compounds 8, 60, 157, and 160 were considered to be the most active for
L. braziliensis.
2.3. Alignment of Protein Sequences
Sequence alignment was used to verify the similarities and identities of the enzymes selected in this study, across different species, which allowed the analysis of conserved regions and the identification of common residues in the active site. In addition, differences and structural similarities could be identified that might contribute to rational drug planning. Therefore, we investigated the shared amino acids in the active sites of various L. major and L. braziliensis enzymes.
The results showed that most of the enzymes shared greater than 80% identity between
L. major and
L. braziliensis, with 84% identity for Glycerol-3-phosphate dehydrogenase (GPDH); 85% for dihydroorotate dehydrogenase (DHODH); 72.13% for the Pteridine reductase 1 (PTR1); 82.69% for Trypanothione reductase (TR); and 85.69% for UDP-glucose pyrophosphorylase (UGPase) (
Figures S1–S5).
All of the amino acids in the active site of GPDH protein were conserved between species. According to Choe et al. [
26], interactions with the residues Trp44, Ile93, Phe101, Phe97, and Met46 of
L. major GPDH (
LmGPDH) were observed for the compound 2-bromo-6-chloro-purine. For the DHODH protein, researchers have described advances and perspectives in structural biology, which were used to identify and validate target sites for the development of drugs that target this enzyme [
27]. The study showed that DHODH is structurally conserved among
Leishmania species, except in
L. braziliensis, which showed changes in the amino acid sequence of the active site. In
L. braziliensis, Met104 is replaced with Ala138, and Cys150 is replaced with Tyr176 (
Figure S2). In the PTR1 enzyme, interactions were observed between the amino acids Arg17, Ser111, Asp181, Leu188, Leu226, Leu229, His241, Tyr283, and Arg287 and the compound (2~{
R})-2-[3,4-bis (oxidanyl) phenyl]-6-oxidanyl-2,3-dihydrochromen-4-one, indicating the location of the active site, according to a study performed by Pisa et al. [
28]. In this study, we noticed the replacement of the Leu229 residue with Phe229 in the
L. braziliensis PTR1. The enzymes TR and UGPase showed that the residues in the active site were highly conserved in both species.
2.4. Homology Modeling
In this study, six models were generated: L. major GPDH (LmGPDH) L. braziliensis GPDH (LbGDPH), L. braziliensis PTR1 (LbPTR1), L. major TR (LmTR), L. braziliensis TR (LbTR), and L. braziliensis UGPase (LbUGPase).
The reliabilities of the models were verified using several tools. One of the primary tools used was the Ramachandran graph, which represents all possible combinations of dihedral angles Ψ (psi) versus φ (phi) for each amino acid of a protein, except for glycine, which has no side chains, and models are considered to be reliable when more than 90% of amino acids are present in the permitted and/or favored regions (colored regions of the graph). Blank regions represent outliers, which have bad contacts. All generated models showed greater than 98% of amino acids in the permitted and favored regions (
Figure 3 and
Table 4). Highly reliable models were likely obtained due to the high similarity between the model sequences and the high resolution of the templates. All models were used for the following methodologies.
Verify 3D analyzes the compatibility of the 3-dimensional (3D) structure with its 1-dimensional (1D) amino acid sequence, based on the characteristics of the chemical environment, such as polarity and compares the results with good structures.
The method determines the environmental characteristics of each residue: (i) The total area of the side chain; ii) the fraction of the side chain area covered by polar atoms or water; and (iii) the local secondary structure. Then, the information is categorized and the structure of the 3D protein is converted into 1D, as a sequence, which represents the environment class of each residue in the structure of the folded protein. Then, the sequence is aligned and compared with sequences of good structures. A reliable model must have a 3D-1D score of more than 80%. All models obtained had scores above 80%, as shown in
Table 5.
The quality of the atomic contacts between the atoms of each residue was analyzed, using the module Coarse Packing Quality Control or Fine Packing Quality Control on the WHAT IF server, which compares the distribution of atom positions around each residue. A mean score of less than –5.0 indicates bad or unusual atomic contacts. All models generated by the homology presented average score values above −5.0, as shown in
Table 6.
2.5. Combined Analysis, Based on Ligand and Structure
The lignans and neolignans that were considered to be active based on the RF models for L. major and L. braziliensis were submitted to docking consensus evaluation, to increase the reliability of the method and decrease the number of false positives.
In total, 10 enzymes were used, four obtained from the PDB database and six based on homology. The docking results were generated using five different scoring functions and were validated by redocking the PDB ligand with the five types of enzymes for each species. More negative values indicated better predictions for most scoring function, except for the Goldscore and ChemPLP, which rank the best poses using the most positive values.
After docking, the results were standardized, and an average of all energy values was calculated for each lignan. Then, the lignans that obtained lower energy values than the PDB ligand in at least three of the scoring functions were used.
Further calculations were performed to obtain the lignans with the best active potential probabilities for each of the analyzed proteins, using the following formula:
where
EMLig is the average energy across all five scoring functions for each analyzed lignan,
EMMLig is the highest average energy obtained by the tested lignans, and
EMInib is the average energy across all five scoring functions for the ligand inhibitor, obtained from the crystallographic data of the test protein. Thus, only those compounds that obtained energy values equal to or greater than the interaction energy of the crystallographic inhibitor ligand were considered to be potentially active.
Among the 11 lignans analyzed for
L. major, all were considered potentially active against GPDH, 10 against DHODH, 9 against PTR1, 9 against TR, and all 11 against UGPase (
Table S3).
Among the 21 lignans analyzed using the consensus docking analysis for
L. braziliensis, 20 were potentially active against GPDH, all against DHODH, 19 against PTR1, 16 against TR, and 15 against UGPase (
Table S4).
Tables S3 and S4 show that the consensus docking analysis returned activity probabilities for the examined lignans that were much higher than that for PDB ligand for various enzymes in the two species.
A second consensus analysis was performed to select multitarget lignans, which demonstrate an active potential probability for more than one protein, in the RF analysis and. For this analysis, the following formula was used:
where
ProbDc is the average active potential probability based on the molecular coupling analysis,
ESP is the specificity value from the RF model, and
PActivity is the active potential probability value from the RF model. The combined probability is conditioned, such that only compounds with values above 0.5 are considered likely to be active. The combined probability values were calculated for the lignans for each enzyme studied. Finally, we analyzed which compounds were multitarget compounds.
Table 7 and
Table 8 show the combined probability values between the forecasting models and the molecular docking analysis.
Among the lignin and neolignans bank analyzed in this study, after the combined ligand- and structure-based analysis, and the identification of multitarget compounds, all of the lignans, except lignan 76, were considered to be potentially active against three or more enzymes (
Table 7 and
Table 8). Therefore, 20 molecules were considered to have potential multitarget activity against
L. major and/or L
. braziliensis, with 10 molecules common to both species. We also observed that the most likely targets, according to the combined analysis based on structure and ligand, are GPDH, PTR1, TR, and UGPase for
L. major. While for
L. braziliensis, the most likely targets are GPDH, PTR1, and UGPase.
Figure 4 shows the common lignans for
L. major and
L. braziliensis that were considered to be potentially active, based on the RF model, selected by the consensus analysis combined with fit values, and identified as multitarget. We also observed that
L. braziliensis showed higher rates of combined probability, indicating that lignans may be potential leishmanicides for this species.
2.5.1. Interaction Analysis
To analyze the interactions, we selected lignan
160 as the compound with the highest leishmanicidal potential because it is considered to be multitarget and obtained activity predictions for Leishmania species (
Figure 5 and
Figure 6).
GPDH
For the L. major GPDH enzyme, the interaction with lignan 160 consisted primarily of large numbers of hydrophobic bonds. Four hydrogen bonds were established, which consisted of bonds with the Ser24, Cys124, Lys126, and Glu301 residues. Four hydrophobic interactions were counted, corresponding with the Thr45, Ile47, Ile94, and Pro95 residues. We observed that although the lignan was located at the enzyme active site, it interacted with other amino acids that are not reported in the literature.
Similar to L. major, the L. braziliensis GDPH formed four hydrogen bonds with lignan 160, at the Ser24, Pro95, Thr96, and Glu301 residues. Two hydrophobic interactions, with Trp45 and Ile47, and a steric Val93 interaction were also observed.
DHODH
In L. major, lignan 160 formed several hydrogen bonds with the DHODH enzyme active site, providing increased stability to the complex. Links with the amino acids Gly101, Asn107, Ser130, Cys131, Ser130, Pro138, and Val140 and hydrophobic interactions with the amino acids Met104 and Cys150 were observed. According to the literature, lignan 160 interacted with two important amino acids responsible for catalytic activity: Met104 and Cys150.
In L. braziliensis, two hydrogen bonds were formed with residues Asn107 and Tyr150, and three hydrophobic interactions were observed with Ser130, Gln139, and Ala146.
PTR1
Hydrogen bonds with the amino acids Ans109, Asn181, and Gly225 were observed between lignan
160 and the active site of
L. major PTR1. A hydrophobic interaction with Phe113 and a steric interaction with Ser111 were also observed. Important interactions have been reported in the literature between the amino acids Arg17, Ser111, Asp181, Leu188, Leu226, Leu229, His241, Tyr283, and Arg287 and the compound (2~{
R})-2-[3,4-bis(oxidanyl) phenyl ] -6-oxidanil-2,3-dihydrochromen-4-one, indicating that this may be the location of the active site, as reported in a study by Pisa et al. [
25]. In the present study, we noticed the substitution of Leu229 residue with Phe229 in
L. braziliensis PTR1, and a hydrophobic interaction was observed at Phe229. In addition, three hydrophobic interactions, with the amino acids Leu188, Phe191, and Pro230, were also observed. According to the literature [
28], the Leu188 residue is conserved in Leishmania species.
TR
The amino acids in the TR active site formed several bonds with lignan 160. Hydrogen bonds were formed with the residues Val55 and Ala363 and hydrophobic interactions were observed with the residues Cys57, Ile199, and Pro336. Interactions with the amino acids Cys57 and Pro336 are considered important for enzymatic inhibition. In contrast, lignan 160 did not obtain good results in L. braziliensis.
UGPase
In L. major, lignan 160 established hydrogen bonds with the amino acids Arg248, His410, and Pro411. In addition, hydrophobic interactions with residues Arg373 and Val413 and steric interactions with the amino acids Asp62 and Val370 were observed. Most of these amino acids, such as Arg248, Arg373, His410, and Pro411, have been reported in the literature as important for protein inhibition.
In L. braziliensis, seven hydrogen bonds were observed between lignan 160 and the UGPase active site, providing increased stability. Links with the amino acids Glu60, Glu62, Ans301, Ser304, Arg374, Ser375, and Leu404 were observed, in addition to a hydrophobic interaction with His411.
2.5.2. Docking Validation
The docking results generated by the five scoring functions were validated by redocking the PDB ligand with the five proteins from Leishmania species.
The root-mean-square deviations (RMSDs) of the obtained fitting poses were calculated in comparison with the crystal structure. RMSD values of less than 2 Å indicate an ideal degree of screening reliability. Information regarding the starting structures and the redocking validation results are shown in
Table 9.
During the redocking analysis, most of the RMSD values were below 2.0 Å, and all five tested scoring functions positioned the ligand correctly at the active site. The Vina program generated only one ideal RMSD value. Thus, studies that use only the Vina program may generate many false-positive results. In addition, RMSD values for the poses obtained by AD4 could not be calculated because the program does not generate outputs for all ligand poses. Overall, the programs provided values that were considered to be satisfactory for the docking consensus validation.
2.5.3. Evaluation of Docking Programs
The performances of the programs used for docking analysis in this study were evaluated, by analyzing how each program ranked the compounds that were discarded from the study for having lower energy values than the PDB ligand in at least three docking programs. The workflow used to perform the step-by-step calculations of program error rates (E
r) is shown in
Figure 7. The calculation of E
r was used to analyze the probability that a given program would classify an inactive molecule as active, verifying its performance.
The results showed that the MVD, Gold, Vina, AD4, and Plants programs presented E
r values of 42%, 27%, 35%, 11%, and 40%, respectively. Therefore, AD4 had the highest hit rate (89%). Although AD4 is more restrictive for the selection of active compounds, the program was able to obtain excellent results when compared against other programs.
Table 10 shows the E
r value
per program/enzyme, revealing that MVD had a higher total E
r, with higher E
r values identified for DHODH and UGPase compared with those for the other programs.
2.6. Prediction of Drug Resistance
One of the great justifications for the development of new drugs against Leishmaniasis is the resistance of some species to commercialized drugs. To minimize the possible effects of resistance to a likely drug candidate, we searched the TritryDB database for single nucleotide polymorphisms (SNP). SNPs are mutations that are frequently present in over 1% of the species and may contribute to the development of drug resistance.
The results showed that only one SNP was identified in L. major PTR1, whereas in L. braziliensis, non-synonymous SNPs were identified in both DHODH and TR. No data were found for UGPase in either species.
Among the 14 SNPs identified, only four presented a polymorphic allele with relevant allele frequency, between 40% and 50%, as shown in
Table 11.
After identifying the primary SNPs that can cause drug resistance, we located the ancestral amino acids in the enzyme structure that are likely to be replaced by SNPs and determined whether these changes were near or in the active site of the target protein. We found that none of these amino acids were located near the active site of the studied proteins, which reduces the likelihood of drug resistance (
Figure 8).
2.7. Molecular Dynamics Simulations
We selected some lignans that were considered to have potential multitarget leishmanicidal activity against the studied species, with excellent ADMET properties, and examined them using molecular dynamics (MD) simulations. MD assesses the flexibility of enzymes and the stability of interactions, depending on the conditions, such as solvent used, ion concentration, pressure, and temperature. Therefore, the interactions between lignans and the crystallographic ligands were used to study the flexibility and conformational changes that affect the complexes during the MD simulation. RMSD was calculated for the Cα atoms in each complexed enzyme and the structures of each ligand, separately.
The RMSD analysis of the complexed GPDH enzyme showed conformations ranging from 0.2 to 0.4 nm in size during 10 ns, with high stability (
Figure 9A), except for GPDH complexed with compound 83, which showed a peak of instability at 5 ns but was quickly stabilized. The same pattern was observed for DHODH (
Figure 9B). PTR1 also showed stability during 10 ns, except for the protein complexed with compound
159 (
Figure 9C). The PTR1-157 complex was considered to be the most stable. TR complexed with lignan 20 showed greater stability than the complex containing the crystallographic ligand (
Figure 9D). All complexes with the UGPase protein showed stability after 2 ns (
Figure 9E). However, it was not possible to perform MD calculations for the crystallographic ligand of DHODH, due to problems with the parameterization of the ligand.
When we analyzed the flexibility of the ligands, we found that compounds
31,
83, and the PDB ligand complexed with GPDH enzyme were more stable (
Figure 10A). The same pattern was also found for complexes with DHODH (
Figure 10B). For the PTR1 enzyme, lignan
157 showed high instability, unlike the other molecules, which remained stable (
Figure 10C). Small peaks of instability were observed for TR compounds (
Figure 10D). Compounds 86 and the PDB ligand showed stability during 10 ns for the UGPase protein (
Figure 10E). In addition, lignan
44 achieved stability from 2 ns and lignan
160 achieved stability from 4 ns for UGPase.
To understand the flexibility of the residues and the amino acids that contribute to the conformational changes in the enzymes, the root-mean-square fluctuation (RMSF) values for each amino acid in each enzyme were calculated. High RMSF values suggest increased flexibility, whereas low RMSF values reflect decreased flexibility. Given that amino acids with fluctuations above 0.3 nm contribute to the flexibility of the protein structure, we found that residues at positions 1, 182, 285–290, 338–340, and 345–349 contribute to conformational changes in GPDH, with only one residue from the active site complexed to compound
83, favoring the alteration (
Figure 11A). Among the more than 300 amino acids present in DHODH, only amino acids 1, 10–25, 241–243, 250, 347, 348, and 351–354 contribute to conformational changes (
Figure 11B). In PRT1, we observed that residues 1–4, 76–79, 120–123, 125–130, 132–134, 168, and 231 favor structural changes (
Figure 11C). In TR, amino acids 74–88, 91, 173, 186, 351, 360, 403, 447, 448, 450, 452, 453, 456, and 458–463 favor structural changes (
Figure 11D). In UGPase, the amino acids 6, 7, 43, 53, 171, 180, 201, 265, 267–274, 342, 343–346, 348, 467, 468, 487, and 488 (
Figure 11E) favor structural changes. We found that none of the amino acids that affect structural conformations identified in DHODH, PTR1, TR, and UGPase are components of the active site.
2.8. Activity Test against L. major and L. braziliensis Axenic Promastigotes
We selected four lignans that achieved excellent results during the virtual screening process and isolated them. The compounds were evaluated for their potential to inhibit the growth of promastigotes forms of
L. major and
L. braziliensis. Only compound
156 was not tested in
L. major because it did not obtain good results in silico. The results showed that lignan (
159), epipinoresinol-4-O-β-D-glucopyranoside, displayed antileishmanial activity against
L. major promastigotes, with an inhibitory concentration (IC
50) of 36.51 µM (
Table 12). When investigated
against L. braziliensis promastigotes, compounds (
156) secoisolariciresinol, (
158) pinoresinol-4-O-β-D-glucopyranoside, (
159) epipinoresinol-4-O-β-D-glucopyranoside, and (
160) pinoresinol-4-O-β-D-apiofuranosyl-(1→2)-β-D-glucopyranoside inhibited growth with IC
50 values of 9.28, 36.35, 5.39, and 13.77 µM, respectively (
Table 12). The results showed that compounds
156,
158, and
159, showed excellent potential as growth inhibitors during the promastigote stage. In addition, we observed that compounds (
158), pinoresinol-4-O-β-D-glucopyranoside and (
159), epipinoresinol-4-O-β-D-glucopyranoside, which are epimers, showed significantly different biological activities, with compound (
159) epipinoresinol-4-O-β-D-glucopyranoside having the greatest potential against
L. braziliensis. This result confirms the docking study consensus and combined analysis, which revealed greater potential activity for molecule
159 than for molecule
158.
3. Discussion
The set of 160 Lignans and Neolignans from different subclasses (14 Furans, 10 Furofurans, 14 Dibenzylbutyrolactols, 22 Dibenzylbutanes, 21 Dibenzocycloocyadienes, 17 Aryltetralins, 3 Arylnaphthalenes, 8 Neolignans alkyl aryl ethers, 16 Neolignans benzofurans, and 9 Neolignans benzodiones) were used in this study. Lignans and Neolignans comprise a class of secondary metabolites, with diverse chemical structures, that are found in more than 70 families of plants and exhibit several significant and potent biological activities, including antioxidant [
21,
22], anti-inflammatory, hepatoprotective, anticancer [
22,
23], antimicrobial [
22], trypanocidal [
24,
25], neuroprotective [
26], and larvicidal [
27] activities.
Initially, the set of 160 Lignans and Neolignans were used by various predictive parameters, which were investigated using ADMET profiles. Thus, compounds with better ADMET profiles were subjected to further virtual screening methodologies.
The RF model was able to select 11 compounds with active potentials ranging from 50% to 57%, for L. major, and 21 potentially active compounds for L. braziliensis, with probabilities ranging between 50% and 75%. We observed the probabilities for biological activity were higher for L. braziliensis; therefore, we suggest that lignans may offer greater therapeutic potential against this species.
Virtual ligand-based screening is a method that is capable of evaluating and/or selecting compounds with desired properties, using chemical structures associated with known biological activity data to develop models, such as QSAR analyses [
29]. QSAR models contribute to planning and drug development by reducing the costs of new molecule development and reducing the number of animals necessary for experimental tests.
Docking is a virtual screening method, based on structure that can identify selective compounds and predict the mechanisms of action. We performed consensus docking for the molecules that were identified as potentially active by the RF models. The consensus docking method allows the elimination of potential false positives and ensures the increased reliability of the procedure. The results showed that all of the lignans identified as potentially active by the RF models achieved excellent results during the consensus docking analyses for different enzymes.
When analyzing the results of consensus docking, we observed that the same compounds were ranked among the first five positions in one docking program, whereas it was listed among the bottom positions in another program. In addition, when standardizing the docking results, we identified a compound that failed in three programs; however, when all the binding energies were averaged, that compound increased in rank. These results were due to a single program assigning this molecule much higher energy values than the other programs. Therefore, we used two criteria for the elimination of compounds during the consensus docking analysis: i) Compounds that failed in at least three docking programs; and ii) compounds that obtained average consensus docking energy values below that of the crystallographic ligand.
Understanding the performance of each docking program is necessary to verify the quality and reliability of the virtual screening process, regardless of whether a consensus approach is applied. Therefore, the use of a consensus strategy and the performance evaluation of the docking programs were essential for our structure-based virtual screening analysis.
A study performed by Chang et al. [
30] compared the virtual screening results obtained for HIV protease inhibitors, using AutoDock 4 and Vina. The authors concluded that both AutoDock 4 and Vina were able to select active compounds (AUC = 0.69 and 0.68, respectively;
p < 0.001) and that Vina was more scalable for the treatment of difficult coupling problems (i.e., larger and more flexible compounds) than AD4. Another study, recently performed by Ren et al. [
31], investigated the performance of several docking programs, to validate a scoring function strategy, and observed that among the evaluated programs, Plants and Vina obtained success rates of 53.6% and 48.5%, respectively. In contrast, the Gold program achieved success rates of up to 81% (within 2.0 Å of the experimental binding mode), according to Verdonk et al. [
32] and was more than 60% successful at predicting the correct poses for 84 evaluated protein-ligand complexes, in a study by Chaput and Mouawad [
33].
The combined analysis, based on both ligand and structure, allowed the selection of several potentially active and multitarget compounds against three or more enzymes, except lignan 76, which obtained good results against only two enzymes. We noticed that the combined probability values were better for the species L. braziliensis, for which most of the lignans showed ProbComb values between 60% and 80% of activity. These results were confirmed by in vitro assays, using L. braziliensis promastigotes, as all tested molecules inhibited parasite growth. In L. major, of the four compounds tested, only one achieved antileishmanial activity. We, therefore, suggest that a probability of activity greater than 60% is necessary to inhibit growth.
An interesting study by Stevanovic et al. [
34] found success in their study using homology detection methods and in silico screening to search for potential inhibitors of a new target, type 2 NADH dehydrogenase of
Leishmania infantum. According to the researchers, the selected compounds that exhibited favorable properties in the computational screening experiments were tested and the leishmanicidal activity was determined in amastigotes and wild-type axenic promastigotes of
L. infantum. The results showed that the identified compound, a substituted 6-methoxy-quinalidine, showed promising activity under the two cellular forms.
Although the isolated compounds have been tested only in amastigote forms, several studies indicate that the enzymes addressed in the study are expressed in amastigote and promastigote forms and there is proven biological activity for these enzymes in both forms. According to Choe et al. [
26], the enzyme GPDH is more expressed especially in the amastigote form, where fatty acids, instead of carbohydrates, are predominantly used as an energy source. According to Steiner et al. [
35], glucose UDP is essential for all organisms and in
Leishmania, several gluconjugates are expressed during the parasite’s life cycle, allowing survival and proliferation in the vector and in the mammalian host. Experimental data for the in vitro inhibition of
LmDHODH indicate that natural products can actually inhibit
LmDHODH against promastigotes and amastigotes [
36]. The same study identified several secondary metabolites that were able to inhibit
LmDHODH in vitro at concentrations of IC
50 27 μM, 30 μM and 31 μM. Compounds based on a structure-guided approach designed to have anti-leishmanial activity through the anti-folate mechanism, targeting
LmPTR1 in vitro was promising for promastigote and amastigote forms with IC
50 values of 4.2 μM and 3.3 μM, respectively [
37]. Da Sila et al. [
38] reported inhibitory activity of the compound ResAn2 for the target TR of
L. braziliensis against promastigotes and amastigotes with IC
50 values of 10.27 μM and 17.54 μM.
We also observed that three of the four isolated lignans, which showed inhibitory activity for
L. braziliensis, belong to the furofurans class. In addition, these lignans have glucose units attached to their structures, which can enhance their therapeutic action. According to Xu et al. [
39], furofurans are lignans originally formed by the enantioselective dimerization of two units of coniferyl alcohol derived from the biosynthetic shiquimate pathway. Furofuran lignans are known to have a diversity of structures due to bonding patterns, different substituents, and different configurations [
39]. This diversity contributes to a variety of biological activities, including anti-cancer [
40], antioxidants, anti-inflammatory, cytotoxic, antimicrobial [
39], and antiestrogenic [
41]. Therefore, this research brings new information about antileishmanial activity for this class of lignans.
5. Conclusions
Leishmaniasis is endemic in more than 90 countries, affecting low-income populations. Leishmaniasis is estimated to affect 2 million people annually, worldwide, and more than 20,000 deaths per year are due to complications from the disease. L. major and L. braziliensis are responsible for CL, which represents more than 90% of cases in several countries. Because current treatments can result in drug-resistance and are often associated with side effects, due to high toxicity index values, lignans have been investigated as a treatment alternative, particularly because many lignans have great ADMET profiles.
We used simple programs and rules to calculate the absorption, bioavailability, pharmacokinetics, and to select lignans with good properties. We were able to screen 33 promising lignans from a set of 160 compounds, which were subjected to several computational and experimental approaches to investigate their leishmanicidal potentials.
The generated predictive models obtained excellent performance results, with an accuracy greater than 81%, and selected 11 lignans with active potential probabilities ranging between 50% and 57%, for L. major. For L. braziliensis, an accuracy greater than 79% was achieved, and the model selected 21 lignans, with activity probabilities between 50% and 75%.
To investigate the mechanism of action for lignans and to evaluate their selectivity for five enzymes in Leishmania, a consensus docking analysis was performed, to guarantee the reliability of the RF model and to reduce the number of false positives. Eleven lignans were found to be potentially active against LmGPDH, 10 against LmDHODH, 9 against LmPTR1, 9 against LmTR, and 11 against LmUGPase. Among the 21 lignans analyzed in L. braziliensis, 20 were potentially active against LbGPDH, 21 against LbDHODH, 19 against LbPTR1, 16 against LbTR, and 15 against LbUGPase.
A combined analysis, based on both ligand and structure, was able to identify potentially active molecules, using both RF and multitarget models, resulting in the identification of 23 potentially active, multitarget molecules against L. major and/or L. braziliensis, with 10 compounds common to both species.
Due to concerns regarding the development of drug resistance during the treatment of Leishmaniasis, the present study investigated the presence of SNPs, which may contribute to the development of drug resistance. Among the 14 SNPs identified, only four presented polymorphic alleles with relevant frequencies, between 40% and 50%. We also found that none of the amino acids affected by the SNPs were located near the active sites of studied proteins, which reduces the likelihood of developing drug resistance.
MD simulations revealed that most of the studied enzymes are stable under various conditions, including various solvents, ions, temperatures, and pressure, with only small variations observed for some complexed compounds. Therefore, the binding affinity between proteins and ligands is unlikely to be affected by environmental changes. In addition, none of the amino acids responsible for the enzymatic conformational changes were in the active site, except for those in the DHODH-83 complex, which allows the active site to remain stable.
Four lignans with excellent ADMET profiles, which are considered to be potentially active and multitarget inhibitors for the studied enzymes, were isolated from Justicia aequilabris (Nees) Lindau [
65] and subjected to in vitro tests. Lignans were collected in the city of Puxinanã-PB, Brazil. The results showed that only lignan (
159) epipinoresinol-4-O-β-
d-glucopyranoside achieved antileishmanial activity against promastigotes forms of L. major, with an IC
50 value of 36.51 µM. In L. braziliensis, compounds (
156) secoisolariciresinol, (
158) pinoresinol-4-O-β-
d-glucopyranoside, (
159) epipinoresinol-4-O-β-
d-glucopyranoside, and (
160) pinoresinol-4-O-β-
d-apiofuranosyl-(1→2)-β-
d-glucopyranoside inhibited growth, with IC
50 values of 9.28, 36.35, 5.39, and 13.77, respectively. Compounds (
156) secoisolariciresinol, (
159) epipinoresinol-4-O-β-
d-glucopyranoside, and (
160) pinoresinol-4-O-β-
d-apiofuranosyl-(1→2)-β-
d-glucopyranoside showed excellent potential as growth inhibitors for the promastigote stage of the parasite. When compared with the values obtained from the biological activity prediction, using the RF models, we noticed that the probability of activity for L. major varied 50%–60%, whereas those values for L. braziliensis ranged 50%–75%. We suggest that it may be preferable to test compounds with probabilities of activity above 60% to obtain good results and that the computational approach can be used to guide experimental research.