Next Article in Journal
Activating Wnt/β-Catenin Signaling in Osteocytes Promotes Osteogenic Differentiation of BMSCs through BMP-7
Next Article in Special Issue
Past COVID-19: The Impact on IVF Outcomes Based on Follicular Fluid Lipid Profile
Previous Article in Journal
Substrate Recognition Properties from an Intermediate Structural State of the UreA Transporter
Previous Article in Special Issue
Protein Interactome Profiling of Stable Molecular Complexes in Biomaterial Lysate
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Hepatoprotective Activity of Lignin-Derived Polyphenols Dereplicated Using High-Resolution Mass Spectrometry, In Vivo Experiments, and Deep Learning

1
Skolkovo Institute of Science and Technology, 30 Bolshoy Boulevard bld. 1, 121205 Moscow, Russia
2
Moscow Institute of Physics and Technology, 141701 Moscow, Russia
3
N.N. Petrov National Medical Research Center of Oncology, 68 Leningradskaya Str., 197758 Saint Petersburg, Russia
4
Emanuel Institute for Biochemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
*
Authors to whom correspondence should be addressed.
Int. J. Mol. Sci. 2022, 23(24), 16025; https://doi.org/10.3390/ijms232416025
Submission received: 11 November 2022 / Revised: 9 December 2022 / Accepted: 13 December 2022 / Published: 16 December 2022
(This article belongs to the Special Issue Molecular Research Using Omics Technologies for Human Health)

Abstract

:
Chronic liver diseases affect more than 1 billion people worldwide and represent one of the main public health issues. Nonalcoholic fatty liver disease (NAFLD) accounts for the majority of mortal cases, while there is no currently approved therapeutics for its treatment. One of the prospective approaches to NAFLD therapy is to use a mixture of natural compounds. They showed effectiveness in alleviating NAFLD-related conditions including steatosis, fibrosis, etc. However, understanding the mechanism of action of such mixtures is important for their rational application. In this work, we propose a new dereplication workflow for deciphering the mechanism of action of the lignin-derived natural compound mixture. The workflow combines the analysis of molecular components with high-resolution mass spectrometry, selective chemical tagging and deuterium labeling, liver tissue penetration examination, assessment of biological activity in vitro, and computational chemistry tools used to generate putative structural candidates. Molecular docking was used to propose the potential mechanism of action of these structures, which was assessed by a proteomic experiment.

1. Introduction

According to a recent analysis [1], acute and chronic liver diseases remain a significant public health threat in Europe with more than 1 billion affected people, with increasing incidence and prevalence of these diseases in industrialized countries [1,2,3,4]. Among the most common liver diseases worldwide is the non-alcoholic fatty liver disease (NAFLD). NAFLD is the leading cause of liver-related morbidity and mortality, [3] with more than 800 million people suffering from this disease. Therefore, an urgent task is the search and development of new hepatoprotective drugs targeting NAFLD with various mechanisms of action [5,6].
Natural and synthetic complex mixtures of low-molecular-weight polyphenolic compounds, including products of lignin oxidation, possess a wide range of biological activity and low toxicity, which makes them promising candidates for therapeutic applications [7,8]. It has been shown that polyphenols can act as traps for free radicals [8,9] and therefore prevent oxidative stress [10]. In addition to that, they possess an anti-inflammatory effect, which is manifested in the suppression of the synthesis of cytokines and interleukins of the pro-inflammatory spectrum [11]. Depending on the origin, natural mixtures can exhibit hepatoprotective properties, including the aforementioned prevention of oxidative stress (e.g., silibinin, catergen), stimulation of liver parenchyma regeneration [12], and alleviation of NADFL biochemical symptoms [13,14]. However, most of the papers published on the biological activity of complex mixtures are related to recording the biological response in experiments in vitro and in vivo, while their molecular mechanism of action remains unknown. The main reason is the high complexity of the analysis of multi-component systems at the molecular level and the lack of feasible opportunities to identify components responsible for their action.
The molecular complexity of nature-derived mixtures is usually analyzed in the framework of the dereplication concept [15]. The dereplication procedure is aimed at the quick identification of known chemotypes in the mixture by analyzing the spectra data and searching for components in biological activity and spectra databases. Dereplication strategies are based on the combination of advanced analytical methods including HPLC, MS, NMR, biological tests, etc., complemented with data mining techniques [15,16] Structural information is used to narrow a list of possible candidates from open-source libraries such as ChEMBL [17] and CoCoNut [18]. Yet, the combinatorial nature of highly degraded polyphenols prevents them from obtaining sufficiently narrow fractions [19]. Additionally, the application of tandem mass spectrometry within the dereplication strategy is almost impossible due to its time-consuming nature, even with the most powerful Fourier transform ion cyclotron resonance mass spectrometry (FTICR MS) techniques [20]. Instead, the integral structural parameters are investigated in order to build up a chemical foundation for biological properties [21,22]. In addition, the correlation of the abundance of molecular components with the biological response in a set of similar samples or during bulk fractionation can be used to prioritize components in a mixture [23,24]. Ultimately, this approach mostly misses structural information, preventing it from discovering the mechanism of action.
Alternatively, the administration of labeled mixtures facilitates the detection of components in tissues. For example, in previous reports, the application of tritium labeling allowed authors to quantitatively analyze tissue distribution and pharmacokinetic parameters for synthetic polyphenols administrated to mice [25]. The drawback of the approach is the total loss of qualitative information. In this study, we aimed to adapt a labeling strategy in order to detect individual components of the complex oxidized lignin derivative BP-Cx-1 [26] in mice liver tissue by FTICR MS as well as to explore their structural features by application of selective chemical tagging. Molecular and structural information obtained with these methods was used to apply deep learning to generate candidate structures for the detected components, suggest a molecular mechanism of hepatoprotective activity, and, finally, extract a more active fraction enriched with these components. Based on the data obtained in silico, an in vivo experiment on the evaluation of the hepatoprotective activity of BP-Cx-1 and its fraction was carried out in a model of NAFLD in mice with induced type 2 diabetes.

2. Results

2.1. Overview of the Study Pipeline

To assess which chemotypes can potentially lead to bioactivity, we applied the following computational methodology (Figure 1). In the first step, the molecular composition of a complex polyphenolic mixture was characterized using FTICR MS, and its bioactivity and the potential active components were elucidated by the extraction of D-labeled material from mice livers followed by MS analysis. Secondly, bulk fractionation of the parent material was conducted to obtain a fraction enriched with the detected components. Its hepatoprotective activity was compared to the parent material. Then, to indicate a potential mechanism of action, a combination of selective chemical labeling, FTICR MS, proteomics, and computational chemistry methods was applied. The Kelch-like ECH-associated protein 1 (KEAP1)–nuclear factor erythroid 2–related factor 2 (Nrf2) pathway, which plays an important role in NADFL, was chosen as a primary target for analyzing the chemical space of potential components. The ability of the active fraction to interact with KEAP1 was evaluated by LC-MS experiments, and the putative structural candidates were suggested using deep learning and molecular modeling.

2.2. Determination of BP-Cx-1 Components in Mice Liver

BP-Cx-1 was analyzed by FTICR MS before and after it was labeled. The labeling method was effective—most of the components were multiply labeled (Table S1, Figure S1). Despite NaOD provided for the carbon-skeleton labeling, back-exchange during metabolism might occur and deep labeling assured detection of components of interest after administration, tissue sampling, and extraction described in Methods. The set of extracts included control samples, samples after the administration of parent BP-Cx-1, and samples obtained after D-labeled BP-Cx-1 administration. As it is described in Methods, only the latter deuterium-contained samples were used for component detection. FTICR mass spectra were highly populated (Figure S2) and visually highly resembled, which indicated low intensities of BP-Cx-1-derived components. Still, applying the developed spectra-treating algorithm, 448 and 312 D-labeled were found in the liver extract after parenteral and oral administration, respectively, comprising 644 unique records. It should be noted that a number of these molecules were missing in the mass spectra of D-labeled BP-Cx-1. This might be explained by the reproducibility issue of direct-ESI measurements, in which charge suppression, combating ionization, and even the lifetimes of ion clouds depend on the mixture composition. After additional filtration of the data, the list of 208 molecular components of BP-Cx-1 was composed presenting formulae observed either after parenteral or oral administration. For structural characterization of BP-Cx-1 components detected in the liver, selective deuteromethylation, reducing, and mild bromination reactions were conducted giving reliable information on the number of carboxylic and carbonyl groups, as well as aromatic rings. Upon aggregating 133 out of 208 formulae, all three descriptors were obtained (Table S2). The distribution of these moieties over the van Krevelen diagram (Figure S3) shows that most of the BP-Cx-1 component contained at least one carbonyl group and most of the compounds contained two carboxylic groups, and they were mostly aromatic.

2.3. Fractionation of BP-Cx-1 for Enrichment of Targeted Component

We postulated that components found in the liver should be responsible for the hepatoprotective properties of BP-Cx-1. Therefore, it was promising to isolate the fraction enriched with these components. To do so, we performed solvent extraction in the Soxhlet apparatus with a gradual increase in binary solvent polarity: From 100% CH3OH to 25% CH3OH-75% H2O. All fractions were analyzed by FTICR MS. The 100% CH3OH and 25% CH3OH fractions were composed of the maximum and minimum numbers of formulae: 5927 and 3676, respectively. Number-averaged values obtained from mass-spectra were similar; all fractions were lacking oxidized compounds—(O/C)n values were below 0.3. The advantage of FTICR MS is that we can examine groups of compounds, which contribute to the number-average values. This is important since the presence of impurities (e.g., lipids) may affect the average values significantly. Correspondingly, we grouped compounds according to their atomic ratios and constrained aromaticity index excluding lipids, which are unlikely indigenous to the polyphenolic material (Tables S3 and S4). Here, it is clearly seen that, in fact, the gradient in the molecular composition is pronounced. It is obvious that the gradient extraction resulted in the shift from mostly unsaturated compounds to aromatic and, consequently, to condensed compounds dominating 50% and 25% CH3OH fractions. Schematically, this trend is designated on the van Krevelen diagram (molecular profiles) (Figure 2A). It is worth noting that for natural samples, such as humic substances, the shift in elemental composition during fractionation is usually explicit. This indicated high structural diversity of molecular constituents of BP-Cx-1 on the relatively narrow range of elemental compositions (in terms of O/C and H/C ratios), which corroborates well with the distribution of structural descriptors on van Krevelen diagrams (Figure S3). All fractions were examined as present and contributed to the total intensity of 208 molecular components, which were found in the liver in the previous step. The methanol fraction was the most enriched of those components (Figure 2B). At the next stage, the biological activity of parent BP-Cx-1 and the presumably most active fraction were examined.

2.4. Assessment of Biological Activity of Parent and Fractionated BP-Cx-1

General information on body weight, food and liquid consumption, and blood analysis can be found in Tables S5–S7. During the first two weeks, the weight of animals from groups with induced diabetes—negative control (group 2), under parental BP-Cx-1 treatment (group 3), and under BP-Cx-1 fraction treatment (group 4)—was comparable to that of the intact control (group 1). After the administration of streptozotocin (week 3), a decrease in body weight was observed for experimental groups 2–4 with no influence of the treatment. It was accompanied by a decrease in food consumption and an increase in water consumption. By the end of the experiment, the increase in hematocrit was observed for group 2. The treatment with both BP-Cx-1 and its fraction normalized the hematocrit level (Table S7). Unlike a routine blood test, blood chemistry analysis showed a significant deviation of many parameters (Table 1). Compared to group 1, all experimental groups showed an increase in the total protein level, mostly due to the increase in globulin content. The total protein level increased up to 61 g/L after the administration of streptozotocin, which may indicate inflammation. The treatment with an isolated fraction (group 4) almost normalized the total protein (~47 g/L) and significantly decreased the globulins content (down to ~24 g/L). All experimental groups showed a significant increase in liver enzymes indicating damage to liver functioning. The administration of BP-Cx-1 and its active fraction did not result in a significant decrease in AST and ALT levels. A similar trend was observed for glucose. The positive effect of the treatment was observed for pigment metabolism. Liver steatosis in group 2 was accompanied by increased bilirubin levels—up to ~18 µmol/L, in which 93% was accounted for by the conjugated bilirubin. The administration of fractionated BP-Cx-1 (group 4) decreased the total bilirubin level to ~15 µmol/L and conjugated bilirubin to ~13 µmol/L. Bilirubin is currently considered an endogenous antioxidant cytoprotectant [27] so its level is likely regulated by the administration of the exogenous polyphenol agent. This corroborates well with Group 3 (parent BP-Cx-1) with bilirubin levels comparable to group 2. Lipid exchange was also affected by inflammation. The cholesterol level increased in group 2 up to 3.5 µmol/L. Similar to bilirubin, only the isolated fraction of BP-Cx-1 (group 4) decreased the cholesterol level to 2.83 µmol/L. Despite the significant deviation of experimental groups from group 1, the positive therapeutic effect of the isolated fraction was pronounced. This supports the assumption of a leading role of liver-isolated components in the overall hepatoprotective effect of BP-Cx-1. In the next step, their molecular mechanism of action is studied.

2.5. Examination of BP-Cx-1 Interaction with KEAP1

The commercial KEAP1 protein was investigated using the conventional bottom-up proteomic approach and its sequence coverage was substantial (~40%) with post-translation modifications (PTM) resolved. In the discovered sequence, the electrophilic inhibitor could modify eleven cysteine residues, including Cys273, which, according to the literature, plays a major role in the KEAP1–Nrf2 interaction during oxidative stress. The challenge in the analysis of the BP-Cx-1 interaction with KEAP1 lies in the ambiguity of the mechanism and the unknown structure of any of the 208 liver-detected components. Consequently, the protein–ligand interaction was examined by the inhibition of KEAP1 alkylation and the comparison of relative intensities of the free-cysteine-containing peptides with and without incubation with BP-Cx-1. The results are presented in Figure 3.
It can be seen that after incubation, the relative intensities of cysteine-alkylated peptides varied from the control sample. The relative intensities significantly decreased for Cys226, Cys241, Cys249, and Cys273 as compared to other fragments. Changes in the relative intensities of alkylated peptides are the result of KEAP1’s interaction with BP-Cx-1. This interaction was not selective but there was clear targeting. It is likely that the environment of the interactive cysteine targeted BP-Cx-1 in these sites. It is particularly important that the Cys273 was one of the reactive sites and was a signal residue. Upon obtaining information about the biological activity of BP-Cx-1 both in vitro and in vivo, its possible molecular mechanism was further investigated using computational tools.

2.6. Generation of Structural Candidates for BP-Cx-1 Active Components Using Deep Learning

In this work, the deep learning method [28] based on the recurrent neural network architecture (RNN) was used for the generation of natural-like compounds structurally similar to compounds active against KEAP according to ChEMBL data (see Section 4). In total, 1.8 M unique valid SMILES strings were generated. The generated compounds comprise a wide range of various natural product-like scaffolds similar to those present in CoCoNut (Figure 4a) and the compounds interfering with the Nrf2 pathway (Figure 4b, Table S8). The distribution of occurrence of physico-chemical descriptors (Figure S4) among the generated compounds and compounds from CoCoNut and Nrf2 subsets is similar. Matching of the generated compounds to formulae obtained with FTICR MS and structural features of compounds determined the H/D exchange significantly reduced the number of putative structural candidates (1775 compounds). The compounds were then used to investigate their ability to interact with KEAP1 using molecular docking.

2.7. Molecular Docking of Structural Candidates to KEAP-1 Structure

While the KEAP1–Nrf2 interaction is widely acknowledged as an important target for the design of new therapeutics [29] for various chronic diseases, the structural biology of KEAP1 is not completely known. KEAP1 is a 624-amino-acid protein, which consists of five domains (Figure 5a). The X-ray structures in the apo form and with ligands are available only for the Kelch-like domain (e.g., PDB ID 6LRZ [30]) and BTB domains (e.g., PDB ID 4CXI [31]).
With the recent advances in predicting the 3D protein structures using deep learning [32] and the deposition of the large pool of predicted structures in the AlphaFold Protein Structure Database [33], a new avenue for structural biology studies has been paved. We retrieved the structure of the full-length (624 amino acids) KEAP1 predicted by AlphaFold (Figure 5a) and used it to investigate the ability of the generated compounds to interact with the protein. Unsurprisingly, Kelch-like and BTB domains of the model were structurally similar to the aforementioned available X-ray data (RMSD < 1 A for Cα atoms) differing mostly in loops’ conformation. Considering that the cysteine residues in human KEAP1 including Cys 151, 273, and 288 are highly reactive and play an essential role in the repression of Nrf2 activation, we addressed the question of whether the molecular components of the sample can prevent the binding of the electrophiles to these residues [34]. Proteomics experiments (see above) did not reveal signatures of covalent binding but rather allowed us to identify cysteines affected by the presence of the sample. Therefore, we hypothesized that putative compounds could bind to the protein in close proximity to these residues. In particular, we focused our attention on Cys273, since this residue is buried in the potentially “druggable” cavity between helixes formed by 298–311 and 262–275 residues. The binding of the compounds to this site can barricade access to Cys273 and thus prevent the alkylation of the cysteine moiety. Molecular docking simulations revealed that the generated compounds can indeed bind to this cavity (Figure 5b and Figure S5), anchoring to the site via the π–π stacking interaction with His274. The presence of positively charged amino acid residues framing the entrance to the site is in line with the mostly nucleophilic nature of the molecular components of the sample: Many components contain at least one −OH or −COOH group (−COO− at the physiological pH).
Figure 5. (a) A model of full-length KEAP1 predicted by AlphaFold2. A BTB domain (amino acids 61–179) is shown in cyan, an intervening region (IVR, amino acids 180–314) is shown in orange, and a Kelch domain (Kelch) is shown in red. N-terminal (NTR, amino acids 1–60) and C-terminal regions (CTR, amino acids 599–624) are shown in grey. Cysteine residues, which were investigated in the proteomics experiments, are shown in magenta (non-alkylated ones) and lime (alkylated). Domains are designated as in ref. [35]. (b) Docking pose of the generated compound (3,5,7-trihydroxy-2-[(4-hydroxy-3-(3-methylbut-2-en-1-yl)phenyl)methyl]-2,3-dihydro-4H-1-benzopyran-4-one), shown as ball-and-stick model) in the region close to Cys273. Positively charged amino acids (Lys312, His274, Arg269) forming the entrance to the cavity are shown. Visualization is made in VMD 1.9.3. [36].
Figure 5. (a) A model of full-length KEAP1 predicted by AlphaFold2. A BTB domain (amino acids 61–179) is shown in cyan, an intervening region (IVR, amino acids 180–314) is shown in orange, and a Kelch domain (Kelch) is shown in red. N-terminal (NTR, amino acids 1–60) and C-terminal regions (CTR, amino acids 599–624) are shown in grey. Cysteine residues, which were investigated in the proteomics experiments, are shown in magenta (non-alkylated ones) and lime (alkylated). Domains are designated as in ref. [35]. (b) Docking pose of the generated compound (3,5,7-trihydroxy-2-[(4-hydroxy-3-(3-methylbut-2-en-1-yl)phenyl)methyl]-2,3-dihydro-4H-1-benzopyran-4-one), shown as ball-and-stick model) in the region close to Cys273. Positively charged amino acids (Lys312, His274, Arg269) forming the entrance to the cavity are shown. Visualization is made in VMD 1.9.3. [36].
Ijms 23 16025 g005

3. Discussion

Numerous biological pathways are associated with NAFLD [37,38], and therefore, various molecular targets can be exploited for the design of NAFLD-targeting therapeutics [5]. While a thorough analysis of a sample’s mechanism of action assumes the analysis of its interaction with a panel of known molecular targets associated with NAFLD, such an analysis was outside the scope of this paper. Alternatively, in this paper, we suggest an example of a dereplication pipeline combining computational and experimental methods to decipher putative chemotypes behind the activity of the complex mixture under study. The research design presented in Figure 1 is advantageous in the case of a multicomponent mixture. In fact, many structures present in the lignin derivative may interact with KEAP1 or other NAFLD-related proteins. However, for future development, it is important to determine which component may penetrate the liver tissue in detectable amounts.
This was achieved in our investigation by the administration of D-labeled material. Components of BP-Cx-1 are characterized by structural ambiguity; consequently, it is a non-trivial task to detect them in tissue. The application of the H/D exchange facilitated the determination of BP-Cx-1 in the extract due to the low abundance of heavy isotopes in the organism. Thus, labeled (exogenous) and endogenous molecules were easily distinguished by mass spectrometry. We suggest that detected components are responsible for hepatoprotective activity. In support of this, a fraction enriched with these components demonstrated a higher protective effect compared to the parent material.
The major focus of the research was chemoinformatics approaches for structure elucidation in order to assess the molecular mechanism. While the mining of candidate structures in large repositories of chemical data can be an effective approach to dereplication [23,26], the number of compounds with biological activity assessed experimentally is limited, which leads to small coverage of potential chemotypes for various formulae. This problem was addressed by using methods of de novo structure generation [39]. These approaches allow the rapid generation of millions of compounds bearing chemical features and/or possessing (predicted) required bioactivity profiles. The generated structures did not exhaustively cover the chemical space of relevant structural isomers and were considered “probes” of chemotypes that are natural-like and similar to the compounds interfering with the Nrf2 pathway. Yet, 1.8 M structures were generated. In order to obtain a feasible list of candidates, we narrowed the structure space by the results of selective chemical tagging. This yielded 1775 structures, which were possible to handle without excessive calculation power. Indeed, molecular docking enabled us to suggest a binding site for these structures, which was supported by a proteomic experiment with KEAP1. Moreover, structural information on the BP-Cx-1 components suggests a blocking mechanism for Cys273 residue resulting from the π–π stacking interaction with adjacent His274. However, it should be emphasized that numerous sites on the surface of the KEAP1 model can be determined (Figure S6), and it is known that compounds can bind to several sites at once [30]. Therefore, further experimental investigation of mixture components’ binding modes via X-ray crystallography is required.
Overall, this pilot study provides a new strategy to study the mechanism of action of natural multicomponent mixtures. This requires a combination of chemical, chemoinformatic, and biological experiments. In the particular case of BP-Cx-1, the results are promising from a practical point of view. In fact, synthetic lignin derivatives have batch-to-batch reproducibility [40]. With the proven mechanism, it may be introduced as a bivalent drug: As a dietary fiber, which helps in the elimination of residues in the intestinal tract and improves the intestinal environment [41,42], and as a hepatoprotective agent, which prevents or reduces liver damage.

4. Materials and Methods

Solvents and other reagents used in this study were commercially available. Methanol of HPLC grade (Chimmed, Moscow, Russia) was used for the elution and dissolution of samples. High-purity distilled water (18.2 MΩ) was prepared using a Millipore Simplicity 185 system (Laverna XX1, Moscow, Russia). Bond Elut PPL (Priority PolLutant, Agilent Technologies, Santa Clara, CA, USA) cartridges (50 mg, 1 mL) were used for the isolation and purification of the parent and labeled samples. Raw mass spectrometric data for deuteromethylation and reducing reactions were obtained from a previous study [43]. H/D exchange of the parent BP-Cx-1 was performed according to the previously described procedure [44]. For the bromination of BP-Cx-1, 5-fold excess of NBS (Chimmed, Moscow, Russia) was added to the AcN (Chimmed, Moscow, Russia) solution and stirred overnight followed by PPL extraction.

4.1. Enumeration of Functional Groups and H/D Exchanges

Detailed information on FTICR MS measurements can be found elsewhere [45]. For the enumeration of COOH− and C=O groups, as well as the presence of an aromatic ring, a Python-based script was developed, which enabled fully automatic assignment [46]. In brief, the algorithm facilitates the extraction and enumeration of peak series with mass differences corresponding to the labeling reaction and filtration of the obtained results based on the heuristic chemistry-based rules depending on the reaction. Mass differences were 17.03448, 3.02193, 77.91051, and 1.00628 Da for singly charged ions corresponding to deuteromethylation, reducing, bromination, and H/D exchange. In all cases, the mass error (between peaks of a labeling series) was below 0.0003 m/z, which was optimized based on the performance of the FTICR MS instrument.

4.2. Fractionation of BP-Cx-1

Fractionation was performed by using different combinations of binary solvent mixtures from 100% CH3OH to 25% CH3OH-75% H2O in the Soxhlet extractor. For this, 50 g of the solid BP-Cx-1 batch was used. All fractions were analyzed by FTICR MS without further purification. The fraction of interest (100% CH3OH) was dried, yielding 5 g (105) of the solid material.

4.3. Animal Welfare

Female BALB/c mice (animal facility Stolbovaya, Moscow District, Russia) were quarantined for 14 days upon delivery. The animals were kept in T2 type IVC cages under artificial 12 h light/dark cycle conditions, 21 ± 2 °C, average humidity of 20–50%, and ad libitum access to laboratory chow (Laboratorkorm LLC, Moscow, Russia) and tap water. Experimental animals were handled under the Guide for the Care and Use of Laboratory Animals, 8th edition. The study protocol was approved by the Local Ethics Committee of the N.N. Petrov National Medical Research Center of Oncology (protocol no. 3/215; dated 22 September 2020). Experimental animals at the end of observation were euthanized by decapitation under isoflurane anesthesia.

4.4. Induction of NAFLD Accompanied by Type 2 Diabetes

NAFLD accompanied by type 2 diabetes was induced as follows [47,48]: Throughout the experiment (7 weeks), BALB/c female mice were put under a high-carbohydrate diet (5% aqueous fructose solution ad libitum) and a high-fat diet (standard chow + lard). Pancreatic lesions were induced via intraperitoneal streptozotocin (CarboSynth, San Diego, CA, USA) at a dose of 80 mg/kg (week 3). Seven days after this, blood glucose levels were measured in the animals. Animals were included in experimental groups provided their glucose concentrations were above 10 mmol/L (180 mg/dL)).
After randomization by glucose level, four animal groups of 8 mice each were formed: The intact control (group1), mice with induced type 2 diabetes and pancreatic lesions without treatment—negative control (group2), and mice under treatment of the parent BP-Cx-1 (group 3) and its methanol fraction (group 4). Both parent BP-Cx-1 and its fraction were administrated at a dose of 75 mg/kg body weight daily by gavage. Blood and serum samples were collected after euthanasia at the end of the experiment.

4.5. Extraction of BP-Cx-1 Components from Mice Liver Tissue

For the pharmacokinetic experiment, 29 Balb/c female mice were divided into 5 groups: Control (3), intravenous administration of parent and labeled BP-Cx-1 (3 and 10, respectively), and gavage administration (3 and 10, respectively). Intravenous and gavage doses were 20 µg/kg and 100 µg/kg, respectively. Intravenous administration was carried out using a plastic restrainer without anesthesia. All animals underwent euthanasia after 2 h following the maximum concentration of the material in tissue obtained previously by tritium labeling [25]. The organs were collected, and the liver was thoroughly cleaned of surrounding tissues and washed of blood in a cold solution of 0.9% sodium chloride (400 mL, the solution was changed after every 3 animals at one time point). Then, fluid residues were removed using filter paper, and livers were frozen for further extraction.
Before extraction, liver tissues were ground in a mortar in liquid nitrogen pooled according to the mice grouping and freeze-dried. First, lipids were removed by three-fold washing with a chloroform-methanol (2:1) solution. The solid residue was dried again and extracted via water with continuous shaking (3 h) followed by chloroform extraction to remove polar metabolites. The water residue was extracted by solid-phase extraction on PPL resin in two steps. First, the supernatant was passed at a native pH through the first cartridge to remove metabolites. The collected permeate was acidified to pH 2 and passed through the second cartridge targeting BP-Cx-1 components. Components were further eluted with methanol and kept in the freezer for FTICR MS analysis.

4.6. BP-Cx-1-KEAP Interaction

For the examination of the interaction between the BP-Cxp1 active fraction and KEAP-1, the conventional proteomic bottom-up approach was applied with and without overnight incubation of the protein with 0.6 mg/L of polyphenol material in Tris buffer (50 mM, pH 8). To both samples, dithiothreitol (DTT) was added as a reducing agent, and the solution was incubated for 1 h at 37 °C for the reduction of disulfide bonds in the KEAP-1 protein. Further, acetylation with iodoacetamide was conducted. Before trypsin digestion, 0.01% detergent was added to facilitate better protein coverage. Digestion was quenched by the addition of 1% FA. The tryptic peptides were analyzed in triplicate on a nano-HPLC Dionex Ultimate 3000 system (Thermo Fisher Scientific, Madison, WI, USA) coupled to a timsTOF Pro (Bruker Daltonics, Billerica, MA, USA) mass spectrometer. The sample volume was 2 µL per injection. HPLC separation was carried out using a packed emitter column (C18, 25 cm × 75 µm 1.6 µm) (Ion Optics, Parkville, Australia) by gradient elution. Mobile phase A was 0.1% formic acid in water; mobile phase B was 0.1% formic acid in acetonitrile. LC separation was achieved at a flow of 400 nL/min using a 40 min gradient from 4% to 90% of phase B. The obtained data were analyzed using PEAKS XPro software (BSI, North Waterloo, ON, Canada) using the following parameters: Parent mass error tolerance of 20 ppm and fragment mass error tolerance of 0.03 Da. Only the peptides with both trypsin-specific ends were considered for identification purposes. The alkylation of cysteine residues was set as a possible variable modification.

4.7. ChEMBL Bioactivity Data Mining

The MySQL version of ChEMBL (v. 29) [49,50] was retrieved and installed on a local MySQL server. The Python library “mysql-connector-python” (8.0.18) [51] was used to access data from the MySQL server. Data retrieval, standardization, and preprocessing were performed using pandas [52,53] and RDKit (v. 2019.03.3) [54] libraries in the Jupyter notebook interactive environment.
Assays with various types of activity data against Nrf2 were retrieved using a list of key substrings (“%nrf2%”, “%nfe212%”, “%nuclear%factor%erythroid%2%related%factor%2%”) from target_dictionary.pref_name, assays.description, assay.organism, and target_dictionary.organism fields. Retrieved assay entries were manually checked. Only compounds consisting of carbon, hydrogen, and oxygen were kept. Activity entries were standardized, and 1/0 (“active/inactive”) labels were assigned to each entry (see GitHub repository). Structures were standardized following the protocol from https://github.com/PharminfoVienna/Chemical-Structure-Standardisation (accessed on 8 Septmber 2021) with slight modifications. Stereochemistry was depleted from SMILES entries. The activity entries related to the same standardized SMILES were grouped, and the mean values were taken. Compounds with average activity values no less than 0.5 were designated as “active”, and related SMILES notations were retrieved. Finally, the dataset composed of 1942 smiles entries was used for tuning (see below). The complete protocols used for data retrieval and standardization are available in the Jupyter notebook in the GitHub repository (see below).

4.8. Generation of Structural Candidates Using Deep Learning

4.8.1. Generation of Compounds

The recurrent neural network with the architecture described by Segler, M.H.S. et al. [28] was used to generate putative structural candidates. The hyperparameters of the original network were used. At first, a model was trained on an entire CoCoNut dataset for 40 epochs. The model was then retrained on the dataset comprising compounds active against Nrf2 for 10 epochs. This fine-tuned model was used for sampling.

4.8.2. Filtering of Generated Compounds Using Labeling Data

Generated SMILES were checked for validity using RDKit, and duplicates were removed. Corresponding compounds were filtered according to the data from isotopic labeling experiments: The compound should have contained at least one ionizable group (-OH, -COOH) and its functional groups should have matched at least one out of three enumerated moieties: Aromatic, carboxylic, and carbonyl. The counts of relevant functional groups were calculated using RDKit.

4.8.3. Chemical Space Analysis

All procedures related to the analysis of chemical structure datasets were performed using either RDKit or DataWarrior v.5.5.0 [55]. Self-organizing maps were built in DataWarrior using FragFP fingerprints. Scaffold analysis was performed in DataWarrior.

4.9. Molecular Docking of Structural Candidates to KEAP-1 Structure

4.9.1. Preparation of Receptor

The model of the full-length KEAP-1 structure generated by AlphaFold2 (UniProt KB [56] identification code P57790) was retrieved from https://alphafold.ebi.ac.uk/ (accessed on 1 October 2021). The structure was minimized using the OpenMM [57] program via python script. The following parameters were used: A step size of the Verlet Integrator of 0.001 ps, 1000 steps of minimization, and an AMBER99SB force field. The quality of the minimized structure was checked using PROCHECK [58] and Prosa-Web (Figures S7 and S8) [59,60]. The parameters of the box for docking selected to be close to the Cys273 residue were chosen using AutoDock Tools [61]: The coordinates (−5, −18, −5) and dimensions of 20 × 20 × 20 A°.

4.9.2. Binding Site Detection

Binding site detection was performed using fpocket2 [62]. The default parameters were used.

4.9.3. Preparation of Ligands

Initial conformations of ligands were obtained using Open Babel 2.4.1. [63] The following options were used: gen3d, best (for more thorough conformation optimization), and -p 7.4 (adding hydrogens appropriate for pH 7.4). Tautomers and stereoisomers were not enumerated, and the default tautomer/stereoisomer generated by Open Babel was kept.

4.9.4. Molecular Docking

Molecular docking was performed using AutoDock Vina v.1.2.2. [64,65] with the exhaustiveness parameter equal to 32. All other parameters were left as default.

4.9.5. Visualization and Analysis

For visual analysis, VMD 1.9.3 [36] and YASARA [66] were used. Post-docking analysis was performed in python.

5. Conclusions

The investigation of the synthetic multicomponent polyphenolic agent, BP-Cx-1, showed its pronounced hepatoprotective activity in mice. The developed H/D exchange labeling approach enabled us to find 200 individual components in mice livers after its administration. This approach seems promising, evidenced by the increased activity of the isolated fraction, enriched with the detected components. The H/D exchange was analytically justified and maintained the properties of BP-Cx-1. The additional combination of labeling techniques, data mining, and deep learning enabled us to generate 1775 natural-like structures with potential hepatoprotective activity and molecular formulae corresponding to components found in the liver. Further application of molecular docking showed the possibility of interaction of the chosen structures with positively charged amino acid residues framing the cavity bearing Cys273. Further experimental investigation into mixture components’ binding modes via X-ray crystallography is required to prove this prediction.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms232416025/s1.

Author Contributions

Conceptualization, A.Z.; methodology, A.Z. and A.O.; biological experiments, A.S. (Alexander Semenov), E.E., and E.G.; data curation, G.R. and A.S. (Anastasia Sarycheva); experiments, S.S. and O.K.; proteomic experiments, A.E.B. and A.K., writing—original draft preparation, review and editing, E.N., E.I.F., and A.Z.; supervision, E.N. and E.I.F.; funding acquisition, A.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Russian Science Foundation grant 22-75-10140.

Institutional Review Board Statement

Experimental animals were handled under the Guide for the Care and Use of Laboratory Animals, 8th edition. The study protocol was approved by the Local Ethics Committee of the N.N. Petrov National Medical Research Center of Oncology (protocol no. 3/215; dated 22 September 2020).

Informed Consent Statement

Not applicable.

Data Availability Statement

The source code used in this study is published in the GitHub repository https://github.com/AxelRolov/hepato-derep-lignin (accessed 15 December 2022).

Acknowledgments

We are grateful to Alexander G. Brzhozovskiy and Maria I. Indeykina for their assistance in proteomics experiments.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Ge, X.; Zheng, L.; Wang, M.; Du, Y.; Jiang, J. Prevalence Trends in Non-Alcoholic Fatty Liver Disease at the Global, Regional and National Levels, 1990–2017: A Population-Based Observational Study. BMJ Open 2020, 10, e036663. [Google Scholar] [CrossRef] [PubMed]
  2. Vento, S.; Cainelli, F. Chronic Liver Diseases Must Be Reduced Worldwide: It Is Time to Act. Lancet Glob. Health 2022, 10, e471–e472. [Google Scholar] [CrossRef] [PubMed]
  3. Riazi, K.; Azhari, H.; Charette, J.H.; Underwood, F.E.; King, J.A.; Afshar, E.E.; Swain, M.G.; Congly, S.E.; Kaplan, G.G.; Shaheen, A.-A. The Prevalence and Incidence of NAFLD Worldwide: A Systematic Review and Meta-Analysis. Lancet Gastroenterol. Hepatol. 2022, 7, 851–861. [Google Scholar] [CrossRef] [PubMed]
  4. Pinzani, M. Pathophysiology of Non-Alcoholic Steatohepatitis and Basis for Treatment. Dig. Dis. 2011, 29, 243–248. [Google Scholar] [CrossRef]
  5. Umemura, A.; Kataoka, S.; Okuda, K.; Seko, Y.; Yamaguchi, K.; Moriguchi, M.; Okanoue, T.; Itoh, Y. Potential Therapeutic Targets and Promising Agents for Combating NAFLD. Biomedicines 2022, 10, 901. [Google Scholar] [CrossRef]
  6. Pydyn, N.; Miękus, K.; Jura, J.; Kotlinowski, J. New Therapeutic Strategies in Nonalcoholic Fatty Liver Disease: A Focus on Promising Drugs for Nonalcoholic Steatohepatitis. Pharmacol. Rep. 2020, 72, 1–12. [Google Scholar] [CrossRef] [Green Version]
  7. Barapatre, A.; Meena, A.S.; Mekala, S.; Das, A.; Jha, H. In Vitro Evaluation of Antioxidant and Cytotoxic Activities of Lignin Fractions Extracted from Acacia Nilotica. Int. J. Biol. Macromol. 2016, 86, 443–453. [Google Scholar] [CrossRef]
  8. Cory, H.; Passarelli, S.; Szeto, J.; Tamez, M.; Mattei, J. The Role of Polyphenols in Human Health and Food Systems: A Mini-Review. Front. Nutr. 2018, 5, 87. [Google Scholar] [CrossRef] [Green Version]
  9. Balasaheb Nimse, S.; Pal, D. Free Radicals, Natural Antioxidants, and Their Reaction Mechanisms. RSC Adv. 2015, 5, 27986–28006. [Google Scholar] [CrossRef] [Green Version]
  10. Forman, H.J.; Zhang, H. Targeting Oxidative Stress in Disease: Promise and Limitations of Antioxidant Therapy. Nat. Rev. Drug Discov. 2021, 20, 689–709. [Google Scholar] [CrossRef]
  11. Fedoros, E.I.; Baldueva, I.A.; Perminova, I.V.; Badun, G.A.; Chernysheva, M.G.; Grozdova, I.D.; Melik-Nubarov, N.S.; Danilova, A.B.; Nekhaeva, T.L.; Kuznetsova, A.I.; et al. Exploring Bioactivity Potential of Polyphenolic Water-Soluble Lignin Derivative. Environ. Res. 2020, 191, 110049. [Google Scholar] [CrossRef] [PubMed]
  12. Madrigal-Santillán, E.; Madrigal-Bujaidar, E.; Álvarez-González, I.; Sumaya-Martínez, M.T.; Gutiérrez-Salinas, J.; Bautista, M.; Morales-González, Á.; García-Luna y González-Rubio, M.; Aguilar-Faisal, J.L.; Morales-González, J.A. Review of Natural Products with Hepatoprotective Effects. World J. Gastroenterol. 2014, 20, 14787–14804. [Google Scholar] [CrossRef] [PubMed]
  13. Yang, K.; Chen, J.; Zhang, T.; Yuan, X.; Ge, A.; Wang, S.; Xu, H.; Zeng, L.; Ge, J. Efficacy and Safety of Dietary Polyphenol Supplementation in the Treatment of Non-Alcoholic Fatty Liver Disease: A Systematic Review and Meta-Analysis. Front. Immunol. 2022, 13, 949746. [Google Scholar] [CrossRef]
  14. Abenavoli, L.; Larussa, T.; Corea, A.; Procopio, A.C.; Boccuto, L.; Dallio, M.; Federico, A.; Luzza, F. Dietary Polyphenols and Non-Alcoholic Fatty Liver Disease. Nutrients 2021, 13, 494. [Google Scholar] [CrossRef] [PubMed]
  15. Hubert, J.; Nuzillard, J.-M.; Renault, J.-H. Dereplication Strategies in Natural Product Research: How Many Tools and Methodologies behind the Same Concept? Phytochem. Rev. 2017, 16, 55–95. [Google Scholar] [CrossRef]
  16. Wolfender, J.-L.; Allard, P.-M.; Kubo, M.; Queiroz, E.F. Metabolomics Strategies for the Dereplication of Polyphenols and Other Metabolites in Complex Natural Extracts. In Recent Advances in Polyphenol Research; John Wiley & Sons, Ltd.: Hoboken, NJ, USA, 2019; pp. 183–205. ISBN 978-1-119-42789-6. [Google Scholar]
  17. Gaulton, A.; Hersey, A.; Nowotka, M.; Bento, A.P.; Chambers, J.; Mendez, D.; Mutowo, P.; Atkinson, F.; Bellis, L.J.; Cibrián-Uhalte, E.; et al. The ChEMBL Database in 2017. Nucleic Acids Res. 2017, 45, D945–D954. [Google Scholar] [CrossRef]
  18. Sorokina, M.; Merseburger, P.; Rajan, K.; Yirik, M.A.; Steinbeck, C. COCONUT Online: Collection of Open Natural Products Database. J. Cheminform. 2021, 13, 2. [Google Scholar] [CrossRef]
  19. Hertkorn, N.; Frommberger, M.; Witt, M.; Koch, B.P.; Schmitt-Kopplin, P.H.; Perdue, E.M. Natural Organic Matter and the Event Horizon of Mass Spectrometry. Anal. Chem. 2008, 80, 8908–8919. [Google Scholar] [CrossRef]
  20. Leyva, D.; Tariq, M.U.; Jaffé, R.; Saeed, F.; Lima, F.F. Unsupervised Structural Classification of Dissolved Organic Matter Based on Fragmentation Pathways. Environ. Sci. Technol. 2022, 56, 1458–1468. [Google Scholar] [CrossRef]
  21. Verrillo, M.; Parisi, M.; Savy, D.; Caiazzo, G.; Di Caprio, R.; Luciano, M.A.; Cacciapuoti, S.; Fabbrocini, G.; Piccolo, A. Antiflammatory Activity and Potential Dermatological Applications of Characterized Humic Acids from a Lignite and a Green Compost. Sci. Rep. 2022, 12, 2152. [Google Scholar] [CrossRef]
  22. Zykova, M.V.; Brazovsky, K.S.; Veretennikova, E.E.; Danilets, M.G.; Logvinova, L.A.; Romanenko, S.V.; Trofimova, E.S.; Ligacheva, A.A.; Bratishko, K.A.; Yusubov, M.S.; et al. New Artificial Network Model to Estimate Biological Activity of Peat Humic Acids. Environ. Res. 2020, 191, 109999. [Google Scholar] [CrossRef] [PubMed]
  23. Orlov, A.A.; Zherebker, A.; Eletskaya, A.A.; Chernikov, V.S.; Kozlovskaya, L.I.; Zhernov, Y.V.; Kostyukevich, Y.; Palyulin, V.A.; Nikolaev, E.N.; Osolodkin, D.I.; et al. Examination of Molecular Space and Feasible Structures of Bioactive Components of Humic Substances by FTICR MS Data Mining in ChEMBL Database. Sci. Rep. 2019, 9, 12066. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Mikhnevich, T.A.; Vyatkina (Turkova), A.V.; Grigorenko, V.G.; Rubtsova, M.Y.; Rukhovich, G.D.; Letarova, M.A.; Kravtsova, D.S.; Vladimirov, S.A.; Orlov, A.A.; Nikolaev, E.N.; et al. Inhibition of Class a β-Lactamase (TEM-1) by Narrow Fractions of Humic Substances. ACS Omega 2021, 6, 23873–23883. [Google Scholar] [CrossRef]
  25. Badun, G.A.; Chernysheva, M.G.; Zhernov, Y.V.; Poroshina, A.S.; Smirnov, V.V.; Pigarev, S.E.; Mikhnevich, T.A.; Volkov, D.S.; Perminova, I.V.; Fedoros, E.I. A Use of Tritium-Labeled Peat Fulvic Acids and Polyphenolic Derivatives for Designing Pharmacokinetic Experiments on Mice. Biomedicines 2021, 9, 1787. [Google Scholar] [CrossRef] [PubMed]
  26. Fedoros, E.I.; Orlov, A.A.; Zherebker, A.; Gubareva, E.A.; Maydin, M.A.; Konstantinov, A.I.; Krasnov, K.A.; Karapetian, R.N.; Izotova, E.I.; Pigarev, S.E.; et al. Novel Water-Soluble Lignin Derivative BP-Cx-1: Identification of Components and Screening of Potential Targets in Silico and in Vitro. Oncotarget 2018, 9, 18578–18593. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Sedlak, T.W.; Saleh, M.; Higginson, D.S.; Paul, B.D.; Juluri, K.R.; Snyder, S.H. Bilirubin and Glutathione Have Complementary Antioxidant and Cytoprotective Roles. Proc. Natl. Acad. Sci. USA 2009, 106, 5171–5176. [Google Scholar] [CrossRef] [Green Version]
  28. Segler, M.H.S.; Kogej, T.; Tyrchan, C.; Waller, M.P. Generating Focused Molecule Libraries for Drug Discovery with Recurrent Neural Networks. ACS Cent. Sci. 2018, 4, 120–131. [Google Scholar] [CrossRef] [Green Version]
  29. Cuadrado, A.; Rojo, A.I.; Wells, G.; Hayes, J.D.; Cousin, S.P.; Rumsey, W.L.; Attucks, O.C.; Franklin, S.; Levonen, A.-L.; Kensler, T.W.; et al. Therapeutic Targeting of the NRF2 and KEAP1 Partnership in Chronic Diseases. Nat. Rev. Drug Discov. 2019, 18, 295–317. [Google Scholar] [CrossRef] [Green Version]
  30. Unni, S.; Deshmukh, P.; Krishnappa, G.; Kommu, P.; Padmanabhan, B. Structural Insights into the Multiple Binding Modes of Dimethyl Fumarate (DMF) and Its Analogs to the Kelch Domain of Keap1. FEBS J. 2021, 288, 1599–1613. [Google Scholar] [CrossRef]
  31. Cleasby, A.; Yon, J.; Day, P.J.; Richardson, C.; Tickle, I.J.; Williams, P.A.; Callahan, J.F.; Carr, R.; Concha, N.; Kerns, J.K.; et al. Structure of the BTB Domain of Keap1 and Its Interaction with the Triterpenoid Antagonist CDDO. PLoS ONE 2014, 9, e98896. [Google Scholar] [CrossRef]
  32. Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly Accurate Protein Structure Prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef] [PubMed]
  33. AlphaFold Protein Structure Database. Available online: https://alphafold.ebi.ac.uk/ (accessed on 25 October 2021).
  34. Ooi, B.K.; Chan, K.-G.; Goh, B.H.; Yap, W.H. The Role of Natural Products in Targeting Cardiovascular Diseases via Nrf2 Pathway: Novel Molecular Mechanisms and Therapeutic Approaches. Front Pharm. 2018, 9, 1308. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Baird, L.; Swift, S.; Llères, D.; Dinkova-Kostova, A.T. Monitoring Keap1–Nrf2 Interactions in Single Live Cells. Biotechnol. Adv. 2014, 32, 1133–1144. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Humphrey, W.; Dalke, A.; Schulten, K. VMD: Visual Molecular Dynamics. J. Mol. Graph. 1996, 14, 33–38. [Google Scholar] [CrossRef]
  37. Musso, G.; Cassader, M.; Gambino, R. Non-Alcoholic Steatohepatitis: Emerging Molecular Targets and Therapeutic Strategies. Nat. Rev. Drug Discov. 2016, 15, 249–274. [Google Scholar] [CrossRef]
  38. Musso, G.; Gambino, R.; Cassader, M. Emerging Molecular Targets for the Treatment of Nonalcoholic Fatty Liver Disease. Annu. Rev. Med. 2010, 61, 375–392. [Google Scholar] [CrossRef]
  39. Meyers, J.; Fabian, B.; Brown, N. De Novo Molecular Design and Generative Models. Drug Discov. Today 2021, 26, 2707–2715. [Google Scholar] [CrossRef]
  40. Zherebker, A.Y.A.; Rukhovich, G.D.; Kharybin, O.N.; Fedoros, E.I.; Perminova, I.V.; Nikolaev, E.N. Fourier Transform Ion Cyclotron Resonance Mass Spectrometry for the Analysis of Molecular Composition and Batch-to-Batch Consistency of Plant-Derived Polyphenolic Ligands Developed for Biomedical Application. Rapid Commun. Mass Spectrom. 2020, 34, e8850. [Google Scholar] [CrossRef]
  41. Vinardell, M.P.; Mitjans, M. Lignins and Their Derivatives with Beneficial Effects on Human Health. Int. J. Mol. Sci. 2017, 18, 1219. [Google Scholar] [CrossRef] [Green Version]
  42. George, N.; Andersson, A.A.M.; Andersson, R.; Kamal-Eldin, A. Lignin is the Main Determinant of Total Dietary Fiber Differences between Date Fruit (Phoenix Dactylifera L.) Varieties. NFS J. 2020, 21, 16–21. [Google Scholar] [CrossRef]
  43. Zherebker, A.; Lechtenfeld, O.J.; Sarycheva, A.; Kostyukevich, Y.; Kharybin, O.; Fedoros, E.I.; Nikolaev, E.N. Refinement of Compound Aromaticity in Complex Organic Mixtures by Stable Isotope Label Assisted Ultrahigh-Resolution Mass Spectrometry. Anal. Chem. 2020, 92, 9032–9038. [Google Scholar] [CrossRef] [PubMed]
  44. Zherebker, A.; Perminova, I.V.; Kostyukevich, Y.; Kononikhin, A.S.; Kharybin, O.; Nikolaev, E. Structural Investigation of Coal Humic Substances by Selective Isotopic Exchange and High-Resolution Mass Spectrometry. Faraday Discuss. 2019, 218, 172–190. [Google Scholar] [CrossRef]
  45. Zherebker, A.; Shirshin, E.; Rubekina, A.; Kharybin, O.; Kononikhin, A.; Kulikova, N.A.; Zaitsev, K.V.; Roznyatovsky, V.A.; Grishin, Y.K.; Perminova, I.V.; et al. Optical Properties of Soil Dissolved Organic Matter Are Related to Acidic Functions of Its Components as Revealed by Fractionation, Selective Deuteromethylation, and Ultrahigh Resolution Mass Spectrometry. Environ. Sci. Technol. 2020, 54, 2667–2677. [Google Scholar] [CrossRef] [PubMed]
  46. Zherebker, A.; Rukhovich, G.D.; Sarycheva, A.; Lechtenfeld, O.J.; Nikolaev, E.N. Aromaticity Index with Improved Estimation of Carboxyl Group Contribution for Biogeochemical Studies. Environ. Sci. Technol. 2022, 56, 2729–2737. [Google Scholar] [CrossRef] [PubMed]
  47. Islam, M.S.; Wilson, R.D. Experimentally Induced Rodent Models of Type 2 Diabetes. In Animal Models in Diabetes Research; Joost, H.-G., Al-Hasani, H., Schürmann, A., Eds.; Methods in Molecular Biology; Humana Press: Totowa, NJ, USA, 2012; pp. 161–174. ISBN 978-1-62703-068-7. [Google Scholar]
  48. Furman, B.L. Streptozotocin-Induced Diabetic Models in Mice and Rats. Curr. Protoc. 2021, 1, e78. [Google Scholar] [CrossRef]
  49. Gaulton, A.; Bellis, L.J.; Bento, A.P.; Chambers, J.; Davies, M.; Hersey, A.; Light, Y.; McGlinchey, S.; Michalovich, D.; Al-Lazikani, B.; et al. ChEMBL: A Large-Scale Bioactivity Database for Drug Discovery. Nucleic Acids Res. 2012, 40, D1100–D1107. [Google Scholar] [CrossRef] [Green Version]
  50. ChEMBL Ftp Site. Available online: https://ftp.ebi.ac.uk/pub/databases/chembl/ChEMBLdb/releases/chembl_29/ (accessed on 1 September 2021).
  51. Mysql-Connector-Python Library. Available online: https://pypi.org/project/mysql-connector-python/ (accessed on 25 October 2021).
  52. McKinney, W. Data Structures for Statistical Computing in Python. In Proceedings of the 9th Python in Science Conference, Austin, TX, USA, 28 June–3 July 2010; pp. 56–61. [Google Scholar]
  53. Team, T.P.D. Pandas Python Library. Available online: https://pandas.pydata.org/ (accessed on 25 October 2021).
  54. RDKit: Open-Source Cheminformatics. Available online: http://Www.Rdkit.Org (accessed on 25 October 2021).
  55. Sander, T.; Freyss, J.; von Korff, M.; Rufener, C. DataWarrior: An Open-Source Program for Chemistry Aware Data Visualization and Analysis. J. Chem. Inf. Model. 2015, 55, 460–473. [Google Scholar] [CrossRef]
  56. The UniProt Consortium; Bateman, A.; Martin, M.-J.; Orchard, S.; Magrane, M.; Agivetova, R.; Ahmad, S.; Alpi, E.; Bowler-Barnett, E.H. UniProt: The Universal Protein Knowledgebase in 2021. Nucleic Acids Res. 2021, 49, D480–D489. [Google Scholar] [CrossRef]
  57. Eastman, P.; Swails, J.; Chodera, J.D.; McGibbon, R.T.; Zhao, Y.; Beauchamp, K.A.; Wang, L.-P.; Simmonett, A.C.; Harrigan, M.P.; Stern, C.D.; et al. OpenMM 7: Rapid Development of High Performance Algorithms for Molecular Dynamics. PLoS Comput. Biol. 2017, 13, e1005659. [Google Scholar] [CrossRef] [Green Version]
  58. Laskowski, R.A.; Rullmann, J.A.C.; MacArthur, M.W.; Kaptein, R.; Thornton, J.M. AQUA and PROCHECK-NMR: Programs for Checking the Quality of Protein Structures Solved by NMR. J. Biomol. NMR 1996, 8, 477–486. [Google Scholar] [CrossRef]
  59. Wiederstein, M.; Sippl, M.J. ProSA-Web: Interactive Web Service for the Recognition of Errors in Three-Dimensional Structures of Proteins. Nucleic Acids Res. 2007, 35, W407–W410. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  60. Sippl, M.J. Recognition of Errors in Three-Dimensional Structures of Proteins. Proteins Struct. Funct. Bioinform. 1993, 17, 355–362. [Google Scholar] [CrossRef] [PubMed]
  61. Morris, G.M.; Huey, R.; Lindstrom, W.; Sanner, M.F.; Belew, R.K.; Goodsell, D.S.; Olson, A.J. AutoDock4 and AutoDockTools4: Automated Docking with Selective Receptor Flexibility. J. Comput. Chem. 2009, 30, 2785–2791. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  62. Le Guilloux, V.; Schmidtke, P.; Tuffery, P. Fpocket: An Open Source Platform for Ligand Pocket Detection. BMC Bioinform. 2009, 10, 168. [Google Scholar] [CrossRef] [Green Version]
  63. O’Boyle, N.M.; Banck, M.; James, C.A.; Morley, C.; Vandermeersch, T.; Hutchison, G.R. Open Babel: An Open Chemical Toolbox. J. Cheminform. 2011, 3, 33. [Google Scholar] [CrossRef] [Green Version]
  64. Trott, O.; Olson, A.J. AutoDock Vina: Improving the Speed and Accuracy of Docking with a New Scoring Function, Efficient Optimization and Multithreading. J. Comput. Chem. 2010, 31, 455–461. [Google Scholar] [CrossRef] [Green Version]
  65. Eberhardt, J.; Santos-Martins, D.; Tillack, A.F.; Forli, S. AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings. J. Chem. Inf. Model. 2021, 61, 3891–3898. [Google Scholar] [CrossRef]
  66. Krieger, E.; Vriend, G. YASARA View—Molecular Graphics for All Devices—From Smartphones to Workstations. Bioinformatics 2014, 30, 2981–2982. [Google Scholar] [CrossRef]
Figure 1. The dereplication pipeline used in this work. H/D exchange was used to label lignin-derivative and find its component in liver by FTICR MS. The sample and its enriched fraction were tested on its ability to alleviate NADFL-related biochemical changes. LC-MS experiments assessing the ability of samples to interact with the KEAP-1 were then performed. Further, a recurrent neural network was trained on the natural compound structures from CoCoNut and fine-tuned on compounds possessing activity against KEAP1-Nrf2 system retrieved from ChEMBL. The network was used to sample various natural compound-like structures that were filtered by the relevance according to the experimental MS data including isotopic labeling. Finally, the ability of the filtered compounds to interact with the KEAP-1 was assessed using molecular docking simulations.
Figure 1. The dereplication pipeline used in this work. H/D exchange was used to label lignin-derivative and find its component in liver by FTICR MS. The sample and its enriched fraction were tested on its ability to alleviate NADFL-related biochemical changes. LC-MS experiments assessing the ability of samples to interact with the KEAP-1 were then performed. Further, a recurrent neural network was trained on the natural compound structures from CoCoNut and fine-tuned on compounds possessing activity against KEAP1-Nrf2 system retrieved from ChEMBL. The network was used to sample various natural compound-like structures that were filtered by the relevance according to the experimental MS data including isotopic labeling. Finally, the ability of the filtered compounds to interact with the KEAP-1 was assessed using molecular docking simulations.
Ijms 23 16025 g001
Figure 2. Van Krevelen diagrams for BP-Cx-1 fractions obtained by gradient extraction (A); summarized contribution of molecular components found in liver to the total intensity in mass-spectra of fractions (B).
Figure 2. Van Krevelen diagrams for BP-Cx-1 fractions obtained by gradient extraction (A); summarized contribution of molecular components found in liver to the total intensity in mass-spectra of fractions (B).
Ijms 23 16025 g002
Figure 3. Relative intensities of alkylated peptides containing free cysteine without (blue) and with (red) incubation with BP-Cx-1 determined by LC-MS.
Figure 3. Relative intensities of alkylated peptides containing free cysteine without (blue) and with (red) incubation with BP-Cx-1 determined by LC-MS.
Ijms 23 16025 g003
Figure 4. Comparison of the chemical spaces of natural products from CoCoNut database (only CxHyOz), compounds active against Nrf2 and generated compounds using self-organizing map (SOM) (100 × 100 neurons, FragFp fingerprints, see Materials and Methods for details). (a) The SOM built on CoCoNut compounds colored by most frequent scaffolds (see Table S7). Red—benzene ring, cyan—sterone ring, orange—coumarin, blue—flavone, green—tetrahydropyran, grey—compounds bearing other scaffolds. Markers for compounds bearing frequent scaffolds were enlarged for better visibility. (b) The SOM built on CoCoNut compounds (grey) on which generated compounds (dark blue) and compounds active against Nrf2 (magenta) were projected. Markers for CoCoNut compounds and compounds active against Nrf2 were enlarged for better visibility.
Figure 4. Comparison of the chemical spaces of natural products from CoCoNut database (only CxHyOz), compounds active against Nrf2 and generated compounds using self-organizing map (SOM) (100 × 100 neurons, FragFp fingerprints, see Materials and Methods for details). (a) The SOM built on CoCoNut compounds colored by most frequent scaffolds (see Table S7). Red—benzene ring, cyan—sterone ring, orange—coumarin, blue—flavone, green—tetrahydropyran, grey—compounds bearing other scaffolds. Markers for compounds bearing frequent scaffolds were enlarged for better visibility. (b) The SOM built on CoCoNut compounds (grey) on which generated compounds (dark blue) and compounds active against Nrf2 (magenta) were projected. Markers for CoCoNut compounds and compounds active against Nrf2 were enlarged for better visibility.
Ijms 23 16025 g004
Table 1. Biochemical blood test of experimental mice.
Table 1. Biochemical blood test of experimental mice.
GroupTotal
Protein, g/L
Albumin,
g/L
Globulins,
g/L
Amylase,
U/L
AST,
U/L
ALT,
U/L
144.4 ± 0.828.0 ± 0.4716.4 ± 0.90664 ± 34.7196 ± 16.370 ± 5.4
260.9 ± 3.3327.2 ± 1.1433.7 ± 2.741250 ± 64.5341 ± 10.7139 ± 37.0
352.4 ± 2.3725.5 ± 0.4233.3 ± 5.031725 ± 343.1287 ± 31.6137 ± 12.1
446.8 ± 1.5222.9 ± 0.5324.2 ± 1.551462 ± 133.6286 ± 21.7164 ± 12.5
GroupTotal
bilirubin, µmol/L
Conjugated
bilirubin, µmol/L
Conjugated bilirubin,
%
Cholesterol, mmol/LGlucose, mmol/LLDH,
U/L
110.9 ± 0.347.7 ± 0.3771 ± 1.31.67 ± 0.0163.38 ± 0.0661974 ± 156
218.4 ± 0.6817.2 ± 1.1393 ± 2.73.54 ± 0.2575.83 ± 0.5612985 ± 345
319.5 ± 1.7417.8 ± 0.4293 ± 7.23.32 ± 0.3326.30 ± 0.6002835 ± 319
415.4 ± 0.6713.5 ± 1.0787 ± 3.32.83 ± 0.0845.12 ± 0.4252839 ± 207
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Orlov, A.; Semenov, S.; Rukhovich, G.; Sarycheva, A.; Kovaleva, O.; Semenov, A.; Ermakova, E.; Gubareva, E.; Bugrova, A.E.; Kononikhin, A.; et al. Hepatoprotective Activity of Lignin-Derived Polyphenols Dereplicated Using High-Resolution Mass Spectrometry, In Vivo Experiments, and Deep Learning. Int. J. Mol. Sci. 2022, 23, 16025. https://doi.org/10.3390/ijms232416025

AMA Style

Orlov A, Semenov S, Rukhovich G, Sarycheva A, Kovaleva O, Semenov A, Ermakova E, Gubareva E, Bugrova AE, Kononikhin A, et al. Hepatoprotective Activity of Lignin-Derived Polyphenols Dereplicated Using High-Resolution Mass Spectrometry, In Vivo Experiments, and Deep Learning. International Journal of Molecular Sciences. 2022; 23(24):16025. https://doi.org/10.3390/ijms232416025

Chicago/Turabian Style

Orlov, Alexey, Savva Semenov, Gleb Rukhovich, Anastasia Sarycheva, Oxana Kovaleva, Alexander Semenov, Elena Ermakova, Ekaterina Gubareva, Anna E. Bugrova, Alexey Kononikhin, and et al. 2022. "Hepatoprotective Activity of Lignin-Derived Polyphenols Dereplicated Using High-Resolution Mass Spectrometry, In Vivo Experiments, and Deep Learning" International Journal of Molecular Sciences 23, no. 24: 16025. https://doi.org/10.3390/ijms232416025

APA Style

Orlov, A., Semenov, S., Rukhovich, G., Sarycheva, A., Kovaleva, O., Semenov, A., Ermakova, E., Gubareva, E., Bugrova, A. E., Kononikhin, A., Fedoros, E. I., Nikolaev, E., & Zherebker, A. (2022). Hepatoprotective Activity of Lignin-Derived Polyphenols Dereplicated Using High-Resolution Mass Spectrometry, In Vivo Experiments, and Deep Learning. International Journal of Molecular Sciences, 23(24), 16025. https://doi.org/10.3390/ijms232416025

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop