1. Introduction
For several decades the benefits of the dietary styles abundant in fruit, vegetables, and legumes have been emphasized in terms of longevity, healthy ageing, and low morbidity from degenerative diseases, such as type 2 diabetes, metabolic syndromes, cardiovascular disease, neurodegenerative and neurobehavioral disorders, and some types of cancers. As an example, the Mediterranean diet can be considered the archetype of a health-promoting lifestyle, including moderate physical activity, a high intake of low-fat dairy products and healthy lipids (from seafood), as well as low consumption of refined sugars, red meat, and saturated fats [
1,
2].
The huge phytochemical diversity of plant foods is one of the reasons why well-balanced vegetable-based diets consistently afford health benefits. Plants typically contain hundreds of phytochemicals arising from different secondary metabolic pathways and contribute to plant defense from competitors and pests [
3]. Varieties that humans selected for food use have lower contents of toxic and off-tasting phytochemicals and a higher content of bulk and micronutrients, especially of vitamins, essential fatty acids, amino acids, and antioxidants. Phenylpropanoids, a class of widespread phenylalanine derivatives including polyphenols (flavonoids, stilbenes, and proanthocyanidins), have been the most investigated dietary phytochemicals in the last decades, thus representing the paradigm of the relationship between food and health [
4,
5].
The amino acid Tryptophan (TRP) is an irreplaceable constituent of the human diet, since the human genome does not include the enzymes that are necessary for its de-novo biosynthesis, but only those that perform some biotransformations and those responsible for break-down of the indole nucleus (
Figure 1). An important class of neuro-transmitters, such as Serotonin, is derived from TRP, the source of which is only in the diet. A further biotransformed derivative of Serotonin, Melatonin (MEL), is also an important neurohormone, with multiple biological effects that include the circadian regulation of sleep.
Recently, we suggested that indole compounds, in particular MEL (a potent neurohormone produced in traces from TRP in the pineal gland of mammals), discovered in some typical Mediterranean foods, such as wine and nuts, may represent a new element in this scenario, further corroborating the protective effects of diets rich in plant products [
6,
7,
8].
In addition, the chemical feature of indole is widely represented in the phytochemicals’ structures, which range in complexity from simple derivatives of the amino acid TRP to the high chemical diversity of the huge family of indole alkaloids.
Of note in plants, phenylalanine and TRP share a common biosynthetic route together with tyrosine, the shikimic acid or aromatic amino acid pathway unique of plants [
3]. According to the USDA Food Composition Database (
https://ndb.nal.usda.gov/ndb/), the sources that contain very high levels of free TRP include soy seeds (0.68 g TRP/100 g), pumpkin seeds (0.58 g TRP/100 g), and spirulina (0.93 g TRP/100 g).
Some studies have pointed out the ability of Saccharomyces cerevisiae and other high producer yeast strains to produce MEL by biotrasformation of TRP. In mammals, MEL is involved in many physiological functions: the regulation of circadian rhythms and sleep-wake cycle, immunostimulation, antioxidant defense, and tumor biology. Plants produce MEL with some similar functions to increase stress resistance, regulate circadian rhythms, and promote seed germination and fruit maturation.
In this context, with the aim of using plants naturally rich in TRP as a substrate for yeast fermentation, we developed a multicomponent analytical method to measure in plant matrices 14 compounds that include the almost complete biotransformation pathways of TRP. Among these, nine have an intact indole nucleus and five have an oxidized open indole ring. We needed to specifically adapt suggestions from several published methods to fit the unique needs of our compound panel and examined plant matrices. Among the improvements is the choice of internal standards adapted for each compound class (amino acids, indoleamines, N-acetyl indoleamines, amino-benzoic, and pyridine derivatives). Following a well-trodden pathway, we propose “indolome” as the collective identifier for all metabolites in the TRP pathway, irrespective of whether they retain the indole nucleus or are converted to pyridine by the pyrrole-opening Kynurenine pathway and further recyclization.
This method has been applied to measure indole phytochemicals in soybeans, pumpkin seeds, sesame seeds, and spirulina, selected according to their botanical diversity and documented healthy effects. Different procedures were also tested in order to improve the metabolite extraction from each plant matrix.
This method will be useful for improving the knowledge on TRP-related compounds in plant foods and their biotransformation pathways. Moreover, it will help to select the most suitable plant source of TRP to be used as a natural substrate for yeast strains working as a “cell factory” for the production of antioxidants or bioactive compounds used for the production of “functional foods”.
2. Results and Discussion
2.1. Analytes Characterization by Fragment Ion Spectroscopy
To identify the best fragmentation transitions for use in detection by multiple reaction monitoring, a solution of each standard compound was directly infused into the ion source of the mass spectrometer with a built-in Hamilton syringe pump at a rate of approximately 10 µL/min. Typical conditions were exemplified for TRP. A 4 g/L (0.77 mol/L = 770 µM) water solution of TRP was diluted 1:1000 (770 nM) with an ESI ionization solvent of 50% methanol/water, containing 0.1% formic acid.
ESI source spectra were continuously scanned in the Q1 from 100 to 600 m/z in 2 s, and the relevant source parameters were optimized for the best signal-to-noise of the expected compound-related signals and optimal mass peaks shape and signal stability. The Ramp function of the Analyst software was employed to scan the declustering potential (DP) voltage of the instrument over values typically ranging from 0 (diffusion driven ion injection) to approximately 70 to 90 eV (voltage driven ion injection).
Collision induced fragment ion spectra of source-generated protonated molecules and first generation fragments were recorded by continuously scanning the Q3 mass filter over the relevant
m/
z range for the expected fragments and precursor ion. For fragmentation studies, gas pressure in the Q2 collision cell was selected at the low instrument setting. The
Ramp function of the Analyst software was employed to scan the collision energy (CE) voltage of the instrument over values typically ranging from 0 (decomposition of metastable precursors) to approximately 50 to 70 eV (highest collision energy that yields reasonable ion current and structurally meaningful fragments). The integrated, transmission-convoluted array of spectra was acquired as the “integrated fragment ion spectrum” of each examined ion precursor. As an example,
Figures S1–S4 show the integrated fragment spectra of protonated TRP, TRP D5, MEL, and MEL OCD3. Fragment abundance curves were extracted from the instrument data file as txt arrays of the fragment abundance vs. collision voltage from the Analyst software. All further elaboration was performed in custom Microsoft Excel spreadsheets, essentially according to Rubino et al. [
9,
10].
Figure S5 shows the elaboration of the fragment abundance curve for MEL and TRP (an operational surrogate of the “breakdown curve”, adapted to the specific task) to obtain the collision energy corresponding to the maximum yield of the selected fragment ion.
The optimization of MS/MS conditions as reported above yielded the values of the relevant mass spectrometry parameters that are reported in
Table 1 where, in addition to retention time, two independent MRM transitions (quantifier and qualifier) for each analyte are also reported.
To ensure specificity, however, we used the data-dependent acquisition coupling MRM with EPI (enhanced product ion) (MRM-EPI). When the MRM is triggered, the ion trap records the full spectra (MS-MS) for that precursor, so providing a confirmative spectrum of the parent ion generating it. We recorded the on-the-fly fragment ion spectra to confirm peak identity for each analyte in each matrix. To apply this procedure in a routine fashion is, however, unfeasible, and the use of more than one MRM transition was enough.
2.2. Analytical Performances
Figure 2 shows the final chromatographic separation of a standard mixture of the indole compounds (2 µg/mL for all analytes except for internal standards which are 0.4 µg/mL TRP D5, 20 ng/mL 5F TRY, and 22 ng/mL MEL OCD3). MEL levels were previously assessed in almond and pistachios by using an LC-MS/MS incorporating the isotopomer analogue, MEL OCD3, which ensured a specific fragmentation and an improved performance for quantification [
8].
Analytical performances were evaluated by studying parameters such as specificity, precision, accuracy, linearity, limit of detection (LoD), and limit of quantification (LoQ) (
Table 2). The specificity was assessed by extracting water and water added in IS in each validation run. The lack of interfering peaks at the retention times of the analytes was considered as a sign of an acceptable selectivity. Precision and accuracy were calculated using different replicates of samples on the same day (intraday) and in different working days (interdays). Accuracy was expressed as the relative error (RE%), while precision was measured as a coefficient of variation (CV%). A CV% and RE% below 15% were considered suitable. Accuracy and precision were determined at low (5 ng/mL), medium (500 ng/mL), and high (10,000 ng/mL) concentrations in sets of three replicates per day. Results of precision and accuracy are expressed as means. Ten-point calibration curves were exactly constructed in different ranges so as to match the expected concentration of each singular analyte in the plant matrix (1–50,000 ng/mL). Linearity was assessed by unweighted least squares regression and expressed by the coefficient of determination (R
2). LoD was considered as the concentration that generated a signal-to-noise ratio greater than 3 and LoQ was the lowest concentration that yielded a signal-to-noise ratio greater than 10 [
11].
The pre-analytical treatment of plant extracts used before LC-MS/MS analysis was based solely on proteins removal by methanol extraction. The use of HybridSPE-Phospholipids cartridges (Sigma-Aldrich) to remove the phospholipid’s ion-suppression as described by Alvarez-Fernandez et al. [
12] in yeast extracts, gave a loss on the TRP D5 recovery of more than 70%, so we decided to skip that passage. The samples’ treatment here described gave a process efficiency (peak area extracted samples/peak area unextracted samples) of >65%, considering both matrix effect and extraction efficiency, as calculated on the internal standard area (
Table 3).
2.3. Extraction Efficiency and Indolome Profile in Different Food Plants
A number of different extraction procedures are available for extraction of bioactive phytochemicals from plant matrices [
13,
14]. The extraction efficiency of indole compounds from plants is highly dependent on the chemical structure, the different composition of vegetal matrices, and the extraction procedure used.
Figure 3 shows the yield of TRP extraction from four different vegetal matrices, using four different extraction methods both at high and at room temperature. It is evident that the extraction with methanol at room temperature is not suitable for most of the plant materials used in this study, while the use of a Soxhlet apparatus at high temperature and with solvents of increasing polarity greatly improved the recovery. This was also shown when methanol alone was used at high temperature (Soxhlet 1). Despite these observations, the extraction procedure based on the use of water at room temperature was chosen as the reference method due to its low cost, sustainability, high efficiency of extraction, and short time needed. Except for sesame flour, water yielded TRP recovery in the same order of hot methanol. In
Figure 3, the graph titled “Total” summarizes the efficiency of TRP extraction among all plant materials and extraction procedures. All the main indole compounds related to the TRP metabolism, the “indolome”, were investigated, and the results obtained by each vegetal matrix after being log-transformed and autoscaled are reported in
Figure 4 as a heatmap and in
Figure S6 as a PCA plot. From
Figure S6 the chemical diversity of spirulina and sesame with respect to the other two matrices is evident. From the heatmap (
Figure 4) water-based extraction was less efficient for 5OME TRY, TRY, NAC TRY in spirulina and pumpkin, while it gave a higher recovery for sesame and soybean. KINA, KIN, and ANTA were also well extracted from almost all matrices by water at room temperature.
We were interested in a relative comparison among different extraction procedures on 14 analytes that are very different in polarity, molecular weight, and concentration. Of course, some of them could benefit from the hot Soxhlet extraction, while others were well extracted at room temperature without solvent. In the end, we concluded that water is a sufficiently strong extractant for indolome analytes that are less polar than TRP, such as the MEL analogues, and we chose this extraction with the best safety and sustainability profiles to deepen our study.
Figure 5 reports the detailed quali-quantitative composition of the indolome in the four vegetal sources after water extraction at room temperature. TRP is the most abundant compound in all the plant extracts, while 5OH TRP accounts for ≤1% in soybean and pumpkin and was undetectable in the other plants. Spirulina had the highest concentration of KIN, KINA, and ANTA with respect to the other matrices, whereas 3OH ANTA was undetectable. In sesame, all the TRP metabolites derived from the activity of tryptophane-2,3-dioxygenase (
Figure 1) are present, including 3OH ANTA. In addition, sesame extraction provided the highest yield of metabolites derived from tryptophan decarboxylase or from tryptophan hydroxylase activity (TRY, 5OH TRY, and NAC TRY) and likely lead to MEL production. However, under the conditions used in these experiments, MEL was almost undetectable in all matrices. The specificity of our analysis was ensured by the use of the deuterated internal standard MEL OCD3, which confirmed the absence of signals in the trace of MEL [MH]
+ 233.2 > 174.2, at the retention time of MEL OCD3 (236.2 > 177.2
m/
z). In any case, we cannot exclude the presence of MEL in these matrices at pg/mL trace levels lower than our LoQ (2 ng/mL). The use of a specific purification by SPE can yield an improved LoQ < 80-fold (data not shown).
2.4. A Possible Melatonin-Interfering Compound in Sesame
The sesame extracts contain an intense chromatographic peak on the trace at 233.2 > 174.2
m/
z at a retention time very close to that of MEL (5.32 vs. 5.60 min,
Figure 6). This peak was only found in sesame flour, but not in all the other examined matrices. This component could be erroneously identified as MEL if it is not properly resolved chromatographically from the authentic MEL peak, or when a deuterated internal standard of MEL is not used.
The use of MRM-EPI was crucial for excluding false positives on the MEL transition.
Figure 7 reports the fragment ion spectra recorded “on-the-fly” in the enhanced product ion (EPI) mode for the three compounds, namely, pure MEL, pure TEE (a known MEL isomer [
15]), and the spectra of the unknown peak 2 found in the sesame extract (
Figure 6). The constitutional isomer TEE [
15] is a known interfering compound in the analysis of MEL, the retention time of which is, in our system, sufficiently different as to separately identify the two compounds, even if their fragment spectra feature the same fragments, although with different relative intensities (
Figure 7A,B).
The spectrum of the unknown peak (
Figure 7Cfeatures the same precursor ion of MEL (233
m/
z) but features two main fragments that are up-shifted by one
m/
z unit (160 and 175
m/
z) from those of MEL and TEE (159, 174
m/
z). Two further signals, at 187 and 189
m/
z, are present only in the spectrum of the unknown peak.
It is difficult to speculate on the identity of the component that yielded peak A2 (
Figure 6), due to the high complexity of the sesame metabolome [
16]. However, a literature search shows that sesame contains a specific lignan, sesamolin (C
20H
18O
7; MW 370), a non-nitrogenous asymmetrical compound, the mass spectra of which contain a fragment at 233
m/
z [
17,
18]. This fragment can originate from low critical energy elimination processes para to a methylenedioxy aromatic system further bearing an aryl-alkyl ether, and yielding a 1,2,4-trioxy-benzene neutral fragment. This decomposition can occur through a charge-remote 1,2- or 1,6-rearrangement of a protonated or ionized precursor (
Figure S7). Depending on the desolvatation and declustering conditions of the ESI ionization used in our analytical method, in-source decomposition may produce the first-generation transition to 233
m/
z from sesamolin. However, the spectrum of sesamolin published by Dar et al. [
17] (
Figure 4B of reference), and reportedly acquired under ESI ionization, shows several differences with respect of that expected based on standard fragmentation rules of even-electron protonated molecules [
19,
20]. In fact, the published spectrum does not show the expected protonated molecule (371
m/
z), but does show signals at 369 and 370
m/
z that may be compatible with the molecular cluster of a [M − H]
+ species, but not with the expected MH
+. The reported spectrum also displays much more abundant fragmentation than seemingly compatible with “soft” ESI ionization, and the supplied instrumental details do not allow for a judgement on the energy of the processes in the recording of the spectrum. In addition, the observed fragments are likely derived from the overlap of odd-electron and even-electron decomposition pathways. Due to the current unavailability of a sesamolin standard, it is not possible to confirm or disprove the identification of the observed unknown peak in the sesame extract.
3. Materials and Methods
3.1. Reagent and Chemicals
The chemicals were all analytical grade and purchased by Sigma-Aldrich (Milan, Italy). All aqueous solutions were prepared using purified water in a Milli-Q (Millipore, Milan, Italy) device. Melatonin (MEL), 5-F Tryptamine (5F TRY; IS1),Tryptamine (TRY), 5-OH Tryptamine (5OH TRY), 5-OCH3 Tryptamine (5OME TRY), N-Ac Tryptamine (NAC TRY), N-Ac-5-OH Tryptamine (NAC 5OH TRY), Tryptophan-D5 (TRP D5; IS2), 3-OH Anthranilic acid (3OH ANTA), 3-OH Kynurenine (3OH KIN), Anthranilic acid (ANTA), Kynurenic acid (KINA), Kynurenine (KIN), Tryptophan (TRP), Tryptophan ethyl-ester (TEE), and 5-OH-Tryptophan (5OH TRP) standards was purchased from Sigma-Aldrich (Milan, Italy). The isotopomer Melatonin-OCD3 (MEL OCD3) was synthesized by Prof. Andrea Penoni (Dipartimento di Scienza e Alta Tecnologia, Università dell’Insubria, Varese).
3.2. Preparation of Standard Solution
Standard stock solutions of the analytes were prepared individually at a concentration 1 mg/mL in water, except for 3OH ANTA and KINA, which were dissolved in methanol/water 1:1. The mixture of internal standards (MIX IS) were prepared as described: 20 µL of each solution, 2 mg/mL TRP D5, 10 µg/mL 5F TRY, and 11 µg/mL MEL OCD3 brought to the final volume of 10 mL with methanol 0.5% formic acid. The stock solutions and the MIX IS (0.4 µg/mL TRP D5, 20 ng/mL 5F TRY, and 22 ng/mL MEL OCD3) were stored at 4 °C.
3.3. Plant Material
Organic flours from soybean, pumpkin, and sesame seeds were purchased from Fior di Loto and Molino Bongiovanni companies (Turin, Italy). A suitable specimen of each was dried in a ventilated oven set at 50 °C for 24 h before extraction. Organic spirulina was kindly provided by Spireat (Cremona, Italy), and pulverized in a mortar before use.
3.4. Extraction Procedure from Plant Samples
Various techniques were applied in extraction of the plant sample constituents in order to compare their efficiency. All procedures were carried out in dim light conditions to prevent metabolite degradation by photochemical reactions.
3.4.1. Soxhlet Extraction
Each sample (5 g) was weighed into a cellulose thimble and placed in a Soxhlet apparatus equipped with a 0.25 L round bottom flask and a water-cooled condenser. A three-step extraction was performed using solvents of increasing polarity (150 mL), i.e., petroleum ether (40 °C), dichloromethane (60 °C), and methanol (65 °C) for 8 h (4–6 cycles h
−1) (Soxhlet 3) [
13,
14]. In the Soxhlet apparatus, both the flours and the Spirulina were also subjected to one-step extraction using only methanol without previous defatting by nonpolar solvents (Soxhlet 1). All obtained extracts were stored at −20 °C until analysis
3.4.2. Methanol Extraction at Room Temperature
Methanol (3 mL/g) was added to flours and alga. The obtained mixtures were stirred for 20 min after 10 min sonication. Then, each sample was decanted and the supernatant centrifuged at 28,000 rpm for 10 min. The resulting extract was filtered through a 0.20 μm filter and combined with that of a second extraction before storage at −20 °C.
3.4.3. Aqueous Extraction at Room Temperature
Distilled water (3 mL/g) was added to flours and alga. The obtained mixtures were stirred for 20 min after 10 min sonication. Then, each sample was decanted and the supernatant centrifuged at 28,000 rpm for 10 min before storage at −20 °C.
3.5. Purification of Plant Extracts
A small aliquot of plant extract (10 µL) was added with 50 µL of MIX IS (20 ng TRP D5, 1 ng 5F TRY, and 1.1 ng MEL OCD3) and 100 µL of precipitating solution (methanol + 0.5% formic acid). After vortexing, 100 µL of solution was withdrawn and evaporated until dry. Then, it was reconstituted with 50 µL water + 0.5% formic acid, and 10 µL was directly injected in the LC-MS/MS.
3.6. LC-MS/MS Analysis for the Study of Indolic Compounds
The analytical system consisted of an HPLC coupled to a tandem mass spectrometer. The liquid chromatograph system was a Dionex 3000 UltiMate instrument with autosampler, binary pump, and column oven (Thermo Fisher Scientific, Waltham, MA, USA). Separation of 17 indolic compounds (14 analytes and 3 internal standards) was attained on a reversed-phase Zorbax Eclipse XDB-C18 (4.5 × 50 mm, 1.8 µm) analytical column preceded by a security guard cartridge. Linear gradient was obtained by mixing eluent A (water + 0.1% formic acid) and eluent B (methanol + 0.1% formic acid). The elution gradient was set as follows: 0–1 min (20% B), 1–5 min (20%–60% B), 5–7 (60% B), 7.0–7.2 (60%–95% B), 7.2–8.2 (95% B), 8.2–8.5 (95%–5% B), 10 (5% B). The flow rate was 0.4 mL/min and the column temperature was 40 °C. The tandem mass spectrometer was an AB Sciex 3200 QTRAP instrument with electrospray ionization TurboIonSpray™ source (AB Sciex, Milano, Italy). Instruments were managed with the proprietary manufacturer’s software and according to the manufacturer’s instructions. The analytical data were processed using Analyst software (version 1.2). The ion spray voltage was set at 5.5 kV and the source temperature was set at 500 °C. Nitrogen was used as a nebulizing gas (GS 1, 55 psi), turbo spray gas (GS 2, 60 psi), and curtain gas (30 psi). The collision-activated dissociation (CAD) was set to a low level. The dwell time was set at 70 ms, and the MS scan was performed in positive ion modes (ESI+). MS/MS experiments were conducted using nitrogen as the collision gas. Compound-dependent parameters were optimized via direct infusion. Multiple reaction monitoring (MRM) mode was used with a fixed dwell time of 60 ms for all transitions and analytes. Quantitative analysis was performed interpolating each peak area of analyte/area internal standards (IS) with calibration curves of each indole compounds.