1. Introduction
The aging population will become a major societal issue as the number of people over 60 years of age continues to grow. At the global level, estimates put the population over 60 years of age at 2 billion people in 2050 compared to 629 million today. At the same time, the risk of undernutrition and deficiency for the elderly affects between 4% and 10% of people living at home, between 15% and 38% of people living in nursing homes and up to 70% in hospitals [
1]. There is therefore a need to supplement the diet of the elderly. It is accepted that protein requirements for them are about 20% higher to maintain body protein mass [
1] and specific amino acids such as leucine play a beneficial and protective role in terms of muscle mass and function [
2]. In addition, a diet rich in protein and specific amino acids is important for maintaining muscle mass in the elderly, who are often subject to low-grade inflammation [
3]. It was shown that antioxidant supplementation improved the anabolic response to leucine of old muscle and reduced inflammation [
4]. Older people regularly suffer from sarcopenia. Sarcopenia is defined as a progressive and generalized decline in muscle mass, strength, and physical performance in geriatric patients. In addition, aging is accompanied by changes, including alterations and the deterioration of intestinal functions, such as less secretion of digestive fluids and enzymes, and a higher pH in the stomach [
3,
4,
5,
6]. As a result, food is less well assimilated by the body, so it is important that the proteins and amino acids provided through supplementation are highly bioavailable. To prevent sarcopenia, the most effective approach is a combination of physical exercise and an adequate intake of amino acids, and macro and micro nutrients [
7,
8,
9].
The digestibility of a protein depends on its origin and more precisely on its amino acid composition. The PDCAAS (which is protein digestibility corrected amino acid score) has been the reference in the notion of protein quality and recommended by the FAO and WHO for more than 20 years. The PDCAAS considers the essential amino acid profile of proteins—and thus makes it possible to identify the limiting factor—and the overall digestibility of the protein. The FAO has recommended replacing the PDCAAS with the digestible indispensable amino acid score (DIAAS). The new recommended method is more precise, as it considers the digestibility of each essential amino acid [
10]. Other methods approaching digestibility such as OPA (ortho-phthaldialdehyde) spectrophotometric assay or fluorescamin fluorimetric assay [
11] evaluate the degree of protein hydrolysis through the amount of NH
2 groups released.
The quality of a protein depends on its composition in essential amino acids but also on the anti-nutritional factors that may be present and thus limit protein digestion. With this new score of DIAAS, milk proteins reached high values (>100%), which is twice as high as that of peas (~64%) and three times as high as that of wheat (~40%) [
12,
13]. Proteins of vegetable origin are often incomplete in amino acids, for example cereals are deficient in lysine whereas legumes are deficient in sulphur amino acid. In addition, some antinutritional factors (ANFs)—such as phytate, phenolic compounds, tannins, and fibers—are present in plant foods and known to contribute to the low bioavailability of some minerals essential for elderly, like iron and zinc. Various processing technologies (mechanical treatments, soaking, germination, fermentation, and heating) can be applied to improve the mineral bioaccessibility [
14]. For example, heat treatments reduce the content of ANFs up to 40%, as reported for legumes [
15].
With plant proteins, it is thus necessary to combine them to balance essential amino acids. The PNNS (Programme National Nutrition Santé, French nutrition program) [
16] recommends the diversification of protein sources in order to meet the complementary nutritional requirements for overall balance (vitamins, minerals, etc.). This also leads to the improved use of natural resources, especially plant proteins. Continuing along these lines, our objective was to select protein ingredients (i.e., protein extracts) that would meet the nutritional requirements of the elderly, and be well digested in order to be effectively assimilated by the body.
In addition, bioactive peptides are generated from different food sources during gastrointestinal digestion. Bioactive peptides have a variety of targets such as immune, cardiovascular, digestive and endocrine systems in the human body [
17,
18,
19,
20]. In the case of personalized nutrition for elderly, such properties may be considered in the nutritional value of protein sources.
To this end, multivariate analysis is often used to integrate different types of data and extract meaningful information. Multivariate studies are increasingly being developed, particularly to study omics datasets, but are still rare, especially in the field of nutrition [
21,
22].
Therefore, the aim of this study is to develop a statistical analysis workflow from compositional and nutritional properties as an efficient tool to compare all the protein extracts, all things being equal.
3. Results
The results of the composition of each protein extract are summarized in
Table 2, which therefore provides the composition of macro and micronutrients as well as the oxidation of lipids and proteins of each protein extract. Plant extracts contain the most carbohydrates, and complex sample A5 (Pork liver) in particular has a very high protein and lipid oxidation rate. This powder also has a high iron content.
Table 3 shows the amino acids contained in the samples. In general, protein extracts from animal origin were those that contained the highest protein content. Protein extracts P1 (fava bean) and P2 (fermented fava bean) contained less protein and amino acids compared to the other extracts. Protein extracts D4 (whey protein), D5 (whey protein), D9 (whey protein), and P4 (pea) contained the highest leucine content, which is very important for maintaining muscle mass. It can be seen that samples A4 (pork collagen), D1 (cheese powder), D3 (60% micellar casein concentrate), P1 (fava bean), and P2 (fermented fava bean) were deficient in leucine and branched AA. Finally, the protein extracts that were the most deficient in aromatic amino acids were D2 (80% serum protein concentrate), P1 (fava bean) and P2 (fermented fava bean) whereas for sulfur amino acid it is the protein extracts A4 (pork collagen) and A7 (chicken broth) that were deficient.
3.1. Bioaccessibility and Bioactivity
According to
Table 4 and
Table 5, globally protein extracts of animal origin are more hydrolyzed in the gastric phase. However, protein extracts obtained from chicken or pork broth (A7 and A3, respectively) exhibited a low value of bioaccessibility measured through NH
2 group release, even under the value of plant or dairy powder. In the gastric compartment, protein hydrolysis is due to pepsin action. The cutting site of pepsin is after an aromatic amino acid. Therefore, in an attempt to possibly explain the difference due to protein source, the content of all aromatic amino acids was compared regarding the origin (plant, dairy, or animal). No significant difference was observed (data not shown). In a global manner, protein extracts of plant origin were less hydrolysed in the gastric and intestinal phase. D1 (cheese powder) and P1 (fava bean) were essentially found as less hydrolysed samples in the intestinal phase. Fermentation of fava bean (P2) did not modify this observation. Antioxidant bioactivity was recorded in the gastric phase. The animal samples obtained from pork liver (A5 and A6) exhibited the highest antioxidant activity, in the same range than fava beans (P1 and P2) and sunflower seeds (P9). Chicken and pork broth (A7 and A3, respectively) displayed a low antioxidant activity. Both plant and animal samples presented heterogenous antioxidant bioactivity, may be explained by the presence of phenolic compounds, able to chelate metals. On the contrary, the antioxidant activity of the dairy protein extract in the gastric phase was rather similar between samples, in average 40% less of the mean of plant or animal samples. Similar results were observed in the intestinal phase.
Many parameters are involved in the digestion process of protein extract. This marks the limit of reasoning variables per variables, with a large set of samples. Therefore, multivariate analysis is required to integrate different types of data and to deal with the data complexity to extract meaningful information.
3.2. Global Visualization of Variables and Protein Extracts
Heat map clustering allows grouping protein extracts according to their compositional characteristics (e.g., some AA, carbohydrates, etc.). For example, in
Figure 1A the protein extracts D1 (cheese powder), P1 (fava bean), and P2 (fermented fava bean) contain large amounts of total sugar, simple and complex carbohydrates, and Mg. These variables represent a carbohydrate cluster.
Figure 1A also shows that animal protein extracts are linked together, which means that in terms of composition they are similar extracts.
Figure 1B,C shows that animal samples appear to be better digested and develop more antioxidant activity in the gastric and intestinal phases. In addition, ABTS, FRAP, and DPPH are related, as are the results obtained by the Direct Detect
® method and fluorescamine. The interpretation of the data by heat map clustering is difficult because there are many variables, and our aim is to gather composition and digestion data. Thus, the number of variables must be reduced.
3.3. Reduction: Association of Quantitative Variables by Clustering
To reduce the number of variables we start by analyzing the variables using a hierarchical cluster analysis (HCA). In order to gather the variables that are related to each other in a cluster. This new group of variables is defined by a new name and represents a latent variable.
Figure 2A shows the classification of the composition data. This allows us to visualize which variables are related and thus bring them together to create new variables called latent variables. Nine latent variables were created from the 34 composition variables. In
Figure 2B,C, Direct Detect
® results are related to the results obtained by the fluorescamine method. Both methods measure peptide levels. The ABTS, DPPH, and FRAP methods measure the antioxidant activity of bioactive peptides and are linked together. As in the intestinal phase, three latent variables were created from seven variables in the gastric phase. To sum up, the 48 initial variables were merged into 15 latent variables.
3.4. Classification of Protein Extracts to Obtain Categorical Value to Each Latent Variable
The second step of the analysis consists of attributing for each protein extracts a categorical value for each cluster (latent variable) defined above. The k-means method was applied to the quantitative variables of a cluster, allowing classifying the protein extracts into classes. Then, a categorical value (high, medium, and low) was assigned to each protein extract.
Figure 3A corresponds to the results obtained for the latent variable Prot_HFDSEP, which consists of the content variables of histidine, phenylalanine, aspartic acid, serine, glutamic acid, and proline. The protein ingredients can be divided into two classes using the k-means method. The first class includes ingredients with a high level and the second class includes protein extracts with a low level in its components.
Figure 3B corresponds to the results obtained for the latent variable TIVLK which is composed of the content variables of threonine, isoleucine, valine, leucine, and lysine. The protein extracts are divided into three classes: high, average, and low level in these amino acids.
This method is used for all the latent variables created with HCA. All the data obtained are gathered in
Table 6.
3.5. Synthesis
The results obtained after the creation of latent variables by HCA and then class variables by the k-means method are summarized in
Table 6. From the 48 initial variables, 15 latent variables were created, with two or three levels (high, medium, or low).
In order to highlight associations between protein extracts and link with latent variable, a multivariate correspondence analysis (MCA) was conducted (
Table 6). If a class was only composed of two protein extracts the latent variable was not used for the MCA because it would be too discriminating. Thus, the latent variables vitA_Ox and CrKNa_GA were not considered.
The resulting graph is shown in
Figure 4. The percentage variance expressed is 21.38% for the first axis and 15.29% for the second axis. Both factors explained 36.67% of the total system inertia. Dimension 1 opposed individuals such as P3 (hemp), P2 (fermented fava bean), and P9 (sunflower) to individuals such as D8 (milk protein concentrate), D6 (calcium caseinate), and D5 (whey protein). The group in which individuals P3 (hemp), P2 (fermented fava bean), and P9 (sunflower) are found have the following characteristics: a high level of carbohydrates, a low level of TIVLK, a low level of protein and antioxidant activity (ORAC) in the intestine, a high level of antioxidant activity in the stomach and intestine. The individuals D8 (milk protein concentrate), D6 (calcium caseinate), and D5 (whey protein) shared a high level of intestinal peptide, a high level of protein and HFDSEP and a low level of antioxidant activity in the intestine. On the other hand, dimension 2 opposed individuals such as and D8 (milk protein concentrate), D6 (calcium caseinate), D1 (cheese powder), D3 (60% micellar casein concentrate), and D5 (whey protein) to individuals such as A7 (chicken broth) and A5 (pork liver). D8 (milk protein concentrate), D6 (calcium caseinate), and D5 (whey protein) are characterised as above. D1 (cheese powder) and D3 (60% micellar casein concentrate) are characterized by their high level of cysteine and tryptophan, whereas, A7 (chicken broth) and A5 (pork liver) share a high level of gastric peptides. However, some protein extracts remained undifferentiated, located near the centre of the graph, such as P4 (pea), P6 (soybean), and A2 (pork liver).
4. Discussion
All the protein extracts analysed contained several vitamins and organic and mineral nutrients; however, some contained more of certain minerals than others, or were less digestible, etc. We observed that the composition of the protein extracts was linked to their origin. For dairy products, calcium caseinate (D6) and casein rennet (D7) showed the highest level of calcium, as expected. Calcium is an important mineral for the development strong bones by proper intake in young people, and for keeping the bones of the elderly strong and healthy, by preventing a variety of bone-related illnesses, such as osteoporosis. The review of Philips et al. [
42] highlighted that whey proteins were the best for supporting muscle protein synthesis due to their high leucine content compared to milk and soy proteins. The absence or insufficient quantity of an essential amino acid is enough to disrupt protein synthesis. It is therefore the balance between the different essential amino acids (histidine, isoleucine, leucine, lysine, methionine, phenylalanine, threonine, tryptophan, and valine) that will be the first factor of dietary protein quality. Recently Reynaud et al. [
13] evaluated the digestible indispensable amino acid scores (DIAAS) of pea emulsion (40–64%), tofu and soya milk (78–116%). Interestingly those results concerning soya products were comparable to those reported for milk (≥100%) [
12] and meat protein (80–99%) [
43]. However, no information about a possible bioactivity was reported.
Diets rich in plant foods are increasingly recommended to reduce the risk of cardiometabolic diseases because of strong evidence that fruit, vegetables, legumes, and seeds are protective. In this study, plant-based protein extracts presented slower hydrolysis in the upperpart of our in vitro digestive system than animal and dairy products, which cannot be explained by less aromatic amino acid content. Antinutritional factors could be involved such as phytic acid or protease inhibitors, but they were not analysed in this study. The literature showed that mechanical treatment, heating, and fermentation of plant-based products, to cite a few, could partly overcome this adverse effect [
15]. Moreover, Bax et al. [
44] reported that the speed of pepsin digestion in the gastric compartment was explained by an enhanced enzyme accessibility due to protein denaturation. According to Dangin et al. [
8] and Serafini et al. [
9], proteins have specific absorption rates based on amino acid composition, which characterizes the anabolic properties of ‘fast’ or ’slow’ protein absorption. The rapidity of absorption of dietary amino acids by the intestine is crucial, because it influences the rate of postprandial protein synthesis, and therefore muscle mass accretion. Moreover, the composition of plant-based protein extracts contained lower amounts of lysine, methionine and/or leucine. Besides the rate of digestion, plant-based protein extracts generally contained more complex sugars. Fibers are important in the diet because they contribute to the proper functioning of the gut, promoting transit and so forth, but some fibers can act as an anti-nutritional factor, i.e., reducing the digestibility of proteins.
In addition to their high nutritional value, proteins can be precursors of bioactive peptides released during digestion, acting locally or on other organs. These peptides are characterized by their beneficial properties on key body functions. Indeed, natural antioxidants are beginning to be considered for the treatment of cellular degeneration because they inhibit or delay the oxidation process by blocking both the initiation and propagation of oxidative chain reactions [
45]. All the protein extracts analyzed exhibited antioxidant activity with various degree. In the plant samples, fava bean and sunflower protein extracts (P1, P2, and P9, respectively) displayed high antioxidant bioactivity especially during the intestinal phase. One explanation could be the presence phenolic compounds, such as chlorogenic acid, predominant in sunflower kernel [
46].
Many parameters are involved in the digestion process of protein extract to determine its nutritional value. This leads to studying a few protein-based food or a limited number of variables in a same row in the literature. In the future, the promotion in the diet of plant-based protein foods will increase as a transition towards more sustainable food consumption, particularly a substitution of animal protein with plant-based protein sources [
47]. The in vitro approach developed in this study allows to screen samples for elderly before any in vivo determination of the true ileal digestibility of amino acids, for example, which requires significant resources [
13].
In order to compare the different protein extracts using all the variables, it was necessary to develop a statistical workflow. Secondly such an approach can also help to better combine different protein sources to improve the essential amino acid profile, bioactivity, digestibility, and fiber content.
The capacity to compile large amounts of different types of data has become possible thanks to the development of omics data sets. Multivariate analysis is often used to integrate different kinds of data and attempt to extract meaningful information. Graphical representation is also part of the challenge to aid interpretation [
48]. Basically, we started with a classical statistical analysis of data visualization (heat map) in two dimensions, and variable grouping was performed in order to reduce data. The HCA method builds the hierarchy from the individual elements (i.e., variables) by progressively merging clusters. In our study, three sets of variables were used and three HCAs were built: the first set on composition, the second on digestive variables in the gastric compartment, and the third on the same variables but in the intestinal compartment. Then, the next step was to determine which elements should be merged in a cluster. For the first set the 34 variables were merged into 9 clusters. For the second and third sets, the seven variables were merged into three clusters. Each cluster gave rise to a latent variable, also called categorical variable. The application of the k-means algorithm was then used to group the input data set into two or three partitions linked to intensity (low, medium, and high). Finally, a multiple correspondence analysis (MCA) was applied using the latent variables; it is the counterpart of principal component analysis but for categorical data.
Similar approaches have been developed to identify optimal enzymes and proteins to generate food protein-derived bioactive peptides [
49] or for the discovery of specific peptides [
21] in the field of bioinformatics. These approaches were based on the generation of in silico data, but both aimed to screen a large batch of proteins and enzymes in the work of Tu et al. [
49] and peptides in the works of Siow et al. [
21], which are the counterparts of the protein extracts used in our study. Recently, Gauglitz et al. [
50] reported the development of a statistical workflow to explore and visualize the similarities and dissimilarities of raw and processed food products. To do this, they used untargeted mass spectrometry (MS data) and molecular networking to reveal molecular changes due to processing. Principal coordinates analysis was used for clustering. Moreover, using beta diversity analysis of food types (yogurt, tea, coffee, meat, and tomato) and their processing, they were able to visualize the molecular relationship among all the samples analyzed. The term β-diversity was first introduced in the field of ecology and corresponds to the ratio between regional and local species diversity.
5. Conclusions
The objective of this study was to develop a workflow to screen different origins of protein extracts and identify their potentiality as high quality nutritional culinary aids for recipes for the elderly. As expected, the composition of the protein extracts was linked to their origin and the digestive properties highlighted that animal and dairy proteins released peptides more rapidly, at least in the upper part of the digestive tract. However, not only proteins were targeted (i.e., also minerals, vitamin, etc.). Therefore, the search to reduce variables from 48 to 13 latent variables without losing information was explored in the statistical workflow developed. Such an approach permitted identifying protein extracts capable of satisfying the criteria applied. MCA unveiled complementary protein extracts. Their combination could not only offer a culinary aid, combining micro and macro nutrients, but also high levels of easily digested proteins and essential amino acids.
Such an approach can also be applied to other ingredients and is compatible with the incorporation of technical-functional and/or sensory data. Finally, the addition of culinary aids should be validated through a hedonic study.