Next Article in Journal
An Explorative Study of Circularity Practices in Swedish Manufacturing Companies
Previous Article in Journal
Ownership Characteristics and Financial Performance: Evidence from Chinese Split-Share Structure Reform
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparative Analysis of Support Vector Machine Regression and Gaussian Process Regression in Modeling Hydrogen Production from Waste Effluent

by
SK Safdar Hossain
1,*,
Bamidele Victor Ayodele
2,
Syed Sadiq Ali
1,
Chin Kui Cheng
3 and
Siti Indati Mustapa
4
1
Department of Chemical Engineering, College of Engineering, King Faisal University, P.O. Box 380, Al-Ahsa 31982, Saudi Arabia
2
Department of Chemical Engineering, Universiti Teknologi PETRONAS, Seri Iskandar 32610, Perak, Malaysia
3
Centre for Catalysis and Separation (CeCaS), Department of Chemical Engineering, College of Engineering, Khalifa University of Science and Technology, Abu Dhabi P.O. Box 127788, United Arab Emirates
4
Institute of Energy Policy and Research, Universiti Tenaga Nasional, Kajang 43000, Selangor, Malaysia
*
Author to whom correspondence should be addressed.
Sustainability 2022, 14(12), 7245; https://doi.org/10.3390/su14127245
Submission received: 14 May 2022 / Revised: 10 June 2022 / Accepted: 10 June 2022 / Published: 13 June 2022
(This article belongs to the Section Energy Sustainability)

Abstract

:
Organic-rich substrates from organic waste effluents are ideal sources for hydrogen production based on the circular economy concept. In this study, a data-driven approach was employed in modeling hydrogen production from palm oil mill effluents and activated sludge waste. Seven models built on support vector machine (SVM) and Gaussian process regression (GPR) were employed for the modeling of the hydrogen production from the waste sources. The SVM was incorporated with linear kernel function (LSVM), quadratic kernel function (QSVM), cubic kernel function (CSVM), and Gaussian fine kernel function (GFSVM). While the GPR was incorporated with the rotational quadratic kernel function (RQGPR), squared exponential kernel function (SEGPR), and exponential kernel function (EGPR). The model performance revealed that the SVM-based models did not show impressive performance in modeling the hydrogen production from the palm oil mill effluent, as indicated by the R2 of −0.01, 0.150, and 0.143 for LSVM, QSVM, and CSVM, respectively. Similarly, the SVM-based models did not perform well in modeling the hydrogen production from activated sludge, as evidenced by R2 values of 0.040, 0.190, and 0.340 for LSVM, QSVM, and CSVM, respectively. On the contrary, the SEGPR, RQGPR, SEGPR, and EGPR models displayed outstanding performance in modeling the prediction of hydrogen production from both oil palm mill effluent and activated sludge, with over 90% of the datasets explaining the variation in the model output. With the R2 > 0.9, the predicted hydrogen production was consistent with the SEGPR, RQGPR, SEGPR, and EGPR with minimized prediction errors. The level of importance analysis revealed that all the input parameters are relevant in the production of hydrogen. However, the influent chemical oxygen demand (COD) concentration and the medium temperature significantly influenced the hydrogen production from palm oil mill effluent, whereas the pH of the medium and the temperature significantly influenced the hydrogen production from the activated sludge.

1. Introduction

In the past two decades, there has been an increasing quest for transition into a low carbon economy due to the adverse effect of the utilization of energy derived from fossil fuels on the environment [1]. The utilization of fossil-fuel-derived energy for various human activities often results in the emission of environmentally unfriendly gaseous emissions, with climate change being one of the most undesirable effects [2]. To keep global warming to no more than 2 °C, the Paris climate agreement, signed in December 2015 and also further substantiated at the recently concluded COP 26 summit held at Glasgow, aims to limit it to 1.5 °C [3]. This has pushed several urgent international concerns on the continuous utilization of fossil fuel-derived energy to the forefront. The topmost questions being presently addressed are: what should be performed to avoid dangerous thresholds of climate change, how each type of fossil fuel should be lowered, and how could the production of renewable energy be hastened to replace fossil fuels in the short term? Several strategies have been proposed to answer these questions [4]. According to Fawzy et al. [4], these strategies can be categorized into two, namely, the conventional and negative emissions. The conventional strategies for climate change mitigation include the development and deployment of renewable energy sources, increasing use of nuclear power, carbon capture, storage and utilization, and fuel switch and efficiency gain. The negative emission strategies deal with the activities such as soil carbon sequestration, afforestation, reforestation, bioenergy carbon capture, and storage.
As part of the conventional strategies, there is growing interest in the development and deployment of hydrogen energy due to its several benefits as a renewable energy source [5]. Some of the benefits include zero-emission when used in a hydrogen fuel cell, robust reliability in terms of its usage in tough climatic conditions, high efficiency compared to other forms of renewable energy sources, and lower operational cost compared to renewable energy sources such as photovoltaic solar cell [6]. Several technological pathways have been explored for hydrogen production. Some of these technological pathways are heavily reliant on the use of fossil fuel sources such as natural gas as feedstock. However, there is a growing research interest in producing hydrogen from waste sources based on the circular economy concept, as shown in Table 1.
Table 1. Summary of studies on hydrogen production from waste sources.
Table 1. Summary of studies on hydrogen production from waste sources.
SourcesTechnological ProcessHydrogen ProducedReference
Food wasteGasification 1.2 tonne/h[7]
Municipal solid wasteElectrolysis Gasification43.1 MNm3
183 MNm3
[8]
Garden wasteDark fermentation 97 mL of H2/g[9]
Household wasteHydrothermal gasification 39 wt% biohydrogen yield[10]
Waste activated sludgeAnaerobic fermentation46.8 ± 1.0 mL/g [11]
Organic solid wastePyrolysis reforming11.49 mmol/g[12]
Waste activated sludgeDark fermentation11.01 ± 0.30 mL/g[13]
Palm oil mill effluentDark fermentation4.62 LH2/L[14]
Palm oil mill effluentMicrobial electrolysis92.72% biohydrogen yield[15]
Palm oil mill effluentDark fermentation3.07 ± 0.66 H2 yield mol-H2/mol-acetate[16]
As shown in Table 1, food wastes, municipal solid wastes, garden wastes, household wastes, waste-activated sludge, organic solid wastes, and palm oil mill effluent have been employed for hydrogen production. The hydrogen was produced from various sources using technological processes such as gasification, electrolysis, dark fermentation, hydrothermal gasification, anaerobic fermentation, pyrolysis reforming, and microbial electrolysis. The amounts of hydrogen produced were observed to vary with the type of technological process used. Moreover, experimental investigations have shown that the hydrogen produced can also be influenced by various process conditions temperature, pH of the medium, influent COD concentration, light intensity, agitation speed, and so on. It is unclear from the literature the inter-relationship between the various parameters and to what extent these parameters influence hydrogen production. The constraints can be resolved using a data-driven modeling approach such as SVM and GPR.
A variety of processes have been modeled using SVM and GPR. SVM and GPR were used by Zhao et al. [17] to model hydrogen production from biomass via the water gasification process. Both the SVM and GPR displayed robust prediction performance with R2 of 0.976, and 0.954, respectively. The application of SVM for the identification of leak possibilities in hydrogen fuel cell vehicles has been reported by Tian et al. [18]. The SVM modeling resulted in 95% diagnosis accuracy at constant diagnosis time. SVM has been reported to be robust in the predictive modeling of solar thermal energy systems [19]. However, the authors observed that the SVM predictive accuracy was less than other machine learning algorithms, such as random forest. Yoon and Moon [20] reported that the GPR model accurately models energy consumption in a commercial building. The modeling of hydrogen-rich syngas production from co-gasification agriculture waste using SVM and GPR has been reported by Bahadar et al. [19]. The hydrogen-rich syngas was accurately predicted by the SVM and GPR as indicated by the high R2 (>0.9), indicating a strong correlation between the actual and the predicted values. To the best of the authors’ knowledge, the comparative analysis of SVM regression and GPR in modeling hydrogen production from waste effluent has not been reported in the literature. Therefore, this study focuses on the comparative analysis of SVM and GPR in modeling hydrogen production from palm oil mill effluent and activated waste sludge. The input variables sensitivity analysis on the predicted hydrogen production was also investigated.

2. Experimental Details and Model Configuration

Two sources of waste, namely, activated sludge and palm oil mill effluent, were employed for biohydrogen production. The details of the experimental design and description have been reported in Yin and Wang [21] and Chong et al. [22]. As reported by Yin and Wang [21], Erlenmeyer flasks of 100 mL were used for batch experimental runs for biohydrogen production. Silicone rubber stoppers were used to prevent gas from escaping, 1 mol/L HCl and 1 mol/L NaOH were used to alter the pH in each batch. The temperature of the batch reactor was maintained at each stipulated value in the range of 25–40 °C by using a reciprocal shaker at 100 r/min. Each of the experimental runs was performed using 10 mL of sludge, which was pre-cultured, and 10 mL of nutrients. The culture medium was anaerobically gassed for 3 min before incubation. The cumulative hydrogen production was analyzed from each batch run using a stipulated gas chromatograph. As reported by Chong et al. [22], the experimental hydrogen production was carried out in a fermenter of 3 L in volume. The pH, temperature, and COD levels were varied based on the experimental design when introducing the POME into the fermenter. The hydrogen produced was analyzed using a gas chromatograph.
SVM and GPR with different kernel functions were employed for modeling the hydrogen production from the two waste substrates. One major advantage of the SVM regression is that it allows the specification of how much error can be tolerated in the model, and it is robust in locating a suitable hyperplane in higher dimensions to fit the data. The objective function of the SVM regression is to minimize the weight function (w) of the input variables represented in Equation (1) and the constraints represented in Equation (2).
Min   1 2 w 2
y i     w i x i   ε
where y i , x i , and ε , targeted variable, the input variables, and the maximum error function, respectively. The architecture of the SVM regression is depicted in Figure 1. It consists of the input layer which supplies the input variables, the hidden layer, and the output layer.
Although the SVM possesses the practical merit of producing a fair estimation of its own uncertainty, the GPR, on the other hand, has the main practical advantage of accurately evaluating its own level of uncertainty. The GPR may be thought of as kernelized Bayesian linear regression, where the kernel parameterization is controlled by choice of covariance/kernel function and the data used to generate predictions. By estimating the parameters of the Gaussian distribution in a provided observed training data, the GPR predicts a posterior Gaussian distribution for targets over test points X. The probabilistic model of Gaussian process functions across the training and testing datasets (f and f*, respectively is provided by Equation (3). A block matrix of kernel similarity matrices makes up the right-hand covariance matrices in Equation (3).
f f * ~ N 0 ,   K   X , X K X * X K X , X *   K X * , X *
Different regression learners with kernel functions such as squared exponential (Equation (4)), exponential (Equation (5), and rotational quadratic (Equation (6)) were used and trained to compare their results.
k z i , z j θ )   =     σ j 2 exp   1 2 z i     z j T z i     z j σ j 2
k z i ,   z j θ )   =   σ j 2 exp   z i     z j T ( z i     z j σ j  
k z i , z j θ )   =     σ j 2   1   +   z i     z j T ( z i     z j 2 2 α σ j 2 α
where σ j ,   σ f , are the characteristic length scale and the signal standard deviation for a covariance function k z i , z j . z i   and   z j are d-by-1 vectors. α   is the positive-valued scale-mixture parameter.
Different features and k-fold validations, as well as hyperparameter settings, were used to train the SVM and GPR models in the MATLAB environment. All the models were trained with distinct feature frames and parameters in mind. The SVM and GPR model performance were evaluated using root mean square error (RMSE), coefficient of determination (R2), and mean absolute errors (MAE). The MSE is a measure of how closely a regression line follows a collection of data points. This is accomplished by squaring the distances between the points on the plots and the regression line. The RMSE is a measure of the residuals’ standard deviation (prediction errors). The RMSE is also useful in quantifying the amount of variance that exists around the models. It can be represented by Equation (7). The R2 is a statistical metric that assesses the extent to which variations in one variable can be explained by changes in another one (Equation (8)). The MAE of the models with regard to a test set is the average of all individual prediction errors on all instances in the test set (Equation (9)).
RMSE   =   Y ¨ i Y i 2 n
where Y ¨ i and Y i are predicted and actual value of the hydrogen, n is the number of observed runs.
R 2 = Expand   variation   in   Y Total   variation   in   Y   =   1 i   =   1 n y ^ i     y i 2 i   =   1 n y i     y i 2
where y ^ i depicts the mean of the actual values of the hydrogen produced, y i is the actual value of the hydrogen produced and y i is a given point on the regression line.
MAE   =   i   =   n n Y ¨ i       Y i n
The independent variable importance analysis was conducted to determine the influence of the input parameters on the model output. The relative importance of the input parameters for a single response variable can be determined by deconstructing the model weights. Finding all weighted connections between the nodes of interest can reveal the relative relevance (or degree of association) of a certain explanatory variable for the response variable. That is, the weights that relate a certain input node to the response variable are identified in the hidden layer. The weights determine the relative importance of the information that is processed in the network so that the weights minimize input variables that have no relationship to a response variable.

3. Results and Discussion

3.1. Performance Analysis of the SVM and GPR Models

Figure 2 portrays the actual and predicted hydrogen production from the palm oil mill effluent, the details of the predictor importance analysis based on the measured values, and the predictor’s importance analysis. As shown in Figure 1, the SVM models integrated with the linear (LSVM), quadratic (QSVM), and cubic (CSVM) kernel functions did not display a notable performance in the predicting the hydrogen production from the activated sludge. The predicted hydrogen production from the three models was not consistent with the actual values, as indicated in Figure 1. As summarized in Table 2, the LSVM, QSVM, and CSVM poorly predict hydrogen production. This can be confirmed by the R2 values of −0.01, 0.150, and 0.143 for LSVM, QSVM, and CSVM, respectively. This implies that a small fraction of the variations of the predicted hydrogen production is explained by the LSVM, QSVM, and CSVM. The incorporation of the exponential kernel function significantly improves the predictability of the SVM in modeling the prediction of hydrogen production. The performance of the SFVM model can be confirmed from the R2 of 0.980, which implies that the model can explain over 90% variant in the predicted hydrogen production with minimized errors, as indicated by the RMSE and MAE values in Table 2.
The GPR models incorporated with rotational, squared exponential, and quadratic, exponential kernel functions displayed superior performance compared to the SVM models. As shown in Figure 2a, the predicted hydrogen agrees with the observed values. This can further be confirmed by the R2 value of 0.999, which implies that over 90% variation in the predicted hydrogen can be explained by the models. The hydrogen production from the palm oil mill effluent is observed to be significantly influenced in the low-temperature range of 23–30 °C, as depicted in Figure 2b. This phenomenon is typical of photocatalytic reactions for hydrogen production, as reported by Hisatomi et al. [23]. Furthermore, a positive influence observed at pH of 5.7, 7, and 8 is an indication that the palm oil mill effluent at the stipulated pH could enhance hydrogen production. Karimi et al. [24] reported that hydrogen production from glycerol, ethanol, and methanol is positively influenced by pH of 7.5–8.5. Using palm oil mill effluent with a high concentration of COD was observed to promote high hydrogen production, as shown in Figure 2b. Zhu et al. [25]) reported that the use of waste molasses with high COD concentration significantly enhances hydrogen production. The overall level of importance analysis of the various parameters is depicted in Figure 2c. It can be seen that the reaction temperature has the greatest influence on the predicted hydrogen production from the palm oil mill effluent. This is followed by the influent COD concentration, while the pH has the least influence.
The performance of the SVM and GPR models in predicting hydrogen production from activated sludge is depicted in Figure 3. As shown in Figure 3a, the SVM incorporated with linear, quadratic, and cubic kernel functions did not show impressive performance in modeling the prediction of the hydrogen production from activated sludge. The poor performance of the LSVM, QSVM, and CSVM is depicted by the large deviation of the predicted hydrogen production and the actual values obtained from the activated sludge. With R2 of 0.04, 0.190, and 0.340 obtained using the LSVM, QSVM, and CSVM models, respectively, for predicting the hydrogen production, a large portion of the datasets cannot explain the variation in the model output (predicted hydrogen). Hence, the models are not suitable for modeling the relationship between the input parameters and hydrogen production. Moreover, large prediction errors indicated by the RMSE and the MAE values also confirm the poor predictability of the LSVM, QSVM, and CSVM models. However, the incorporation of the fine Gaussian kernel functions into the SVM significantly improves the performance, as indicated by the R2 of 0.980. This is an indication that 98% of the datasets can explain the variation in the model output. Hence, the predicted hydrogen production is consistent with the actual values obtained from the activated sludge.
All the three kernel functions incorporated into the GPR showed impressive performance in predicting the hydrogen production from the activation sludge. The predicted hydrogen production is in proximity to the actual values as indicated by the R2. R2 of 0.999, 0.999, and 0.990 were obtained for the RGPR, SEGPR, and DEGPR, respectively. This indicates that the datasets can explain the variation in the model output with minimized prediction errors, as indicated by the RMSE and MAE values. As indicated by the R2 and the prediction errors, the RGPR, SEGPR, and DEGPR models are robust in modeling the non-linear relationship between the input parameters and the process output. Based on the GRP models, the level of importance analyses depicted in Figure 3a,b revealed that the hydrogen production from the activated sludge is significantly influenced by the temperature, pH, and glucose concentration. In Figure 3b, a positive influence at a temperature of 25 °C implies an enhanced hydrogen production at the stipulated temperature. Furthermore, hydrogen production from the activated sludge is most favored at a pH of 5 and glucose concentration of 20 g/L. The overall analysis of the level of importance of the input parameters revealed that the pH of the medium plays a vital role in influencing the model output. This is followed by the temperature and the glucose concentration.
The three-dimensional plots showing the comparison between the actual and predicted hydrogen produced from palm oil mill effluent and activated sludge are depicted in Figure 4 and Figure 5. The interaction effect of the two most significant factors, which are: influent COD concentration and temperature for hydrogen produced from the palm oil mill effluent, as well as pH and temperature for hydrogen produced from activated sludge, were also displayed on the plots. In Figure 4a–g, the actual hydrogen produced was compared with the predicted values using LSVM, QSVM, CSVM, FGSVM, RGPR, SEGPR, and DEGPR models. Figure 4a,b revealed that the actual hydrogen (dome shape) is not consistent with the predicted values (flat shape) by the LSVM and QSVM. Improved consistency between the actual and predicted is, however, observed using the CSVM model as depicted in Figure 4c. Figure 4d–g show a high degree of consistency between the actual and predicted hydrogen from the palm oil mill effluent. The plots also show that the two factors significantly influence the hydrogen produced from the oil palm mill effluent. An increase in the influent COD concentration up to 50 mg/L resulted in a corresponding increase in the amount of hydrogen produced per day. Similarly, optimum hydrogen production from the palm oil mill effluent is favorable at 32 °C. The interaction between the influent COD concentration and the temperature peaked at maximum hydrogen production of 1.1 m3 hydrogen per day.
Similarly, in Figure 5a–g, the actual hydrogen produced from activated sludge was compared to the predicted values using LSVM, QSVM, CSVM, FGSVM, RGPR, SEGPR, and DEGPR models. Figure 5a,b show that the actual hydrogen (dome shape) does not match the LSVM and QSVM predicted values (flat shape). The CSVM model, as shown in Figure 5c, improves the consistency between the actual and predicted values. Figure 5d–g indicate that the actual and predicted hydrogen from the activated sludge is quite consistent compared to Figure 5a,b. The plots also illustrate that the two most significant factors (pH and temperature) have a considerable impact on the amount of hydrogen produced from the activated sludge. When the pH was increased to 8.2, the amount of hydrogen produced per liter increased proportionally. At 32 °C, optimal hydrogen production of the activated sludge was attained.

3.2. Comparison of the Best Model with Literature and Practical Implications of the Study

The appraisal of the best models utilized for modeling hydrogen production from palm oil mill effluent and activated sludge waste with those reported in the literature are summarized in Table 3. As summarized in Table 3, various data-driven modeling techniques such as multilayer perceptron neural network (MLPNN), Levenberg–Marquardt neural network, scaled conjugate gradient neural network (SCGNN), least-square SVM, backpropagation neural network (BPNN), Ensemble Tree, ANN incorporated with sigmoid function and ANN incorporated with hyperbolic tangent function has been employed in modeling various processes. The performance of the FGSVM, RGPR, SEGPR, and DEGPR used in modeling hydrogen production from palm oil mill effluent and activated sludge wastes is comparable with the models reported in the literature, as indicated by the R2 values. The R2 of 0.999 obtained for the RGPR, SEGPR, and DEGPR models are consistent with that reported for modeling hydrogen production from biomass gasification [22], commercial proton exchange membrane fuel cell [23], photocatalytic hydrogen production from ethanol [24], hydrogen production through water-gas shift reaction [26], and hydrogen production by catalytic steam methane reforming [27]. This is an indication of the sturdiness of the GPR in modeling non-linear complex processes. The slight variation in the performance of the SVM and GPR used in modeling formic acid electro-oxidation with the present study could be a result of different model inputs, the datasets, and the complexity of the process. Furthermore, the SVM algorithm performs poorly compared to the GPR, possibly due to the lack of inbuilt mechanisms to handle datasets where target classes overlapped.
Table 3. Comparison of the best model with literature.
Table 3. Comparison of the best model with literature.
S/NProcessModelPerformance MatrixReference
1Hydrogen production from Palm oil mill effluentRGPR,
SEGPR,
DEGPR
R2 = 0.999, RMSE = 0.00258
R2 = 0.999, RMSE = 0.00258
R2 = 0.999, RMSE = 0.00258
This study
2Hydrogen production from waste activated sludgeFGSVM
RGPR
SEGPR
R2 = 0.980, RMSE = 133.290
R2 = 0.999, RMSE = 3.222
R2 = 0.999, RMSE = 13.222
This study
3Hydrogen production via biomass gasificationSVM
KNN
Decision tree
R2 = 0.997, RMSE = 1.631
R2 = 0.991, RMSE = 0.991
R2 = 0.991, RMSE = 0.944
[28]
4Commercial proton exchange membrane fuel cellSVM
MLP
R2 = 0.977, RMSE = 0.050
R2 = 0.995, RMSE = 0.03
[29]
5Photocatalytic hydrogen production from ethanolLMNN
SCGNN
R2 = 0.998, RMSE = 1.32 × 10−15
R2 = 0.997, RMSE = 2.21 × 10−15
[30]
6Hydrogen production through water-gas shift reactionLSSVMR2 = 0.998, RMSE = 0.0069[31]
7Co-gasification techniques for producing hydrogenSQGPR
RQGPR
R2 = 0.980, RMSE = 0.0089
R2 = 0.980, RMSE = 0.0088
[32]
8PEM fuel cell systemsBPNNR2 = N.R, RMSE = 0.02334[33]
9Formic acid electro-oxidation SVM, Ensemble Tree, and GPRR2 = 0.820, RMSE = N.R
R2 = 0.830, RMSE = N.R
R2 = 0.850, RMSE = N.R
[34]
10Catalytic steam methane reforming to hydrogenANN with Sigmoid function
ANN with Hyperbolic Tangent function
R2 = 0.997, RMSE = 0.652
R2 = 0.984, RMSE = 0.348
[31]
N.R. Not reported.
The SVM and GPR-based models can be incorporated into the design process of the hydrogen production process from palm oil mill effluent and activated sludge waste in the case that a scale-up process is necessary. Having the knowledge of how each of the input parameters influences the process outputs utilizing SVM and GPR-based models will help to identify how resources should be correctly allocated. In addition, a suitable control measure may be incorporated into the whole operation to cut down on the amount of wasted energy and resources.

4. Conclusions

The bulk of wastewater has a large proportion of readily biodegradable organic components, which allows it to avoid the need for pre-treatment and lets it be utilized directly as a possible substrate in hydrogen production bioreactors. This helps to overcome the necessities for the pre-treatment stage. This study has demonstrated the application of SVM and GPR models incorporated with different kernel functions for modeling the production of hydrogen production from palm oil mill effluent and activated sludge. The incorporation of the linear, quadratic, and cubic kernel function to the SVM did not result in any significant improvement in the predictive performance of the hydrogen production from the palm oil mill effluent and activated sludge, as indicated by the low R2 values and high prediction errors based on the RMSE and MAE. The GPR models with the various kernel functions show better performance in modeling the prediction of the hydrogen production from the palm oil mill effluent and the activated sludge. The predicted hydrogen was consistent with the actual values. The input parameters were found to significantly influence the hydrogen produced from the two sources. The level of importance study revealed that all the input parameters are important in the process of producing hydrogen. However, the concentration of COD in the influent water and the temperature of the medium had a substantial impact on the amount of hydrogen that was produced from palm oil mill effluent. On the other hand, the pH of the medium and the temperature had a substantial impact on the amount of hydrogen that was produced by the activated sludge. This study could serve as a reference point in the case of a scale-up; real-time optimization of the hydrogen production from waste effluent will be made possible with a thorough understanding of how various process factors influence the process.

Author Contributions

Conceptualization, B.V.A. and S.S.H.; methodology, B.V.A.; software, B.V.A. and S.S.H.; validation, S.S.H., B.V.A. and S.S.A.; formal analysis, B.V.A.; investigation, B.V.A.; resources, S.S.H.; data curation, S.S.H.; writing—original draft preparation, B.V.A.; writing—review and editing, S.S.H., C.K.C. and S.I.M.; visualization, S.S.H. and S.I.M.; supervision, S.S.H.; project administration, S.S.H.; funding acquisition, S.S.H. All authors have read and agreed to the published version of the manuscript.

Funding

The authors acknowledge the Deanship of Scientific Research at King Faisal University for the financial support under the Research Group Support Track (Grant no. 1811012).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Viškovi, A.; Franki, V.; Baši, A. City-Level Transition to Low-Carbon Economy. Energies 2022, 15, 1737. [Google Scholar] [CrossRef]
  2. Mensah, I.A.; Sun, M.; Gao, C.; Omari-Sasu, A.Y.; Zhu, D.; Ampimah, B.C.; Quarcoo, A. Analysis on the nexus of economic growth, fossil fuel energy consumption, CO2 emissions and oil price in Africa based on a PMG panel ARDL approach. J. Clean. Prod. 2019, 228, 161–174. [Google Scholar] [CrossRef]
  3. Skytt, T.; Nielsen, S.N.; Jonsson, B.-G. Global warming potential and absolute global temperature change potential from carbon dioxide and methane fluxes as indicators of regional sustainability—A case study of Jämtland, Sweden. Ecol. Indic. 2020, 110, 105831. [Google Scholar] [CrossRef]
  4. Fawzy, S.; Osman, A.I.; Doran, J.; Rooney, D.W. Strategies for mitigation of climate change: A review. Environ. Chem. Lett. 2020, 18, 2069–2094. [Google Scholar] [CrossRef]
  5. Fan, M.-S.; Abdullah, A.Z.; Bhatia, S. Hydrogen production from carbon dioxide reforming of methane over Ni–Co/MgO–ZrO2 catalyst: Process optimization. Int. J. Hydrogen Energy 2011, 36, 4875–4886. [Google Scholar] [CrossRef]
  6. Ally, J.; Pryor, T. Life cycle costing of diesel, natural gas, hybrid and hydrogen fuel cell bus systems: An Australian case study. Energy Policy 2016, 94, 285–294. [Google Scholar] [CrossRef]
  7. Xu, Z.; Qi, H.; Yao, D.; Zhang, J.; Zhu, Z.; Wang, Y.; Cui, P. Modeling and comprehensive analysis of food waste gasification process for hydrogen production. Energy Convers. Manag. 2022, 258, 115509. [Google Scholar] [CrossRef]
  8. Borgogna, A.; Centi, G.; Iaquaniello, G.; Perathoner, S.; Papanikolaou, G.; Salladini, A. Assessment of hydrogen production from municipal solid wastes as competitive route to produce low-carbon H2. Sci. Total Environ. 2022, 827, 154393. [Google Scholar] [CrossRef]
  9. Ramprakash, B.; Incharoensakdi, A. Dark fermentative hydrogen production from pretreated garden wastes by Escherichia coli. Fuel 2022, 310, 122217. [Google Scholar] [CrossRef]
  10. Rajagopal, J.; Gopinath, K.P.; Neha, R.; Aakriti, K.; Jayaraman, R.S.; Arun, J.; Pugazhendhi, A. Processing of household waste via hydrothermal gasification and hydrothermal liquefaction for bio-oil and bio-hydrogen production: Comparison with RSM studies. J. Environ. Chem. Eng. 2022, 10, 107218. [Google Scholar] [CrossRef]
  11. Wang, Y.; Wei, W.; Dai, X.; Ni, B.J. Corncob ash boosts fermentative hydrogen production from waste activated sludge. Sci. Total Environ. 2022, 807, 151064. [Google Scholar] [CrossRef]
  12. Zhang, L.; Chao, B.; Zhang, X. Modeling and optimization of microbial lipid fermentation from cellulosic ethanol wastewater by Rhodotorula glutinis based on the support vector machine. Bioresour. Technol. 2020, 301, 122781. [Google Scholar] [CrossRef]
  13. Li, X.; Sui, K.; Zhang, J.; Liu, X.; Xu, Q.; Wang, D.; Yang, Q. Revealing the mechanisms of rhamnolipid enhanced hydrogen production from dark fermentation of waste activated sludge. Sci. Total Environ. 2022, 806, 150347. [Google Scholar] [CrossRef]
  14. Rosa, D.; Medeiros, A.B.P.; Martinez-Burgos, W.J.; Nascimento, J.R.D.; de Carvalho, J.C.; Sydney, E.B.; Soccol, C.R. Biological hydrogen production from palm oil mill effluent (POME) by anaerobic consortia and Clostridium beijerinckii. J. Biotechnol. 2020, 323, 17–23. [Google Scholar] [CrossRef]
  15. Krishnan, S.; Din, M.F.M.; Taib, S.M.; Nasrullah, M.; Sakinah, M.; Wahid, Z.A.; Kamyab, H.; Chelliapan, S.; Rezania, S.; Singh, L. Accelerated two-stage bioprocess for hydrogen and methane production from palm oil mill effluent using continuous stirred tank reactor and microbial electrolysis cell. J. Clean. Prod. 2019, 229, 84–93. [Google Scholar] [CrossRef]
  16. Mishra, P.; Singh, L.; Ab Wahid, Z.; Krishnan, S.; Rana, S.; Islam, M.A.; Sakinah, M.; Ameen, F.; Syed, A. Photohydrogen production from dark-fermented palm oil mill effluent (DPOME) and statistical optimization: Renewable substrate for hydrogen. J. Clean. Prod. 2018, 199, 11–17. [Google Scholar] [CrossRef]
  17. Zhao, S.; Li, J.; Chen, C.; Yan, B.; Tao, J.; Chen, G. Interpretable machine learning for predicting and evaluating hydrogen production via supercritical water gasification of biomass. J. Clean. Prod. 2021, 316, 128244. [Google Scholar] [CrossRef]
  18. Tian, Y.; Zou, Q.; Jin, Z.; Lin, Z. Data-driven diagnosis of the high-pressure hydrogen leakage in fuel cell vehicles based on relevance vector machine. Int. J. Hydrogen Energy 2022, 47, 12281–12292. [Google Scholar] [CrossRef]
  19. Bahadar, A.; Kanthasamy, R.; Sait, H.H.; Zwawi, M.; Algarni, M.; Ayodele, B.V.; Cheng, C.K.; Wei, L.J. Elucidating the effect of process parameters on the production of hydrogen-rich syngas by biomass and coal Co-gasification techniques: A multi-criteria modeling approach. Chemosphere 2022, 287, 132052. [Google Scholar] [CrossRef]
  20. Ahmad, M.W.; Reynolds, J.; Rezgui, Y. Predictive modelling for solar thermal energy systems: A comparison of support vector regression, random forest, extra trees and regression trees. J. Clean. Prod. 2018, 203, 810–821. [Google Scholar] [CrossRef]
  21. Yoon, Y.R.; Moon, H.J. Energy consumption model with energy use factors of tenants in commercial buildings using Gaussian process regression. Energy Build. 2018, 168, 215–224. [Google Scholar] [CrossRef]
  22. Yin, Y.; Wang, J. Optimization of Hydrogen Production by Response Surface Methodology Using γ-Irradiated Sludge as Inoculum. Energy Fuels 2016, 30, 4096–4103. [Google Scholar] [CrossRef]
  23. Chong, M.-L.; Rahman, N.A.A.; Rahim, R.A.; Aziz, S.A.; Shirai, Y.; Hassan, M.A. Optimization of biohydrogen production by Clostridium butyricum EB6 from palm oil mill effluent using response surface methodology. Int. J. Hydrogen Energy 2009, 34, 7475–7482. [Google Scholar] [CrossRef] [Green Version]
  24. Hisatomi, T.; Maeda, K.; Takanabe, K.; Kubota, J.; Domen, K. Aspects of the water splitting mechanism on (Ga 1-xZn x)(N 1-xO x) photocatalyst modified with Rh 2-yCr yO 3 cocatalyst. J. Phys. Chem. C 2009, 113, 21458–21466. [Google Scholar] [CrossRef]
  25. Estahbanati, M.K.; Mahinpey, N.; Feilizadeh, M.; Attar, F.; Iliuta, M.C. Kinetic study of the effects of pH on the photocatalytic hydrogen production from alcohols. Int. J. Hydrogen Energy 2019, 44, 32030–32041. [Google Scholar] [CrossRef]
  26. Zhu, G.; Liu, C.; Li, J.; Gao, Y.; Jin, H. Effect of chemical oxygen demand (COD) on hydrogen production from waste molasses in continuous stirred tank reactor (CSTR). In Proceedings of the 2010 International Conference on Mechanic Automation and Control Engineering, Wuhan, China, 26–28 June 2010; pp. 4058–4061. [Google Scholar]
  27. Ozbas, E.E.; Aksu, D.; Ongen, A.; Aydin, M.; Ozcan, H.K. Hydrogen production via biomass gasification, and modeling by supervised machine learning algorithms. Int. J. Hydrogen Energy 2019, 44, 17260–17268. [Google Scholar] [CrossRef]
  28. Kheirandish, A.; Shafiabady, N.; Dahari, M.; Kazemi, M.S.; Isa, D. Modeling of commercial proton exchange membrane fuel cell using support vector machine. Int. J. Hydrogen Energy 2016, 41, 11351–11358. [Google Scholar] [CrossRef]
  29. Mageed, A.K. Modeling photocatalytic hydrogen production from ethanol over copper oxide nanoparticles: A comparative analysis of various machine learning techniques. Biomass Convers. Biorefinery 2021, in press. [Google Scholar] [CrossRef]
  30. Amiri, S.; Karimi, E. Estimation of catalytic hydrogen production through water-gas shift reaction using a machine learning technique. Energy Sources Part A Recover. Util. Environ. Eff. 2021, in press. [Google Scholar] [CrossRef]
  31. Ayodele, B.V.; Alsaffar, M.A.; Mustapa, S.I.; Adesina, A.; Kanthasamy, R.; Witoon, T.; Abdullah, S. Process intensification of hydrogen production by catalytic steam methane reforming: Performance analysis of multilayer perceptron-artificial neural networks and nonlinear response surface techniques. Process Saf. Environ. Prot. 2021, 156, 315–329. [Google Scholar] [CrossRef]
  32. Wang, Y.; Tang, M.; Ling, J.; Wang, Y.; Liu, Y.; Jin, H.; He, J.; Sun, Y. Modeling biohydrogen production using different data driven approaches. Int. J. Hydrogen Energy 2021, 46, 29822–29833. [Google Scholar] [CrossRef]
  33. Deng, H.; Hu, W.; Cao, D.; Chen, W.; Huang, Q.; Chen, Z.; Blaabjerg, F. Degradation trajectories prognosis for PEM fuel cell systems based on Gaussian process regression. Energy 2022, 244, 122569. [Google Scholar] [CrossRef]
  34. Hossain, S.S.; Ali, S.S.; Rushd, S.; Ayodele, B.V.; Cheng, C.K. Interaction effect of process parameters and Pd-electrocatalyst in formic acid electro-oxidation for fuel cell applications: Implementing supervised machine learning algorithms. Int. J. Energy Res. 2022. [Google Scholar] [CrossRef]
Figure 1. Architecture of the SVM regression showing the input, hidden, and the output nodes.
Figure 1. Architecture of the SVM regression showing the input, hidden, and the output nodes.
Sustainability 14 07245 g001
Figure 2. (a) Plots showing actual and predicted hydrogen from the palm oil mill effluent (b) Details of level of importance analysis (c) independent variable importance analysis.
Figure 2. (a) Plots showing actual and predicted hydrogen from the palm oil mill effluent (b) Details of level of importance analysis (c) independent variable importance analysis.
Sustainability 14 07245 g002
Figure 3. (a) Plots showing actual and predicted hydrogen from the activated sludge (b) Details of level of importance analysis (c) independent variable importance analysis.
Figure 3. (a) Plots showing actual and predicted hydrogen from the activated sludge (b) Details of level of importance analysis (c) independent variable importance analysis.
Sustainability 14 07245 g003
Figure 4. Effect of influent COD concentrations and temperature on actual and predicted hydrogen by (a) LSVM (b) QSVM (c) CSVM (d) FGSVM (e) RGPR (f) SEGPR and (g) DEGPR using palm oil mill effluent.
Figure 4. Effect of influent COD concentrations and temperature on actual and predicted hydrogen by (a) LSVM (b) QSVM (c) CSVM (d) FGSVM (e) RGPR (f) SEGPR and (g) DEGPR using palm oil mill effluent.
Sustainability 14 07245 g004aSustainability 14 07245 g004b
Figure 5. Effect of influent COD concentrations and temperature on actual and predicted hydrogen by (a) LSVM (b) QSVM (c) CSVM (d) FGSVM (e) RGPR (f) SEGPR and (g) DEGPR using activated sludge.
Figure 5. Effect of influent COD concentrations and temperature on actual and predicted hydrogen by (a) LSVM (b) QSVM (c) CSVM (d) FGSVM (e) RGPR (f) SEGPR and (g) DEGPR using activated sludge.
Sustainability 14 07245 g005aSustainability 14 07245 g005b
Table 2. Comparative analysis of the SVM and GPR in predicting the hydrogen production.
Table 2. Comparative analysis of the SVM and GPR in predicting the hydrogen production.
SubstratePerformance MatrixLSVMQSVMCSVMFGSVMRGPRSEGPRDEGPR
RMSE0.1480.1350.1110.0200.002580.002580.00258
Palm oil mill effluentR2−0.0100.1500.1430.9800.9990.9990.999
MAE0.1250.1130.0950.0190.001450.001450.00145
RMSE1000.1918.19830.3133.3913.22213.222173.26
SludgeR20.0400.1900.3400.9800.9990.9990.970
MAE786.29701.36598.64128.056.1026.10289.332
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Hossain, S.S.; Ayodele, B.V.; Ali, S.S.; Cheng, C.K.; Mustapa, S.I. Comparative Analysis of Support Vector Machine Regression and Gaussian Process Regression in Modeling Hydrogen Production from Waste Effluent. Sustainability 2022, 14, 7245. https://doi.org/10.3390/su14127245

AMA Style

Hossain SS, Ayodele BV, Ali SS, Cheng CK, Mustapa SI. Comparative Analysis of Support Vector Machine Regression and Gaussian Process Regression in Modeling Hydrogen Production from Waste Effluent. Sustainability. 2022; 14(12):7245. https://doi.org/10.3390/su14127245

Chicago/Turabian Style

Hossain, SK Safdar, Bamidele Victor Ayodele, Syed Sadiq Ali, Chin Kui Cheng, and Siti Indati Mustapa. 2022. "Comparative Analysis of Support Vector Machine Regression and Gaussian Process Regression in Modeling Hydrogen Production from Waste Effluent" Sustainability 14, no. 12: 7245. https://doi.org/10.3390/su14127245

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop