Next Article in Journal
Oil Price Volatility and MENA Stock Markets: A Comparative Analysis of Oil Exporters and Importers
Previous Article in Journal
Sequence-to-Sequence Deep Learning for Urban Water Demand Forecasting
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Proceeding Paper

Hybrid Chemical and Data-Driven Model for Stiff Chemical Kinetics Using a Physics-Informed Neural Network †

1
Fariborz Maseeh Department of Civil Architectural and Environmental Engineering, The University of Texas at Austin, Austin, TX 78712, USA
2
Division of Applied Mathematics, Brown University, Providence, RI 02906, USA
3
Department of Industrial Engineering, University of Bologna, 40126 Bologna, Italy
*
Author to whom correspondence should be addressed.
Presented at the 3rd International Joint Conference on Water Distribution Systems Analysis & Computing and Control for the Water Industry (WDSA/CCWI 2024), Ferrara, Italy, 1–4 July 2024.
Eng. Proc. 2024, 69(1), 40; https://doi.org/10.3390/engproc2024069040
Published: 3 September 2024

Abstract

:
Models of chemical kinetic processes, comprising systems of stiff ordinary differential equations (ODEs), are essential for modeling important chemical reactions relevant to drinking water chemistry, such as disinfectant decay and disinfection byproduct formation. However, the accuracy of these models can be inhibited by (1) the challenge of fully describing the chemical reaction system, and (2) additional chemical reactions occurring in actual environmental settings that were not accounted for in the laboratory conditions used to develop and calibrate the models. This study proposes the use of a Physics-Informed Neural Network framework, utilizing the eXtreme Theory of Functional Connections (X-TFC) technique to create a hybrid chemical- and data-driven model that incorporates data and the underlying system of ODEs into the trained model in order to increase the accuracy of the predicted chemical concentrations.

1. Introduction

Key characteristics of water quality paramount for the protection of public health, such as decay of free chlorine, monochloramine, and formation of disinfection byproducts, can be modeled using a system of stiff ordinary differential equations (ODEs). Despite the widespread adoption of stiff chemical models, these may yield inaccurate results for two key reasons. First, some of the underlying chemical mechanisms occurring in the actual reaction scheme may not be discovered or fully described in the description of the chemical model [1]. Second, additional reactions that occur in actual environmental systems may not have been accounted for in the laboratory conditions in which the model was developed [2].
Therefore, we propose a Physics-Informed Neural Network (PINN) framework based on the eXtreme Theory of Functional Connections (X-TFC), which has been demonstrated to effectively model stiff systems of chemical equations [3,4]. Previous implementations for solving stiff ODEs use X-TFC as an equation-solving method rather than a predictive modeling tool [4]. Therefore, the purpose of this work is to (1) expand the capabilities of the X-TFC framework by including pH, a key parameter that affects the rate of many aquatic chemical reactions, as a second input variable in order to predict chemical concentrations over a range of water quality conditions; and (2) incorporate data into the model training process. The outcome is a model that relies on the underlying system of chemical equations and data in order to increase prediction accuracy. We demonstrate the proposed method using a subset of the chemical model used to simulate the decay of monochloramine, a disinfectant used by approximately 25% of water utilities in the United States [5].

2. Methods

2.1. X-TFC Overview

PINNs have been shown to have difficulty solving stiff systems of chemical equations with initial conditions due to competition between loss functions representing the initial conditions and gradient loss components [4]. Therefore, we employ the X-TFC framework, which has been shown to effectively solve stiff systems of chemical equations with high accuracy and speed. The key feature of the X-TFC framework is the use of a constrained expression to analytically satisfy the initial conditions, thus eliminating the competition between different components in the loss function.
Consider a system of constrained ODEs for different species and pH values with initial conditions:
d y i ( t ; p h ) d t = f i t , y i , , y m ; p h t t 0 , t f   y t 0 ; p h = y i , 0
where i = 1 , , m is the species, y i t is the concentration of a chemical species i at time t , t 0 is the initial time, t f is the final time, and y i , 0 and f i t , y i , , y m ; p h are the initial condition and rate expression of a chemical species i , respectively. In the X-TFC framework, the concentration of a species y i is represented with a constrained expression where y i y c e = g t ; p h + y t 0 ; p h g t 0 ; p h . Hence, when t = t 0 , the constrained expression is guaranteed to satisfy the initial conditions. The constrained expression is substituted back into the ODEs, resulting in a simplified unconstrained system of ODEs. The function g t ; p h is defined as the free function, which in the X-TFC framework is approximated using a single-layer neural network (NN).

2.2. Neural Network Architecture

The single-layer NN, g t ; p h , has two inputs: time and pH. The number of outputs is equal to the number of chemical species included in the chemical system. The number of nodes in the hidden layer is a tunable hyperparameter. The tanh activation function, which was shown to be effective, is used [4]. The values of t and p h are scaled to a domain of [0, 1] or [−1, 1] to coincide with the domain of the tanh activation function.

2.3. Neural Network Loss Functions

The loss function used to train the NN includes the gradient loss and, if partial data are available, the data loss components. The gradient loss incorporates information about the chemical kinetics by minimizing the mismatch between the time derivates, d y i ( t ; p h ) d t , and rate of change, f i t , y c e , i , , y c e , m ; p h . An additional data loss component is added for each species that has data available that minimizes the difference between the data, y ~ i t ; p h , and NN prediction, y c e , i t ; p h .

2.4. Neural Network Optimization

The NN is trained via the extreme learning machine technique, in which the values of input weights and biases at each hidden node are randomly sampled from a uniform distribution between [−1,1] and held constant [6]. The values of the output weights are tuned during model training. The NN is linear with respect to the output weights, enabling the use of iterative least squares regression to optimize the NN output weights extremely quickly.

3. Results

We demonstrate our proposed methodology using the first three reactions of the unified model of monochloramine decay [7]. The reaction scheme contains four chemical species, monochloramine (NH2Cl), dichloramine, (NHCl2), hypochlorous acid (HOCl), and ammonia (NH3) [7]. We train the proposed X-TFC model using the chemical equations for in-sample pH values and training data for a subset of species. Then, we evaluate the performance of the trained model using out-of-sample pH values not used in training and compare the performance to hold-out species for which data were not included in the model training. In the X-TFC model, and in the figures below, the concentrations of the acid and base species (HOCl and NH3) are represented by the total amount of conjugate acid and base for each species. The initial concentration of monochloramine was set as 4.22 · 10 5 mol/L, corresponding to 3.0 mg/L as Cl2, which is a typical monochloramine concentration for treatment plant effluent entering a water distribution system [8]. To generate data, we solved the system of equations with increased rate constants, which we consider to be the ground truth in this scenario. If implemented on a system with measured data, the measured data would be used instead.
Figure 1 shows the results of the trained X-TFC model, using data for TOTNH and TOTCl. In this figure, chemical model refers to the result of the chemical model with the original rate constants solved using ode15s, training data refers to data that were used to train the model for in-sample pH values, hold-out data refers to data for out-of-sample pH values, and hold-out species refers to species for which data were not included during training. Thus, the X-TFC model relies on the chemical model and data for a subset of chemical species to predict concentrations of species for which no data were incorporated into the training of the X-TFC model. The key finding is that the X-TFC model is able to accurately predict the concentrations of hold-out species NH2Cl for in-sample and out-of-sample pH values (solid lines align closely with dotted lines), even though no data were supplied for this species was included during model training.

4. Discussion and Conclusions

This work proposes a PINN-based modeling framework that improves the accuracy of chemical predictions for chemical systems, with multiple implications and opportunities for future work. We observed that the modeling framework is able to accurately predict the concentrations of hold-out species, as shown in Figure 1. However, the ability to predict hold-out species is dependent on the dynamics of the chemical system and which species are used for training data. The proposed modeling framework could be expanded in many different directions, for example, coupling X-TFC with hydraulic models of a water distribution system to approximate chemical concentrations at a particular location within the system. In addition, because the X-TFC model trains extremely quickly (the training time for the X-TFC model shown in Figure 1 was under 5 s), there is potential to use the X-TFC model as a data assimilation technique for real-time predictions of water quality at unmeasured locations within a water distribution system. Opportunities for future work include developing optimized X-TFC model hyperparameters, utilizing different optimization methods, and implementing additional chemical reaction schemes.
In the face of rising water quantity and quality concerns and the continuous development of chemical kinetic process models, the impact of the proposed X-TFC model framework is far-reaching [9]. In addition, systems of stiff differential equations are used in a variety of disciplines, such as atmospheric chemistry, biology, and epidemiology, showing promise for many use cases of a hybrid modeling framework in multiple domains.

Author Contributions

Conceptualization, M.F. and L.S.; methodology, M.F., M.D.F., E.S. and L.S.; software, M.F., M.D.F. and E.S.; formal analysis, M.F.; writing—original draft preparation, M.F.; writing—review and editing, M.D.F., E.S. and L.S.; supervision, L.S.; funding acquisition, L.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the United States National Science Foundation under Grant 1953206.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Pham, H.T.; Wahman, D.G.; Fairey, J.L. Updated reaction pathway for dichloramine decomposition: Formation of reactive nitrogen species and N-nitrosodimethylamine. Environ. Sci. Technol. 2021, 55, 1740–1749. [Google Scholar] [CrossRef] [PubMed]
  2. De Santi, M.; Khan, U.T.; Arnold, M.; Fesselet, J.F.; Ali, S.I. Forecasting point-of-consumption chlorine residual in refugee settlements using ensembles of artificial neural networks. NJP Clean Water 2021, 4, 35. [Google Scholar] [CrossRef]
  3. Karniadakis, G.E.; Kevrekidis, I.G.; Perdikaris, P.; Yang, L. Physics-informed machine learning. Nat. Rev. Phys. 2021, 3, 422–440. [Google Scholar] [CrossRef]
  4. De Florio, M.; Schiassi, E.; Furfaro, R. Physics-informed neural networks and functional interpolation for stiff chemical kinetics. Chaos 2022, 32, 063107. [Google Scholar] [CrossRef] [PubMed]
  5. American Water Works Association. 2017 Water Utility Disinfection Survey Report. Available online: https://www.awwa.org/Portals/0/AWWA/ETS/Resources/2017DisinfectionSurveyReport.pdf?ver=2018-12-21-163548-830 (accessed on 12 March 2024).
  6. Huang, G.; Huang, G.B.; Song, S.; You, K. Trends in extreme learning machines: A review. Neural Netw. 2015, 61, 32–48. [Google Scholar] [CrossRef] [PubMed]
  7. Jafvert, T.J.; Valentine, R.L. Reaction Scheme for the Chlorination of Ammoniacal Water. Environ. Sci. Technol. 1992, 26, 557–586. [Google Scholar] [CrossRef]
  8. Austin Water Utility. Water Quality Report. Available online: https://www.austintexas.gov/sites/default/files/files/Water/WaterQualityReports/AW_Water_Quality_Report_Austin_2021.pdf (accessed on 12 March 2024).
  9. Brodfuehrer, S.H. Kinetics of Haloamines during Chloramination of Bromide-Containing Waters: Impact of Acid/Base Catalysis and Natural Organic Matter on Haloamine Formation and Decay. Ph.D. Thesis, University of Texas at Austin, Austin, TX, USA, 2022. [Google Scholar]
Figure 1. X-TFC model results using training data for TOTNH and TOTCl.
Figure 1. X-TFC model results using training data for TOTNH and TOTCl.
Engproc 69 00040 g001
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Frankel, M.; De Florio, M.; Schiassi, E.; Sela, L. Hybrid Chemical and Data-Driven Model for Stiff Chemical Kinetics Using a Physics-Informed Neural Network. Eng. Proc. 2024, 69, 40. https://doi.org/10.3390/engproc2024069040

AMA Style

Frankel M, De Florio M, Schiassi E, Sela L. Hybrid Chemical and Data-Driven Model for Stiff Chemical Kinetics Using a Physics-Informed Neural Network. Engineering Proceedings. 2024; 69(1):40. https://doi.org/10.3390/engproc2024069040

Chicago/Turabian Style

Frankel, Matthew, Mario De Florio, Enrico Schiassi, and Lina Sela. 2024. "Hybrid Chemical and Data-Driven Model for Stiff Chemical Kinetics Using a Physics-Informed Neural Network" Engineering Proceedings 69, no. 1: 40. https://doi.org/10.3390/engproc2024069040

Article Metrics

Back to TopTop