Development and Optimization of a Novel Soft Sensor Modeling Method for Fermentation Process of Pichia pastoris
Abstract
:1. Introduction
2. Methods
2.1. Principle and Solution of Balanced Distribution Adaptation
2.2. Improving Particle Swarm—Least Squares Support Vector Machine Algorithm
2.2.1. Least Squares Support Vector Machine
2.2.2. Improved Particle Swarm Optimization
2.3. Soft Sensor Modeling Based on BDA-IPSO-LSSVM
2.4. Introduction of the Pichia pastoris Experimental Work
- Step 1:
- Carry out Pichia pastoris fermentation experiments, build the datasets, and normalize the dataset.
- Step 2:
- Establish the IPSO-LSSVM model: the first step is to determine the parameters of the LSSVM model, including regularization parameter and kernel width . IPSO-LSSVM is an optimization method based on the improved particle swarm algorithm, which automatically selects the optimal model parameters. The IPSO-LSSVM uses the IPSO algorithm proposed in this paper to automatically obtain regularization parameter and kernel width .
- Step 3:
- Train the IPSO-LSSVM model using labeled source domain data : labeled source domain data can be used to train the initial IPSO-LSSVM model, which can serve as the starting point for the iterative process, helping to improve the subsequent optimization results.
- Step 4:
- Obtain soft label for target domain data by iteratively inputting unlabeled data into the IPSO-LSSVM model: Since the target domain data are unlabeled, the unlabeled target domain data are input into the IPSO-LSSVM model obtained in step 3 to generate predicted values , which are then used as soft labels. The soft labels are then combined with the target domain data .
- Step 5:
- Compute the transformation matrix A using the source domain data, target domain data, and soft labels as inputs for the improved BDA algorithm: the improved BDA algorithm utilizes the source domain data , target domain data , and target domain data soft labels to compute a transformation matrix A that matches the source domain data and the target domain data, facilitating the transfer of knowledge from the source domain to the target domain.
- Step 6:
- Input the matched source domain and target domain into the IPSO-LSSVM model to obtain the actual predicted key parameters in the fermentation process of Pichia pastoris.
- The fermentation system was sterilized and the bacterial strain was cultured according to the requirements of the Pichia pastoris fermentation process. The medium was sterilized at 130 °C for 30 min, and the bacterial strain was inoculated by flame when the temperature dropped to 30 °C. The initial fermentation conditions were set: initial tank pressure control at 0.02~0.05 MPa; pH control at 5.0; temperature control at 28 °C; speed set at 300~400 rpm; and airflow velocity control at 150~300 L/M.
- We selected the stirring speed v, temperature T, airflow q, pH of the fermentation liquid, dissolved oxygen Do, and fermenter pressure P as auxiliary variables by the absolute relation degree method. All auxiliary variables were transmitted to the database through the distributed control system. The auxiliary variables in this experiment were sampled every 0.5 h.
- We selected different batches of Pichia pastoris fermentation data as data samples. Since the fermentation cycle of Pichia pastoris is 90 h, each batch contained 180 data samples. We used auxiliary variables as input variables, cell density and product concentration as output variables, and input the data into the established Pichia pastoris fermentation soft sensor model to complete the establishment of the soft sensor model of the Pichia pastoris fermentation process and realize real-time prediction of key biological parameters. The root mean square error (RMSE), coefficient of determination (R2), and mean absolute error (MAE) and floating point operations (GFLOPs) were used as performance evaluation indicators for the soft sensor model. The calculation formulas are as follows:
3. Result and Discussion
3.1. Result
3.2. Discussion
4. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
BDA | balanced distribution adaptation |
LSSVM | least squares support vector machine |
IPSO | improved particle swarm optimization |
LSTM | Long Short Term Memory |
GWO | Grey Wolf Optimization |
ABC | Artificial Bee Colony |
MMD | Maximum Mean Discrepancy |
H | Reproducing Kernel Hilbert Space |
KKT | Karush–Kuhn–Tucker |
PSO | Particle Swarm Optimization |
EPSO | Emotional Particle Swarm Optimization |
RMSE | Root Mean Square Error |
R2 | R-Square |
FLOPs | Floating Point Operations |
MAE | Mean Absolute Error |
Nomenclature
the source domain data | |
i-th input source domain data | |
i-th output source domain data | |
n | the amount of data in the source domain |
target domain data | |
j-th input source domain data | |
j-th output source domain data | |
m | the amount of data in the target domain |
the source domain future space | |
the target domain future space | |
the source domain label space | |
the target domain label space | |
the source domain marginal distribution | |
the target domain marginal distribution | |
the source domain conditional distribution | |
the target domain conditional distribution | |
the balance factor | |
the sample set of class c in the source domain | |
the sample set of class c in the target domain | |
C | the number of classes |
c | the c-th classes |
the number of samples in | |
the number of samples in | |
the z-th source domain i-th data | |
the z-th source domain 5-th percentile values | |
the z-th source domain 50-th percentile values | |
the z-th source domain 95-th percentile values | |
the membership degree of in class p in the source domain | |
the membership degree of in class q in the target domain | |
the matrices for maximum mean discrepancy | |
the regularization parameter | |
A | the transformation matrix |
X | the input matrix composed of and |
the Hilbert–Schmidt norm | |
E | the identity matrix |
the Lagrange multiplier | |
the weight vector | |
a nonlinear function that maps the data to a high-dimensional space | |
the regularization parameter | |
the error introduced by the samples | |
b | the constant bias |
the LSSVM optimization objective | |
the Lagrange multiplier for the i-th constraint | |
an l∗l identity matrix | |
the kernel matrix | |
the kernel width | |
the RBF kernel | |
The emotional state of the i-th particle in the t-th iteration | |
the increase of emotional state | |
the increase of emotional state | |
the position of the i-th particle during the t-th iteration | |
the global best position at the t-th iteration | |
the lobal worst position at the t-th iteration | |
the particle swarm | |
the i-th particle position | |
the global perception | |
The historical perception | |
k | a constant factor |
S(·) | the stimulus function |
the stimulus threshold | |
gBest | the historical best position of the particle swarm |
the historical position of the i-th particle | |
the upper limit of the particle search range | |
l | the lower limit of the particle search range |
v | The stirring speed |
T | temperature |
q | airflow |
Do | dissolved oxygen |
C | cell concentration |
p | fermenter pressure |
P | the concentrations of production |
the model i-th prediction data | |
i-th real data |
References
- Karbalaei, M.; Rezaee, S.A.; Farsiani, H. Pichia pastoris: A Highly Successful Expression System for Optimal Synthesis of Heterologous Proteins. J. Cell. Physiol. 2020, 235, 5867–5881. [Google Scholar] [CrossRef] [PubMed]
- Yang, Y.; Madden, K.; Sha, M. Human IgG Fc Production Through Methanol-Free Pichia pastoris Fermentation. BioProcess. J. 2022, 21, 1–10. [Google Scholar]
- Wu, J.; Zhang, X.; Yu, H.; Li, W.; Jia, Y.; Guo, J.; Zhang, L.; Song, X. Research Progress of High Density Fermentation Process of Pichia pastoris. China Biotechnol. 2016, 36, 108–114. [Google Scholar] [CrossRef]
- Zhu, X.; Rehman, K.U.; Wang, B.; Shahzad, M. Modern Soft-Sensing Modeling Methods for Fermentation Processes. Sensors 2020, 20, 1771. [Google Scholar] [CrossRef] [Green Version]
- Mohanty, S.; Khasa, Y.P. Nitrogen Supplementation Ameliorates Product Quality and Quantity during High Cell Density Bioreactor Studies of Pichia pastoris: A Case Study with Proteolysis Prone Streptokinase. Int. J. Biol. Macromol. 2021, 180, 760–770. [Google Scholar] [CrossRef]
- Chai, W.Y.; Teo, K.T.K.; Tan, M.K.; Tham, H.J. Fermentation Process Control and Optimization. Chem. Eng. Technol. 2022, 45, 1731–1747. [Google Scholar] [CrossRef]
- Wang, B.; Wang, X.; He, M.; Zhu, X. Study on Multi-Model Soft Sensor Modeling Method and Its Model Optimization for the Fermentation Process of Pichia pastoris. Sensors 2021, 21, 7635. [Google Scholar] [CrossRef] [PubMed]
- Shao, W.; Ge, Z.; Song, Z. Soft-Sensor Development for Processes With Multiple Operating Modes Based on Semisupervised Gaussian Mixture Regression. IEEE Trans. Control Syst. Technol. 2019, 27, 2169–2181. [Google Scholar] [CrossRef]
- Yuan, X.; Li, L.; Wang, Y. Nonlinear Dynamic Soft Sensor Modeling With Supervised Long Short-Term Memory Network. IEEE Trans. Ind. Inform. 2020, 16, 3168–3176. [Google Scholar] [CrossRef]
- Zheng, W.; Liu, Y.; Gao, Z.; Yang, J. Just-in-Time Semi-Supervised Soft Sensor for Quality Prediction in Industrial Rubber Mixers. Chemom. Intell. Lab. Syst. 2018, 180, 36–41. [Google Scholar] [CrossRef]
- Chang, S.; Zhao, C.; Li, K. Consistent-Contrastive Network With Temporality-Awareness for Robust-to-Anomaly Industrial Soft Sensor. IEEE Trans. Instrum. Meas. 2022, 71, 1–12. [Google Scholar] [CrossRef]
- Fan, A.; Huang, Y.; Xu, F.; Bom, S. Soft Sensing Regression Model: From Sensor to Wafer Metrology Forecasting. arXiv 2023, arXiv:2301.08974. [Google Scholar]
- Zhang, C.; Li, Z.; Sun, Y. Study on Soft Sensing of Glutamic Acid Fermentation Process Based on LS-SVM. In Proceedings of the 2022 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA), Dalian, China, 24–26 June 2022; pp. 354–360. [Google Scholar]
- Han, T.; Liu, C.; Wu, R.; Jiang, D. Deep Transfer Learning with Limited Data for Machinery Fault Diagnosis. Appl. Soft Comput. 2021, 103, 107150. [Google Scholar] [CrossRef]
- Chai, Z.; Zhao, C.; Huang, B.; Chen, H. A Deep Probabilistic Transfer Learning Framework for Soft Sensor Modeling With Missing Data. IEEE Trans. Neural Netw. Learn. Syst. 2021, 33, 7598–7609. [Google Scholar] [CrossRef]
- Xie, J.; Huang, B.; Dubljevic, S. Transfer Learning for Dynamic Feature Extraction Using Variational Bayesian Inference. IEEE Trans. Knowl. Data Eng. 2022, 34, 5524–5535. [Google Scholar] [CrossRef]
- Ren, J.-C.; Liu, D.; Wan, Y. VMD-SEAE-TL-Based Data-Driven Soft Sensor Modeling for a Complex Industrial Batch Processes. Measurement 2022, 198, 111439. [Google Scholar] [CrossRef]
- Hsiao, Y.-D.; Kang, J.-L.; Wong, D.S.-H. Development of Robust and Physically Interpretable Soft Sensor for Industrial Distillation Column Using Transfer Learning with Small Datasets. Processes 2021, 9, 667. [Google Scholar] [CrossRef]
- Wang, J.; Chen, Y.; Hao, S.; Feng, W.; Shen, Z. Balanced Distribution Adaptation for Transfer Learning. arXiv 2018, arXiv:1807.00516. [Google Scholar]
- Zhu, X.; Liu, W.; Wang, B.; Wang, W. A Soft Sensor Model of Pichia pastoris Cell Concentration Based on IBDA-RELM. Prep. Biochem. Biotechnol. 2022, 52, 618–626. [Google Scholar] [CrossRef]
- Tang, Y.; Rahmani Dehaghani, M.; Wang, G.G. Review of Transfer Learning in Modeling Additive Manufacturing Processes. Addit. Manuf. 2023, 61, 103357. [Google Scholar] [CrossRef]
- Kora, P.; Ooi, C.P.; Faust, O.; Raghavendra, U.; Gudigar, A.; Chan, W.Y.; Meenakshi, K.; Swaraja, K.; Plawiak, P.; Rajendra Acharya, U. Transfer Learning Techniques for Medical Image Analysis: A Review. Biocybern. Biomed. Eng. 2022, 42, 79–107. [Google Scholar] [CrossRef]
- Curreri, F.; Patanè, L.; Xibilia, M.G. RNN- and LSTM-Based Soft Sensors Transferability for an Industrial Process. Sensors 2021, 21, 823. [Google Scholar] [CrossRef]
- Zhuang, F.; Qi, Z.; Duan, K.; Xi, D.; Zhu, Y.; Zhu, H.; Xiong, H.; He, Q. A Comprehensive Survey on Transfer Learning. Proc. IEEE 2021, 109, 43–76. [Google Scholar] [CrossRef]
- Weiss, K.; Khoshgoftaar, T.M.; Wang, D. A Survey of Transfer Learning. J. Big Data 2016, 3, 9. [Google Scholar] [CrossRef] [Green Version]
- Pan, S.J.; Tsang, I.W.; Kwok, J.T.; Yang, Q. Domain Adaptation via Transfer Component Analysis. IEEE Trans. Neural Netw. 2011, 22, 199–210. [Google Scholar] [CrossRef] [Green Version]
- Zhou, X.; Sbarufatti, C.; Giglio, M.; Dong, L. A Fuzzy-Set-Based Joint Distribution Adaptation Method for Regression and Its Application to Online Damage Quantification for Structural Digital Twin. Mech. Syst. Signal Process. 2023, 191, 110164. [Google Scholar] [CrossRef]
- Wu, D.; Lawhern, V.J.; Gordon, S.; Lance, B.J.; Lin, C.-T. Driver Drowsiness Estimation From EEG Signals Using Online Weighted Adaptation Regularization for Regression (OwARR). IEEE Trans. Fuzzy Syst. 2017, 25, 1522–1535. [Google Scholar] [CrossRef] [Green Version]
- Alon, I.; Globerson, A.; Wiesel, A. On the Optimization Landscape of Maximum Mean Discrepancy. arXiv 2021, arXiv:2110.13452. [Google Scholar]
- Berlinet, A.; Thomas-Agnan, C. Reproducing Kernel Hilbert Spaces in Probability and Statistics; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2011; ISBN 978-1-4419-9096-9. [Google Scholar]
- Guo, H.; Cui, M.; Feng, Z.; Zhang, D.; Zhang, D. Classification of Aviation Alloys Using Laser-Induced Breakdown Spectroscopy Based on a WT-PSO-LSSVM Model. Chemosensors 2022, 10, 220. [Google Scholar] [CrossRef]
- Suykens, J.A.K.; Vandewalle, J. Least Squares Support Vector Machine Classifiers. Neural Process. Lett. 1999, 9, 293–300. [Google Scholar] [CrossRef]
- Jain, M.; Saihjpal, V.; Singh, N.; Singh, S.B. An Overview of Variants and Advancements of PSO Algorithm. Appl. Sci. 2022, 12, 8392. [Google Scholar] [CrossRef]
- Tao, X.; Li, X.; Chen, W.; Liang, T.; Li, Y.; Guo, J.; Qi, L. Self-Adaptive Two Roles Hybrid Learning Strategies-Based Particle Swarm Optimization. Inf. Sci. 2021, 578, 457–481. [Google Scholar] [CrossRef]
- Shami, T.M.; El-Saleh, A.A.; Alswaitti, M.; Al-Tashi, Q.; Summakieh, M.A.; Mirjalili, S. Particle Swarm Optimization: A Comprehensive Survey. IEEE Access 2022, 10, 10031–10061. [Google Scholar] [CrossRef]
- Ge, Y.; Rubo, Z. An Emotional Particle Swarm Optimization Algorithm. In Proceedings of the Advances in Natural Computation; Wang, L., Chen, K., Ong, Y.S., Eds.; Springer: Berlin/Heidelberg, Germany, 2005; pp. 553–561. [Google Scholar]
- Gou, J.; Lei, Y.-X.; Guo, W.-P.; Wang, C.; Cai, Y.-Q.; Luo, W. A Novel Improved Particle Swarm Optimization Algorithm Based on Individual Difference Evolution. Appl. Soft Comput. 2017, 57, 468–481. [Google Scholar] [CrossRef]
- Kausik, B.N. Accelerating Machine Learning via the Weber-Fechner Law. arXiv 2022, arXiv:2204.11834. [Google Scholar]
- Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey Wolf Optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef] [Green Version]
- Sharma, A.; Sharma, A.; Choudhary, S.; Pachauri, R.; Shrivastava, A.; Kumar, D. A review on artificial bee colony and it’s engineering applications. J. Crit. Rev. 2020, 7, 4097–4107. [Google Scholar] [CrossRef]
RMSE | R2 | MAE | GFLOPs | |
---|---|---|---|---|
LSSVM | 2.3356 | 0.9425 | 2.1929 | 0.0014 |
PSO-LSSVM | 2.1830 | 0.9572 | 1.9425 | 0.0016 |
IPSO-LSSVM | 1.5902 | 0.9779 | 1.4154 | 0.0016 |
BDA-IPSO-LSSVM | 1.0485 | 0.9912 | 0.8554 | 0.0016 |
RMSE | R2 | MAE | FLOPs | |
---|---|---|---|---|
LSSVM | 0.1397 | 0.9773 | 0.1046 | 0.0013 |
PSO-LSSVM | 0.0973 | 0.9890 | 0.0887 | 0.0015 |
IPSO-LSSVM | 0.0569 | 0.9962 | 0.0508 | 0.0015 |
BDA-IPSO-LSSVM | 0.0368 | 0.9984 | 0.0322 | 0.0015 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, B.; Liu, J.; Yu, A.; Wang, H. Development and Optimization of a Novel Soft Sensor Modeling Method for Fermentation Process of Pichia pastoris. Sensors 2023, 23, 6014. https://doi.org/10.3390/s23136014
Wang B, Liu J, Yu A, Wang H. Development and Optimization of a Novel Soft Sensor Modeling Method for Fermentation Process of Pichia pastoris. Sensors. 2023; 23(13):6014. https://doi.org/10.3390/s23136014
Chicago/Turabian StyleWang, Bo, Jun Liu, Ameng Yu, and Haibo Wang. 2023. "Development and Optimization of a Novel Soft Sensor Modeling Method for Fermentation Process of Pichia pastoris" Sensors 23, no. 13: 6014. https://doi.org/10.3390/s23136014
APA StyleWang, B., Liu, J., Yu, A., & Wang, H. (2023). Development and Optimization of a Novel Soft Sensor Modeling Method for Fermentation Process of Pichia pastoris. Sensors, 23(13), 6014. https://doi.org/10.3390/s23136014