Digital Model of Automatic Plate Turning for Plate Mills Based on Machine Vision and Reinforcement Learning Algorithm
Abstract
:1. Introduction
- (1)
- Because of the severe environment in the rough rolling area, the traditional visual inspection method cannot effectively perceive the position and angle information of the billet.
- (2)
- The control process of plate turning has characteristics of nonlinearity, strong coupling; it is multivariable; and there is no unified operation standard or control logic, so it is difficult to use a mechanism model for direct control.
2. Billet Angle Measurement Method Based on Machine Vision
2.1. Detection Device
2.2. Image Defogging Processing
2.3. Adaptive Enhancement of Billet Image
2.4. External Rectangle Fitting Algorithm Based on Tukey Weight
2.5. Smoothing Treatment Method of the Billet Angle
3. Automatic Plate Turning Control Model
3.1. Reasoning Model for Setting Optimal Plate Turning Speed
3.2. Model of Setting Speed and Feedback Speed of Conical Roller Table Motor
3.3. Theoretical Model of Billet Rotation Speed
3.4. Building Simulation Environment of Automatic Plate Turning Based on Gym
- (1)
- State space:
- (2)
- Action space:
- (3)
- Transition of environmental state:
- (4)
- Design of the reward function:
3.5. Reinforcement Learning Algorithm Training
3.6. Online Strategy Optimization
4. Application of Automatic Plate Turning System
5. Conclusions and Prospect
5.1. Conclusions
- (1)
- The image defogging algorithm, improved image adaptive enhancement algorithm, external rectangle fitting algorithm based on Tukey weight, and angle smoothing algorithm were proposed in order to resolve the problem of billet angle detection in complex production environments. The fusion processing of the above algorithms can adapt to changes in influencing factors such as water vapor interference and partial occlusion in the rolling process, automatically eliminate water vapor interference and abnormal angle situations, and provide real-time and stable angle detection values for the automatic plate turning control system.
- (2)
- Using the big data of manual plate turning operations, the optimal roller table speed setting rules were obtained. The set speed and feedback speed models of the conical roller table motor and the theoretical model of billet speed were constructed to simulate the plate turning process. Based on reinforcement learning theory, the elements of reinforcement learning were defined for the motion model of plate turning. The forms of state space and action space were defined; the state was updated according to the state transition equation; and the reward function was designed. The reinforcement learning model for intelligent control of plate turning was constructed.
- (3)
- Three reinforcement learning algorithms, DQN, PPO, and SAC, were used to train the model in the simulation environment of automatic plate turning. The simulation results showed that the SAC algorithm has a faster convergence speed, fewer fluctuations of rewards, and reward value after convergence, and it has better online optimization performance than the DQN and PPO algorithms in automatic plate turning tasks.
- (4)
- In the testing process of the automatic plate turning system, the closed-loop control time, including image processing, data communication, and plate turning command setting, was less than 50 ms, and the plate turning angle detection error was less than ±2°. For the control range of 85°~95°, the rate of one-time plate turning was more than 98.5%, and the average plate turning time of each billet was greatly shortened compared with manual plate turning, and the fastest time could be shortened by more than 1 s.
5.2. Prospect
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Wang, G.D. Status and prospects of research and development of key common technologies for high-quality heavy and medium plate production. Steel Roll. 2019, 36, 1. [Google Scholar] [CrossRef]
- Jiao, Z.J.; He, C.Y.; Wang, L.X.; Cai, Y.L.; Wang, X.; Sun, X.D. Torque Model in Plate Rolling Process with Biting Impact Considered. ISIJ Int. 2021, 61, 239–247. [Google Scholar] [CrossRef]
- Jiao, Z.J.; Luo, J.Y.; Wang, Z.Q.; Xu, Z.P.; He, C.Y.; Zhao, Z. Research and application of the angular rolling technology for plate mill. Adv. Manuf. 2023, 11, 462–476. [Google Scholar] [CrossRef]
- Dong, Z.S.; Li, X.; Luan, F.; Zhang, D.H. Prediction and analysis of key parameters of head deformation of hot-rolled plates based on artificial neural networks. J. Manuf. Process. 2022, 77, 282–300. [Google Scholar] [CrossRef]
- Schausberger, F.; Steinboeck, A.; Kugi, A.; Jochum, M.; Wild, D.; Kiefer, T. Vision-Based Material Tracking in Heavy-Plate Rolling. IFAC-Pap. 2016, 49, 108–113. [Google Scholar] [CrossRef]
- Jiao, Z.J.; He, C.Y.; Zhao, Z.; Wu, Z.Q.; Wang, J. Research progress and application of high precision intelligent control system for plate rolling. Steel Roll. 2022, 99, 52. [Google Scholar] [CrossRef]
- Men, Q.L. An image identification based automatic plate turning scheme for wide heavy plate mills. Metall. Ind. Autom. 2010, 34, 55–60. [Google Scholar]
- He, C.Y.; Jiao, Z.J.; Wu, X.G.; Wang, J. Research on Control Method of Turning Plate Based on Image Processing Technology. In Proceedings of the 2015 6th International Conference on Manufacturing Science and Engineering, Guangzhou, China, 28–29 November 2015; Atlantis Press: Qingdao, China, 2015. [Google Scholar] [CrossRef]
- Wang, L. Fully automatic steel conversion and widening technology based on image processing system during the rolling process of wide and thick plates. Shanxi Metall. 2018, 41, 88. [Google Scholar] [CrossRef]
- Mukhopadhyay, P.; Chaudhuri, B.B. A survey of Hough Transform. Pattern Recognit. 2015, 48, 993–1010. [Google Scholar] [CrossRef]
- Liu, H. Survey for application of conventional artificial intelligence technologies in the steel & iron field. Metall. Ind. Autom. 2019, 43, 24–28. [Google Scholar]
- Jiao, Z.J.; Gao, S.W.; Liu, C.J.; Luo, J.Y.; Wang, Z.Q.; Lang, G.Y.; Zhao, Z.; Wu, Z.Q.; He, C.Y. Digital Model of Plan View Pattern Control for Plate Mills Based on Machine Vision and the DBO-RBF Algorithm. Metals 2024, 14, 94. [Google Scholar] [CrossRef]
- He, K.M.; Sun, J.; Tang, X.O. Single Image Haze Removal Using Dark Channel Prior. IEEE Trans. Pattern Anal. Mach. Intell. 2011, 33, 2341–2353. [Google Scholar] [CrossRef] [PubMed]
- Choi, D.; Yun, J.P.; Jeon, Y.; Kim, S.W. Pinhole detection in steel slab images using machine vision. IFAC Proc. Vol. 2009, 42, 397–401. [Google Scholar] [CrossRef]
- Yang, H.; Zhang, Y.; Gao, Y.J.; Zhou, P.; Wang, C.Z.; Huo, X.G. Development and Application of Steel Plate Contour Online Detection System Based on Machine Vision. Shandong Metall. 2022, 44, 49. [Google Scholar] [CrossRef]
- Qin, D.D.; Lu, J.; Song, Y.Q. The Research on Appearance Recognition and Visual Inspection Technology of Workpiece. Modul. Mach. Tool&Autom. Manuf. Tech. 2018, 9, 84–87. [Google Scholar] [CrossRef]
- Wang, R.Z.; Gao, Q.Y.; Wang, D.J. Research of Least Square Curve Fitting and Simplified Algorithm. Sens. World 2021, 27, 8–10. [Google Scholar] [CrossRef]
- Wu, Z.; Cai, P. An improved data preprocessing method in dynamic measurement. Chin. J. Sens. Actuacors 2010, 23, 558–561. [Google Scholar]
- Shang, X.L.; Tan, L.; Yu, K.P.; Zhang, J.; Kaur, K.; Hassan, M.M. Newton-interpolation-based zk-SNARK for Artificial Internet of Things. Ad Hoc Netw. 2021, 123, 102656. [Google Scholar] [CrossRef]
- Yu, Y. Reinforcement Learning from Offline Data: Approaches and Advances. China Basic Sci. 2022, 24, 35. [Google Scholar]
- Sun, H.N.; Liu, N.; Tan, L.; Du, P.; Zhang, B.C. Digital twin-based online resilience scheduling for microgrids: An approach combining imitative learning and deep reinforcement learning. IET Renew. Power Gener. 2024, 18, 1–13. [Google Scholar] [CrossRef]
- Udekwe, D.; Ajayi, O.; Ubadike, O.; Ter, K.; Okafor, E. Comparing actor-critic deep reinforcement learning controllers for enhanced performance on a ball-and-plate system. Expert Syst. Appl. 2024, 245, 123055. [Google Scholar] [CrossRef]
- Song, S.Y.; Hu, J.B.; Wang, Y.Y.; Han, X.L. Actor-Critic Learning Algorithm for Parameter Tuning of Sliding Mode Controller. Electron. Opt. Control 2020, 27, 24. [Google Scholar]
- Sindhuja, S.; Lovenia, J.D.L.; Lavanya, A.P.; Jawahar, G. Qualitative Research of First Order Linear Difference Equations. Int. J. Innov. Technol. Explor. Eng. 2019, 8, 1021–1025. [Google Scholar] [CrossRef]
- Cui, G.M.; Zhu, J.T. Hot continuous rolling intelligent PID gauge control based on reinforcement learning. Mod. Electron. Technol. 2022, 45, 78–82. [Google Scholar] [CrossRef]
- Zhang, J.; Jiang, X.; Shi, X.Y.; Cheng, J.; Zheng, Y.B. Offline reinforcement learning for eco-driving control at signalized intersections. J. Southeast Univ. 2022, 52, 762. [Google Scholar]
- Mnih, V.; Kavukcuoglu, K.; Silver, D. Human-level control through deep reinforcement learning. Nature 2015, 518, 529–533. [Google Scholar] [CrossRef]
- Schulman, J.; Wolski, F.; Dhariwal, P.; Radford, A.; Klimov, O. Proximal policy optimization algorithms. arXiv 2017, arXiv:1707.06347. [Google Scholar]
- Schulman, J.; Levine, S.; Moritz, P.; Jordan, M.; Abbeel, P. Trust region policy optimization. In Proceedings of the 2015 International Conference on Machine Learning, Lille, France, 7–9 July 2015; ACM Press: New York, USA, USA, 2015; pp. 1889–1897. [Google Scholar]
- Haarnoja, T.; Zhou, A.; Abbeel, P.; Levine, S. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. In Proceedings of the International Conference on Machine Learning 2018, Stockholm, Sweden, 10–15 July 2018; p. 1856. [Google Scholar]
State | Minimum Value | Maximum Value |
---|---|---|
Current angle | 0° | 180° |
Billet length | 1000 mm | 3000 mm |
Billet width | 1000 mm | 3000 mm |
Action | Meaning |
---|---|
1 | Acceleration stage curve with step length of 20 ms |
0 | Deceleration stage curve with step length of 20 ms |
Hyperparameter | Value |
---|---|
Q network learning rate | 2 × 10−3 |
Discount factor | 0.98 |
Replay buffer size | 10,000 |
Batch size | 64 |
Target network update parameters | 10 |
Number of neural network layers | 3 |
Number of hidden neurons per layer | 128 |
Activation function | ReLU |
Exploration factor | 0.01 |
Hyperparameter | Value |
---|---|
Actor network learning rate | 1 × 10−4 |
Critic network learning rate | 5 × 10−3 |
Discount factor | 0.95 |
Number of neural network layers | 3 |
Number of hidden neurons per layer | 128 |
Activation function | ReLU |
GAE parameters | 0.95 |
Estimate the clipping coefficient of advantage function | 0.2 |
Hyperparameter | Value |
---|---|
Actor network learning rate | 1 × 10−3 |
Critic network learning rate | 1 × 10−2 |
entropy regularity coefficient learning rate | 1 × 10−2 |
Discount factor | 0.98 |
Replay buffer size | 100,000 |
Number of neural network layers | 3 |
Number of hidden neurons per layer | 128 |
Activation function | ReLU |
Entropy regularization coefficient | 0.01 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
He, C.; Xue, S.; Wu, Z.; Zhao, Z.; Jiao, Z. Digital Model of Automatic Plate Turning for Plate Mills Based on Machine Vision and Reinforcement Learning Algorithm. Metals 2024, 14, 709. https://doi.org/10.3390/met14060709
He C, Xue S, Wu Z, Zhao Z, Jiao Z. Digital Model of Automatic Plate Turning for Plate Mills Based on Machine Vision and Reinforcement Learning Algorithm. Metals. 2024; 14(6):709. https://doi.org/10.3390/met14060709
Chicago/Turabian StyleHe, Chunyu, Song Xue, Zhiqiang Wu, Zhong Zhao, and Zhijie Jiao. 2024. "Digital Model of Automatic Plate Turning for Plate Mills Based on Machine Vision and Reinforcement Learning Algorithm" Metals 14, no. 6: 709. https://doi.org/10.3390/met14060709
APA StyleHe, C., Xue, S., Wu, Z., Zhao, Z., & Jiao, Z. (2024). Digital Model of Automatic Plate Turning for Plate Mills Based on Machine Vision and Reinforcement Learning Algorithm. Metals, 14(6), 709. https://doi.org/10.3390/met14060709