Profit-Efficient Elastic Allocation of Cloud Resources Using Two-Stage Adaptive Workload Prediction
Abstract
:1. Introduction
- A novel two-stage adaptive prediction method for Wikimedia workload prediction:
- -
- Introduces first-order gradient features and applies the K-means algorithm to effectively classify workload patterns into uphill and downhill trends, eliminating the need for manual labeling.
- -
- A binary classifier is used for trend categorization, with specialized LSTM networks assigned to each trend category to improve prediction accuracy.
- A queueing theory-based resource allocation algorithm for cloud computing environments:
- -
- Optimizes virtual machine provisioning by incorporating predicted workload patterns, VM rental costs, and task rejection penalties.
- -
- Maximizes service profit while ensuring system stability and QoS guarantees.
- Experimental validation of the proposed proactive resource allocation method:
- -
- Demonstrates improved workload prediction accuracy.
- -
- Identifies the optimal number of VMs required to maximize profit while maintaining QoS constraints.
2. Related Work
2.1. Workload Prediction in the Cloud
- (1)
- The task arrival workload only contains numerical information, and classifying the workload adaptively is the key problem.
- (2)
- Due to different workload categories, building sub-training data sets for training of the corresponding sub-models is another key problem.
2.2. Queueing Theory in Cloud Computing
3. System Architecture
- Cloud resource pool—This component is composed of VMs that run in the physical server cluster. Based on virtual technologies, the computing resources of a VM (e.g., CPU, memory, disks, and networking) can be configured on demand. The resource pool also can be considered as an IaaS layer, which provides online computing resources.
- Task scheduling and networking interface—This component provides load balancing for the tasks that arrive from the Internet. Some studies, such as [31,32,33], have researched task scheduling optimization policies. Unlike these studies, we did not focus on task scheduling in the cloud center. Instead, we used a common load-balancing policy (i.e., round-robin; RR) as a task scheduling policy, meaning that the tasks are distributed evenly across the VMs.
- Cloud management platform—This component provides the resource management service in the cloud center. The admission control service and the resource monitoring service collect the resource pool running information (e.g., resource utilization of physical servers and VMs, the health status of physical servers, and hardware configurations). This collected information is stored in the database and used by the resource scheduling service to identify the suitable target physical server for launching the VM. The admission control component authenticates cloud center administrators and manages VM lifecycle operations, specifically handling requests for VM provisioning and de-allocation.
- Workload monitoring—This component provides the workload information collection service. In general, the workload monitoring component can periodically collect the task arrival workload information through the load-balancing interface.
- Workload prediction—Based on historical information, this component predicts the future workload.
- Resource determination—Based on a specific optimization objective (e.g., cost or profit optimization), this component determines the suitable number of VMs according to the predicted future workload. When the resource determination result is obtained, the component sends to the VM control a command to launch or release the corresponding number of VMs.
4. Prediction Method
4.1. Introduction of Workload Prediction
4.2. Adaptive Two-Stage Multi-Neural Network Based on the LSTM Prediction Method
5. Cloud Resource Allocation Policy Based on Maximum Cloud Service Profit
5.1. Task Processing Modeling in the Cloud Service
5.2. Maximum Cloud Service Profit Resource Allocation Method
- (1)
- High task profit: Due to the typical high load of the Wikimedia service (the workload of the task arrival can be on the order of ), we first assume that the task profit is RMB 0.01. Thus, the task loss becomes the main element that affects the cloud service cost, rather than the rental cost of VM. To reduce the cloud service cost, we need to allocate enough VMs to reduce the cost of task loss. Thus, the cloud service cost decreases rapidly when the number of VMs increases. When the number of VMs becomes redundant, the cloud service cost increases slowly. The cloud service cost is a weakly convex function, as shown in Figure 14.
- (2)
- The low task profit: We next assume that the task profit is 0.0000001. Thus, the rental costs of VMs become the main element affecting the cloud service cost. In this case, the optimization resource allocation strategy uses the smallest number of VMs; that is, one VM. Thus, the cloud service cost is a monotonic increasing function, as shown in Figure 15.
- (3)
- The medium task profit: Compared to the previous two cases, we then assume that the task profit is 0.00001, which means that the task profit and the rental costs of VMs both affect the cloud service cost. In this case, when the number of VMs increases, the task loss decreases but the rental costs of VMs increase, and vice versa. Thus, Equation (24) becomes a strongly convex function, as shown in Figure 16.
Algorithm 1 MaxCSPR algorithm |
Input: The task arrival in the time slot; The cloud service cost vector ; The average task execution time ); The allowable process time of the task ); The allowable system load of VM ); The maximum number of VMs allowed ). |
|
6. Performance Evaluation
6.1. Experimental Environment Settings
- (1)
- Mean absolute percentage error (MAPE) [42]:
- (2)
- R-squared () [43]:If the value is close to 1, the prediction model fits better.
- (3)
- A lower RMSE indicates a better prediction result.
6.2. Prediction Performance Evaluation
6.3. Resource Allocation Performance Evaluation
6.4. MaxCSPR with Workload Prediction Evaluation
7. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Goel, P.K.; Gulati, S.; Singh, A.; Tyagi, A.; Komal, K.; Mahur, L.S. Energy-Efficient Block-Chain Solutions for Edge and Cloud Computing Infrastructures. In Proceedings of the 2024 2nd International Conference on Disruptive Technologies (ICDT), Greater Noida, India, 15–16 March 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 852–856. [Google Scholar] [CrossRef]
- Ismahene, N.W.; Souheila, B.; Nacereddine, Z. An Auto Scaling Energy Efficient Approach in Apache Hadoop. In Proceedings of the 2020 International Conference on Advanced Aspects of Software Engineering (ICAASE), Constantine, Algeria, 28–30 November 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 1–6. [Google Scholar] [CrossRef]
- Amiri, M.; Mohammad-Khanli, L. Survey on prediction models of applications for resources provisioning in cloud. J. Netw. Comput. Appl. 2017, 82, 93–113. [Google Scholar] [CrossRef]
- Khan, A.A.; Vidhyadhari, C.H.; Kumar, S. A review on fixed threshold based and adaptive threshold based auto-scaling techniques in cloud computing. MATEC Web Conf. 2024, 392, 01115. [Google Scholar] [CrossRef]
- Quattrocchi, G.; Incerto, E.; Pinciroli, R.; Trubiani, C.; Baresi, L. Autoscaling Solutions for Cloud Applications under Dynamic Workloads. IEEE Trans. Serv. Comput. 2024, 17, 804–820. [Google Scholar] [CrossRef]
- Hu, Y.; Deng, B.; Peng, F. Autoscaling prediction models for cloud resource provisioning. In Proceedings of the 2016 2nd IEEE International Conference on Computer and Communications (ICCC), Chengdu, China, 14–17 October 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 1364–1369. [Google Scholar] [CrossRef]
- Taha, M.B.; Sanjalawe, Y.; Al-Daraiseh, A.; Fraihat, S.; Al-E’mari, S.R. Proactive Auto-Scaling for Service Function Chains in Cloud Computing based on Deep Learning. IEEE Access 2024, 12, 38575–38593. [Google Scholar] [CrossRef]
- Singh, S.T.; Tiwari, M.; Dhar, A.S. Machine Learning based Workload Prediction for Auto-scaling Cloud Applications. In Proceedings of the 2022 OPJU International Technology Conference on Emerging Technologies for Sustainable Development (OTCON), Raigarh, India, 8–10 February 2022; IEEE: Piscataway, NJ, USA, 2023; pp. 1–6. [Google Scholar] [CrossRef]
- Ahamed, Z.; Khemakhem, M.; Eassa, F.; Alsolami, F.; Al-Ghamdi, A.S.A.M. Technical study of deep learning in cloud computing for accurate workload prediction. Electronics 2023, 12, 650. [Google Scholar] [CrossRef]
- Urdaneta, G.; Pierre, G.; Van Steen, M. Wikipedia workload analysis for decentralized hosting. Comput. Netw. 2009, 53, 1830–1845. [Google Scholar] [CrossRef]
- Yadav, A.; Kushwaha, S.; Gupta, J.; Saxena, D.; Singh, A.K. A Survey of the Workload Forecasting Methods in Cloud Computing. In Proceedings of the 3rd International Conference on Machine Learning, Advances in Computing, Renewable Energy and Communication (MARC), Ghaziabad, India, 10–11 December 2021; Springer Nature: Singapore, 2022; pp. 539–547. [Google Scholar] [CrossRef]
- Kumar, J.; Singh, A.K. Performance Assessment of Time Series Forecasting Models for Cloud Datacenter Networks’ Workload Prediction. Wirel. Pers. Commun. 2021, 116, 1949–1969. [Google Scholar] [CrossRef]
- Alidoost Alanagh, Y.; Firouzi, M.; Rasouli Kenari, A.; Shamsi, M. Introducing an Adaptive Model for Auto-Scaling Cloud Computing Based on Workload Classification. Concurr. Comput. Pract. Exp. 2023, 35, e7720. [Google Scholar] [CrossRef]
- Singh, P.; Gupta, P.; Jyoti, K. TASM: Technocrat ARIMA and SVR Model for Workload Prediction of Web Applications in Cloud. Cluster Comput. 2019, 22, 619–633. [Google Scholar] [CrossRef]
- Islam, S.; Keung, J.; Lee, K.; Liu, A. Empirical prediction models for adaptive resource provisioning in the cloud. Future Generat. Comput. Syst. 2012, 28, 155–162. [Google Scholar] [CrossRef]
- Kumar, J.; Singh, A.K. Workload Prediction in Cloud using Artificial Neural Network and Adaptive Differential Evolution. Future Generat. Comput. Syst. 2018, 81, 41–52. [Google Scholar] [CrossRef]
- Chen, Z.; Hu, J.; Min, G.; Zomaya, A.Y.; El-Ghazawi, T. Towards accurate prediction for high-dimensional and highly-variable cloud workloads with deep learning. IEEE Trans. Parallel Distrib. Syst. 2019, 31, 923–934. [Google Scholar] [CrossRef]
- Song, B.; Yu, Y.; Zhou, Y.; Wang, Z.; Du, S. Host load prediction with long short-term memory in cloud computing. J. Supercomput. 2018, 74, 6554–6568. [Google Scholar] [CrossRef]
- Yadav, M.P.; Pal, N.; Yadav, D.K. Workload prediction over cloud server using time series data. In Proceedings of the 11th International Conference on Cloud Computing, Data Science & Engineering (Confluence), Noida, India, 28–29 January 2021; pp. 267–272. [Google Scholar] [CrossRef]
- Morichetta, A.; Pusztai, T.; Vij, D.; Casamayor Pujol, V.; Raith, P.; Xiong, Y. Predicting High-Level Service Level Objectives for Cloud Computing Using Neural Networks. In Proceedings of the 2023 IEEE 16th International Conference on Cloud Computing (CLOUD), Chicago, IL, USA, 2–8 July 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 24–34. [Google Scholar] [CrossRef]
- Lackinger, A.; Morichetta, A.; Dustdar, S. Time Series Predictions for Cloud Workloads: A Comprehensive Evaluation. In Proceedings of the 2024 IEEE International Conference on Service-Oriented System Engineering (SOSE), Shanghai, China, 15–18 July 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 1–8. [Google Scholar] [CrossRef]
- Cao, K.; Zhang, T.; Huang, J. Advanced Hybrid LSTM-Transformer Architecture for Real-Time Multi-Task Prediction in Engineering Systems. Sci. Rep. 2024, 14, 4890. [Google Scholar] [CrossRef] [PubMed]
- Ullah, Q.Z.; Hassan, S.; Khan, G.M. Adaptive resource utilization prediction system for infrastructure as a service cloud. Comput. Intell. Neurosci. 2017, 2017, 4873459. [Google Scholar] [CrossRef]
- Liu, C.; Liu, C.; Shang, Y.; Chen, S.; Cheng, B.; Chen, J. An adaptive prediction approach based on workload pattern discrimination in the cloud. J. Netw. Comput. Appl. 2017, 80, 35–44. [Google Scholar] [CrossRef]
- Gross, D.; Shortle, J.F.; Thompson, J.M.; Harris, C.M. Fundamentals of Queuing Theory, 4th ed.; Wiley: New York, NY, USA, 2008. [Google Scholar]
- Xia, Y.; Zhou, M.; Luo, X.; Zhu, Q.; Li, J.; Huang, Y. Stochastic modeling and quality evaluation of infrastructure-as-a-service clouds. IEEE Trans. Autom. Sci. Eng. 2015, 12, 162–170. [Google Scholar] [CrossRef]
- Kushchazli, A.; Safargalieva, A.; Kochetkova, I.; Gorshenin, A. Queuing model with customer class movement across server groups for analyzing virtual machine migration in cloud computing. Mathematics 2024, 12, 468. [Google Scholar] [CrossRef]
- Sinha, A.; Banerjee, P.; Roy, S.; Rathore, N.; Singh, N.P.; Uddin, M.; Abdelhaq, M.; Alsaqour, R. Improved Dynamic Johnson Sequencing Algorithm (DJS) in cloud computing environment for efficient resource scheduling for distributed overloading. J. Syst. Sci. Syst. Eng. 2024, 33, 391–424. [Google Scholar] [CrossRef]
- Wei, J.; Liang, X. Research on Task-Offloading Delay in the IoV Based on a Queuing Network. IEEE Access 2024, 12, 31324–31333. [Google Scholar] [CrossRef]
- Abbasi, M.; Mohammadi Pasand, E.; Khosravi, M.R. Workload Allocation in IoT-Fog-Cloud Architecture Using a Multi-Objective Genetic Algorithm. J. Grid Comput. 2020, 18, 43–56. [Google Scholar] [CrossRef]
- Tawfeek, M.A.; El-Sisi, A.; Keshk, A.E.; Torkey, F.A. Cloud task scheduling based on ant colony optimization. In Proceedings of the 2013 8th IEEE International Conference on Computer Engineering & Systems (ICCES), Cairo, Egypt, 26–28 November 2013; pp. 64–69. [Google Scholar] [CrossRef]
- Strumberger, I.; Bacanin, N.; Tuba, M.; Tuba, E. Resource Scheduling in Cloud Computing Based on a Hybridized Whale Optimization Algorithm. Appl. Sci. 2019, 9, 4893. [Google Scholar] [CrossRef]
- Zuo, L.; Shu, L.; Dong, S.; Zhu, C.; Hara, T. A multi-objective optimization scheduling method based on the ant colony algorithm in cloud computing. IEEE Access 2015, 3, 2687–2699. [Google Scholar] [CrossRef]
- Lee, Y.H.; Huang, K.C.; Wu, C.H.; Kuo, Y.H.; Lai, K.C. A Framework for Proactive Resource Provisioning in IaaS Clouds. Appl. Sci. 2017, 7, 777. [Google Scholar] [CrossRef]
- Bishop, C.; Christopher, M. Pattern Recognition and Machine Learning; Springer: New York, NY, USA, 2006. [Google Scholar]
- Calheiros, R.N.; Masoumi, E.; Ranjan, R.; Buyya, R. Workload prediction using ARIMA model and its impact on cloud applications’ QoS. IEEE Trans. Cloud Comput. 2015, 3, 449–458. [Google Scholar] [CrossRef]
- Avrachenkov, K.; Dreveton, M. Statistical Analysis of Networks; Now Publishers: Boston, MA, USA; Delft, The Netherlands, 2022. [Google Scholar]
- Frahm, K.M.; Shepelyansky, D.L. Poisson statistics of PageRank probabilities of Twitter and Wikipedia networks. Eur. Phys. J. B 2014, 87, 93. [Google Scholar] [CrossRef]
- Marijn, T.T.; Yana, V.; David, L.; Andreas, K. Modeling page-view dynamics on Wikipedia. arXiv 2012, arXiv:1212.5943. [Google Scholar] [CrossRef]
- Li, C.; Zheng, J.; Okamura, H.; Dohi, T. Performance Evaluation of a Cloud Datacenter Using CPU Utilization Data. Mathematics 2023, 11, 513. [Google Scholar] [CrossRef]
- Chafai, Z.; Nacer, H.; Lekadir, O.; Gharbi, N.; Ouchaou, L. A performance evaluation model for users’ satisfaction in federated clouds. Clust. Comput. 2024, 27, 4983–5004. [Google Scholar] [CrossRef]
- Jorgensen, M. Experience with the accuracy of software maintenance task effort prediction models. IEEE Trans. Softw. Eng. 1995, 21, 674–681. [Google Scholar] [CrossRef]
- Achelis, S.B. Technical Analysis from A to Z; McGraw Hill: New York, NY, USA, 2001. [Google Scholar]
- Salomon, D. A Guide to Data Compression Methods; Springer: New York, NY, USA, 2013. [Google Scholar]
- Diebold, F.X.; Mariano, R.S. Comparing predictive accuracy. J. Bus. Econ. Stat. 1995, 13, 253–263. [Google Scholar] [CrossRef]
- Arellano-Uson, J.; Magaña, E.; Morato, D.; Izal, M. Survey on Quality of Experience Evaluation for Cloud-Based Interactive Applications. Appl. Sci. 2024, 14, 1987. [Google Scholar] [CrossRef]
- Sankaran, L.; Subramanian, S.J. CloudSim exploration: A knowledge framework for cloud computing researchers. In Applied Soft Computing and Communication Networks: Proceedings of ACN 2020; Springer: Singapore, 2021; pp. 107–122. [Google Scholar] [CrossRef]
Metric | MAPE | RMSE | |
---|---|---|---|
ATSMNN-LSTM | 0.0276 | 3.7805 | 0.9522 |
ATSMNN | 0.0344 | 4.2957 | 0.9358 |
ARIMA | 0.0536 | 8.6441 | 0.7402 |
LR | 0.0382 | 4.8621 | 0.9178 |
SVR | 0.0794 | 7.8139 | 0.7877 |
LSTM | 0.0325 | 3.9242 | 0.9465 |
Transformer | 0.0317 | 4.2307 | 0.9378 |
LSTM–Transformer | 0.0364 | 4.4557 | 0.9278 |
NN | 0.0354 | 4.4512 | 0.9311 |
Comparison Model | p-Value | Result |
---|---|---|
ATSMNN | 8.764 × 10−7 | <0.05 |
ARIMA | <2.2204 × 10−16 | <0.05 |
LR | 2.723 × 10−9 | <0.05 |
SVR | <2.2204 × 10−16 | <0.05 |
LSTM | 8.845 × 10−6 | <0.05 |
Transformer | 4.204 × 10−12 | <0.05 |
LSTM–Transformer | 2.864 × 10−14 | <0.05 |
NN | 2.703 × 10−10 | <0.05 |
Metric | MAPE | RMSE | |
---|---|---|---|
ATSMNN-LSTM | 0.0276 | 0.7791 | 0.9356 |
ATSMNN | 0.0348 | 0.8809 | 0.9177 |
ARIMA | 0.0381 | 0.9661 | 0.9010 |
LR | 0.0533 | 1.5509 | 0.7448 |
SVR | 0.0793 | 1.4661 | 0.7719 |
LSTM | 0.0329 | 0.8213 | 0.9284 |
Transformer | 0.0332 | 0.8412 | 0.9223 |
LSTM-Transformer | 0.0361 | 0.9102 | 0.9121 |
NN | 0.0355 | 0.8971 | 0.9146 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Li, L.; Gao, X. Profit-Efficient Elastic Allocation of Cloud Resources Using Two-Stage Adaptive Workload Prediction. Appl. Sci. 2025, 15, 2347. https://doi.org/10.3390/app15052347
Li L, Gao X. Profit-Efficient Elastic Allocation of Cloud Resources Using Two-Stage Adaptive Workload Prediction. Applied Sciences. 2025; 15(5):2347. https://doi.org/10.3390/app15052347
Chicago/Turabian StyleLi, Lei, and Xue Gao. 2025. "Profit-Efficient Elastic Allocation of Cloud Resources Using Two-Stage Adaptive Workload Prediction" Applied Sciences 15, no. 5: 2347. https://doi.org/10.3390/app15052347
APA StyleLi, L., & Gao, X. (2025). Profit-Efficient Elastic Allocation of Cloud Resources Using Two-Stage Adaptive Workload Prediction. Applied Sciences, 15(5), 2347. https://doi.org/10.3390/app15052347