Workload-Aware Machine Learning for Microservice Scaling in Kubernetes
DOI:
https://doi.org/10.22399/ijcesen.4546Keywords:
Kubernetes autoscaling, Microservice scaling, Secure Workload-aware learning, Predictive resource management, Machine learning forecasting, Cloud-native systemsAbstract
Modern cloud platforms rely on Kubernetes as their main technology to manage microservices with fluctuating and unpredictable workloads, but current autoscaling solutions mainly respond to workload changes and find it challenging to sustain optimal service quality during sudden shifts in demand. The upsizing and downsizing of the resources to be done mostly depend on the threshold of resources and do not consider the actual workload thereby causing scaling, instability and wastage of resources which are not efficient. This paper addresses the gap in current autoscaling systems by examining their inability to forecast workload demands and proactively implement measures to prevent service performance issues. A key objective of this research is to create and test a machine learning framework that is aware of the work-load and therefore, the microservice scaling is going to be both proactive and stable in the Kubernetes environments. The proposed approach gathers various workload metrics, transforms them into structured features and ap-plies short-term prediction techniques to estimate future resource needs and potential service level issues. A policy-driven decision engine confirms these predictions and initiates scaling operations that are carried out through Kubernetes’s built-in mechanisms. A constant feedback loop makes it possible for the system to be in sync with the changing workload patterns over a period of time. The results of the experiment show that the proposed method is able to reduce the error in the prediction of the workload by more than forty percent and also improves the time taken for reaction to the scaling from almost ninety seconds to under thirty seconds. The service is made much more stable, as the scaling fluctuations are brought down by more than sixty
References
[1] Santos, J., Wauters, T., Volckaert, B.,and Turck, F. D. (2023). gymhpa: Efficient Auto-Scaling via Reinforcement Learning for Complex Microservice-based Applications in Kubernetes. NOMS 2023-2023 IEEE/IFIP Network Operations and Management Symposium. https://doi.org/10.1109/noms56928.2023.10154298
[2] Zarai, O., Mcharfi, Z., and El Asri, B. (2025). Intelligent Autoscaling Strategies for Cloud-Native Microservices: A Systematic Review. 2025 International Conference on Intelligent Systems: Theories and Applications (SITA), 1–9. https://doi.org/10.1109/sita67914.2025.11273589
[3] Mondal, S. K., Wu, X., Kabir, H. M. D., Dai, H. N., Ni, K., Yuan, H.,and Wang, T. (2023). Toward optimal load prediction and customizable autoscaling scheme for kubernetes. Mathematics, 11(12), 2675.
[4] Xu, M., Song, C., Ilager, S., Gill, S. S., Zhao, J., Ye, K., and Xu,C. (2022). CoScal: Multifaceted scaling of microservices with reinforcement learning. IEEE Transactions on Network and Service Management, 19(4), 3995-4009.
[5] Shi, T., Ma, H., Chen, G., and Hartmann, S. (2023). Auto-scaling containerized applications in geo-distributed clouds. IEEE Transactions on Services Computing, 16(6), 4261-4274.
[6] Rubak, A., and Taheri, J. (2023, December). Machine learning for predictive resource scaling of microservices on kubernetes platforms. In Proceedings of the IEEE/ACM 16th International Conference on Utility and Cloud Computing (pp. 1-8).
[7] Chouliaras, S., and Sotiriadis, S. (2023). An adaptive auto-scaling framework for cloud resource provisioning. Future Generation Computer Systems, 148, 173-183.
[8] Peng, Z., Tang, B., Xu, W., Yang, Q., Hussaini, E., Xiao, Y., and Li,H. (2023, December). Microservice Auto-Scaling Algorithm Based on Workload Prediction in Cloud-Edge Collaboration Environment. In 2023 IEEE International Conferences on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData) and IEEE Congress on Cybermatics (Cybermatics) (pp. 608-615). IEEE.
[9] Wang, Z., Zhu, S., Li, J., Jiang, W., Ramakrishnan, K. K., Yan, M., Zhang, X., and Liu, A. X. (2024). DeepScaling: Autoscaling Microservices With Stable CPU Utilization for Large Scale Production Cloud Systems. IEEE/ACM Transactions on Networking, 32(5), 3961–3976. https://doi.org/10.1109/tnet.2024.3400953
[10] Alharthi, S., Alshamsi, A., Alseiari, A., & Alwarafy, A. (2024). Auto-scaling techniques in cloud computing: Issues and research directions. Sensors, 24(17), 5551.
[11] Razzaq, M. A., Mahar, J. A., Ahmad, M., Saher, N., Mehmood, A., and Choi, G. S. (2021). Hybrid auto-scaled service-cloud-based predictive workload modeling and analysis for smart campus system. IEEE Access, 9, 42081-42089.
[12] Toka, L., Dobreff, G., Fodor, B., and Sonkoly, B. (2021). Machine learning-based scaling management for kubernetes edge clusters. IEEE Transactions on Network and Service Management, 18(1), 958-972.
[13] Alidoost Alanagh, Y., Firouzi, M., Rasouli Kenari, A., & Shamsi, M. (2023). Introducing an adaptive model for auto‐scaling cloud computing based on workload classification. Concurrency and Computation: Practice and Experience, 35(22). Portico. https://doi.org/10.1002/cpe.7720
[14] Abdel Khaleq, A., and Ra, I. (2023). Intelligent microservices autoscaling module using reinforcement learning. Cluster computing, 26(5), 2789-2800.
[15] Incerto, E., Pizziol, R., and Tribastone, M. (2023). Opt: An Efficient Optimal Autoscaler for Microservice Applications. 2023 IEEE International Conference on Autonomic Computing and Self-Organizing Systems (ACSOS), 67–76. https://doi.org/10.1109/acsos58161.2023.00024
[16] Merkouche, S., and Bouanaka, C. (2022, December). A Hybrid approach for containerized Microservices auto-scaling. In 2022 IEEE/ACS 19th International Conference on Computer Systems and Applications (AICCSA) (pp. 1-6). IEEE.
[17] Saboor, A., Hassan, M. F., Akbar, R., Shah, S. N. M., Hassan, F., Magsi,S. A., and Siddiqui, M. A. (2022). Containerized microservices orchestration and provisioning in cloud computing: A conceptual framework and future perspectives. Applied Sciences, 12(12), 5793.
[18] ZargarAzad, M., and Ashtiani, M. (2023). An auto-scaling approach for microservices in cloud computing environments. Journal of Grid Computing, 21(4), 73.
[19] Xiao, Z., and Hu, S. (2022, September). Dscaler: A horizontal autoscaler of microservice based on deep reinforcement learning. In 2022 23rd Asia-Pacific Network Operations and Management Symposium (APNOMS) (pp. 1-6). IEEE.
[20] Vu, D. D., Tran, M. N., and Kim, Y. (2022). Predictive hybrid autoscaling for containerized applications. IEEE Access, 10, 109768-109778.
[21] Ahmad, H., Treude, C., Wagner, M., and Szabo, C. (2024). Towards Resource-Efficient Reactive and Proactive Auto-Scaling for Microservice Architectures. https://doi.org/10.2139/ssrn.4918202
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 International Journal of Computational and Experimental Science and Engineering

This work is licensed under a Creative Commons Attribution 4.0 International License.