Enhancing Fault Tolerance in Cloud Computing using Modified Deep Q-Network (M-DQN) for Optimal Load Balancing

Bikash Chandra Pattanaik; Bidush kumar Sahoo; Bibudhendu Pati; Arabinda Pradhan

doi:10.22399/ijcesen.601

Authors

Bikash Chandra Pattanaik BIJU PATNAIK UNIVERSITY OF TECHNOLOGY
Bidush kumar Sahoo Department of CSE, GIET University, Gunupur, Odisha, India
Bibudhendu Pati Department of CS, Ramadevi Women’s University, Bhubaneswar, Odisha, India, https://orcid.org/0000-0002-2544-5343
Arabinda Pradhan Department of CSE, Gandhi Institute for Education and Technology, Khurdha, Odisha, India https://orcid.org/0000-0002-3299-8990

DOI:

https://doi.org/10.22399/ijcesen.601

Keywords:

Cloud computing, Load balancing, Fault tolerance, DQN

Abstract

Due to popularity of cloud computing approach, excessive cloud user can send their request to cloud server for accessing their requirements. Servers are handling these incoming requests and allocate required resources to fulfill user demands. But in real scenario the numbers of servers are limited. Therefore, some servers are heavily loaded and some servers are in idle mode. This can result in a major fault tolerance issue that reduces system performance. To overcome this issue, this study presented an effective scheduling mechanism known as Modified Deep Q-Network (M-DQN). In this process the data centre controller performs appropriate actions on the environment in order to select a suitable virtual machine (VM) capable of optimizing different load balancing parameters. To get the desired outcome, a simulation is run using Google Colab with the TensorFlow environment, demonstrating the usefulness of the proposed scheduling technique. The experiment revealed that our suggested approach has a higher reward rate, reduces makespan but increases resource utilization and throughput when compared to the existing DQN algorithm. Simulation findings demonstrate that the M-DQN method works better in decreasing around 16% execution time and 10% makespan time, while it increases 8% resource utilization and 4% throughput value. Overall, it increases 18% reward value as compare with I-DQN and DQN algorithm.

References

Pradhan, A., Bisoy, S. K., and Mallick, P. K. (2020). Load balancing in cloud computing: survey. In book: Innovation in Electrical Power Engineering, Communication, and Computing Technology, Lecture Notes in Electrical Engineering, springer, pp 99-111.

Rehman, A. U., Aguiar, R. L., Barraca, J. P. (2022). Fault-Tolerance in the Scope of Cloud Computing. IEEE Access, 10;63422-63441.

Pradhan, A., Bisoy, S. K., Das, A. (2022). A survey on PSO based meta-heuristic scheduling mechanism in cloud computing environment. Journal of King Saud University –Computer and Information Sciences, 34(8);4888-4901 https://doi.org/10.1016/j.jksuci.2021.01.003

Pattnaik, B. C., Sahoo, B. K., Pradhan, A., Mishra, S. R., Tripathy, H. S., Agasti, P. (2024). Fault Tolerance Enhancement Through Load Balancing Optimization in Cloud Computing. International Journal of Intelligent Systems and Applications in Engineering, 12(4), 172–180. https://doi.org/10.1016/j.jksuci.2018.01.003

Pradhan, A., Bisoy, S. K., Kautish, S., Jasser, M. B., Mohamed, A. W. (2022). Intelligent Decision-Making of Load Balancing Using Deep Reinforcement Learning and Parallel PSO in Cloud Environment. IEEE Access, 10;76939-76952.

Pradhan, A., Bisoy, S. K., and Sain, M. (2022). Action-Based Load Balancing Technique in Cloud Network Using Actor-Critic-Swarm Optimization, Wireless Communications and Mobile Computing, Wiley, Hindawi, 2022;6456242, pp 1-17.

Hatem, M. E., Rabie, A. R. (2019). Resource Scheduling for Offline Cloud Computing Using Deep Reinforcement Learning. International Journal of Computer Science and Network Security (IJCSNS), 19(4);54-60.

Tennakoon, D., Chowdhury, M., Luan, T. H. (2018). Cloud-based load balancing using double Q-learning for improved Quality of Service. Wireless Networks, https://doi.org/10.1007/s11276-018-1888-8.

Li, M., Zhang, J., Wan, J., Ren, Y., Zhou, L., Wu, B., Yang, R., Wang, j. (2019). Distributed machine learning load balancing strategy in cloud computing services. Wireless Networks. https://doi.org/10.1007/s11276-019-02042-2.

Noel, R. R., Mehra, R., Lama, P. (2019). Towards Self-Managing Cloud Storage with Reinforcement Learning. IEEE International Conference on Cloud Engineering (IC2E), pp 34-44.

Tassel, P.,Gebser, M.,Schekotihin, K. (2021). A Reinforcement Learning Environment for Job-Shop Scheduling.arXiv:2104.03760[cs. LG].

Lin, J., Cui, D., Peng, Z., Li, Q., He, J. (2020). A Two-Stage Framework for the Multi-User Multi-Data Center Job Scheduling and Resource Allocation. IEEE Access, 8;197863-74.

Che, H., Bai, Z., Zuo, R., Li, H. (2020). A Deep Reinforcement Learning Approach to the Optimization of Data Center Task Scheduling. Hindawi Complexity, Wiley, 2020;3046769, pp 1-12. https://doi.org/10.1155/2020/3046769

Dong, T., Xue, f., Xiao, C., Li, J. (2020). Task scheduling based on deep reinforcement learning in a cloud manufacturing environment. Concurrency and Computation: Practice and Experience. Wiley, pp 1-12.

Pradhan, A., and Bisoy, S. K. (2022). Intelligent Action Performed Load Balancing Decision Made in Cloud Data center Based on Improved DQN Algorithm. 2022 International Conference on Emerging Smart Computing and Informatics (ESCI), pp. 1-6.

Peng, Z., Lin, J., Cui, D., Li, Q., He, J. (2020). A multi-objective trade-off framework for cloud resource scheduling based on the Deep Q-network algorithm. Cluster Computing. https://doi.org/10.1007/s10586-019-03042-9.

Pradhan A., Bisoy, S. K. (2022). A novel load balancing technique for cloud computing platform based on PSO. Journal of King Saud University –Computer and Information Sciences, 34(7);3988-3995. https://doi.org/10.1016/j.jksuci.2020.10.016

Enhancing Fault Tolerance in Cloud Computing using Modified Deep Q-Network (M-DQN) for Optimal Load Balancing

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Information

Keywords

Announcements

Current Issue