Adaptive AI Model Selection for Cloud Infrastructure Optimization: A Framework for Intelligent, Self-Regulating Computing Environments

Authors

  • Mallikarjuna Muchu

DOI:

https://doi.org/10.22399/ijcesen.4670

Keywords:

Adaptive AI Model Selection, Cloud Infrastructure Optimization, Event-Driven Automation, Multi-Cloud Resource Management, Infrastructure as Code

Abstract

Contemporary cloud computing ecosystems demand intelligent infrastructure management strategies that transcend traditional static provisioning models. The adaptive AI model selection framework presented herein addresses fundamental challenges in managing heterogeneous workloads across microservices architectures, data processing pipelines, IoT data streams, and AI inference engines through systematic integration of machine learning operations with event-driven automation mechanisms. The framework synthesizes cloud engineering principles, Infrastructure as Code methodologies, and intelligent model selection algorithms to enable real-time optimization based on telemetry analysis, historical workload patterns, and operational objectives. Through automated training pipelines, continuous evaluation protocols, and progressive deployment strategies with rollback capabilities, the framework facilitates a self-optimizing infrastructure that minimizes human intervention while maintaining service-level agreements and cost efficiency. The architecture comprises telemetry collection subsystems, versioned model repositories, intelligent selection engines implementing multi-criteria decision frameworks, and automated deployment orchestration utilizing canary patterns and circuit breakers. Event-driven automation enables real-time responsiveness through stream processing frameworks that evaluate optimization opportunities via windowed computations, complex event processing patterns, and stateful processing mechanisms. Enhancements to multi-cloud and hybrid environments support heterogeneous resource abstractions, cross-platform data movement limits, and vendor-specific operational behaviors by using cloud-agnostic abstraction layers and provider-specific adapters. The framework illustrates how smart and dynamic infrastructure operation can make organizations realize better resource utilization, better service reliability, and reduction of operational expenses in da distributed computing environment, as well as provision of regulatory compliance and data sovereignty requirements in a complex multi-cloud deployment.

References

[1] Tanner Luxner, "Cloud Computing Trends: Flexera 2023 State of the Cloud Report," Flexera 2023. [Online]. Available: https://www.flexera.com/blog/finops/cloud-computing-trends-flexera-2023-state-of-the-cloud-report/

[2] Anton Beloglazo, et al., "Energy-aware resource allocation heuristics for efficient management of data centers for Cloud computing," ScienceDirect, 2012. [Online]. Available: https://www.sciencedirect.com/science/article/abs/pii/S0167739X11000689

[3] Jayavardhana Gubbi, et al., "Internet of Things (IoT): A vision, architectural elements, and future directions," arxiv, 2012. [Online]. Available: https://arxiv.org/abs/1207.0203

[4] Hui Zhang, et al., "Intelligent Workload Factoring for a Hybrid Cloud Computing Model," IEEE, 2009. [Online]. Available: https://ieeexplore.ieee.org/document/5190708

[5] Babak Ravandi; Ioannis Papapanagiotou, "A Self-Learning Scheduling in Cloud Software Defined Block Storage," IEEE, 2017. [Online]. Available: https://ieeexplore.ieee.org/document/8030616

[6] Manish Kumar Abhishek, et al., "Framework to Deploy Containers using Kubernetes and CI/CD Pipeline," International Journal of Advanced Computer Science and Applications, 2022. [Online]. Available: https://thesai.org/Downloads/Volume13No4/Paper_60-Framework_to_Deploy_Containers_using_Kubernetes_and_CICD_Pipeline.pdf

[7] Pulkit A. Misra, et al., "Managing Tail Latency in Datacenter-Scale File Systems Under Production Constraints," ACM Digital Library, 2019. [Online]. Available: https://dl.acm.org/doi/10.1145/3302424.3303973

[8] Vamsi Krishna Reddy Munnangi, "Multi-Cloud and Hybrid Cloud Strategies for Enterprise API Architectures," ResearchGate, 2025. [Online]. Available: https://www.researchgate.net/publication/391630699_Multi-Cloud_and_Hybrid_Cloud_Strategies_for_Enterprise_API_Architectures

[9] Nicola Capodieci, et al., "Deadline-Based Scheduling for GPU with Preemption Support," IEEE, 2009 [Online]. Available: https://ieeexplore.ieee.org/document/8603197

[10] Omdia, “Global cloud infrastructure spending rose 21% in Q1 2025”, 2025. https://omdia.tech.informa.com/pr/2025/jun/global-cloud-infrastructure-spending-rose-21percent-in-q1-2025

Downloads

Published

2026-01-04

How to Cite

Mallikarjuna Muchu. (2026). Adaptive AI Model Selection for Cloud Infrastructure Optimization: A Framework for Intelligent, Self-Regulating Computing Environments. International Journal of Computational and Experimental Science and Engineering, 12(1). https://doi.org/10.22399/ijcesen.4670

Issue

Section

Research Article