Democratizing High-Performance Computing: How Virtualization and Workload Mobility Enable AI/ML Accessibility Across Organizations

Authors

  • Shruthi Karpur

DOI:

https://doi.org/10.22399/ijcesen.4980

Keywords:

GPU Virtualization, Workload Mobility, AI Democratization, Resource Optimization, Computational Accessibility

Abstract

The rapid increase in demand for GPU-accelerated compute for AI and machine learning workloads has outpaced the ability of many organizations to acquire, instrument, and manage dedicated pools of high-performance GPUs. Innovations in GPU virtualization and workload mobility have provided an alternate model of pooling, sharing, and migrating GPUs across heterogeneous infrastructure with strong performance isolation and quality of service characteristics with guaranteed performance bounds. The article proposes architectural and operational techniques to adapt virtualization-based GPU sharing and workload migration to enterprise data centers, edge and constrained installations, and air-gapped environments. Evaluation of production deployments reveals that, compared to legacy state-of-the-art systems, virtualization-based pooling allows sustained GPU utilization at higher rates while achieving almost native performance on compute-intensive workloads. Beyond their operational efficiencies, workload mobility and TCO reduction allow academic institutions, startups, and resource-constrained organizations to participate in AI workloads. The results show that virtualization and workload mobility are critical to democratizing access to accelerated computing and, at the same time, meeting the security, reliability, and performance needs of enterprise data science workflows.

References

[1] Steven Humphrey, "Semiconductor Industry Trends and the Future of Manufacturing," PTC, 2025. [Online]. Available: https://www.ptc.com/en/blogs/electronics-high-tech/semiconductor-industry-trends-and-challenges

[2] Emmett Fear, "GPU Cluster Management: Optimizing Multi-Node AI Infrastructure for Maximum Efficiency," RunPod, 2025. [Online]. Available: https://www.runpod.io/articles/guides/gpu-cluster-management-optimizing-multi-node-ai-infrastructure-for-maximum-efficiency

[3] Cheol-Ho Hong et al., "GPU Virtualization and Scheduling Methods: A Comprehensive Survey," ACM Computing Surveys (CSUR), 2017. [Online]. Available: https://dl.acm.org/doi/10.1145/3068281

[4] Ahmad Raeisi et al.," GOGH: Correlation-Guided Orchestration of GPUs in Heterogeneous Clusters," arXiv:2510.15652v1 [cs.DC], 2025. [Online]. Available: https://arxiv.org/html/2510.15652v1

[5] Xinyu Lian, et al., "Universal Checkpointing: A Flexible and Efficient Distributed Checkpointing System for Large-Scale DNN Training with Reconfigurable Parallelism," arXiv preprint arXiv:2406.18820, 2024. [Online]. Available: https://arxiv.org/abs/2406.18820

[6] Robert Chab, “Algorithmic Techniques for GPU Scheduling: A Comprehensive Survey," Algorithms, 2025. [Online]. Available: https://www.mdpi.com/1999-4893/18/7/385

[7] Tianyu Wang et al., "Improving GPU Multi-Tenancy Through Dynamic Multi-Instance GPU Reconfiguration," arXiv:2407.13126v1 [cs.DC], 2024. [Online]. Available: https://arxiv.org/pdf/2407.13126

[8] "NVIDIA RTX vWS: Sizing and GPU Selection Guide for Virtualized Workloads," NVIDIA Documentation, 2025. [Online]. Available: https://docs.nvidia.com/vgpu/sizing/virtual-workstation/latest/performance-analysis.html

[9] Carlos J. Costa et al., "The Democratization of Artificial Intelligence: Theoretical Framework," Appl. Sci., 2024. [Online]. Available: https://www.mdpi.com/2076-3417/14/18/8236

[10] Keegan Fonte, "The Intersection of AI and Emerging Markets: Opportunities and Challenges," Cornell SC Johnson College of Business, 2024. [Online]. Available: https://business.cornell.edu/hub/2024/08/13/intersection-ai-emerging-markets-opportunities-challenges/

Downloads

Published

2026-02-27

How to Cite

Shruthi Karpur. (2026). Democratizing High-Performance Computing: How Virtualization and Workload Mobility Enable AI/ML Accessibility Across Organizations. International Journal of Computational and Experimental Science and Engineering, 12(1). https://doi.org/10.22399/ijcesen.4980

Issue

Section

Research Article