Cloud-Native ETL Transformation in Healthcare: Performance Optimization and Cost Reduction Through AWS Infrastructure Modernization
DOI:
https://doi.org/10.22399/ijcesen.4514Keywords:
Cloud Computing, Healthcare Data Infrastructure, Etl Modernization, Gitops Automation, Aws ContainerizationAbstract
Healthcare enterprises are under constant pressure to deliver faster analytics on growing clinical and claims datasets while staying compliant and cost-efficient. This paper presents a cloud-native ETL modernization that I led for a regulated U.S. healthcare organization, migrating a legacy on-prem batch ETL stack into AWS using containerization and GitOps automation. The modern platform runs Informatica workloads inside Docker containers on Amazon ECS, orchestrated with AWS Step Functions, and lands curated outputs in Snowflake. All infrastructure and runtime configuration are defined as code using AWS CDK and version-controlled in Git with promotion pipelines in GitHub Actions.The program replaced a tightly coupled, manually operated batch system that required ~34 hours per major run and incurred fixed licensing/compute costs, with an elastic, auditable platform completing equivalent runs in ~2.5 hours (≈90% faster) and reducing per-run cost from roughly $183 to $18 (≈85% lower). Performance gains were validated via Step Functions execution histories, ECS/CloudWatch metrics, and Snowflake query logs; cost reductions were verified through AWS Cost Explorer and Snowflake billing usage. Beyond speed and cost, the modernization introduced repeatable deployment patterns, safer multi-environment promotions, and measurable improvements in reliability and compliance traceability. The resulting approach offers a practical, evidence-backed blueprint for healthcare data organizations modernizing ETL under strict regulatory constraints.
References
[1] Lipsa Das et al., "Data-Driven Healthcare Management, Analysis, and Future Trends," in 2024 International Conference on Communication, Computer Sciences and Engineering (IC3SE), 23 July 2024. https://ieeexplore.ieee.org/document/10593217
[2] S. P. Ahuja and B. Moore, "A Survey of Cloud Computing and Social Networks," Network and Communication Technologies, vol. 2, no. 2, pp. 11-16, 2013. DOI: 10.5539/nct.v2n2p11 https://www.researchgate.net/publication/271060474_A_Survey_of_Cloud_Computing_and_Social_Networks
[3] Hicham Boudlal, et al., "Cloud Computing for Healthcare Services: Technology, Opportunities, and Challenges," in 2022 11th International Symposium on Signal, Image, Video and Communications (ISIVC), 22 June 2022. https://ieeexplore.ieee.org/document/9800212
[4] Florian Beetz and Simon Harrer, "GitOps: The Evolution of DevOps?," IEEE Software, 08 October 2021. https://ieeexplore.ieee.org/document/9565152
[5] K. Morris, "Infrastructure as Code: Managing Servers in the Cloud," O'Reilly Media, 2016. ISBN: 978-1491924358 /https://dl.ebooksworld.ir/books/Infrastructure.as.Code.2nd.Edition.Kief.Morris.OReilly.9781098114671.EBooksWorld.ir.pdf
[6] M. Kleppmann, "Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems," O'Reilly Media, 2017. ISBN: 978-1449373320
https://www.oreilly.com/library/view/designing-data-intensive-applications/9781491903063/
[7] Rafael Vaño, et al., "Cloud-native workload orchestration at the edge: A deployment review and future directions," Sensors, 23(4), 2215, 15 February 2023. https://www.mdpi.com/1424-8220/23/4/2215
[8] Bruno Nascimento, et al., "Availability, scalability, and security in the migration from container-based to cloud-native applications," Computers, 13(8), 192, 9 August 2024. https://www.mdpi.com/2073-431X/13/8/192
[9] C. Pahl and P. Jamshidi, "Microservices: A Systematic Mapping Study," in Proceedings of the 6th International Conference on Cloud Computing and Services Science (CLOSER), 2016, pp. 137-146. DOI: 10.5220/0005785501370146 https://www.researchgate.net/publication/302973857_Microservices_A_Systematic_Mapping_Study
[10] P. Zhang, J. K. Chiang, and A. Dey, "A Comprehensive Survey on Cloud Computing Research," in 2014 IEEE 3rd International Conference on Cloud Networking (CloudNet), 2014, pp. 381-386. DOI: 10.1109/CloudNet.2014.6969019 https://www.researchgate.net/publication/282802256_A_Comprehensive_Survey_on_Cloud_Computing
[11]Fabiano de Abreu Agrela Rodrigues, & Flávio Henrique dos Santos Nascimento. (2025). Neurobiology of perfectionism. International Journal of Sustainable Science and Technology, 3(1). https://doi.org/10.22399/ijsusat.6
[12]S. Jagan, B. Girirajan, Manisha Bhimrao Mane, R B, H. J., Mariam Anil, & M. Thillai Rani. (2025). Adaptive Quantum AI Models for Accelerating Deep Learning in Decentralized Cloud Architectures. International Journal of Computational and Experimental Science and Engineering, 11(3). https://doi.org/10.22399/ijcesen.2493
[13]Soyal, H., & Canpolat, M. (2025). Intersections of Ergonomics and Radiation Safety in Interventional Radiology. International Journal of Sustainable Science and Technology, 3(1). https://doi.org/10.22399/ijsusat.12
[14]Ankit, & Amritpal Singh. (2025). Optimized Architecture for Efficient VM Allocation and Migration in Cloud Environments. International Journal of Computational and Experimental Science and Engineering, 11(2). https://doi.org/10.22399/ijcesen.1466
[15]García, R. (2025). Optimization in the Geometric Design of Solar Collectors Using Generative AI Models (GANs). International Journal of Applied Sciences and Radiation Research , 2(1). https://doi.org/10.22399/ijasrar.32
[16]Vishwanath Pradeep Bodduluri. (2025). Social Media Addiction and Its Overlay with Mental Disorders: A Neurobiological Approach to the Brain Subregions Involved. International Journal of Sustainable Science and Technology, 3(1). https://doi.org/10.22399/ijsusat.3
[17]Ujjwal Raj. (2025). The Serverless Paradigm: Abstraction, Elasticity, and Event-Driven Computing in Modern Cloud Architectures. International Journal of Computational and Experimental Science and Engineering, 11(4). https://doi.org/10.22399/ijcesen.4088
[18]Harsha Patil, Vikas Mahandule, Rutuja Katale, & Shamal Ambalkar. (2025). Leveraging Machine Learning Analytics for Intelligent Transport System Optimization in Smart Cities. International Journal of Applied Sciences and Radiation Research , 2(1). https://doi.org/10.22399/ijasrar.38
[19]Jhansi Rani Ganapa, Poonam Joshi, T Amitha, Sandip Rahane, N. Ravinder, Jignesh Jani, … Chandreshkumar Vyas. (2025). Security and Privacy Challenges in Deep Learning Models Hosted on Cloud Platforms. International Journal of Computational and Experimental Science and Engineering, 11(3). https://doi.org/10.22399/ijcesen.3235
[20]Chui, K. T. (2025). Artificial Intelligence in Energy Sustainability: Predicting, Analyzing, and Optimizing Consumption Trends. International Journal of Sustainable Science and Technology, 3(1). https://doi.org/10.22399/ijsusat.1
[21]V. Ananthakrishna, & Chandra Shekhar Yadav. (2025). QP-ChainSZKP: A Quantum-Proof Blockchain Framework for Scalable and Secure Cloud Applications. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.718
[22]Madane, S., Kamble, V., & Chavan, G. (2025). Cyber Chain – Merging Blockchain with Cyber Security. International Journal of Applied Sciences and Radiation Research , 2(1). https://doi.org/10.22399/ijasrar.42
[23]Kumari, S. (2025). Machine Learning Applications in Cryptocurrency: Detection, Prediction, and Behavioral Analysis of Bitcoin Market and Scam Activities in the USA. International Journal of Sustainable Science and Technology, 3(1). https://doi.org/10.22399/ijsusat.8
[24]Olola, T. M., & Olatunde, T. I. (2025). Artificial Intelligence in Financial and Supply Chain Optimization: Predictive Analytics for Business Growth and Market Stability in The USA. International Journal of Applied Sciences and Radiation Research , 2(1). https://doi.org/10.22399/ijasrar.18
[25]Madhavi Mangalarapua. (2025). Evaluation of DNA damage and repair in Radiographers and Dental Surgeons using X-ray machines in Dental Clinics. International Journal of Natural-Applied Sciences and Engineering, 3(1). https://doi.org/10.22399/ijnasen.14
[26]Ibeh, C. V., & Adegbola, A. (2025). AI and Machine Learning for Sustainable Energy: Predictive Modelling, Optimization and Socioeconomic Impact In The USA. International Journal of Applied Sciences and Radiation Research , 2(1). https://doi.org/10.22399/ijasrar.19
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 International Journal of Computational and Experimental Science and Engineering

This work is licensed under a Creative Commons Attribution 4.0 International License.