A Formal Model for Feature Store Architecture and Governance Sivaramakrishnan Vaidyanathan

Sivaramakrishnan Vaidyanathan

doi:10.22399/ijcesen.4555

Authors

Sivaramakrishnan Vaidyanathan

DOI:

https://doi.org/10.22399/ijcesen.4555

Keywords:

Feature Store Architecture, Training-Serving Skew, Machine Learning Operations, Feature Governance, Optimization Framework

Abstract

ML systems in production have to address many challenges while ensuring consistency between the features in the training and serving phases. Feature Stores have emerged as one of the key ML infrastructure components to bridge the training and serving gaps. There are tradeoffs between different types of FS, such as latency, consistency guarantees, costs, and operational complexity. Organizations often do not have formal governance frameworks for governing Machine Learning pipelines. One example of the issues that can arise from insufficient frameworks is Training-Serving Skew, whereby feature statistics differ between environments. This leads to challenges in ensuring regulatory compliance and the ability to trace the lineage of features for model auditability and reproducibility. This presents a two-part formal model that enables mathematical optimization and structured governance. The first half frames the FS selection process as a constrained optimisation problem so that the performance of dual-database architectures can be quantitatively compared to that of unified architectures based on business priorities. The second half introduces Versioned Feature Descriptors that are canonical metadata artifacts for the permanent storage of feature definitions, complete lineage from raw data to prediction outputs, and fully machine-enforceable compliance policies. The optimization framework models serving latency, consistency gap, capital expense, and operational complexity for dual-database systems (one for online and another for offline workloads) and for unified systems (which house both workloads). The governance model prevents training-serving skew through runtime validation, ensuring that features input to a deployed model come from the desired descriptor version. Privacy and retention requirements are enforced by formal policy predicates, with the review process showing improvements in operational cost, debugging, audit, and regulatory compliance efforts. The framework formalizes Feature Store architecture evaluation, transforming decision-making from heuristic-based to a systematic architecture evaluation approach based on quantitative analysis for scalable and compliant machine learning adoption.

References

[1] D. Sculley et al., "Hidden Technical Debt in Machine Learning Systems". Available: https://proceedings.neurips.cc/paper_files/paper/2015/file/86df7dcfd896fcaf2674f757a2463eba-Paper.pdf

[2] Neoklis Polyzotis et al., "Data Lifecycle Challenges in Production Machine Learning: A Survey," ACM SIGMOD Record, 2018. Available: https://dl.acm.org/doi/10.1145/3299887.3299891 DOI: https://doi.org/10.1145/3299887.3299891

[3] Alexander Ratner et al., "MLSys: The New Frontier of Machine Learning Systems," arXiv preprint arXiv:1904.03257, 2019. Available: https://arxiv.org/abs/1904.03257

[4] Matei Zaharia et al., "Accelerating the Machine Learning Lifecycle with MLflow," Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 2018. Available: https://people.eecs.berkeley.edu/~matei/papers/2018/ieee_mlflow.pdf

[5] Tianqi Chen and Carlos Guestrin, "XGBoost: A Scalable Tree Boosting System," arxiv>cs>arXiv:1603.02754, 2016. Available: https://arxiv.org/abs/1603.02754

[6] Sebastian Schelter, "On Challenges in Machine Learning Model Management," Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 2018. Available: http://sites.computer.org/debull/A18dec/p5.pdf

[7] Andrei Paleyes et al., "Challenges in Deploying Machine Learning: A Survey of Case Studies," ACM Computing Surveys, 2022. Available: https://dl.acm.org/doi/10.1145/3533378 DOI: https://doi.org/10.1145/3533378

[8] Saleema Amershi et al., "Software Engineering for Machine Learning: A Case Study," 2019 IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP), 2019. Available: https://ieeexplore.ieee.org/document/8804457 DOI: https://doi.org/10.1109/ICSE-SEIP.2019.00042

[9] Christoph Molnar et al., "Interpretable Machine Learning -- A Brief History, State-of-the-Art and Challenges," arXiv preprint arXiv:2010.09337, 2020. Available: https://arxiv.org/abs/2010.09337

[10] R. Maclin and D. Opitz, "Popular Ensemble Methods: An Empirical Study," Journal Of Artificial Intelligence Research, 2011. Available: https://arxiv.org/abs/1106.0257

A Formal Model for Feature Store Architecture and Governance Sivaramakrishnan Vaidyanathan

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Information

Keywords

Announcements

Current Issue