Continuous-Learning Recommendation Engines: Fusing Deep Metric Embeddings with Contextual Bandits at Web Scale

Authors

  • Aditya Choudhary

DOI:

https://doi.org/10.22399/ijcesen.4945

Keywords:

Bandits, Embeddings, Governance, Personalization, Recommendations

Abstract

A novel recommendation system architecture integrates deep metric-learning embeddings with contextual-bandit exploration to address limitations of traditional recommenders. This hybrid design enables personalized content delivery while continuously adapting to evolving user preferences and contexts. The architecture captures semantic relationships between items through high-dimensional embeddings while systematically exploring new options through contextual bandits, creating a balanced approach to the exploitation-exploration dilemma. Implementation features include efficient feature pipeline orchestration, on-device inference capabilities, and robust counterfactual evaluation techniques. Both offline and online evaluations demonstrate significant improvements in click-through rates, cold-start adaptation speed, and recommendation diversity without sacrificing relevance. Responsible deployment patterns including shadow mode operation, fairness audits, and feedback loop dampening ensure the system functions ethically at web scale.

References

[1] Massimo Quadrana, et al., "Sequence-Aware Recommender Systems," arXiv, 2018. [Online]. Available: https://arxiv.org/pdf/1802.08452

[2] Lihong Li, et al., "A Contextual-Bandit Approach to Personalized News Article Recommendation," arXiv, 2012. [Online]. Available: https://arxiv.org/pdf/1003.0146

[3] James Davidson, et al., "The YouTube video recommendation system," RecSys '10: Proceedings of the fourth ACM conference on Recommender systems, 2010. [Online]. Available: https://dl.acm.org/doi/10.1145/1864708.1864770

[4] Maurizio Ferrari Dacrema, et al., "Are we really making much progress? A worrying analysis of recent neural recommendation approaches," RecSys '19: Proceedings of the 13th ACM Conference on Recommender Systems, 2019. [Online]. Available: https://dl.acm.org/doi/10.1145/3298689.3347058

[5] Hao Wang, et al., "Collaborative Deep Learning for Recommender Systems," KDD '15: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015. [Online]. Available: https://dl.acm.org/doi/10.1145/2783258.2783273

[6] Paul Covington, et al., "Deep Neural Networks for YouTube Recommendations," RecSys '16: Proceedings of the 10th ACM Conference on Recommender Systems, 2016. [Online]. Available: https://dl.acm.org/doi/10.1145/2959100.2959190

[7] Vito Bellini, et al.,, "Knowledge-aware Autoencoders for Explainable Recommender Systems," DLRS 2018: Proceedings of the 3rd Workshop on Deep Learning for Recommender Systems, 2018. [Online]. Available: https://dl.acm.org/doi/10.1145/3270323.3270327

[8] Shuai Zhang, et al., "Deep Learning based Recommender System: A Survey and New Perspectives," arXiv:1707.07435v7 [cs.IR] 10 Jul 2019. [Online]. Available: https://arxiv.org/pdf/1707.07435

[9] Claudio Gentile, et al., "Online Clustering of Bandits," arXiv, 2014. [Online]. Available: https://arxiv.org/pdf/1401.8257

[10] Elias Bareinboim, et al., "Bandits with Unobserved Confounders: A Causal Approach," in Advances in Neural Information Processing Systems 28, pp. 1410-1418, 2015. [Online]. Available: https://proceedings.neurips.cc/paper_files/paper/2015/file/795c7a7a5ec6b460ec00c5841019b9e9-Paper.pdf

[11] Carlos A. Gomez-Uribe and Neil Hunt, "The Netflix Recommender System: Algorithms, Business Value, and Innovation," ACM Transactions on Management Information Systems (TMIS), Volume 6, Issue 4, 2015. [Online]. Available: https://dl.acm.org/doi/10.1145/2843948

[12] Olivier Chapelle and Lihong Li, "An Empirical Evaluation of Thompson Sampling," in Advances in Neural Information Processing Systems 24 (NIPS), pp. 2249-2257, 2011. [Online]. Available: https://proceedings.neurips.cc/paper/2011/file/e53a0a2978c28872a4505bdb51db06dc-Paper.pdf

[13] Xiangyu Zhao, et al., "Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning," arXiv, 2018. [Online]. Available: https://arxiv.org/pdf/1802.06501

[14] Xinran He, et al., "Practical Lessons from Predicting Clicks on Ads at Facebook," Facebook Research, 2014. [Online]. Available: https://research.facebook.com/publications/practical-lessons-from-predicting-clicks-on-ads-at-facebook/

[15] Harald Steck, "Calibrated recommendations," RecSys '18: Proceedings of the 12th ACM Conference on Recommender Systems, 2018. [Online]. Available: https://dl.acm.org/doi/10.1145/3240323.3240372

[16] Xiangyu Zhao, et al., "DEAR: Deep Reinforcement Learning for Online Advertising Impression in Recommender Systems," arXiv, 2021. [Online]. Available: https://arxiv.org/pdf/1909.03602

[17] Yang Zhang, et al., "Causal Intervention for Leveraging Popularity Bias in Recommendation," SIGIR '21: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. [Online]. Available: https://dl.acm.org/doi/10.1145/3404835.3462875

[18] Rishabh Mehrotra, et al., "Towards a Fair Marketplace: Counterfactual Evaluation of the trade-off between Relevance, Fairness & Satisfaction in Recommendation Systems," CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018. [Online]. Available: https://dl.acm.org/doi/10.1145/3269206.3272027

[19] Alex Beutel, et al., "Fairness in Recommendation Ranking through Pairwise Comparisons," arXiv, 2019. [Online]. Available: https://arxiv.org/pdf/1903.00780

[20] Robin Burke, et al., "Recommender Systems: An Overview," AI Magazine, vol. 32, no. 3, pp. 13-18, 2011. [Online]. Available: https://www.researchgate.net/publication/220604600_Recommender_Systems_An_Overview

Downloads

Published

2026-02-21

How to Cite

Aditya Choudhary. (2026). Continuous-Learning Recommendation Engines: Fusing Deep Metric Embeddings with Contextual Bandits at Web Scale. International Journal of Computational and Experimental Science and Engineering, 12(1). https://doi.org/10.22399/ijcesen.4945

Issue

Section

Research Article