A Machine Learning Approach to Early Detection and Malignancy Prediction in Breast Cancer
DOI:
https://doi.org/10.22399/ijcesen.516Keywords:
Machine Learning, Artificial Intelligence, Breast Cancer, AdaBoostAbstract
Breast cancer is the most common cancer among women, making early detection crucial for effective treatment. Traditional diagnostic methods often face limitations, leading to potential errors in diagnosis. This study explores the transformative potential of artificial intelligence (AI) and machine learning (ML) in breast cancer diagnosis, particularly through models like AdaBoost, SVM, Random Forest, and logistic regression. By analyzing key variables—such as age, tumor size, and menopausal status—this research aims to accurately differentiate between malignant and benign lesions.
The findings reveal that the AdaBoost model significantly outperforms others, achieving an impressive AUC of 93.60% and a precision rate of 95.65%. This indicates its exceptional ability to accurately classify cases, minimizing false positives and ensuring reliable detection of true positives. With an F1 score of 86.27%, AdaBoost effectively balances precision and recall, positioning it as a valuable tool in clinical settings.
Overall, this study underscores the importance of integrating AI-driven approaches in breast cancer diagnosis, enhancing accuracy and improving patient outcomes while reducing unnecessary invasive procedures. The promising results advocate for the adoption of these advanced techniques in healthcare, paving the way for more personalized and effective treatment strategies.
References
Bray, F., Ferlay, J., Soerjomataram, I., Siegel, R. L., Torre, L. A., & Jemal, A. (2018). Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: A Cancer Journal for Clinicians, 68(6), 394-424. https://doi.org/10.3322/caac.21492
Duffy, M. J., Patnick, J., & Vaughan, T. L. (2017). Screening for breast cancer. British Medical Journal, 358, j4242. https://doi.org/10.1136/bmj.j4242
Lehmann, R., et al. (2020). AI in breast cancer: Review of current clinical applications and future perspectives. Journal of Clinical Medicine, 9(8), 2453. https://doi.org/10.3390/jcm9082453
Yala, A., et al. (2019). A deep learning model to triage breast cancer patients. Nature, 573, 170-174. https://doi.org/10.1038/s41586-019-1456-4
Gurcan, M. N., Boucher, G., Can, A., & Madabhushi, A. (2009). Histopathological image analysis: A review. IEEE Transactions on Biomedical Engineering, 56(2), 292-306. https://doi.org/10.1109/TBME.2009.2014043
McKinney, S. M., et al. (2020). International evaluation of an AI system for breast cancer screening. Nature, 577, 89-94. https://doi.org/10.1038/s41586-019-1799-6
Tavakoli, S., et al. (2021). The impact of artificial intelligence on breast cancer diagnosis and treatment: A comprehensive review. Cancer Control, 28(1), 10732748211009885. https://doi.org/10.1177/10732748211009885
Freund, Y., & Schapire, R. E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1), 119-139. https://doi.org/10.1006/jcss.1997.1504
Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20(3), 273-297. https://doi.org/10.1007/BF00994018
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5-32. https://doi.org/10.1023/A:1010933404324
Hosmer, D. W., Lemeshow, S., & Sturdivant, R. X. (2013). Applied logistic regression (3rd ed.). Wiley.
https://doi.org/10.1007/s10916-019-1272-1
Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. In Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS 2017) (pp. 4765-4774). Curran Associates Inc.
Zuo, D., Yang, L., Jin, Y., et al. (2023). Machine learning-based models for the prediction of breast cancer recurrence risk. BMC Medical Informatics and Decision Making, 23(276). https://doi.org/10.1186/s12911-023-02377-z
Cai, Y., Zhaoxiong, Y., Zhu, W., & Wang, H. (2024). Association between sleep duration, depression and breast cancer in the United States: A national health and nutrition examination survey analysis 2009–2018. Annals of Medicine, 56(1). https://doi.org/10.1080/07853890.2024.2314235
Montazeri, M., Montazeri, M., Montazeri, M., & Beigzadeh, A. (2016). Machine learning models in breast cancer survival prediction. Technology and Health Care, 24(1), 31-42. https://doi.org/10.3233/THC-151064
Zhou, S., Hu, C., Wei, S., & Yan, X. (2024). Breast cancer prediction based on multiple machine learning algorithms. Technology in Cancer Research & Treatment, 23. https://doi.org/10.1177/15330338241234791
Ramakrishna, M. T., Venkatesan, V. K., Izonin, I., Havryliuk, M., & Bhat, C. R. (2023). Homogeneous Adaboost ensemble machine learning algorithms with reduced entropy on balanced data. Entropy (Basel), 25(2), 245. https://doi.org/10.3390/e25020245
He, H., & Garcia, E. A. (2009). Learning from imbalanced data. IEEE Transactions on Knowledge and Data Engineering, 21(9), 1263-1284. https://doi.org/10.1109/TKDE.2008.239
Hsu, C., & Lin, C. (2002). A comparison of methods for multiclass support vector machines. IEEE Transactions on Neural Networks, 13(2), 415-425. https://doi.org/10.1109/72.991427
Menard, S. (2002). Applied logistic regression analysis (2nd ed.). Sage Publications.
Smith, R. A., Andrews, K. S., Brooks, D., Fedewa, S. A., Manassaram-Baptiste, D., Saslow, D., & Wender, R. C. (2019). Cancer screening in the United States, 2019: A review of current American Cancer Society guidelines and current issues in cancer screening. CA: A Cancer Journal for Clinicians, 69(3), 184-210. https://doi.org/10.3322/caac.21557
Harris, L., Fritsche, H., Mennel, R., Norton, L., Ravdin, P., Taube, S., & Winchester, D. (2016). American Society of Clinical Oncology 2007 update of recommendations for the use of tumor markers in breast cancer. Journal of Clinical Oncology, 25(33), 5287-5312. https://doi.org/10.1200/JCO.2007.14.2364
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 International Journal of Computational and Experimental Science and Engineering
This work is licensed under a Creative Commons Attribution 4.0 International License.