A Machine Learning Approach to Early Detection and Malignancy Prediction in Breast Cancer

Authors

  • Tuğçe ÖZNACAR Ankara Medipol Üniversitesi
  • Neyhan ERGENE

DOI:

https://doi.org/10.22399/ijcesen.516

Keywords:

Machine Learning, Artificial Intelligence, Breast Cancer, AdaBoost

Abstract

Breast cancer is the most common cancer among women, making early detection crucial for effective treatment. Traditional diagnostic methods often face limitations, leading to potential errors in diagnosis. This study explores the transformative potential of artificial intelligence (AI) and machine learning (ML) in breast cancer diagnosis, particularly through models like AdaBoost, SVM, Random Forest, and logistic regression. By analyzing key variables—such as age, tumor size, and menopausal status—this research aims to accurately differentiate between malignant and benign lesions.

The findings reveal that the AdaBoost model significantly outperforms others, achieving an impressive AUC of 93.60% and a precision rate of 95.65%. This indicates its exceptional ability to accurately classify cases, minimizing false positives and ensuring reliable detection of true positives. With an F1 score of 86.27%, AdaBoost effectively balances precision and recall, positioning it as a valuable tool in clinical settings.

Overall, this study underscores the importance of integrating AI-driven approaches in breast cancer diagnosis, enhancing accuracy and improving patient outcomes while reducing unnecessary invasive procedures. The promising results advocate for the adoption of these advanced techniques in healthcare, paving the way for more personalized and effective treatment strategies.

References

Bray, F., Ferlay, J., Soerjomataram, I., Siegel, R. L., Torre, L. A., & Jemal, A. (2018). Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: A Cancer Journal for Clinicians, 68(6), 394-424. https://doi.org/10.3322/caac.21492

Duffy, M. J., Patnick, J., & Vaughan, T. L. (2017). Screening for breast cancer. British Medical Journal, 358, j4242. https://doi.org/10.1136/bmj.j4242

Lehmann, R., et al. (2020). AI in breast cancer: Review of current clinical applications and future perspectives. Journal of Clinical Medicine, 9(8), 2453. https://doi.org/10.3390/jcm9082453

Yala, A., et al. (2019). A deep learning model to triage breast cancer patients. Nature, 573, 170-174. https://doi.org/10.1038/s41586-019-1456-4

Gurcan, M. N., Boucher, G., Can, A., & Madabhushi, A. (2009). Histopathological image analysis: A review. IEEE Transactions on Biomedical Engineering, 56(2), 292-306. https://doi.org/10.1109/TBME.2009.2014043

McKinney, S. M., et al. (2020). International evaluation of an AI system for breast cancer screening. Nature, 577, 89-94. https://doi.org/10.1038/s41586-019-1799-6

Tavakoli, S., et al. (2021). The impact of artificial intelligence on breast cancer diagnosis and treatment: A comprehensive review. Cancer Control, 28(1), 10732748211009885. https://doi.org/10.1177/10732748211009885

Freund, Y., & Schapire, R. E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1), 119-139. https://doi.org/10.1006/jcss.1997.1504

Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20(3), 273-297. https://doi.org/10.1007/BF00994018

Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5-32. https://doi.org/10.1023/A:1010933404324

Hosmer, D. W., Lemeshow, S., & Sturdivant, R. X. (2013). Applied logistic regression (3rd ed.). Wiley.

https://doi.org/10.1007/s10916-019-1272-1

Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. In Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS 2017) (pp. 4765-4774). Curran Associates Inc.

Zuo, D., Yang, L., Jin, Y., et al. (2023). Machine learning-based models for the prediction of breast cancer recurrence risk. BMC Medical Informatics and Decision Making, 23(276). https://doi.org/10.1186/s12911-023-02377-z

Cai, Y., Zhaoxiong, Y., Zhu, W., & Wang, H. (2024). Association between sleep duration, depression and breast cancer in the United States: A national health and nutrition examination survey analysis 2009–2018. Annals of Medicine, 56(1). https://doi.org/10.1080/07853890.2024.2314235

Montazeri, M., Montazeri, M., Montazeri, M., & Beigzadeh, A. (2016). Machine learning models in breast cancer survival prediction. Technology and Health Care, 24(1), 31-42. https://doi.org/10.3233/THC-151064

Zhou, S., Hu, C., Wei, S., & Yan, X. (2024). Breast cancer prediction based on multiple machine learning algorithms. Technology in Cancer Research & Treatment, 23. https://doi.org/10.1177/15330338241234791

Ramakrishna, M. T., Venkatesan, V. K., Izonin, I., Havryliuk, M., & Bhat, C. R. (2023). Homogeneous Adaboost ensemble machine learning algorithms with reduced entropy on balanced data. Entropy (Basel), 25(2), 245. https://doi.org/10.3390/e25020245

He, H., & Garcia, E. A. (2009). Learning from imbalanced data. IEEE Transactions on Knowledge and Data Engineering, 21(9), 1263-1284. https://doi.org/10.1109/TKDE.2008.239

Hsu, C., & Lin, C. (2002). A comparison of methods for multiclass support vector machines. IEEE Transactions on Neural Networks, 13(2), 415-425. https://doi.org/10.1109/72.991427

Menard, S. (2002). Applied logistic regression analysis (2nd ed.). Sage Publications.

Smith, R. A., Andrews, K. S., Brooks, D., Fedewa, S. A., Manassaram-Baptiste, D., Saslow, D., & Wender, R. C. (2019). Cancer screening in the United States, 2019: A review of current American Cancer Society guidelines and current issues in cancer screening. CA: A Cancer Journal for Clinicians, 69(3), 184-210. https://doi.org/10.3322/caac.21557

Harris, L., Fritsche, H., Mennel, R., Norton, L., Ravdin, P., Taube, S., & Winchester, D. (2016). American Society of Clinical Oncology 2007 update of recommendations for the use of tumor markers in breast cancer. Journal of Clinical Oncology, 25(33), 5287-5312. https://doi.org/10.1200/JCO.2007.14.2364

Downloads

Published

2024-11-08

How to Cite

ÖZNACAR, T., & ERGENE, N. (2024). A Machine Learning Approach to Early Detection and Malignancy Prediction in Breast Cancer. International Journal of Computational and Experimental Science and Engineering, 10(4). https://doi.org/10.22399/ijcesen.516

Issue

Section

Research Article