A Systematic Comparative Study on the use of Machine Learning Techniques to Predict Lung Cancer and its Metastasis to the Liver: LCLM-Predictor Model
DOI:
https://doi.org/10.22399/ijcesen.788Keywords:
Lung Cancer, Liver Metastasis, Machine Learning, Decision Tree Classifiers, Predictive ModelingAbstract
Lung cancer is one of the major causes of cancer deaths with thousands of affected patients who have developed liver metastasis, complicating the treatment and further prognosis. Early predictions of lung cancer and metastasis may greatly improve patient outcomes since clinical interventions will be instituted in time. This paper compares the performance of different machine learning models including Decision Tree Classifiers, Logistic Regression, Naïve Bayes, K-Nearest Neighbors, Support Vector Machines and Gaussian Mixture Models toward the best set of techniques for prediction. The applied dataset includes various clinical features, such as respiratory symptoms and biochemical markers, for the development of stronger predictive performance. The models were cross-validated using testing and validation techniques aimed at generalizing the whole model with reliability in generating both train and test data. The results of the generated models are gauged using metrics of accuracy, precision, recall, F1-score, and area under ROC curve. Results obtained have revealed that the Decision Tree and KNN models also showed stronger predictive accuracy and strong classification performance, especially in early-stage lung cancer and liver metastasis. The present study is a comparison of the Decision Tree and KNN models, which hence denotes the potential of these models in clinical decision-making and suggests application to the development of diagnostic tools for the early detection of cancer. This provides a very useful guide that is applicable in the use of machine learning in oncology and helps pave the way to future research which would be focused on model optimization and integration into healthcare systems that would produce better management of patients and better survival rates.
References
World Health Organization. WHO Cancer Report, 2020.
American Cancer Society. Lung Cancer Statistics, 2022.
Smith, J., et al. (2020). Challenges in Early Diagnosis of Lung Cancer. Journal of Thoracic Oncology, 15(6),1058-1066.
Chen, X., et al. (2021). AI in Lung Cancer: A Review. Artificial Intelligence in Medicine, 65,21-30.
Wang, L., et al. (2022). Deep Learning Models for Lung Cancer Detection from Radiology Images. IEEE Transactions on Medical Imaging, 38(3);1114-1125.
Tang, R., et al. (2021). Predicting Liver Metastasis in Colorectal Cancer Patients Using Machine Learning Models. Journal of Oncology Research, 47(2);295-302.
Yadav, S. (2020). Interpretability in Machine Learning Models. ACM Computing Surveys, 53(4);1-14. ttps://doi.org/10.1145/3313831.337621
S. M. Al-Waeli, A. Al-Dhafeeri, & M. A. Al-Amri, (2019). Early Prediction of Lung Cancer Using Machine Learning Techniques: A Review, Journal of Medical Imaging and Health Informatics.
M. A. Islam, M. S. Islam, & M. R. Islam, (2020) A Hybrid Machine Learning Approach for Early Diagnosis of Lung Cancer, IEEE Journal of Biomedical and Health Informatics, 2020.
Y. Liu, X. Liu, & Z. Sun, (2020). Predicting Liver Metastasis in Lung Cancer Patients Using a Random Forest Classifier, Journal of Thoracic Oncology.
C. Anderson & E. Thompson, (2020). The Role of Decision Trees in Clinical Diagnostics, Journal of Clinical Medicine.
Yang, C.C. (2022).Explainable Artificial Intelligence for Predictive modeling in Healthcare. J Healthc Inform Res 6,228–239. https://doi.org/10.1007/s41666-022-00114-1
Qaddoumi, A.I., Evans, W.I. & Wilson, M.W. (2025) A case of cutaneous melanoma metastatic to the ciliary body and choroid with complete regression via systemic dual checkpoint inhibitor therapy. BMC Ophthalmol 25, 12). https://doi.org/10.1186/s12886-025-03847-w
Dalwinder Singh, Birmohan Singh (2020) Investigating the impact of data normalization on classification performance Applied Soft Computing 97, 105524. https://doi.org/10.1016/j.asoc.2019.105524
Cao, X., Zheng, S., Zhang, J. et al. (2025) A hybrid CNN-Bi-LSTM model with feature fusion for accurate epilepsy seizure detection. BMC Med Inform Decis Mak 25, 6. https://doi.org/10.1186/s12911-024-02845-0
Zhentian Guo et al. Machine learning for predicting liver and/or lung metastasis in colorectal cancer: A retrospective study based on the SEER database. European Journal of Surgical Oncology 50(7),108362 https://doi.org/10.1016/j.ejso.2024.108362
Dimitris Bertsimas, (2020) Machine Learning in Oncology: Methods, Applications, and Challenges JCO Clinical Cancer Informatics 4 https://doi.org/10.1200/CCI.20.000
Lei Wang, et al. (2025). Classifying 2-year recurrence in patients with dlbcl using clinical variables with imbalanced data and machine learning methods Computer Methods and Programs in Biomedicine 196(C) https://doi.org/10.1016/j.cmpb.2020.10556
Vijayadeep GUMMADI, & Naga Malleswara Rao NALLAMOTHU. (2025). Optimizing 3D Brain Tumor Detection with Hybrid Mean Clustering and Ensemble Classifiers. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.719
K.S. Praveenkumar, & R. Gunasundari. (2025). Optimizing Type II Diabetes Prediction Through Hybrid Big Data Analytics and H-SMOTE Tree Methodology. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.727
TOPRAK, A. (2024). Determination of Colorectal Cancer and Lung Cancer Related LncRNAs based on Deep Autoencoder and Deep Neural Network. International Journal of Computational and Experimental Science and Engineering, 10(4). https://doi.org/10.22399/ijcesen.636
P., A. M., & R. GUNASUNDARI. (2024). An Interpretable PyCaret Approach for Alzheimer’s Disease Prediction. International Journal of Computational and Experimental Science and Engineering, 10(4). https://doi.org/10.22399/ijcesen.655
ÖZNACAR, T., & Zeynep Tuğçe SERTKAYA. (2024). Heart Failure Prediction: A Comparative Study of SHAP, LIME, and ICE in Machine Learning Models. International Journal of Computational and Experimental Science and Engineering, 10(4). https://doi.org/10.22399/ijcesen.589
SHARMA, M., & BENIWAL, S. (2024). Feature Extraction Using Hybrid Approach of VGG19 and GLCM For Optimized Brain Tumor Classification. International Journal of Computational and Experimental Science and Engineering, 10(4). https://doi.org/10.22399/ijcesen.714
Rama Lakshmi BOYAPATI, & Radhika YALAVARTHI. (2024). RESNET-53 for Extraction of Alzheimer’s Features Using Enhanced Learning Models. International Journal of Computational and Experimental Science and Engineering, 10(4). https://doi.org/10.22399/ijcesen.519
Agnihotri, A., & Kohli, N. (2024). A novel lightweight deep learning model based on SqueezeNet architecture for viral lung disease classification in X-ray and CT images. International Journal of Computational and Experimental Science and Engineering, 10(4). https://doi.org/10.22399/ijcesen.425
ÖZNACAR, T., & ERGENE, N. (2024). A Machine Learning Approach to Early Detection and Malignancy Prediction in Breast Cancer. International Journal of Computational and Experimental Science and Engineering, 10(4). https://doi.org/10.22399/ijcesen.516
Nuthakki, praveena, & Pavankumar T. (2024). Comparative Assessment of Machine Learning Algorithms for Effective Diabetes Prediction and Care. International Journal of Computational and Experimental Science and Engineering, 10(4). https://doi.org/10.22399/ijcesen.606
GÜRAKSIN, G. E., & UĞUZ, H. (2018). Comparison of Different Training Data Reduction Approaches for Fast Support Vector Machines Based on Principal Component Analysis and Distance Based Measurements. International Journal of Computational and Experimental Science and Engineering, 4(1), 1–5. Retrieved from https://ijcesen.com/index.php/ijcesen/article/view/55
YAKUT , Önder. (2023). Diabetes Prediction Using Colab Notebook Based Machine Learning Methods. International Journal of Computational and Experimental Science and Engineering, 9(1), 36–41. Retrieved from https://ijcesen.com/index.php/ijcesen/article/view/187
Ponugoti Kalpana, L. Smitha, Dasari Madhavi, Shaik Abdul Nabi, G. Kalpana, & Kodati , S. (2024). A Smart Irrigation System Using the IoT and Advanced Machine Learning Model: A Systematic Literature Review. International Journal of Computational and Experimental Science and Engineering, 10(4). https://doi.org/10.22399/ijcesen.526
LAVUDIYA, N. S., & C.V.P.R Prasad. (2024). Enhancing Ophthalmological Diagnoses: An Adaptive Ensemble Learning Approach Using Fundus and OCT Imaging. International Journal of Computational and Experimental Science and Engineering, 10(4). https://doi.org/10.22399/ijcesen.678
Johnsymol Joy, & Mercy Paul Selvan. (2025). An efficient hybrid Deep Learning-Machine Learning method for diagnosing neurodegenerative disorders. International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.701
AYKAT, Şükrü, & SENAN, S. (2023). Using Machine Learning to Detect Different Eye Diseases from OCT Images. International Journal of Computational and Experimental Science and Engineering, 9(2), 62–67. Retrieved from https://ijcesen.com/index.php/ijcesen/article/view/191
Anakal, S., K. Krishna Prasad, Chandrashekhar Uppin, & M. Dileep Kumar. (2025). Diagnosis, visualisation and analysis of COVID-19 using Machine learning . International Journal of Computational and Experimental Science and Engineering, 11(1). https://doi.org/10.22399/ijcesen.826
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 International Journal of Computational and Experimental Science and Engineering
This work is licensed under a Creative Commons Attribution 4.0 International License.