Performance Evaluation Of Data Mining Classifiers For Predicting Recurrence And Survivability Of Breast Cancer Patients

Authors

  • A. A. Nurudeen Department of Mathematics and Statistics, Kaduna Polytechnic, Kaduna. Author
  • U. M. Umar Department of Statistics, Usmanu Danfodiyo University, Sokoto, Sokoto. Author
  • B.K. Asare Department of Statistics, Usmanu Danfodiyo University, Sokoto, Sokoto. Author
  • B. Abdulkarim Department of Computer Science, Usmanu Danfodiyo University, Sokoto, Sokoto. Author

DOI:

https://doi.org/10.60787/tnamp.v21.474

Keywords:

Breast cancer , Machine learning, Ensemble classifier, Recurrence, Survivability, Performance

Abstract

Worldwide, breast cancer is currently the most common cancer, accounting for 12.5% of all new annual cancer cases and it is one of the leading causes of cancer-related death in women second only to lung cancer. Incorporating machine learning (ML) classifiers into predicting the recurrence and survival of patients with breast cancer has emerged as a promising approach to enhance performance metrics. This study analyzed dataset on breast cancer obtained from clinical studies- Barau Dikko Teaching Hospital, Kaduna and when the performance of employed  Conventional ML classifiers- Artificial Neural Network (ANN), Support Vector Machine (SVM), K-Nearest Neighbor (KNN) and Proposed ensemble learning classifier (ANN-KNN) was evaluated, it was observed that both Conventional ML and Proposed ensemble learning classifiers could predict recurrence of breast cancer and survivability of breast cancer patients. However, the performance of these conventional ML classifiers and the proposed ensemble learning classifier were compared. The results showed that the proposed ensemble learning classifier outperformed ML classifiers with 97% and 91.04% accuracy on recurrence and survival prediction of breast cancer respectively. It remains the best classifier in predicting breast cancer patients' recurrence and survivability, followed by the ANN classifier with accuracy of 90.5% and 81.93% respectively on recurrence and survivability prediction of breast cancer patients. The findings demonstrate that ensemble learning can enhance the performance of weak classifiers like SVM and KNN. Further extensive evaluation of other ML classifiers like decision tree and random forest can be performed using some combinations that can predict the recurrence and survivability of breast cancer patients with greater accuracy. 

         Views | Downloads: 14 / 11

Downloads

Download data is not yet available.

References

Breast Cancer.org (2023). Retrieved online on May 15, 2023.

Ganggayah M.D., Taib N. A., Har Y. C., Lio P. and Dhillon S.K. (2019). Predicting factors for Survival of breast cancer patients using machine learning techniques. BMC Medical Informatics and Decision making, http://doi.org/10.1186/s12911-019-0801-4 19:48.

abbar, Meerja Akhil. (2021). Breast cancer data classification using ensemble machine learning, Engineering and Applied Science Research 2021;48(1):65-72.

WHO (2021). Bulletin on Breast Cancer. Retrieved on July 12, 2021.

Global Cancer Observatory (2020). Information on Cancer. Retrieved online on August 15, 2020.

Musa A. A., Aliyu U.M., (2020). Application of Machine Learning Techniques in Predicting Breast Cancer Metastases Using Decision Tree Algorithm, in Sokoto Northwestern Nigeria, Journal of Data Mining Genomics Proteomics. 11:220. DOI: 10.35248/2153- 0602.20.11.220.

Jason Brownlee (2017). Master Machine Learning Algorithms ‘Discover how they work and implement them from scratch’ edition, V1.12, Jason Brownlee publications, USA.

Rivera, D. L., Narvaez, R. A. (2023). Application of Machine Learning Approaches in CancerPrediction. Canadian Journal of Nursing Informatics, 18(2). https://cjni.net/jo urn al/ ?p=11581.

Gupta S.R. (2022). Prediction time of breast cancer tumor recurrence using Machine Learning. Cancer Treatment and Research Communications. 32. 100602. 10.1016/j.ctarc.2022.100602.

Izci, Hava & Macq, Gilles & Tambuyzer, Tim & De Schutter, Harlinde & Wildiers, Hans & Duhoux, Francois & Azambuja, Evandro & Taylor, Donatienne & Staelens, Gracienne & Orye, Guy & Hlavata, Zuzana & Hellemans, Helga & Rop, Carine & Neven, Patrick & Verdoodt, Freija. (2023). Machine Learning Algorithm to Estimate Distant Breast Cancer Recurrence at the Population Level with Administrative Data. Clinical Epidemiology. Volume 15. 559-568. 10. /CLEP.S400071. 2147.

Pechprasarn, S. & Wattanapermpool, O. & Warunlawan, M. & Homsud, P. & Akarajarasroj, T. (2023). Identification of Important Factors in the Diagnosis of Breast Cancer Cells Using Machine Learning Models and Principal Component Analysis. Journal of Current Science and Technology.13. 642-656.10.59796/ jcst. V13N3.2023.700.

Mosayebi A, Mojaradi B, Bonyadi Naeini A, Khodadad Hosseini SH. (2020). Modeling and comparing data mining algorithms for prediction of recurrence of breast cancer. PLoS One. Oct 15;15(10): e0237658. doi: 10.1371/journal.pone.0237658. PMID: 33057328; PMCID: PMC7561198.

Shikha Agrawal and Jitendra Agrawal (2015). Neural Network Techniques for Cancer Prediction: A Survey, Procedia Computer Science, Volume 60, Pages 769-774, ISSN 1877-0509, https://doi.org/10.1016/j.procs.2015.08.234.

Haque MN, Tazin T, Khan MM, Faisal S, Ibraheem SM, Algethami H, Almalki FA. Predicting Characteristics Associated with Breast Cancer Survival Using Multiple Machine Learning Approaches. (2022). Comput. Math. Methods Med. doi: 10.1155/2022/1249692. PMID: 35509861; PMCID: PMC9060999.

Carlos D. C. L., Deevyankar A., Isabel D T., and Lourdes M. R. (2023). Automatic detection of Breast Cancer by using Ensemble Learning, DOI: https://doi.org/10.21203/rs.3.rs-2934498/v1.

Ahmed L.G., Eshiaghy A.T., Poorebrahim A. Ebrahimi M. and Razavi A.R. (2013). Using three Machine learning techniques for predicting breast cancer recurrence, Journal of Health and Medical Informatics, 4: 124. Doi:10.4172/2157-7420.100124.

Kalafi EY, Nor NAM, Taib NA, Ganggayah MD, Town C, Dhillon SK. (2019). Machine Learning and Deep Learning Approaches in Breast Cancer Survival Prediction Using Clinical Data. Folia Biol (Praha). 2019;65(5-6):212-220. PMID: 32362304.

Khaoula Chtouki, Maryem Rhanouni, Mounia Mikram, Siham Yousfi and Kamelia Amazian(2023). Supervised Machine Learning for Breast Cancer Risk Factors Analysis and Survival Prediction. Journal of arXiv:2304.07299v1 [cs.LG] 13.

Li J, Zhou Z, Dong J, Fu Y, Li Y, Luan Z, Peng X. (2021). Predicting breast cancer 5-year survival using machine learning: A systematic review. PLoS One. 2021 Apr 16;16(4):e0250370. doi: 10.1371/journal.pone.0250370. PMID: 33861809; PMCID: PMC8051758.

Maria Nisheva, Iliyan Mihaylov, and Dimitar Vassilev (2019). Application of machine Learning models for survival prognosis in breast cancer studies, 93; doi:10.3390/info10030093. www.mdpi.com/journal/information10.

Hugo Aerts J.W.L., Yiwen Xu, Ahmed Hosny, Roman Zeleznik, Chintan Parmar, Thibaud Coroller, Idalid Franco, Raymond H. Mak, (2019). Deep Learning Predicts Lung Cancer Treatment Response from Serial Medical Imaging. American Association for Cancer, Research Clin Cancer Res 25:3266–75 doi: 10.1158/1078-0432.CCR-18-2495.

Dawngliani M.S, Chandrasekaran, N. & Samuel Lalmuanawma (2019). A Comparative Study between Data Mining Classification and Ensemble Techniques for Predicting Survivability of Breast Cancer Patients. International Journal of Computer Scienceand Mobile Computing, Vol.8 Issue.9, September- 2019, pg. 01-10.

Dawngliani, M.S., Chandrasekaran, N, Lalmuanawma, Samuel & Thangkhanhau, H (2020). Prediction of Breast Cancer Recurrence Using Ensemble Machine Learning Classifiers.10.1007/978-3-030-46828-6_20.

Dawngliani, M.S, Chandrasekaran, N., Lalmawipuii, R. & Thangkhanhau, H. (2021). Breast Cancer Recurrence Prediction Model Using Voting Technique. 10.1007/978-3-030-49795-8_2.

Anwar Hina, Qamar Usman, Muzaffar Qureshi Abdul Wahab, (2014). "Global Optimization Ensemble Model for Classification Methods", The Scientific World Journal, vol. 2014, Article ID 313164, 9 pages https://doi.org/10.1155/2014/313164.

Shahzad, R. K., and Lavesson, N. (2013). Comparative analysis of voting schemes for ensemble-based malware detection. Journal of Wireless Mobile Networks, Ubiquitous Computing, and Dependable Applications, 4(1), 98-117.

Prusa, J., Khoshgoftaar, T. M., and Dittman, D. J. (2015). Using ensemble learners to improve classifier performance on tweet sentiment data. In 2015 IEEE international conference on information reuse and integration (pp. 252-257). IEEE.

WHO (2020). Bulletin on Breast Cancer. Retrieved on September, 2024.

Wang. H., LI, Khan, S.A., and Lou, Y. (2020). Prediction of breast cancer distant recurrence using natural language preprocessing and knowledge-guided convolutional neural network, Artif. Intell. Med. 110,101977.

Pilaftis A. and Rubio J. (2015). The Higgs machine learning challenge, Journal of Physics: Conference Series, Vol. 664, no. 7, article 072015.

Maabreh R.S.A., Malik B.A. and Ahmed S.A. (2021). Machine learning algorithms for prediction of survival curves in breast cancer patients. Journal of Applied Bionics and Biomechanics, Vol. 2021, Article ID 9338091, http://doi.org/10.1155/2021.9338091

Downloads

Published

2025-03-03

Issue

Section

Articles

How to Cite

Performance Evaluation Of Data Mining Classifiers For Predicting Recurrence And Survivability Of Breast Cancer Patients. (2025). The Transactions of the Nigerian Association of Mathematical Physics, 21, 65-80. https://doi.org/10.60787/tnamp.v21.474

Share

Similar Articles

11-20 of 21

You may also start an advanced similarity search for this article.