Home > Published Issues > 2023 > Volume 14, No. 2, 2023 >
JAIT 2023 Vol.14(2): 363-372
doi: 10.12720/jait.14.2.363-372

Breast Cancer Classification Using an Extreme Gradient Boosting Model with F-Score Feature Selection Technique

Tina Elizabeth Mathew
Government College Kariavattom, Thiruvananthapuram, Kerala, India
Email: tinamathew04@gmail.com

Manuscript received July 1, 2022; revised October 21, 2022.; accepted December 14, 2022; published April 26, 2023.

Abstract—Breast cancer is considered the most problematic of all cancers affecting women. With high incidence and mortality rates, it is ranked as the primary and most significant health hazard for women globally. Early detection of the disease is the key to ensure the survival of the patient. Several medical techniques comprising of Mammography, Magnetic Resonance Imaging, Thermography and many more are available to detect the disease. But these techniques create much stress and pain, besides employing harmful rays for detection, to the patient undergoing them. Hence for early detection other categories of techniques can be implemented. Machine- learning assisted detection and classification is one such alternative. In this paper a hyper parameter optimized extreme gradient boosting model implemented along with F-Score feature selection is proposed and the model is used for classification of the breast tumor as either malignant or benign on the Wisconsin Breast Cancer dataset. The implementation of feature importance is investigated using F-Score and this is used for selecting the most relevant features that influence the target variable and classification is based on this. Experimentation is done using different training-testing partitions and the best performance of 99.27% accuracy score was shown by the 80−20 partition by the proposed XGBoost and F-Score Model.
 
Keywords—breast cancer, classification, extreme gradient boost, feature importance, F-score

Cite: Tina Elizabeth Mathew, "Breast Cancer Classification Using an Extreme Gradient Boosting Model with F-Score Feature Selection Technique," Journal of Advances in Information Technology, Vol. 14, No. 2, pp. 363-372, 2023.

Copyright © 2023 by the authors. This is an open access article distributed under the Creative Commons Attribution License (CC BY-NC-ND 4.0), which permits use, distribution and reproduction in any medium, provided that the article is properly cited, the use is non-commercial and no modifications or adaptations are made.