Home > Published Issues > 2023 > Volume 14, No. 6, 2023 >
JAIT 2023 Vol.14(6): 1410-1424
doi: 10.12720/jait.14.6.1410-1424

Enhancing Prediction Accuracy in Gastric Cancer Using High-Confidence Machine Learning Models for Class Imbalance

Danish Jamil 1,2,*, Sellappan Palaniappan 1, Muhammad Naseem 2, and Asiah Lokman 1
1. Department of Information Technology, Malaysia University of Science and Technology, Kuala Lumpur, Malaysia; Email: sell@must.edu.my (S.P.), asiah@must.edu.my (A.L.)
2. Department of Software Engineering, Sir Syed University of Engineering and Technology, Karachi, Pakistan
*Correspondence: danish.jamil@phd.must.edu.my, djamil@ssuet.edu.pk (D.J.)

Manuscript received May 23, 2023; revised June 25, 2023; accepted July 7, 2023; published December 19, 2023.

Abstract—Gastric Cancer (GC) diagnosis and prognosis present significant challenges in the clinical industry. To address the issue of low prediction accuracy resulting from imbalanced positive and negative GC cases, this study proposes a medical Decision Support System (DSS) based on supervised Machine Learning (ML) methods. Four ML models, including Naïve Bayes (NB), Logistic Regression (LR), and Multilayer Perceptron (MLP), were employed in this study. The impact of data imbalance on GC prediction was assessed through two procedures. Among the ML models, the MLP model demonstrated the best performance in weighted GC prediction, achieving a sensitivity of 0.930 and a Positive Predictive Value (PPV) of 0.932 for balanced predictions, and a sensitivity of 0.918 and a PPV of 0.908 for unbalanced predictions. The NB model showed promise in handling the data imbalance issue, achieving a sensitivity of 0.722 and a PPV of 0.420 on the unbalanced dataset. Additionally, a DSS was developed specifically for the NB and LR models to improve prediction accuracy. The proposed method significantly improved the sensitivity of optimistic GC case prediction, with the Naïve Bayes model achieving a sensitivity of 0.936 and the Logistic Regression model achieving a sensitivity of 0.8306. These improvements enhance the reliability and efficiency of GC diagnostics, offering valuable decision support in healthcare. This research provides insights into addressing class imbalance in GC likelihood prediction and has potential implications for clinical practice.
 
Keywords—class imbalance, gastric cancer, decision support system, machine learning, prediction accuracy, naive bayes, logistic regression, medical diagnostics, positive predictive value

Cite: Danish Jamil, Sellappan Palaniappan, Muhammad Naseem, and Asiah Lokman, "Enhancing Prediction Accuracy in Gastric Cancer Using High-Confidence Machine Learning Models for Class Imbalance," Journal of Advances in Information Technology, Vol. 14, No. 6, pp. 1410-1424, 2023.

Copyright © 2023 by the authors. This is an open access article distributed under the Creative Commons Attribution License (CC BY-NC-ND 4.0), which permits use, distribution and reproduction in any medium, provided that the article is properly cited, the use is non-commercial and no modifications or adaptations are made.