Home > Published Issues > 2026 > Volume 17, No. 1, 2026 >
JAIT 2026 Vol.17(1): 55-64
doi: 10.12720/jait.17.1.55-64

PolyVision: Optimising Retinal Disease Detection Through Collaborative Neural Networks

Sultan Ahmad 1,*, Swathi Kalam 2, Eali Stephen Neal Joshua 3, Hikmat A. M. Abdeljaber 4, and Hessa Alfraihi 5
1. Department of Computer Science, College of Computer Engineering and Sciences,
Prince Sattam Bin Abdulaziz University, Alkharj, Saudi Arabia 2. Department of Computer Science and Engineering, Vignan’s Institute of Information Technology, Visakhapatnam, India
3. Department of Computer Science and Engineering, GST, GITAM (Deemed to be University), India
4. Department of Computer Science, Faculty of Information Technology, Applied Science Private University, Amman, Jordan
5. Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia
Email: s.alisher@psau.edu.sa (S.A.); swathi.kalam@gmail.com (S. K.); seali@gitam.edu (E.S.N.J.); h_abdeljaber@asu.edu.jo (H.A.M.A.); haalfraihi@pnu.edu.sa (H.A.)
*Corresponding author

Manuscript received July 3, 2025; revised September 13, 2025; accepted September 15, 2025; published January 14, 2026.

Abstract—Diabetic Retinopathy (DR) remains one of the leading causes of preventable blindness worldwide, and early, reliable diagnosis is essential for reducing vision loss. Deep learning has shown promise in this domain, but single models often suffer from limited generalizability, sensitivity–specificity imbalance, and high computational demand. To address these challenges, we present PolyVision, a modular ensemble framework designed for robust and equitable DR screening. PolyVision integrates three complementary backbones—ResNet50, EfficientNet-B2, and Vision Transformer—each capturing different levels of spatial and contextual retinal features. Their predictions are combined through a dual fusion mechanism based on mean and maximum voting, which balances diagnostic sensitivity and specificity while minimizing variance across models. To further enhance robustness, the models are trained with diverse augmentation strategies, and hyperparameters are tuned for optimal performance. Evaluated on ultra-widefield fundus images, PolyVision achieved an AUC-ROC of 0.953, an AUPRC of 0.975, and an inference latency of 110 ms per image, demonstrating both high diagnostic accuracy and clinical efficiency. Beyond accuracy, the framework incorporates fairness evaluation across imaging subgroups, supporting more equitable diagnostic outcomes. Its lightweight design also facilitates deployment in resource-constrained clinical settings without compromising reliability. These results highlight the potential of ensemble learning to provide scalable, accurate, and fair DR screening. However, additional validation on multi-institutional datasets and real-world clinical environments remains necessary before broad clinical adoption.
 
Keywords—diabetic retinopathy, convolutional neural networks, vision transformer, deep learning, model fusion, fairness in AI, medical image analysis

Cite: Sultan Ahmad, Swathi Kalam, Eali Stephen Neal Joshua, Hikmat A. M. Abdeljaber, and Hessa Alfraihi, "PolyVision: Optimising Retinal Disease Detection Through Collaborative Neural Networks," Journal of Advances in Information Technology, Vol. 17, No. 1, pp. 55-64, 2026. doi: 10.12720/jait.17.1.55-64

Copyright © 2026 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Article Metrics in Dimensions