Home > Published Issues > 2026 > Volume 17, No. 2, 2026 >
JAIT 2026 Vol.17(2): 222-238
doi: 10.12720/jait.17.2.222-238

Improved Minimum Sum-of-Squares Clustering (MSSC) Algorithm for Educational Big Data Recommendation Systems

Wang Yun 1,*, Bekarystankyzy Akbayan 2,*, and Kassenkhan Aray 1
1. Institute of Automation and Information Technologies, Satbayev University, Almaty, Kazakhstan
2. School of Digital Technologies, Narxoz University, Almaty, Kazakhstan
Email: Wang.Yun@stud.satbayev.university (W.Y.); Akbayan.b@gmail.com (B.A.); a.kassenkhan@satbayev.university (K.A.)
*Corresponding author

Manuscript received May 29, 2025; revised July 22, 2025; accepted August 12, 2025; published February 5, 2026.

Abstract—This paper addresses the clustering challenges within big data for educational recommendation systems. Traditional Minimum Sum-of-Squares Clustering (MSSC) algorithms face critical limitations when applied to this domain, creating three distinct research gaps that this study aims to resolve: a persistent efficiency gap that hinders real-time application, a crucial interpretability gap that impedes pedagogical trust and adoption, and a significant stability gap caused by the dynamic and noisy nature of learner data. To bridge these gaps, we propose an improved MSSC framework integrating a distributed computing architecture, an Locality-Sensitive Hashing (LSH)-based dimensionality reduction stage, a hybrid Fuzzy C-Means (FCM)-based initialization strategy, and an adaptive feature weighting mechanism. Experimental results demonstrate significant, quantifiable improvements. The optimized algorithm reduces runtime by 37.2% and memory usage by 42.3%. It achieves a clustering accuracy of 92.3% and improves the Silhouette Coefficient by 15.8%. Furthermore, the algorithm demonstrates strong robustness against data loss and noise. These findings confirm that our proposed algorithm provides a more efficient, interpretable, and stable solution, offering reliable technical support for precise, personalized learning recommendations.
 
Keywords—educational recommendation system, minimum sum of squares clustering, big data processing, clustering algorithm, personalized learning

Cite: Wang Yun, Bekarystankyzy Akbayan, and Kassenkhan Aray, "Improved Minimum Sum-of-Squares Clustering (MSSC) Algorithm for Educational Big Data Recommendation Systems," Journal of Advances in Information Technology, Vol. 17, No. 2, pp. 222-238, 2026. doi: 10.12720/jait.17.2.222-238

Copyright © 2026 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Article Metrics in Dimensions