Home > Published Issues > 2025 > Volume 16, No. 8, 2025 >
JAIT 2025 Vol.16(8): 1072-1082
doi: 10.12720/jait.16.8.1072-1082

A Lightweight Real-Time CCTV Surveillance Framework for the Education Sector Using Machine Learning

Hassan Ali 1,*, Abid Mehmood 2,*, Naeem Ahmed 3, Muhammad Saeed 3, and Ahmad Ijaz 4
1. College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, China
2. The Beacon College of Computer & Cyber Sciences, Dakota State University Madison, SD 57042, USA
3. School of Software, Nanjing University of Information Science and Technology, Nanjing, China
4. School of Computer Science, Nanjing University of Information Science and Technology, Nanjing, China
Email: ali.hassan@nuaa.edu.cn (H.A.); abid.mehmood@dsu.edu (A.M.); Naeem.uoh@gmail.com (N.A.); Saeed.uet17@gmail.com (M.S.); officialahmed000@gmail.com (A.I.)
*Corresponding author

Manuscript received February 12, 2025 ; revised April 8, 2025; accepted April 15, 2025; published August 8, 2025.

Abstract—Most Educational institutions still rely on Traditional Closed-Circuit Television (CCTV) systems with manual monitoring, which limits their effectiveness in real-time surveillance. Furthermore, deploying advanced surveillance technologies in environments such as universities, colleges, and schools is challenging due to high computational requirements, cost inefficiency, and scalability constraints. To address these limitations, this study proposes a lightweight surveillance framework that uses FaceNet for face recognition, Multi-task Cascaded Convolutional Neural Networks (MTCNN) for face localization during training, and Haar Cascade for real-time face localization during deployment. The system processes CCTV footage by converting videos into frames, detecting faces using the Haar Cascade algorithm, and generating facial embeddings with FaceNet. These embeddings are then classified using Machine Learning (ML) and Deep Learning classifiers (DL) to identify individuals as either authorized or unauthorized. Evaluations conducted on a custom dataset of 20 students around 2000 images created explicitly for university scenarios demonstrated a recognition accuracy of 97.1% using Support Vector Machine (SVM), with the lowest inference time, and included a user-friendly Graphical User Interface (GUI) developed with Flask for real-time monitoring. This approach offers a scalable, computationally efficient, cost-effective, and practical solution for enhancing campus security by effectively identifying unauthorized individuals.
 
Keywords—deep learning, Closed-Circuit Television (CCTV) surveillance, education sector security, university campus security, FaceNet, lightweight security system

Cite: Hassan Ali, Abid Mehmood, Naeem Ahmed, Muhammad Saeed, and Ahmad Ijaz, "A Lightweight Real-Time CCTV Surveillance Framework for the Education Sector Using Machine Learning," Journal of Advances in Information Technology, Vol. 16, No. 8, pp. 1072-1082, 2025. doi: 10.12720/jait.16.8.1072-1082

Copyright © 2025 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Article Metrics in Dimensions