Home > Published Issues > 2026 > Volume 17, No. 1, 2026 >
JAIT 2026 Vol.17(1): 153-170
doi: 10.12720/jait.17.1.153-170

Novel Human Object Detection Method Based on YOLO Deep Learning Optimization Strategy

Ting Zhang 1, Shuqi Niu 2, Lu Chen 2, Chenhui Dou 2, Tao Liu 2,*, and Degan Zhang 2,*
1. School of Sports Economics and Management, Tianjin University of Sport, Tianjin 301617, China
2. Tianjin Key Lab of Intelligent Computing and Novel software Technology, Tianjin University of Technology, Tianjin 300384, China
Email: 2285246377@qq.com (T.Z.); 2840082717@qq.com (S.N.); 1287725598@qq.com (L.C.); 3209134406@qq.com (C.D.); 44128592@qq.com (T.L.); 2310674826@126.com (D.Z.)
*Corresponding author

Manuscript received August 20, 2025; revised October 16, 2025; accepted October 27, 2025; published January 20, 2026.

Abstract—Along with the development of volleyball match video analysis, this paper introduces a novel human target detection method based on an optimized You Only Look Once (YOLO) deep learning strategy. Firstly, shot segmentation (identifying scene boundaries) is performed on the volleyball videos to extract key frames. Then, using semantic annotation techniques (classifying segments to filter non-game content), the videos are described as sequences of shots composed of long shots, medium shots, close-ups, replays, and off-court shots. Secondly, to address the issue of slow speed in target detection algorithms, the backbone network of YOLOv8s is optimized by implementing lightweighting through the GhostNet network and enhancing semantic information with the Convolutional Block Attention Module (CBAM) module to improve model accuracy. Experimental results on the PascalVisual Object Classes (PASCAL VOC), Common Objects in Context (COCO), and the Volleyball datasets, as well as real volleyball match videos, demonstrate that the proposed algorithm achieves a 1.5% higher mAP@50 with 64.2% fewer computational load Giga Floating- point Operations Per Second (GFLOPs) compared to the baseline YOLOv8s, achieving an optimal balance between accuracy and efficiency, making it suitable for real-time tactical analysis and automated player performance statistics in coaching and broadcasting.
 
Keywords—semantic annotation, object detection, deep learning, lightweight network, volleyball video

Cite: Ting Zhang, Shuqi Niu, Lu Chen, Chenhui Dou, Tao Liu, and Degan Zhang, "Novel Human Object Detection Method Based on YOLO Deep Learning Optimization Strategy," Journal of Advances in Information Technology, Vol. 17, No. 1, pp. 153-170, 2026. doi: 10.12720/jait.17.1.153-170

Copyright © 2026 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Article Metrics in Dimensions