Ship detection and tracking based on an improved YOLO11n and ByteTrack framework
DOI:
https://doi.org/10.54939/1859-1043.j.mst.CSCE9.2025.111-122Keywords:
Ship detection; Ship tracking; YOLO11n; SEAM; CCFM; ByteTrackAbstract
Ship detection and tracking are critical components of intelligent maritime and coastal surveillance systems. This study introduces a robust method for multi-class vessel detection and real-time tracking. The approach improves the YOLO11n model by incorporating the Separated and Enhancement Attention Module (SEAM) and the Cross-scale Channel Fusion Module (CCFM) into the neck, which enhances multi-scale feature aggregation and edge attention. Experimental results demonstrate an increase in mAP50 from 94.26% to 94.84% and in mAP50–95 from 69.59% to 70.40% on a custom ship dataset. The model size increases only slightly, while real-time inference speed remains at 25 FPS on the Jetson Xavier NX. A custom dataset comprising over 7,600 manually labeled images was developed, covering six vessel categories: cargo ship, passenger ship, military ship, sailboat, fishing boat, and patrol ship. Images were sourced from various public repositories to ensure diversity in vessel size, viewing angle, background, and lighting conditions. The complete system integrates the enhanced YOLO11n with the ByteTrack algorithm for real-time vessel detection and tracking. Experimental results confirm the system's feasibility for practical maritime surveillance applications.
References
[1]. Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.; Fu, C.; Berg, A., “SSD: Single Shot MultiBox Detector”, In Proceedings of the European Conference on Computer Vision (ECCV), pp. 21–37, (2016).
[2]. H. V. Anh, “Tên sách”, NXB Khoa học Kỹ thuật, tr. 11–23, (2005).
[3]. Girshick, R.; Donahue, J.; Darrell, T.; Malik, J., “Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation”, In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 580–587, (2014).
[4]. Ren, S.; He, K.; Girshick, R.; Sun, J., “Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 39, No. 6, pp. 1137–1149, (2017).
[5]. He, K.; Gkioxari, G.; Dollár, P.; Girshick, R., “Mask R-CNN”, In Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 2980–2988, (2017).
[6]. Ge, Z.; Liu, S.; Wang, F.; Li, Z.; Sun, J., “YOLOX: Exceeding YOLO Series in 2021”, arXiv preprint arXiv:2107.08430, (2021).
[7]. Redmon, J.; Farhadi, A., “YOLO9000: Better, Faster, Stronger”, In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6517–6525, (2017).
[8]. Redmon, J.; Farhadi, A., “YOLOv3: An Incremental Improvement”, arXiv preprint arXiv:1804.02767, (2018).
[9]. Bochkovskiy, A.; Wang, C.-Y.; Liao, H.-Y. M., “YOLOv4: Optimal Speed and Accuracy of Object Detection”, arXiv preprint arXiv:2004.10934, (2020).
[10]. Meituan, “YOLOv6: A High-Performance Object Detection Model”, GitHub Repository, (2022).
[11]. Wong, R.; Chien, Y., “YOLOv7: A Scalable and Efficient Object Detection Model”, GitHub Repository, (2022).
[12]. Zhang, Y.; Sun, P.; Jiang, Y.; Yu, D.; Weng, F.; Yuan, Z.; Luo, P.; Liu, W.; Wang, X., “ByteTrack: Multi-Object Tracking by Associating Every Detection Box”, In Proceedings of the European Conference on Computer Vision (ECCV), pp. 1–21, (2022).
[13]. Shao, Z.; Wang, L.; Wang, Z.; Du, W.; Wu, W., “SeaShips: A Large-Scale Precisely Annotated Dataset for Ship Detection”, IEEE Transactions on Multimedia, Vol. 20, No. 10, pp. 2593–2604, (2018).
[14]. Liu, J et al., “Vehicle Flow Detection and Tracking Based on an Improved YOLOv8n and ByteTrack Framework”, World Electric Vehicle Journal, Vol. 16, No. 1, Article 13, (2024).
[15]. Wang, J.; Zhang, Y.; Li, H.; Zhou, X., “An Efficient Ship-Detection Algorithm Based on the Improved YOLOv5”, Electronics, Vol. 12, No. 17, Article 3600, (2023).
[16]. Wang, L.; Liu, Y.; Zhang, H.; Chen, Z., “Ship Detection Algorithm Based on YOLOv5 Network Improved with Lightweight Convolution and Attention Mechanism”, Algorithms, Vol. 16, No. 12, Article 534, (2023).
[17]. Gao, Y.; Li, Z.; Wang, X.; Sun, Y., “A Lightweight Anti-Unmanned Aerial Vehicle Detection Method Based on Improved YOLOv11”, Drones, Vol. 9, No. 1, Article 11, (2024).
[18]. Huang, J.; Li, Y.; Wang, Q.; Chen, X., “LW-YOLO11: A Lightweight Arbitrary-Oriented Ship Detection Method Based on Improved YOLO11”, Sensors, Vol. 25, No. 1, Article 65, (2024).