ASSD-YOLO: a small object detection method based on improved YOLOv7 for airport surface surveillance

Zhou, Wentao; Cai, Chengtao; Zheng, Liying; Li, Chenming; Zeng, Daohui

doi:10.1007/s11042-023-17628-4

ASSD-YOLO: a small object detection method based on improved YOLOv7 for airport surface surveillance

Published: 18 November 2023

Volume 83, pages 55527–55548, (2024)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Wentao Zhou¹,
Chengtao Cai^1,2,3^na1,
Liying Zheng ORCID: orcid.org/0000-0002-1596-0309⁴^na1,
Chenming Li¹ &
…
Daohui Zeng¹

540 Accesses
2 Citations
Explore all metrics

Abstract

The airport surface surveillance system is essential in ensuring airport safety and maximizing the efficient utilization of airport resources. Current airport detection algorithms suffer from few and lack relevant airport target data. To solve these issues, this paper establishes two airport datasets named ASS-Dataset, including the surveillance dataset and the panoramic surveillance dataset. Compared to other aircraft datasets, our datasets are collected from authentic airport surface surveillance systems. According to observation, most objects are small in datasets. This paper proposes a small object detection method ASSD-YOLO based on improved YOLOv7. First, the designed f-efficient attention module is added to the backbone network to improve the accuracy of the algorithm. Second, the transformer encoder network is incorporated into the backbone network to increase feature extraction. Finally, the small target detection layer is added to the head network to improve the ability to extract small targets. The model of the mean average precision is 93.5\(\%\) in the surveillance dataset. In the panoramic surveillance dataset, the ASSD-YOLO achieves 10.8\(\%\) and 21.4\(\%\) higher average precision for the airplane and truck than YOLOv7. g. Comparing the method proposed in this paper to the original YOLOv7, the performance improvement for the mAP is 4.6\(\%\) when using the RSOD open dataset. ASS-Dataset is available at https://github.com/rookie257/rookie257.github.io. Code is available at https://github.com/rookie257/small_detection.github.io.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

SSD: Single Shot MultiBox Detector

Object detection using YOLO: challenges, architectural successors, datasets and applications

Article 08 August 2022

YOLO-based Object Detection Models: A Review and its Applications

Article 14 March 2024

Data Availability

Data is available at https://github.com/rookie257/rookie257.github.io.

Code Availability

Code is available at https://github.com/rookie257/small_detection.github.io.

References

Wilke S, Majumdar A, Ochieng WY (2015) The impact of airport characteristics on airport surface accidents and incidents. J Safety Res 53:63–75
Article Google Scholar
Wang Y, Li MZ, Gopalakrishnan K, Liu T (2022) Timescales of delay propagation in airport networks. Transportation Research Part E: Logistics and Transportation Review 161:102687
Article Google Scholar
Morris R, Pasareanu CS, Luckow KS, Malik W, Ma H, Kumar TS, Koenig S (2016) Planning, scheduling and monitoring for airport surface operations. In: AAAI workshop: planning for hybrid systems, pp 608–614
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788
Redmon J, Farchadi A (2017) Yolo9000: better, faster, stronger 2017 IEEE conference on computer vision and pattern recognition (CVPR). Go to reference in article
Redmon J, Farhadi A (2018) Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767
Bochkovskiy A, Wang C-Y, Liao H-YM (2020) Yolov4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934
Li C, Li L, Jiang H, Weng K, Geng Y, Li L, Ke Z, Li Q, Cheng M, Nie W et al (2022) Yolov6: a single-stage object detection framework for industrial applications. arXiv preprint arXiv:2209.02976
Wang C-Y, Bochkovskiy A, Liao H-YM (2023) Yolov7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 7464–7475
Bie M, Liu Y, Li G, Hong J, Li J (2023) Real-time vehicle detection algorithm based on a lightweight you-only-look-once (yolov5n-l) approach. Expert Syst Appl 213:119108
Article Google Scholar
Shao Y, Zhang X, Chu H, Zhang X, Zhang D, Rao Y (2022) Air-yolov3: aerial infrared pedestrian detection via an improved yolov3 with network pruning. Appl Sci 12(7):3627
Article Google Scholar
Tang Y, Zhou H, Wang H, Zhang Y (2023) Fruit detection and positioning technology for a camellia oleifera c. abel orchard based on improved yolov4-tiny model and binocular stereo vision. Expert systems with applications 211:118573
Article Google Scholar
Lin T-Y, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft coco: common objects in context. In: Computer vision–ECCV 2014: 13th European conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13. Springer, pp 740–755
Everingham M, Van Gool L, Williams CK, Winn J, Zisserman A (2010) The pascal visual object classes (voc) challenge. Int J Comput Vision 88:303–338
Article Google Scholar
Liu M, Wang X, Zhou A, Fu X, Ma Y, Piao C (2020) Uav-yolo: small object detection on unmanned aerial vehicle perspective. Sensors 20(8):2238
Article Google Scholar
Benjumea A, Teeti I, Cuzzolin F, Bradley A (2021) Yolo-z: improving small object detection in yolov5 for autonomous vehicles. arXiv preprint arXiv:2112.11798
Wang X, Zhao Q, Jiang P, Zheng Y, Yuan L, Yuan P (2022) Lds-yolo: a lightweight small object detection method for dead trees from shelter forest. Comput Electron Agric 198:107035
Article Google Scholar
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587
Girshick R (2015) Fast r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 1440–1448
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: towards real-time object detection with region proposal networks. Advances in neural information processing systems 28
He K, Gkioxari G, Dollár P, Girshick R (2017) Mask r-cnn. In: Proceedings of the IEEE international conference on computer vision, pp 2961–2969
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C-Y, Berg AC (2016) Ssd: single shot multibox detector. In: Computer vision–ECCV 2016: 14th European conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. Springer, pp 21–37
Lin T-Y, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980–2988
Law H, Deng J (2018) Cornernet: detecting objects as paired keypoints. In: Proceedings of the European conference on computer vision (ECCV), pp 734–750
Zhang J, Huo Y-B, Yang J-L, Wang X-Z, Yan B-Y, Du X-H, Hao R-Q, Yang F, Liu J-X, Liu L et al (2022) Automatic counting of retinal ganglion cells in the entire mouse retina based on improved yolov5. Zool Res 43(5):738
Article Google Scholar
Kim M, Jeong J, Kim S (2021) Ecap-yolo: efficient channel attention pyramid yolo for small object detection in aerial image. Remote Sensing 13(23):4851
Article Google Scholar
Junos MH, Mohd Khairuddin AS, Thannirmalai S, Dahari M (2021) Automatic detection of oil palm fruits from uav images using an improved yolo model. The visual computer, 1–15
Lim J-S, Astrid M, Yoon H-J, Lee S-I (2021) Small object detection using context and attention. In: 2021 International conference on artificial intelligence in information and communication (ICAIIC). IEEE, pp 181–186
Cheng G, Yuan X, Yao X, Yan K, Zeng Q, Xie X, Han J (2023) Towards large-scale small object detection: survey and benchmarks. IEEE Transactions on Pattern Analysis and Machine Intelligence
Mahaur B, Mishra K (2023) Small-object detection based on yolov5 in autonomous driving systems. Pattern Recogn Lett 168:115–122
Article Google Scholar
Yuan X, Cheng G, Yan K, Zeng Q, Han J (2023) Small object detection via coarse-to-fine proposal generation and imitation learning. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 6317–6327
Chen C, Liu M-Y, Tuzel O, Xiao J (2017) R-cnn for small object detection. In: Computer vision–ACCV 2016: 13th Asian conference on computer vision, Taipei, Taiwan, November 20-24, 2016, Revised selected papers, Part V 13. Springer, pp 214–230
Kumar A (2023) Seat-yolo: a squeeze-excite and spatial attentive you only look once architecture for shadow detection. Optik, 170513
Kumar A, Kalia A, Verma K, Sharma A, Kaushal M (2021) Scaling up face masks detection with yolo on a novel dataset. Optik 239:166744
Article Google Scholar
Li C, Cai C (2023) A calibration and real-time object matching method for heterogeneous multi-camera system. IEEE Trans Instrum Meas 72:1–12
Google Scholar
Woo S, Park J, Lee J-Y, Kweon IS (2018) Cbam: convolutional block attention module. In: Proceedings of the European conference on computer vision (ECCV), pp 3–19
Wang Q, Wu B, Zhu P, Li P, Zuo W, Hu Q (2020) Eca-net: efficient channel attention for deep convolutional neural networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 11534–11542
Qiu S, Xu X, Cai B (2018) Frelu: flexible rectified linear units for improving convolutional neural networks. In: 2018 24th International conference on pattern recognition (icpr). IEEE, pp 1223–1228
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. Advances in neural information processing systems 30
Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929
Liu Q, Zhang Y, Yang G (2023) Small unopened cotton boll counting by detection with mrf-yolo in the wild. Comput Electron Agric 204:107576
Article Google Scholar
Chen Z, Liu C, Filaretov V, Yukhimets D (2023) Multi-scale ship detection algorithm based on yolov7 for complex scene sar images. Remote Sensing 15(8):2071
Article Google Scholar
Long Y, Gong Y, Xiao Z, Liu Q (2017) Accurate object localization in remote sensing images based on convolutional neural networks. IEEE Trans Geosci Remote Sens 55(5):2486–2498
Article Google Scholar

Download references

Acknowledgements

This work is supported by the National Key R &D Program of China(No.2021YFF0603904) and the Key Projects of Heilongjiang Provincial Natural Science Foundation(No.ZD2022F001).

Author information

Chengtao Cai and Liying Zheng contributed equally to this work.

Authors and Affiliations

College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, 150001, China
Wentao Zhou, Chengtao Cai, Chenming Li & Daohui Zeng
Heilongjiang Provincial Key Laboratory of Environment Intelligent Perception, Harbin Engineering University, Harbin, 150001, China
Chengtao Cai
Key laboratory of Intelligent Technology and Application of Marine Equipment (Harbin Engineering University), Harbin Engineering University, Harbin, 150001, China
Chengtao Cai
College of Computer Science and Engineering, Harbin Engineering University, Harbin, 150001, China
Liying Zheng

Authors

Wentao Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Chengtao Cai
View author publications
You can also search for this author in PubMed Google Scholar
Liying Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Chenming Li
View author publications
You can also search for this author in PubMed Google Scholar
Daohui Zeng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, Wentao Zhou, Chengtao Cai and Liying Zheng; data curation, Liying Zheng; formal analysis, Wentao Zhou and Chengtao Cai; investigation, Wentao Zhou and Liying Zheng; methodology, Wentao Zhou and Chengtao Cai; project administration, Liying Zheng and Chenming Li; writings original draft, Wentao Zhou and Daohui Zeng; writing, reviewing and editing, Chenming Li. All authors have read and agreed to the published version of the manuscript.

Corresponding authors

Correspondence to Chengtao Cai or Liying Zheng.

Ethics declarations

Competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix A: Section title of first appendix

An appendix contains supplementary information that is not an essential part of the text itself but which may be helpful in providing a more comprehensive understanding of the research problem or it is information that is too cumbersome to be included in the body of the paper.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Zhou, W., Cai, C., Zheng, L. et al. ASSD-YOLO: a small object detection method based on improved YOLOv7 for airport surface surveillance. Multimed Tools Appl 83, 55527–55548 (2024). https://doi.org/10.1007/s11042-023-17628-4

Download citation

Received: 08 August 2023
Revised: 16 October 2023
Accepted: 25 October 2023
Published: 18 November 2023
Issue Date: May 2024
DOI: https://doi.org/10.1007/s11042-023-17628-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

ASSD-YOLO: a small object detection method based on improved YOLOv7 for airport surface surveillance

Abstract

Access this article

Similar content being viewed by others

SSD: Single Shot MultiBox Detector

Object detection using YOLO: challenges, architectural successors, datasets and applications

YOLO-based Object Detection Models: A Review and its Applications

Data Availability

Code Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interest

Additional information

Publisher's Note

Appendix A: Section title of first appendix

Rights and permissions

About this article

Cite this article

Keywords

Navigation

ASSD-YOLO: a small object detection method based on improved YOLOv7 for airport surface surveillance

Abstract

Access this article

Similar content being viewed by others

SSD: Single Shot MultiBox Detector

Object detection using YOLO: challenges, architectural successors, datasets and applications

YOLO-based Object Detection Models: A Review and its Applications

Data Availability

Code Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interest

Additional information

Publisher's Note

Appendix A: Section title of first appendix

Appendix A: Section title of first appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation