TF-BiFPN Improves YOLOv5: Enhancing Small-Scale Multi-Class Drone Detection in Dark

Maham Misbah, Farooq Alam Orakazi, Laiba Tanveer, Zeeshan Kaleem, Chau Yuen*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Detecting drones at night with complex backgrounds poses a considerable challenge. To address this issue, we propose an enhanced variant of the YOLOv5 model, termed as the Tiny Feature and Bidirectional Feature Pyramid Network (TF-BiFPN). This method incorporates Efficient Residual Bottleneck (ERB) and Efficient Multi-Receptive Pooling (EMRP) layers along with Convolutional Block Attention Module (CBAM). Utilizing ERB, the network achieves enhanced feature extraction through residual connections, whereas the EMRP layers incorporates multiple receptive fields, enabling the model to better understand and process varied spatial hierarchies within the data. Bi-FPN is added within the model's head to enhance feature representation and capture multiple features at various scales. To optimize model efficiency, cross-convolution replaces simple convolution, leading to a notable reduction in parameters. Furthermore, auto-anchor and auto-batch mechanisms are introduced to ensure efficient GPU utilization. Experimental evaluations conducted on a custom multi-class dataset illustrate a significant enhancement over baseline and state-of-the-art algorithms.

Original languageEnglish
JournalIEEE Transactions on Aerospace and Electronic Systems
DOIs
StateAccepted/In press - 2024

Bibliographical note

Publisher Copyright:
© 1965-2011 IEEE.

Keywords

  • Attention modules
  • Drone detection
  • Feature extraction
  • Infrared
  • YOLOv5

ASJC Scopus subject areas

  • Aerospace Engineering
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'TF-BiFPN Improves YOLOv5: Enhancing Small-Scale Multi-Class Drone Detection in Dark'. Together they form a unique fingerprint.

Cite this