Abstract
Detecting drones at night with complex backgrounds poses a considerable challenge. To address this issue, we propose an enhanced variant of the YOLOv5 model, termed as the Tiny Feature and Bidirectional Feature Pyramid Network (TF-BiFPN). This method incorporates Efficient Residual Bottleneck (ERB) and Efficient Multi-Receptive Pooling (EMRP) layers along with Convolutional Block Attention Module (CBAM). Utilizing ERB, the network achieves enhanced feature extraction through residual connections, whereas the EMRP layers incorporates multiple receptive fields, enabling the model to better understand and process varied spatial hierarchies within the data. Bi-FPN is added within the model's head to enhance feature representation and capture multiple features at various scales. To optimize model efficiency, cross-convolution replaces simple convolution, leading to a notable reduction in parameters. Furthermore, auto-anchor and auto-batch mechanisms are introduced to ensure efficient GPU utilization. Experimental evaluations conducted on a custom multi-class dataset illustrate a significant enhancement over baseline and state-of-the-art algorithms.
Original language | English |
---|---|
Journal | IEEE Transactions on Aerospace and Electronic Systems |
DOIs | |
State | Accepted/In press - 2024 |
Bibliographical note
Publisher Copyright:© 1965-2011 IEEE.
Keywords
- Attention modules
- Drone detection
- Feature extraction
- Infrared
- YOLOv5
ASJC Scopus subject areas
- Aerospace Engineering
- Electrical and Electronic Engineering