Reinforcement learning for end-to-end UAV slung-load navigation and obstacle avoidance

Mohammed Basheer Mohiuddin*, Igor Boiko, Vu Phi Tran*, Matthew Garratt, Ayman Abdallah, Yahya Zweiri

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

This study introduces an end-to-end Reinforcement Learning (RL) approach for controlling Unmanned Aerial Vehicles (UAVs) with slung loads, addressing both navigation and obstacle avoidance in real-world environments. Unlike traditional methods that rely on separate flight controllers, path planners, and obstacle avoidance systems, our unified RL strategy seamlessly integrates these components, reducing both computational and design complexities while maintaining synchronous operation and optimal goal-tracking performance without the need for pre-training in various scenarios. Additionally, the study explores a reduced observation space model, referred to as CompactRL-8, which utilizes only eight observations and excludes noisy load swing rate measurements. This approach differs from most full-state observation RL methods, which typically include these rates. CompactRL-8 outperforms the full ten-observation model, demonstrating a 58.79% increase in speed and a ten-fold improvement in obstacle clearance. Our method also surpasses the state-of-the-art adaptive control methods, showing an 8% enhancement in path efficiency and a four-fold increase in load swing stability. Utilizing a detailed system model, we achieve successful Sim2Real transfer without time-consuming re-tuning, confirming the method’s practical applicability. This research advances RL-based UAV slung-load system control, fostering the development of more efficient and reliable autonomous aerial systems for applications like urban load transport. A video demonstration of the experiments can be found at https://youtu.be/GtGHhOCmy3M.

Original languageEnglish
Article number34621
JournalScientific Reports
Volume15
Issue number1
DOIs
StatePublished - Dec 2025

Bibliographical note

Publisher Copyright:
© The Author(s) 2025.

ASJC Scopus subject areas

  • General

Fingerprint

Dive into the research topics of 'Reinforcement learning for end-to-end UAV slung-load navigation and obstacle avoidance'. Together they form a unique fingerprint.

Cite this