UAV Visual Path Planning Using Large Language Models

Research output: Contribution to journalConference articlepeer-review

3 Scopus citations

Abstract

Unmanned Aerial Vehicles (UAVs) heavily rely on Global Positioning Systems (GPS) for navigation, limiting their functionality in indoor GPS-denied environments. This paper investigates the application of Large Language Models (LLMs) for visual path planning in such scenarios. This work proposed a new LLM-based approach for understanding visual data captured by the UAV's camera. By analyzing this data in terms of the positions of the detected persons and depth information, the fine-tuned LLM would generate safe and efficient flight paths. To validate the proposed approach, we have created an indoor virtual navigation environment for the entrance of our center (JRC-AI) with 3 standing persons and 2 randomly moving. Guided by LLMs, the mission of UAVs is to reach the target goals that result in the minimum collisions. The reported results clearly showed that the proposed LLMs achieved better results than the standard deep reinforcement learning DQN model in both the average number of collisions as well as the traveled distance toward the goal point.

Original languageEnglish
Pages (from-to)339-345
Number of pages7
JournalTransportation Research Procedia
Volume84
DOIs
StatePublished - 2025
Event1st Internation Conference on Smart Mobility and Logistics Ecosystems, SMiLE 2024 - Dhahran, Saudi Arabia
Duration: 17 Sep 202419 Sep 2024

Bibliographical note

Publisher Copyright:
© 2024 The Authors. Published by ELSEVIER B.V.

Keywords

  • GPS-denied environments
  • LLMs
  • Path Planning
  • UAV
  • Visual navigation

ASJC Scopus subject areas

  • Transportation

Fingerprint

Dive into the research topics of 'UAV Visual Path Planning Using Large Language Models'. Together they form a unique fingerprint.

Cite this