A Deep Reinforcement Learning Motion Control Strategy of a Multi-rotor UAV for Payload Transportation with Minimum Swing

Fotis Panetsos, George C. Karras, Kostas J. Kyriakopoulos

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper addresses the problem of controlling a multirotor UAV with a cable-suspended load. In order to ensure the safe transportation of the load, the swinging motion, induced by the strongly coupled dynamics, has to be minimized. Specifically, using the Twin Delayed Deep Deterministic Policy Gradient (TD3) Reinforcement Learning algorithm, a policy Neural Network is trained in a model-free manner which navigates the vehicle to the desired waypoints while, simultaneously, compensating for the load oscillations. The learned policy network is incorporated into the cascaded control architecture of the autopilot by replacing the common PID position controller and, thus, communicating directly with the inner attitude one. The performance of the proposed policy is demonstrated through a comparative simulation and experimental study while using an octorotor UAV.

Original languageEnglish (US)
Title of host publication2022 30th Mediterranean Conference on Control and Automation, MED 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages368-374
Number of pages7
ISBN (Electronic)9781665406734
DOIs
StatePublished - 2022
Event30th Mediterranean Conference on Control and Automation, MED 2022 - Athens, Greece
Duration: Jun 28 2022Jul 1 2022

Publication series

Name2022 30th Mediterranean Conference on Control and Automation, MED 2022

Conference

Conference30th Mediterranean Conference on Control and Automation, MED 2022
Country/TerritoryGreece
CityAthens
Period6/28/227/1/22

ASJC Scopus subject areas

  • Artificial Intelligence
  • Control and Optimization

Fingerprint

Dive into the research topics of 'A Deep Reinforcement Learning Motion Control Strategy of a Multi-rotor UAV for Payload Transportation with Minimum Swing'. Together they form a unique fingerprint.

Cite this