TY - GEN
T1 - Exploring Deep Reinforcement Learning for Robust Target Tracking Using Micro Aerial Vehicles
AU - Dionigi, Alberto
AU - Leomanni, Mirko
AU - Saviolo, Alessandro
AU - Loianno, Giuseppe
AU - Costante, Gabriele
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - The capability to autonomously track a non-cooperative target is a key technological requirement for micro aerial vehicles. In this paper, we propose an output feedback control scheme based on deep reinforcement learning for controlling a micro aerial vehicle to persistently track a flying target while maintaining visual contact. The proposed method leverages relative position data for control, relaxing the assumption of having access to full state information which is typical of related approaches in literature. Moreover, we exploit classical robustness indicators in the learning process through domain randomization to increase the robustness of the learned policy. Experimental results validate the proposed approach for target tracking, demonstrating high performance and robustness with respect to mass mismatches and control delays. The resulting nonlinear controller significantly outperforms a standard model-based design in numerous off-nominal scenarios.
AB - The capability to autonomously track a non-cooperative target is a key technological requirement for micro aerial vehicles. In this paper, we propose an output feedback control scheme based on deep reinforcement learning for controlling a micro aerial vehicle to persistently track a flying target while maintaining visual contact. The proposed method leverages relative position data for control, relaxing the assumption of having access to full state information which is typical of related approaches in literature. Moreover, we exploit classical robustness indicators in the learning process through domain randomization to increase the robustness of the learned policy. Experimental results validate the proposed approach for target tracking, demonstrating high performance and robustness with respect to mass mismatches and control delays. The resulting nonlinear controller significantly outperforms a standard model-based design in numerous off-nominal scenarios.
UR - http://www.scopus.com/inward/record.url?scp=85185832551&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85185832551&partnerID=8YFLogxK
U2 - 10.1109/ICAR58858.2023.10407017
DO - 10.1109/ICAR58858.2023.10407017
M3 - Conference contribution
AN - SCOPUS:85185832551
T3 - 2023 21st International Conference on Advanced Robotics, ICAR 2023
SP - 506
EP - 513
BT - 2023 21st International Conference on Advanced Robotics, ICAR 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 21st International Conference on Advanced Robotics, ICAR 2023
Y2 - 5 December 2023 through 8 December 2023
ER -