TY - GEN
T1 - Zero-Shot Autonomous Vehicle Policy Transfer
T2 - 16th IEEE International Conference on Control and Automation, ICCA 2020
AU - Chalaki, Behdad
AU - Beaver, Logan E.
AU - Remer, Ben
AU - Jang, Kathy
AU - Vinitsky, Eugene
AU - Bayen, Alexandre M.
AU - Malikopoulos, Andreas A.
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/10/9
Y1 - 2020/10/9
N2 - In this article, we demonstrate a zero-shot transfer of an autonomous driving policy from simulation to University of Delaware's scaled smart city with adversarial multi-agent reinforcement learning, in which an adversary attempts to decrease the net reward by perturbing both the inputs and outputs of the autonomous vehicles during training. We train the autonomous vehicles to coordinate with each other while crossing a roundabout in the presence of an adversary in simulation. The adversarial policy successfully reproduces the simulated behavior and incidentally outperforms, in terms of travel time, both a human-driving baseline and adversary-free trained policies. Finally, we demonstrate that the addition of adversarial training considerably improves the performance of the policies after transfer to the real world compared to Gaussian noise injection.
AB - In this article, we demonstrate a zero-shot transfer of an autonomous driving policy from simulation to University of Delaware's scaled smart city with adversarial multi-agent reinforcement learning, in which an adversary attempts to decrease the net reward by perturbing both the inputs and outputs of the autonomous vehicles during training. We train the autonomous vehicles to coordinate with each other while crossing a roundabout in the presence of an adversary in simulation. The adversarial policy successfully reproduces the simulated behavior and incidentally outperforms, in terms of travel time, both a human-driving baseline and adversary-free trained policies. Finally, we demonstrate that the addition of adversarial training considerably improves the performance of the policies after transfer to the real world compared to Gaussian noise injection.
UR - http://www.scopus.com/inward/record.url?scp=85098054386&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85098054386&partnerID=8YFLogxK
U2 - 10.1109/ICCA51439.2020.9264552
DO - 10.1109/ICCA51439.2020.9264552
M3 - Conference contribution
AN - SCOPUS:85098054386
T3 - IEEE International Conference on Control and Automation, ICCA
SP - 35
EP - 40
BT - 2020 IEEE 16th International Conference on Control and Automation, ICCA 2020
PB - IEEE Computer Society
Y2 - 9 October 2020 through 11 October 2020
ER -