TY - GEN
T1 - Lagrangian Control through Deep-RL
T2 - 21st IEEE International Conference on Intelligent Transportation Systems, ITSC 2018
AU - Vinitsky, Eugene
AU - Parvate, Kanaad
AU - Kreidieh, Aboudy
AU - Wu, Cathy
AU - Bayen, Alexandre
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/12/7
Y1 - 2018/12/7
N2 - Using deep reinforcement learning, we derive novel control policies for autonomous vehicles to improve the throughput of a bottleneck modeled after the San Francisco-Oakland Bay Bridge. Using Flow, a new library for applying deep reinforcement learning to traffic micro-simulators, we consider the problem of improving the throughput of a traffic benchmark: a two-stage bottleneck where four lanes reduce to two and then reduce to one. We first characterize the inflow-outflow curve of this bottleneck without any control. We introduce an inflow of autonomous vehicles with the intent of improving the congestion through Lagrangian control. To handle the varying number of autonomous vehicles in the system we derive a per-lane variable speed limits parametrization of the controller. We demonstrate that a 10% penetration rate of controlled autonomous vehicles can improve the throughput of the bottleneck by 200 vehicles per hour: a 25% improvement at high inflows. Finally, we compare the performance of our control policies to feedback ramp metering and show that the AV controller provides comparable performance to ramp metering without the need to build new ramp metering infrastructure. Illustrative videos of the results can be found at https://sites.google.com/view/itsc-lagrangian-avs/home and code and tutorials can be found at https://github.com/flow-project/flow.
AB - Using deep reinforcement learning, we derive novel control policies for autonomous vehicles to improve the throughput of a bottleneck modeled after the San Francisco-Oakland Bay Bridge. Using Flow, a new library for applying deep reinforcement learning to traffic micro-simulators, we consider the problem of improving the throughput of a traffic benchmark: a two-stage bottleneck where four lanes reduce to two and then reduce to one. We first characterize the inflow-outflow curve of this bottleneck without any control. We introduce an inflow of autonomous vehicles with the intent of improving the congestion through Lagrangian control. To handle the varying number of autonomous vehicles in the system we derive a per-lane variable speed limits parametrization of the controller. We demonstrate that a 10% penetration rate of controlled autonomous vehicles can improve the throughput of the bottleneck by 200 vehicles per hour: a 25% improvement at high inflows. Finally, we compare the performance of our control policies to feedback ramp metering and show that the AV controller provides comparable performance to ramp metering without the need to build new ramp metering infrastructure. Illustrative videos of the results can be found at https://sites.google.com/view/itsc-lagrangian-avs/home and code and tutorials can be found at https://github.com/flow-project/flow.
UR - http://www.scopus.com/inward/record.url?scp=85060474880&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85060474880&partnerID=8YFLogxK
U2 - 10.1109/ITSC.2018.8569615
DO - 10.1109/ITSC.2018.8569615
M3 - Conference contribution
AN - SCOPUS:85060474880
T3 - IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC
SP - 759
EP - 765
BT - 2018 IEEE Intelligent Transportation Systems Conference, ITSC 2018
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 4 November 2018 through 7 November 2018
ER -