TY - JOUR
T1 - Reinforcement-Learning-Based Cooperative Adaptive Cruise Control of Buses in the Lincoln Tunnel Corridor with Time-Varying Topology
AU - Gao, Weinan
AU - Gao, Jingqin
AU - Ozbay, Kaan
AU - Jiang, Zhong Ping
N1 - Funding Information:
Manuscript received June 26, 2018; revised November 20, 2018; accepted January 1, 2019. Date of publication February 20, 2019; date of current version October 2, 2019. This work was supported in part by the New York University’s Connected Cities for Smart Mobility towards Accessible and Resilient Transportation Center (C2SMART) and in part by the U.S. National Science Foundation under Grant ECCS-1501044. The Associate Editor for this paper was B. Morris. (Corresponding author: Weinan Gao.) W. Gao is with the Department of Electrical and Computer Engineering, Allen E. Paulson College of Engineering and Computing, Georgia Southern University, Statesboro, GA 30460 USA (e-mail: wgao@georgiasouthern.edu).
Publisher Copyright:
© 2000-2011 IEEE.
PY - 2019/10
Y1 - 2019/10
N2 - The exclusive bus lane (XBL) is one of the most popular bus transit systems in the U.S. The Lincoln Tunnel utilizes an XBL through the tunnel in the AM peak period. This paper proposes a novel data-driven cooperative adaptive cruise control (CACC) algorithm that aims to minimize a cost function for connected and autonomous buses along the XBL. Different from existing model-based CACC algorithms, the proposed approach employs the idea of reinforcement learning, which does not rely on accurate knowledge of bus dynamics. Considering a time-varying topology, where each autonomous vehicle can only receive information from preceding vehicles that are within its communication range, a distributed controller is learned real-time by online headway, velocity, and acceleration data collected from the system trajectories. The convergence of the proposed algorithm and the stability of the closed-loop system are rigorously analyzed. The effectiveness of the proposed approach is demonstrated using a well-calibrated Paramics microscopic traffic simulation model of the XBL corridor. The simulation results show that the travel time in the autonomous version of the XBL are close to the present day travel time even when the bus volume is increased by 30%.
AB - The exclusive bus lane (XBL) is one of the most popular bus transit systems in the U.S. The Lincoln Tunnel utilizes an XBL through the tunnel in the AM peak period. This paper proposes a novel data-driven cooperative adaptive cruise control (CACC) algorithm that aims to minimize a cost function for connected and autonomous buses along the XBL. Different from existing model-based CACC algorithms, the proposed approach employs the idea of reinforcement learning, which does not rely on accurate knowledge of bus dynamics. Considering a time-varying topology, where each autonomous vehicle can only receive information from preceding vehicles that are within its communication range, a distributed controller is learned real-time by online headway, velocity, and acceleration data collected from the system trajectories. The convergence of the proposed algorithm and the stability of the closed-loop system are rigorously analyzed. The effectiveness of the proposed approach is demonstrated using a well-calibrated Paramics microscopic traffic simulation model of the XBL corridor. The simulation results show that the travel time in the autonomous version of the XBL are close to the present day travel time even when the bus volume is increased by 30%.
KW - Reinforcement learning
KW - connected and autonomous vehicles
KW - cooperative adaptive cruise control
KW - time-varying topology
UR - http://www.scopus.com/inward/record.url?scp=85065871785&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85065871785&partnerID=8YFLogxK
U2 - 10.1109/TITS.2019.2895285
DO - 10.1109/TITS.2019.2895285
M3 - Article
AN - SCOPUS:85065871785
SN - 1524-9050
VL - 20
SP - 3796
EP - 3805
JO - IEEE Transactions on Intelligent Transportation Systems
JF - IEEE Transactions on Intelligent Transportation Systems
IS - 10
M1 - 8645803
ER -