Abstract
Stochastic learning automata (SLA) theory is used to model the learning behavior of commuters within the context of the combined departure time route choice (CDTRC) problem. The SLA model uses a reinforcement scheme to model the learning behavior of drivers. A multiaction linear reward-ε-penalty reinforcement scheme was introduced to model the learning behavior of travelers based on past departure time choice and route choice. A traffic simulation was developed to test the model. The results of the simulation are intended to show that drivers learn the best CDTRC option, and the network achieves user equilibrium in the long run. Results indicate that the developed SLA model accurately portrays the learning behavior of drivers, while the network satisfies user equilibrium conditions.
Original language | English (US) |
---|---|
Pages (from-to) | 154-162 |
Number of pages | 9 |
Journal | Transportation Research Record |
Issue number | 1807 |
DOIs | |
State | Published - 2002 |
ASJC Scopus subject areas
- Civil and Structural Engineering
- Mechanical Engineering