Transfer Reinforcement Learning for Autonomous Driving: From WiseMove to WiseSim

Title Transfer Reinforcement Learning for Autonomous Driving: From WiseMove to WiseSim
Author
Abstract

Reinforcement learning (RL) is an attractive way to implement high-level decision-making policies for autonomous driving, but learning directly from a real vehicle or a high-fidelity simulator is variously infeasible. We therefore consider the problem of transfer reinforcement learning and study how a policy learned in a simple environment using\ WiseMove\ can be transferred to our high-fidelity simulator, WiseMove.\ WiseMove\ is a framework to study safety and other aspects of RL for autonomous driving. WiseMoveaccurately reproduces the dynamics and software stack of our real vehicle.\ 

We find that the accurately modelled perception errors in WiseMove\ contribute the most to the transfer problem. These errors, when even naively modelled in\ WiseMove, provide an RL policy that performs better in WiseMove\ than a hand-crafted rule-based policy. Applying domain randomization to the environment in\ WiseMove\ yields an even better policy. The final RL policy reduces the failures due to perception errors from 10\% to 2.75\%. We also observe that the RL policy has significantly less reliance on velocity compared to the rule-based policy, having learned that its measurement is unreliable.

Year of Publication
2021
Journal
ACM Transactions on Modeling and Computer Simulation
Volume
31
Number of Pages
Article No. 15, pp 1~26
Date Published
07/2021
ISSN Number
1049-3301
DOI
10.1145/3449356
Download citation