backends/kerasrl_learner.py · a90b4bc578993e66ca2be8af4ec684414168b3d0 · wise-lab / wise-move

Add and train more low-level policies, train a high-level policy. · a90b4bc5

Jae Young Lee authored Jan 22, 2019

The high-level policy was trained without changelane maneuver but with immediatestop maneuver. Two problems remain: 1) the agent chooses changelane maneuver too frequently; 2) before the stop region, immediatestop maneuver works but was not chosen property after 2.5m the high-level policy training...

a90b4bc5