• Jae Young Lee's avatar
    Add and train more low-level policies, train a high-level policy. · a90b4bc5
    Jae Young Lee authored
    The high-level policy was trained without changelane maneuver but with immediatestop maneuver. Two problems remain: 1) the agent chooses changelane maneuver too frequently; 2) before the stop region, immediatestop maneuver works but was not chosen property after 2.5m the high-level policy training...