• Jae Young Lee's avatar
    Add and train more low-level policies, train a high-level policy. · a90b4bc5
    Jae Young Lee authored
    The high-level policy was trained without changelane maneuver but with immediatestop maneuver. Two problems remain: 1) the agent chooses changelane maneuver too frequently; 2) before the stop region, immediatestop maneuver works but was not chosen property after 2.5m the high-level policy training...
    a90b4bc5
Name
Last commit
Last update
..
simple_intersection Loading commit data...
__init__.py Loading commit data...
env_base.py Loading commit data...
road_env.py Loading commit data...