Commits · d82fe91cccd3efdb4a31d9450fdacfd4ad91f3c9 · wise-lab / wise-move

Feb 06, 2019
- Refactoring (change the module name "model_checker" to "verifier") · d82fe91c
  Jae Young Lee authored 6 years ago
  
  And, changing # of hidden layers to 3 from 6.
  d82fe91c
- Wait (and the others slightly) improved. · f443e382
  Jae Young Lee authored 6 years ago
  
  f443e382
Feb 05, 2019
- Merge branch 'retraining_wait_maneuver' into 'master' · 558f2efd
  Jae Young Lee authored 6 years ago
  
  Retraining wait maneuver See merge request !4
  558f2efd
- Successful high-level policy (6-hidden layers, 1m training). · 074d2f49
  Jae Young Lee authored 6 years ago
  
  - Also added ManualWait class.
  074d2f49
Feb 04, 2019
- High-level policy trained for 1m steps with 3-hidden layers. · 7a4ea75b
  Jae Young Lee authored 6 years ago
  
  7a4ea75b
Feb 01, 2019
- Merge branch 'master' into retraining_wait_maneuver · 77d61b11
  Jae Young Lee authored 6 years ago
  
  77d61b11
- Merge branch 'Further_improve_Follow_and_KeepLane' into 'master' · 062ad4ff
  Aravind Balakrishnan authored 6 years ago
  
  More improve follow and keep lane See merge request !3
  062ad4ff
- Further improve Follow and make KeepLane default (available any time). · 18cec392
  Jae Young Lee authored 6 years ago
  
  18cec392
Jan 31, 2019
- Further improve Follow and make KeepLane default (available any time). · ee257c39
  Jae Young Lee authored 6 years ago
  
  ee257c39
- A TODO added to controller_base.py. · 1f913dc3
  Jae Young Lee authored 6 years ago
  
  1f913dc3
Jan 30, 2019
- Auto stash before merge of "improving_and_refactoring" and "master" · c3e0e738
  Jae Young Lee authored 6 years ago
  
  c3e0e738
- Merge branch 'master' into improving_and_refactoring · c0035ce5
  Jae Young Lee authored 6 years ago
  
  c0035ce5
- Merge branch 'Improve_Follow' into 'master' · 895596a8
  Ashish Gaurav authored 6 years ago
  
  Improve follow See merge request !2
  895596a8
- Merge branch 'master' into 'Improve_Follow' · c098c0f4
  Jae Young Lee authored 6 years ago
  
  # Conflicts: # backends/kerasrl_learner.py # env/simple_intersection/simple_intersection_env.py # options/simple_intersection/maneuvers.py
  c098c0f4
- Merge branch 'Improve_Follow' into improving_and_refactoring · 35249ff6
  Jae Young Lee authored 6 years ago
  
  35249ff6
- Some bugfix and improve Follow and high-level policy (88% success rate) · a0aa5b23
  Jae Young Lee authored 6 years ago
  
  a0aa5b23
- Merge branch 'improve_and_bugfix_low_and_high_level_training' into 'master' · ea8874a8
  Jae Young Lee authored 6 years ago
  
  Improve and bugfix low and high level training See merge request !1
  ea8874a8
- Fix the bug of having -100000 or 10000 rewards sometime. · 6da696a2
  Jae Young Lee authored 6 years ago
  
  6da696a2
Jan 29, 2019
- Fix the bug of having -100000 or 10000 rewards sometime. · 7dc47bce
  Jae Young Lee authored 6 years ago
  
  7dc47bce
Jan 24, 2019

Improve and Bug-fix DQNLearner and environments. · 4a9327bd

Jae Young Lee authored 6 years ago

- Added RestrictedEpsGreedyPolicy and RestrictedGreedyPolicy and use them as policy and test_policy in DQNLearner. Now, the agent never chooses the action corresponding to -inf Q-value if there is at least one action with finite Q-value (if not, it chooses any action randomly, which is necessary for compatibility with keras-rl --
 see the comments in select_action).

- Now, generate_scenario in SimpleIntersectionEnv generates veh_ahead_scenario even when randomize_special_scenario = 1.

- In EpisodicEnvBase, the terminal reward is by default determined by the minimum one;

- Small change of initiation_condition of EpisodicEnvBase (simplified);

4a9327bd

Bugfix in EpisodicEnvBase (related to initiation_condition of maneuvers) · d0b74b00
Jae Young Lee authored 6 years ago

d0b74b00

Jan 22, 2019

Add and train more low-level policies, train a high-level policy. · a90b4bc5

Jae Young Lee authored 6 years ago

The high-level policy was trained without changelane maneuver but with immediatestop maneuver. Two problems remain: 1) the agent chooses changelane maneuver too frequently; 2) before the stop region, immediatestop maneuver works but was not chosen property after 2.5m the high-level policy training...

a90b4bc5

Jan 17, 2019

Successful low-level policies training except Wait maneuver. · 70ad9bf5

Jae Young Lee authored 6 years ago

Each low level policy was retrained with better LTL conditions and rewards, some parts of which are also designed to encourage exploration (to prevent the vehicle from being stopped all the time).

70ad9bf5

Nov 19, 2018
- Merge branch 'formatting' into 'master' · f2171d2c
  Aravind Balakrishnan authored 6 years ago
  
  Formatting See merge request !3
  f2171d2c
- update gitignore, revert vm pyglet fix · 7abc600f
  Ashish Gaurav authored 6 years ago
  
  7abc600f
- run docformatter · f428cd41
  Ashish Gaurav authored 6 years ago
  
  f428cd41
- format using yapf · 72e44d55
  Ashish Gaurav authored 6 years ago
  
  72e44d55
Nov 18, 2018
- Update utilities.py to change screen config variables · e1fdb162
  Ashish Gaurav authored 6 years ago
  
  e1fdb162
- Merge branch 'final_test' into 'master' · e58a1636
  Ashish Gaurav authored 6 years ago
  
  Final test See merge request !2
  e58a1636
- 20 for high level cost weifghts · 418be8f0
  Unknown authored 6 years ago
  
  418be8f0
- changed to 20 test for high level · 3d75ed5b
  Unknown authored 6 years ago
  
  3d75ed5b
- README.txt -> README.md · d5cda2c5
  Jae Young Lee authored 6 years ago
  
  d5cda2c5
- Revert a change in KeepLane. · fbb9b168
  Jae Young Lee authored 6 years ago
  
  fbb9b168
- Merge remote-tracking branch 'origin/final_test' into final_test · 228db9b4
  Jae Young Lee authored 6 years ago
  
  228db9b4
- Improve Wait and KeepLane, minor changes... · 5c3b366b
  Jae Young Lee authored 6 years ago
  
  5c3b366b
- Added note for low level training · a8bc838a
  Unknown authored 6 years ago
  
  a8bc838a
- Updated readme and running · 382065ac
  Unknown authored 6 years ago
  
  382065ac
- Change pip to pip3. · d1dbbd44
  Jae Young Lee authored 6 years ago
  
  d1dbbd44
- Change LTL_test (independent module) · da99b1ff
  Jae Young Lee authored 6 years ago
  
  da99b1ff
- Revert some changes in simple_intersection_env and maneuver_base. · 2cedd22c
  Jae Young Lee authored 6 years ago
  
  2cedd22c