Commits · 878581c72db48eb09e8ad71ac19956b122b65a4c · wise-lab / wise-move

Apr 21, 2020
- WiseMove Release (TOMAC-init). · 878581c7
  Jaeyoung Lee authored 5 years ago
  
  878581c7
Feb 06, 2019

MCTS Fixes · cb171a91

Ashish Gaurav authored 6 years ago

* Reimplemented UCT MCTS
* Fixed softmax
* Merged multiple branches into this branch, all of which should now be in master
* Added reuse of tree functionality
* Added the ability to expand nodes based on q values rather than at random
* Refactored everything, deleted non necessary MCTS classes and files, and mcts.py can evaluate newer MCTS

cb171a91

Jan 30, 2019
- Fix the bug of having -100000 or 10000 rewards sometime. · 6da696a2
  Jae Young Lee authored 6 years ago
  
  6da696a2
Jan 29, 2019
- Fix the bug of having -100000 or 10000 rewards sometime. · 7dc47bce
  Jae Young Lee authored 6 years ago
  
  7dc47bce
Jan 22, 2019

Add and train more low-level policies, train a high-level policy. · a90b4bc5

Jae Young Lee authored 6 years ago

The high-level policy was trained without changelane maneuver but with immediatestop maneuver. Two problems remain: 1) the agent chooses changelane maneuver too frequently; 2) before the stop region, immediatestop maneuver works but was not chosen property after 2.5m the high-level policy training...

a90b4bc5

Nov 19, 2018
- run docformatter · f428cd41
  Ashish Gaurav authored 6 years ago
  
  f428cd41
- format using yapf · 72e44d55
  Ashish Gaurav authored 6 years ago
  
  72e44d55
Nov 18, 2018
- 20 for high level cost weifghts · 418be8f0
  Unknown authored 6 years ago
  
  418be8f0
Nov 17, 2018
- version 1.0.0 · 125ba1e0
  Aravind Bk authored 6 years ago
  
  125ba1e0