Skip to content
Snippets Groups Projects
  1. Apr 21, 2020
  2. Feb 06, 2019
    • Ashish Gaurav's avatar
      MCTS Fixes · cb171a91
      Ashish Gaurav authored
      * Reimplemented UCT MCTS
      * Fixed softmax
      * Merged multiple branches into this branch, all of which should now be in master
      * Added reuse of tree functionality
      * Added the ability to expand nodes based on q values rather than at random
      * Refactored everything, deleted non necessary MCTS classes and files, and mcts.py can evaluate newer MCTS
      cb171a91
  3. Jan 30, 2019
  4. Jan 29, 2019
  5. Jan 22, 2019
    • Jae Young Lee's avatar
      Add and train more low-level policies, train a high-level policy. · a90b4bc5
      Jae Young Lee authored
      The high-level policy was trained without changelane maneuver but with immediatestop maneuver. Two problems remain: 1) the agent chooses changelane maneuver too frequently; 2) before the stop region, immediatestop maneuver works but was not chosen property after 2.5m the high-level policy training...
      a90b4bc5
  6. Nov 19, 2018
  7. Nov 18, 2018
  8. Nov 17, 2018
Loading