high_level_policy_main.html 5.4 KB
 Aravind Bk committed Nov 17, 2018 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53  high_level_policy_main module — WiseMove documentation

high_level_policy_main module

high_level_policy_main.evaluate_high_level_policy(nb_episodes_for_test=100, nb_trials=10, trained_agent_file='highlevel_weights.h5f', pretrained=False, visualize=False)
high_level_policy_main.find_good_high_level_policy(nb_steps=25000, load_weights=False, nb_episodes_for_test=100, visualize=False, tensorboard=False, save_path='./highlevel_weights.h5f')
high_level_policy_main.high_level_policy_testing(nb_episodes_for_test=100, trained_agent_file='highlevel_weights.h5f', pretrained=False, visualize=True)
 Aravind Balakrishnan committed Feb 08, 2019 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 high_level_policy_main.high_level_policy_training(nb_steps=25000, load_weights=False, training=True, testing=True, nb_episodes_for_test=20, max_nb_steps=100, visualize=False, tensorboard=False, save_path='highlevel_weights.h5f')

Do RL of the high-level policy and test it.

Parameters:
• nb_steps – the number of steps to perform RL
• load_weights – True if the pre-learned NN weights are loaded (for initializations of NNs)
• training – True to enable training
• testing – True to enable testing
• nb_episodes_for_test – the number of episodes for testing
 Aravind Bk committed Nov 17, 2018 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90