From d5cda2c5f42ffd914e1f04975091ad069e2708ea Mon Sep 17 00:00:00 2001 From: Jaeyoung Lee <jaeyoung.lee@uwaterloo.ca> Date: Sun, 18 Nov 2018 03:18:50 -0500 Subject: [PATCH] README.txt -> README.md --- README.txt => README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) rename README.txt => README.md (92%) diff --git a/README.txt b/README.md similarity index 92% rename from README.txt rename to README.md index acc4887..8afb6f2 100644 --- a/README.txt +++ b/README.md @@ -38,7 +38,7 @@ These are the minimum steps required to replicate the results for simple_interse * To test one of these trained low-level policies, for example wait: `python3 low_level_policy_main.py --option=wait --test --saved_policy_in_root` * Available maneuvers are: wait, changelane, stop, keeplane, follow * These results are visually evaluated. - * Note: This training has a high variance due to the continuous action space, especially for stop and keeplane maneuvers. It may help to train for 0.2 million steps than the default 0.1 million by adding argument '--nb_steps=200000' while training. + * Note: This training has a high variance issue due to the continuous action space, especially for stop and keeplane maneuvers. It may help to train for 0.2 million steps than the default 0.1 million by adding argument '--nb_steps=200000' while training. * High-level policy: * To train high-level policy from scratch using the given low-level policies: `python3 high_level_policy_main.py --train` * To evaluate this trained high-level policy: `python3 high_level_policy_main.py --evaluate --saved_policy_in_root`. @@ -59,4 +59,4 @@ have modified. 1. `yapf -i YOUR_MODIFIED_FILE.py` 2. `docformatter --in-place YOUR_MODIFIED_FILE.py` -`yapf` formats the code and `docformatter` formats the docstrings. \ No newline at end of file +`yapf` formats the code and `docformatter` formats the docstrings. -- GitLab