ppo-LunarLander-v2 / README.md

Commit History

First implementation. mean_reward=235.67 +/- 43.70256071321255
0b88b27

eelang commited on