Reinforce Agent playing CartPole-v1 This is a trained model of a Reinforce agent playing CartPole-v1.
Evaluation results
- mean_reward on CartPole-v1self-reported500.00 +/- 0.00
Reinforce Agent playing CartPole-v1 This is a trained model of a Reinforce agent playing CartPole-v1.