Doubt in the PPO.py #409
Replies: 1 comment 3 replies
-
You don't have to wait until the episode is terminated. (What if your use case has an episode that has 400k steps?) Please read the docs https://docs.cleanrl.dev/rl-algorithms/ppo/#overview. This is related to PPO's vectorized architecture. |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Shouldn't we terminate the Rollout collection after the episode is terminated?
Beta Was this translation helpful? Give feedback.
All reactions