On GAE calculation math #593
michael-lutz
started this conversation in
General
Replies: 2 comments
-
|
the only logical explaination i could come up with was :
correct me if I'm wrong? |
Beta Was this translation helpful? Give feedback.
0 replies
-
|
The extra 0.5 factor indeed just scales the value loss. I've futzed with a I'm not really sure about the deviation in the GAE implementation, but using |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
In
losses.py, I noticed that the code includes the following step before returning value targets and advantages:From what I understand,
compute_vs_minus_v_xsshould return the standard GAE result. Why do we perform an additional TD computation at the end?Second, I was hoping to ask why the value loss includes an extra
0.5term:Both these decisions seem non-standard. Did you find they improved performance empirically?
Beta Was this translation helpful? Give feedback.
All reactions