Tensorflow implementation of TRPO(Trust Region Policy Optimization) with GAE(Generalized Advantage Estimator) on mujoco
forked from yjhong89/TRPO-GAE
-
Notifications
You must be signed in to change notification settings - Fork 0
ddlau/TRPO-GAE
About
Trust Region Policy Optimization with Generalized Advantage Estimator
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published
Languages
- Python 100.0%