Replies: 8 comments
-
Can you share the learning curve and a video of the resulting policy? Also, steps to reproduce it, in detail (pip installs, apt-get installs etc) |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Do you have a video of the resulting policy? I am not familiar with Openai td3. How do you exactly install it? What version of Tensorflow, gym etc? |
Beta Was this translation helpful? Give feedback.
-
Hi, TensorFlow version is 1.14.0, gym version:0.14.0, I install it just flow its official instruction on its website openai spinningup. I tried to run the resulting policy in humanoid env, and the policy failed. |
Beta Was this translation helpful? Give feedback.
-
By the way, how to save the simulation to a video? I can run it on my computer(for humanoid env, the policy seems failed), but don't know how to save it to a video. I tried to use gym.wrappers.Monitor to save it but occurred this error "TypeError: 'module' object is not callable" . |
Beta Was this translation helpful? Give feedback.
-
I'm having the same issue. In my case, I'm instantiating the environment twice. The first environment seems to be behaving normally. But the second environment is misbehaving in exactly the above manner. |
Beta Was this translation helpful? Give feedback.
-
I haven't trained humanoid myself. Did you check out https://github.com/hill-a/stable-baselines/releases |
Beta Was this translation helpful? Give feedback.
-
The issue is real and it is in pybullet-gym implementation. |
Beta Was this translation helpful? Give feedback.
-
Hi, I use the Openai Spinningup td3 algorithm to test pybullet's HumanoidBulletEnv-v0 environment, but got the test score around 1600 even from the beginning which is not reasonable (td3 should not work in this benchmark), why is it like this? Does anyone have similar results? Thank you.
Beta Was this translation helpful? Give feedback.
All reactions