pybullet HumanoidBulletEnv-v0 score is not reasonable #3638

zhan0903 · 2019-08-04T11:03:39Z

zhan0903
Aug 4, 2019

Hi, I use the Openai Spinningup td3 algorithm to test pybullet's HumanoidBulletEnv-v0 environment, but got the test score around 1600 even from the beginning which is not reasonable (td3 should not work in this benchmark), why is it like this? Does anyone have similar results? Thank you.

erwincoumans · 2019-08-04T14:39:29Z

erwincoumans
Aug 4, 2019
Maintainer

Can you share the learning curve and a video of the resulting policy? Also, steps to reproduce it, in detail (pip installs, apt-get installs etc)

0 replies

zhan0903 · 2019-08-05T01:26:49Z

zhan0903
Aug 5, 2019
Author

Can you share the learning curve and a video of the resulting policy? Also, steps to reproduce it, in detail (pip installs, apt-get installs etc)

Hi, I install the pybullet through pip install pybullet, and use the Openai td3's default setting for the test. Below is its test result(average over ten times test) from seed 4.

Every seed(0-4) has similar testing results and no significant change during the whole training process(the test results stay around 1600).

0 replies

erwincoumans · 2019-08-05T02:21:21Z

erwincoumans
Aug 5, 2019
Maintainer

Do you have a video of the resulting policy?

I am not familiar with Openai td3. How do you exactly install it? What version of Tensorflow, gym etc?

0 replies

zhan0903 · 2019-08-05T06:28:03Z

zhan0903
Aug 5, 2019
Author

Do you have a video of the resulting policy?

I am not familiar with Openai td3. How do you exactly install it? What version of Tensorflow, gym etc?

Hi, TensorFlow version is 1.14.0, gym version:0.14.0, I install it just flow its official instruction on its website openai spinningup. I tried to run the resulting policy in humanoid env, and the policy failed.

0 replies

zhan0903 · 2019-08-06T00:19:33Z

zhan0903
Aug 6, 2019
Author

Do you have a video of the resulting policy?

I am not familiar with Openai td3. How do you exactly install it? What version of Tensorflow, gym etc?

By the way, how to save the simulation to a video? I can run it on my computer(for humanoid env, the policy seems failed), but don't know how to save it to a video. I tried to use gym.wrappers.Monitor to save it but occurred this error "TypeError: 'module' object is not callable" .

0 replies

brokenBrain · 2019-12-07T22:08:11Z

brokenBrain
Dec 7, 2019

I'm having the same issue. In my case, I'm instantiating the environment twice. The first environment seems to be behaving normally. But the second environment is misbehaving in exactly the above manner.

0 replies

erwincoumans · 2019-12-18T23:04:45Z

erwincoumans
Dec 18, 2019
Maintainer

I haven't trained humanoid myself. Did you check out https://github.com/hill-a/stable-baselines/releases
and https://github.com/araffin/rl-baselines-zoo
It has a trained policy using ppo2 and td3.

0 replies

forcecore · 2021-01-07T12:15:45Z

forcecore
Jan 7, 2021

The issue is real and it is in pybullet-gym implementation.
The fix is here: benelot/pybullet-gym#61

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pybullet HumanoidBulletEnv-v0 score is not reasonable #3638

{{title}}

Replies: 8 comments

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

pybullet HumanoidBulletEnv-v0 score is not reasonable #3638

zhan0903 Aug 4, 2019

Replies: 8 comments

erwincoumans Aug 4, 2019 Maintainer

zhan0903 Aug 5, 2019 Author

erwincoumans Aug 5, 2019 Maintainer

zhan0903 Aug 5, 2019 Author

zhan0903 Aug 6, 2019 Author

brokenBrain Dec 7, 2019

erwincoumans Dec 18, 2019 Maintainer

forcecore Jan 7, 2021

zhan0903
Aug 4, 2019

erwincoumans
Aug 4, 2019
Maintainer

zhan0903
Aug 5, 2019
Author

erwincoumans
Aug 5, 2019
Maintainer

zhan0903
Aug 5, 2019
Author

zhan0903
Aug 6, 2019
Author

brokenBrain
Dec 7, 2019

erwincoumans
Dec 18, 2019
Maintainer

forcecore
Jan 7, 2021