removed calls to reset from init #2394

ahmedo42 · 2021-09-05T11:22:29Z

fixes #2242

jkterry1 · 2021-09-05T14:49:58Z

@ahmedo42 can you please make tests pass?

ahmedo42 · 2021-09-05T16:15:47Z

test_record_video_using_default_trigger is flaky , I'll look into it

jkterry1 · 2021-09-06T14:39:58Z

@ahmedo42 thanks a ton. Just to confirm I'm not going crazy here, this shouldn't require a version bump right?

ahmedo42 · 2021-09-06T15:10:53Z

@jkterry1 it shouldn't since it's not a bug fix and behavior is the same ( I think )

vwxyzjn

Thanks for the fix and sorry for the flaky test. If this does not fix I think going forward I will create a dummy end that only outputs black rgb pixels for deterministic test

gym/wrappers/test_record_video.py

RedTachyon · 2021-09-06T15:32:49Z

The behavior is not exactly the same. The big difference is that every environment will need to be reset'd before using it.

With envs made by gym.make, typically TimeLimit seems to enforce this.

If you do e.g. env = BipedalWalker(), you can instantly call env.step and it will work normally.

With this PR, my guess is that it will crash in an unpredictable way (some variable or field will be undefined)

RedTachyon · 2021-09-06T15:39:00Z

Specific example:

import gym
from gym.envs.box2d import BipedalWalker
env = BipedalWalker()
env.step(env.action_space.sample())

Output with this PR:

AttributeError                            Traceback (most recent call last)
<ipython-input-7-38fa47b0972b> in <module>()
----> 1 env.step(env.action_space.sample())

/usr/local/lib/python3.7/dist-packages/gym/envs/box2d/bipedal_walker.py in step(self, action)
    410             self.joints[3].motorSpeed = float(SPEED_KNEE * np.clip(action[3], -1, 1))
    411         else:
--> 412             self.joints[0].motorSpeed = float(SPEED_HIP * np.sign(action[0]))
    413             self.joints[0].maxMotorTorque = float(
    414                 MOTORS_TORQUE * np.clip(np.abs(action[0]), 0, 1)

AttributeError: 'BipedalWalker' object has no attribute 'joints'

Output with gym==0.19.0:

(array([ 0.0127883 , -0.00783346,  0.00696973,  0.02636976, -0.27169684,
        -0.24278045,  1.46274191,  0.85486007,  1.        , -0.37531292,
        -0.69931543,  1.70809513,  0.99102688,  1.        ,  0.45706901,
         0.46225971,  0.47843772,  0.50760233,  0.55379778,  0.62467676,
         0.73529869,  0.9186005 ,  1.        ,  1.        ]),
 -0.06912166370948038,
 False,
 {})

(that is, normal output)

ahmedo42 · 2021-09-06T15:48:17Z

@RedTachyon thanks for the catch , I think that you shouldn't be able to call env.step() directly without calling reset() first ( for all envs)

RedTachyon · 2021-09-06T16:12:53Z

I lean towards not enforcing this in the general case. If a specific custom environment doesn't need resetting, why should that be required? I imagine this could be relevant in some lifelong learning-like scenarios. Not a big deal, but an additional pain with no real benefit.

As for the existing included environments, manual checks could be added in their respective classes I suppose, at the very least to have a sane error message.

But ultimately I don't really think the resets in the init are a big deal, they can stay imo.

ahmedo42 · 2021-09-06T16:29:49Z

I see your point , the whole idea was to be consistent , some envs have reset in init but some don't which can be confusing , at least for me when I read the envs documentation

jkterry1 · 2021-09-06T18:42:41Z

"I think that you shouldn't be able to call env.step() directly without calling reset() first ( for all envs)"

I agree with ahmedo42 on this for a variety of reasons. We should also specifically test for this in the API compliance tests, @ahmedo42 could you please add that real quick?

jkterry1 · 2021-09-06T18:43:40Z

@ahmedo42 the environments should also issue a warning if you call step before reset that all environments inherit from the base class, like what PettingZoo does

RedTachyon · 2021-09-06T18:56:35Z

Wouldn't this require adding verification logic to the currently completely abstract Env class? I think it's much better to just add it to the existing environments (and actually making it an exception, and not just a warning), but keep the base class as "clean" as possible.

jkterry1 · 2021-09-06T21:25:57Z

It should be an exception not a warning yes, sorry. I don't understand your other concerns?

vwxyzjn · 2021-09-06T21:27:04Z

Per fixes in #2401, could you remove the changes in the test_record_video.py?

RedTachyon · 2021-09-06T21:47:51Z

My concern is about how the no-step-before-reset rule would be enforced. Currently it's in the TimeLimit wrapper. The only way I see to automatically enforce it in anything inheriting from gym.Env would be redefining the base reset function, which I think would be a pretty major change.

Unless I'm just misunderstanding and the suggestion is to add this check to the specific environments already within gym, which I think is fine.

ahmedo42 · 2021-09-07T15:18:55Z

A solution that mimics pettingzoo would be like this :

def get_env():
    env = BipedalWalker()
    env = TimeLimit(env)
    return env

which will be implemented in every environment

from the user's side

import gym
from gym.envs.box2d import bipedal_walker
env = bipedal_walker.get_env()

Any cleaner solution is welcome , note that using gym.make() already throws the appropriate exception.

…emove-reset

benblack769 · 2021-09-11T21:56:26Z

I belive that you can automatically wrap environments during environment make. For example, how TimeLimit is wrapped https://github.com/openai/gym/blob/master/gym/envs/registration.py#L109.

For environments which don't have a time limit, you can create a wrapper (perhaps EnforceCallOrder) specifically to enforce that reset() is called before step(), and wrap the environment similar to how the TimeLimit environment is wrapped.

Of course, there should be some way to disable this wrapping via the env spec, but it should probably be enabled by default.

benblack769 · 2021-09-12T21:48:18Z

gym/envs/registration.py

+            from gym.wrappers.time_limit import TimeLimit
+
+            env = TimeLimit(env, max_episode_steps=env.spec.max_episode_steps)
+        else:


There should be some way to disable wrapping the environment with the OrderEnforcing wrapper via the env.spec.

benblack769 · 2021-09-12T21:53:58Z

Looks good, just one comment about allowing this to be disabled if someone really doesn't want their environment to be wrapped for some reason.

benblack769 · 2021-09-13T14:13:02Z

I was thinking that the argument would be another optional argument to the EnvSpec class, similar to max_episode_steps, rather than the kwargs, which are really meant to be passed to the environment directly.

ahmedo42 · 2021-09-13T17:38:31Z

@benblack769 This was what I thought of initially but can't figure out a clean way to do it
what I want to do ideally is :

env = gym.make("someEnv",order_enforce=False) 
# but this is considered a kwarg for the env

Is there a way to override the args of EnvSpec from gym.make() such as max_episode_steps , nondetermenistic etc..?

benblack769 · 2021-09-13T19:50:44Z

What I was thinking is that the environment user would not be thinking about this choice that much, this is a choice that the environment maintainer would be making, primarily. In particular, I am worried about the case where wrapping the environment by default would break a lot of downstream users code, so the environment maintainer could keep the current behavior by doing something like

register(
    id="CustomEnv-v1",
    entry_point="...",
    order_enforce=False,
)

benblack769 · 2021-09-16T14:16:11Z

Thanks, I think this looks much better.

removed all calls to reset

76e289d

passing tests

4c3028e

fix off-by-one error

6362721

ahmedo42 mentioned this pull request Sep 6, 2021

Changed .shape to be a property #2397

Merged

vwxyzjn approved these changes Sep 6, 2021

View reviewed changes

gym/wrappers/test_record_video.py Outdated Show resolved Hide resolved

revert

05321e9

vwxyzjn mentioned this pull request Sep 6, 2021

Fix bad test cases with RecordVideo #2401

Merged

ahmedo42 added 2 commits September 7, 2021 21:40

merge master into branch

6aef02a

Merge branch 'remove-reset' of https://github.com/ahmedo42/gym into r…

9ecbdb0

…emove-reset

ahmedo42 added 3 commits September 12, 2021 12:19

add OrderEnforcing Wrapper

8ffaa9f

Merge branch 'master' into remove-reset

6a28b1b

add orderenforcing to the docs

3691824

benblack769 reviewed Sep 12, 2021

View reviewed changes

add option for disabling

c561aeb

add argument to EnvSpec

067d296

jkterry1 merged commit 2754d97 into openai:master Sep 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

removed calls to reset from init #2394

removed calls to reset from init #2394

ahmedo42 commented Sep 5, 2021

jkterry1 commented Sep 5, 2021

ahmedo42 commented Sep 5, 2021

jkterry1 commented Sep 6, 2021

ahmedo42 commented Sep 6, 2021

vwxyzjn left a comment

RedTachyon commented Sep 6, 2021

RedTachyon commented Sep 6, 2021

ahmedo42 commented Sep 6, 2021

RedTachyon commented Sep 6, 2021

ahmedo42 commented Sep 6, 2021

jkterry1 commented Sep 6, 2021

jkterry1 commented Sep 6, 2021

RedTachyon commented Sep 6, 2021

jkterry1 commented Sep 6, 2021

vwxyzjn commented Sep 6, 2021

RedTachyon commented Sep 6, 2021

ahmedo42 commented Sep 7, 2021

benblack769 commented Sep 11, 2021 •

edited

Loading

benblack769 Sep 12, 2021

benblack769 commented Sep 12, 2021

benblack769 commented Sep 13, 2021

ahmedo42 commented Sep 13, 2021

benblack769 commented Sep 13, 2021 •

edited

Loading

benblack769 commented Sep 16, 2021

removed calls to reset from init #2394

removed calls to reset from init #2394

Conversation

ahmedo42 commented Sep 5, 2021

jkterry1 commented Sep 5, 2021

ahmedo42 commented Sep 5, 2021

jkterry1 commented Sep 6, 2021

ahmedo42 commented Sep 6, 2021

vwxyzjn left a comment

Choose a reason for hiding this comment

RedTachyon commented Sep 6, 2021

RedTachyon commented Sep 6, 2021

ahmedo42 commented Sep 6, 2021

RedTachyon commented Sep 6, 2021

ahmedo42 commented Sep 6, 2021

jkterry1 commented Sep 6, 2021

jkterry1 commented Sep 6, 2021

RedTachyon commented Sep 6, 2021

jkterry1 commented Sep 6, 2021

vwxyzjn commented Sep 6, 2021

RedTachyon commented Sep 6, 2021

ahmedo42 commented Sep 7, 2021

benblack769 commented Sep 11, 2021 • edited Loading

benblack769 Sep 12, 2021

Choose a reason for hiding this comment

benblack769 commented Sep 12, 2021

benblack769 commented Sep 13, 2021

ahmedo42 commented Sep 13, 2021

benblack769 commented Sep 13, 2021 • edited Loading

benblack769 commented Sep 16, 2021

benblack769 commented Sep 11, 2021 •

edited

Loading

benblack769 commented Sep 13, 2021 •

edited

Loading