VecGymEnvTransform

class torchrl.envs.transforms.VecGymEnvTransform(final_name='final')[source]

A transform for GymWrapper subclasses that handles the auto-reset in a consistent way.

Gym, gymnasium and SB3 provide vectorized (read, parallel or batched) environments that are automatically reset. When this occurs, the actual observation resulting from the action is saved within a key in the info. The class torchrl.envs.libs.gym.terminal_obs_reader reads that observation and stores it in a "final" key within the output tensordict. In turn, this transform reads that final data, swaps it with the observation written in its place that results from the actual reset, and saves the reset output in a private container. The resulting data truly reflects the output of the step.

This class works from gym 0.13 till the most recent gymnasium version.

Note

Gym versions < 0.22 did not return the final observations. For these, we simply fill the next observations with NaN (because it is lost) and do the swap at the next step.

Then, when calling env.reset, the saved data is written back where it belongs (and the reset is a no-op).

This transform is automatically appended to the gym env whenever the wrapper is created with an async env.

Parameters:: final_name (str, optional) – the name of the final observation in the dict. Defaults to “final”.

Note

In general, this class should not be handled directly. It is created whenever a vectorized environment is placed within a GymWrapper.

forward(tensordict: TensorDictBase) → TensorDictBase[source]: Reads the input tensordict, and for the selected keys, applies the transform.

transform_observation_spec(observation_spec: TensorSpec) → TensorSpec[source]

Transforms the observation spec such that the resulting spec matches transform mapping.

Parameters:: observation_spec (TensorSpec) – spec before the transform
Returns:: expected spec after the transform

VecGymEnvTransform

Docs

Tutorials

Resources