[BugFix] Fix behavior or partial, nested dones in PEnv and TEnv #2959

vmoens · 2025-05-16T08:00:59Z

Stack from ghstack (oldest at bottom):

-> [BugFix] Fix behavior or partial, nested dones in PEnv and TEnv #2959

[ghstack-poisoned]

ghstack-source-id: c027ee5 Pull-Request-resolved: #2959

pytorch-bot · 2025-05-16T08:01:09Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2959

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures, 6 Pending, 2 Unrelated Failures

As of commit 9185c7a with merge base 36f34da ():

NEW FAILURES - The following jobs have failed:

Continuous Benchmark (PR) / GPU Pytest benchmark (gh)
Process completed with exit code 1.
Habitat Tests on Linux / tests (3.9, 12.8) / linux-job (gh)
RuntimeError: Command docker exec -t fa826afc96321a2eec2a2a4866c71272520a64af7f2887599d91088a8797a399 /exec failed with exit code 1
Libs Tests on Linux / unittests-jumanji (3.10, 12.8) / linux-job (gh)
RuntimeError: Command docker exec -t 97c691eda2fed4677515c585cdda67fa3cba238f982235753397e8798ff49d38 /exec failed with exit code 245
Unit-tests on Linux / tests-optdeps (3.11, 12.8) / linux-job (gh)
test/test_transforms.py::TestVecNormV2::test_vecnorm_parallel_auto[5]

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Libs Tests on Linux / unittests-gym (3.9, 12.8) / linux-job (gh) (trunk failure)
test/test_libs.py::TestGym::test_gym_fake_td[True-False-3-HalfCheetah-v2]
Unit-tests on Windows / unittests-cpu (3.10, windows.4xlarge, cpu) / windows-job (gh) (trunk failure)
test/test_transforms.py::TestTimer::test_transform_env

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

ghstack-source-id: 04009f5 Pull-Request-resolved: #2959

thomasbbrunner

Nice! Thanks for the quick fix.

thomasbbrunner · 2025-05-16T08:21:15Z

test/test_env.py

+        if done_at_root:
+            assert parallel_env._simple_done
+            assert transformed_env._simple_done
+            # We expect each env to have reached a done state once.
+            assert parallel_td["next", "done"].sum().item() == 2
+            # We expect env[0] to have been reset and executed 2 steps.
+            # We expect env[1] to have just been reset (0 steps).
+            assert parallel_env._counter() == [2, 0]
+            assert parallel_td["next", "done"].sum().item() == 2
+
+            # We expect each env to have reached a done state once.
+            assert transformed_td["next", "done"].sum().item() == 2
+            assert_allclose_td(transformed_td, parallel_td, intersection=True)
+            # We expect env[0] to have been reset and executed 2 steps.
+            # We expect env[1] to have just been reset (0 steps).
+            # We only expect env[0] to have reached a done state.
+        else:
+            assert not parallel_env._simple_done
+            assert not transformed_env._simple_done
+            assert ("next", "done") not in parallel_td
+            assert ("next", "done") not in transformed_td
+            assert parallel_td["next", "agent_1", "done"].sum().item() == 2
+            assert parallel_env._counter() == [2, 0]
+            assert parallel_td["next", "agent_1", "done"].sum().item() == 2
+            assert transformed_td["next", "agent_1", "done"].sum().item() == 2
+            assert_allclose_td(transformed_td, parallel_td, intersection=True)
+
+        assert transformed_env._counter() == [2, 0]


I think there's a mismatch in the asserts and comments and some asserts are duplicated. Maybe this helps:

Suggested change

if done_at_root:

assert parallel_env._simple_done

assert transformed_env._simple_done

# We expect each env to have reached a done state once.

assert parallel_td["next", "done"].sum().item() == 2

# We expect env[0] to have been reset and executed 2 steps.

# We expect env[1] to have just been reset (0 steps).

assert parallel_env._counter() == [2, 0]

assert parallel_td["next", "done"].sum().item() == 2

# We expect each env to have reached a done state once.

assert transformed_td["next", "done"].sum().item() == 2

assert_allclose_td(transformed_td, parallel_td, intersection=True)

# We expect env[0] to have been reset and executed 2 steps.

# We expect env[1] to have just been reset (0 steps).

# We only expect env[0] to have reached a done state.

else:

assert not parallel_env._simple_done

assert not transformed_env._simple_done

assert ("next", "done") not in parallel_td

assert ("next", "done") not in transformed_td

assert parallel_td["next", "agent_1", "done"].sum().item() == 2

assert parallel_env._counter() == [2, 0]

assert parallel_td["next", "agent_1", "done"].sum().item() == 2

assert transformed_td["next", "agent_1", "done"].sum().item() == 2

assert_allclose_td(transformed_td, parallel_td, intersection=True)

assert transformed_env._counter() == [2, 0]

# We expect env[0] to have been reset and executed 2 steps.

# We expect env[1] to have just been reset (0 steps).

assert parallel_env._counter() == [2, 0]

assert transformed_env._counter() == [2, 0]

if done_at_root:

assert parallel_env._simple_done

assert transformed_env._simple_done

# We expect each env to have reached a done state once.

assert parallel_td["next", "done"].sum().item() == 2

assert transformed_td["next", "done"].sum().item() == 2

assert_allclose_td(transformed_td, parallel_td, intersection=True)

else:

assert not parallel_env._simple_done

assert not transformed_env._simple_done

assert ("next", "done") not in parallel_td

assert ("next", "done") not in transformed_td

assert parallel_td["next", "agent_1", "done"].sum().item() == 2

assert transformed_td["next", "agent_1", "done"].sum().item() == 2

assert_allclose_td(transformed_td, parallel_td, intersection=True)

[ghstack-poisoned]

ghstack-source-id: e36d1c8 Pull-Request-resolved: #2959

ghstack-source-id: e36d1c8 Pull-Request-resolved: #2959 (cherry picked from commit 6ae8d43)

Update

ded861c

[ghstack-poisoned]

vmoens pushed a commit that referenced this pull request May 16, 2025

[BugFix] Fix behavior or partial, nested dones in PEnv and TEnv

5eb0e47

ghstack-source-id: c027ee5 Pull-Request-resolved: #2959

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 16, 2025

vmoens added the Environments Adds or modifies an environment wrapper label May 16, 2025

Update

699904d

[ghstack-poisoned]

vmoens pushed a commit that referenced this pull request May 16, 2025

[BugFix] Fix behavior or partial, nested dones in PEnv and TEnv

4b36c70

ghstack-source-id: 04009f5 Pull-Request-resolved: #2959

thomasbbrunner reviewed May 16, 2025

View reviewed changes

thomasbbrunner mentioned this pull request May 16, 2025

[BUG] Transforming a ParallelEnv causes all sub-envs to be reset when one of them is done in a multi-agent setting #2958

Closed

3 tasks

Update

9185c7a

[ghstack-poisoned]

vmoens pushed a commit that referenced this pull request May 16, 2025

[BugFix] Fix behavior or partial, nested dones in PEnv and TEnv

096078d

ghstack-source-id: e36d1c8 Pull-Request-resolved: #2959

vmoens merged commit 9185c7a into gh/vmoens/143/base May 16, 2025
91 of 101 checks passed

vmoens pushed a commit that referenced this pull request May 16, 2025

[BugFix] Fix behavior or partial, nested dones in PEnv and TEnv

6ae8d43

ghstack-source-id: e36d1c8 Pull-Request-resolved: #2959

vmoens deleted the gh/vmoens/143/head branch May 16, 2025 10:02

vmoens pushed a commit that referenced this pull request May 16, 2025

[BugFix] Fix behavior or partial, nested dones in PEnv and TEnv

c6ca5af

ghstack-source-id: e36d1c8 Pull-Request-resolved: #2959 (cherry picked from commit 6ae8d43)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BugFix] Fix behavior or partial, nested dones in PEnv and TEnv #2959

[BugFix] Fix behavior or partial, nested dones in PEnv and TEnv #2959

Uh oh!

vmoens commented May 16, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented May 16, 2025 •

edited

Loading

Uh oh!

thomasbbrunner left a comment

Uh oh!

thomasbbrunner May 16, 2025

Uh oh!

Uh oh!

Uh oh!

[BugFix] Fix behavior or partial, nested dones in PEnv and TEnv #2959

[BugFix] Fix behavior or partial, nested dones in PEnv and TEnv #2959

Uh oh!

Conversation

vmoens commented May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2959

❌ 4 New Failures, 6 Pending, 2 Unrelated Failures

Uh oh!

thomasbbrunner left a comment

Choose a reason for hiding this comment

Uh oh!

thomasbbrunner May 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

vmoens commented May 16, 2025 •

edited

Loading

pytorch-bot bot commented May 16, 2025 •

edited

Loading