Adds functionality to populate torch generator using torch.thread_safe_generator by divyanshk · Pull Request #9371 · pytorch/vision

divyanshk · 2026-01-30T02:06:43Z

Added thread-safe random number generation to all V2 torchvision random transforms to prevent race conditions when using DataLoader with thread-based workers (worker_method='thread').

This is based on torch.thread_safe_generator which returns dataloader thread-worker specific RNG or None otherwise.

pytorch-bot · 2026-01-30T02:06:49Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/9371

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Pending

As of commit a6bff65 with merge base 48956e0 ():

NEW FAILURE - The following job has failed:

Lint / python-types / linux-job (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

NicolasHug

Thanks for the PR @divyanshk . I think the changes look reasonable.

One thing I'm wondering is how does this affect the multiprocess-based dataloaders? Currently, since TV is using the global torch RNG, that global generator will be seeded by torch using a different seed for each process/worker. This is the correct behavior since we want each worker to have a different RNG.

Is that behavior preserved now that we're using torch.thread_safe_generator()?

It'd be good to have tests ensure that's the case (both for multiprocess and multithreaded cases).

divyanshk · 2026-02-25T22:47:43Z

The multiprocessing case remains unchanged because torch.thread_safe_generator will return None for multiprocessing use-case. So for MP, there is no change. Earlier the torch.rand functions received None for generator arg, and now they would get the same. Also added a test case where I confirm the expected behavior for multiprocessing.

NicolasHug · 2026-03-16T12:47:53Z

test/test_transforms_v2.py

+        transforms.RandomPerspective(p=1.0),
+        transforms.RandomErasing(p=1.0),
+        transforms.ScaleJitter(target_size=(24, 24)),
+    ]


We have a few more random transforms in TV that we'll also want to update and test. I think the list you'll find in https://github.com/pytorch/vision/pull/7848/changes should have the proper coverage (but claude should be able to find all the relevant ones)

I was missing a lot of random transforms. Updated the PR to cover the transforms in these files in torchvision/transforms/v2:
_augment.py
_auto_augment.py
_color.py
_container.py
_geometry.py
_misc.py
_transform.py

This matches the files touched in your PR above.

RandomHorizontalFlip and RandomVerticalFlip were noisy for the test I want, i.e. two batches are different for different seeded workers. This is because the flipped outputs can be same for different seeds. So I added another reproducibility test which cover the flips and other transforms. This check if two transforms are the same for the same torch.thread_safe_generator value. This should add extra coverage.

NicolasHug · 2026-03-16T12:51:05Z

test/test_transforms_v2.py

+        assert not torch.equal(batch0, batch1)
+
+    @pytest.mark.parametrize("transform", TRANSFORMS, ids=lambda t: type(t).__name__)
+    def test_thread_worker_uses_thread_local_generator(self, transform):


For this multi-threading test, is there a way to test the actual multi-threaded behavior, without the mocking? I.e. ideally I'd like to test the public-facing APIs when a user requests multi-threaded from the DataLoader. I'm not sure what the public entry point is though?

I agree. I am using the mocks only because the threading workers isn't landed yet and I want to land this PR first. Once in, I can update these tests. With respect to the transforms we are not simplifying anything though. With the mocking, each transform function is getting a different generator as would happen if done through the dataloading workers.

meta-cla bot added the cla signed label Jan 30, 2026

divyanshk force-pushed the divyanshk/opt_generator branch from 1e0225e to b15da1c Compare January 30, 2026 17:35

NicolasHug reviewed Feb 24, 2026

View reviewed changes

divyanshk force-pushed the divyanshk/opt_generator branch from b15da1c to 18bdef3 Compare February 25, 2026 22:20

divyanshk force-pushed the divyanshk/opt_generator branch 2 times, most recently from c416fad to e7da958 Compare March 5, 2026 18:34

divyanshk added 4 commits March 9, 2026 11:18

update randomcrop

273b95c

Use torch.thread_safe_generator

52d3353

add unit test mocking torch.thread_safe_generator

30b165a

run ufmt, update mp unit test

6277f11

divyanshk force-pushed the divyanshk/opt_generator branch from e7da958 to 6277f11 Compare March 9, 2026 18:18

divyanshk marked this pull request as ready for review March 9, 2026 22:26

NicolasHug reviewed Mar 16, 2026

View reviewed changes

cover more random transforms

a6bff65

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds functionality to populate torch generator using torch.thread_safe_generator#9371

Adds functionality to populate torch generator using torch.thread_safe_generator#9371
divyanshk wants to merge 5 commits intopytorch:mainfrom
divyanshk:divyanshk/opt_generator

divyanshk commented Jan 30, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jan 30, 2026 •

edited

Loading

Uh oh!

NicolasHug left a comment

Uh oh!

divyanshk commented Feb 25, 2026

Uh oh!

NicolasHug Mar 16, 2026

Uh oh!

divyanshk Mar 17, 2026

Uh oh!

NicolasHug Mar 16, 2026

Uh oh!

divyanshk Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

divyanshk commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/9371

❌ 1 New Failure, 1 Pending

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

divyanshk commented Feb 25, 2026

Uh oh!

NicolasHug Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

divyanshk Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

NicolasHug Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

divyanshk Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

divyanshk commented Jan 30, 2026 •

edited

Loading

pytorch-bot bot commented Jan 30, 2026 •

edited

Loading