Add non-TS'able _resize_image_and_masks variant with less tensor ops #7592

ezyang · 2023-05-16T00:49:09Z

We did some horrible things to _resize_image_and_masks
to make it TorchScriptable, and those horrible things cause
weird divergences when you send the float computation
to a real compiler that is willing to do fastmath optimizations
to floating point, see pytorch/pytorch#93598

This PR adds a non TS-goopified version of the operator which doesn't
have this problem, since it does the size compute the "normal way"
(and consequently, doesn't get fastmath'ified).

Signed-off-by: Edward Z. Yang ezyang@meta.com

We did some horrible things to _resize_image_and_masks to make it TorchScriptable, and those horrible things cause weird divergences when you send the float computation to a real compiler that is willing to do fastmath optimizations to floating point, see pytorch/pytorch#93598 This PR adds a non TS-goopified version of the operator which doesn't have this problem, since it does the size compute the "normal way" (and consequently, doesn't get fastmath'ified). Signed-off-by: Edward Z. Yang <ezyang@meta.com>

pytorch-bot · 2023-05-16T00:49:11Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/7592

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit d8043a9:

NEW FAILURE - The following job has failed:

unittests-linux (3.8, linux.g5.4xlarge.nvidia.gpu, cuda, 11.7) / linux-job (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

The bulk of the heavy lifting is happening in pytorch/vision#7592 Signed-off-by: Edward Z. Yang <ezyang@meta.com> [ghstack-poisoned]

The bulk of the heavy lifting is happening in pytorch/vision#7592 Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: d8f8c954c3d7c45f595847c642d56f97e3322b6f Pull Request resolved: #101477

NicolasHug

Thanks for the PR @ezyang . I am growing wary of the maintenance cost we're facing with recent PRs related to PT 2.0 support (#7587, now this PR). Both PRs add a separate implementation for an existing function.

Do we expect to have a lot more of those PT 2.0 support issues in the future? And if yes, is there a alternate solution to what we're currently doing which is to duplicate all implementations?

Supporting the cross product of JIT x ONNX x all_platorms x <insert your preferred tech here> has been a massive challenge in torchvision (and I'm being diplomatic), and I fear that adding yet another factor to that is going to be... err... difficult.

NicolasHug · 2023-05-16T10:06:50Z

torchvision/models/detection/transform.py

+            if self.training:
+                if self._skip_resize:
+                    return image, target
+                size = random.choice(self.min_size)


tests failing because this needs an import. But please use the RNG from torch instead of Python, so that users can fully control randomness by just calling torch.manual_seed() and similar mechanisms.

ezyang · 2023-05-16T13:03:33Z

PT 2.0 support (#7587)

I want to treat this and #7587 separately. #7587 has nothing to do with PT2 support per se, it's entirely around support for deterministic algorithms (which I ran into while working on PT2, sure, but it stands on its own.) On the subject of deterministic algorithms, there really are two main approaches for how you can add a deterministic version of a CUDA kernel that uses gpuAtomicAdd: (1) you can write a decomposition (which is what the PR does) or (2) you can write another copy of the CUDA kernel by hand that doesn't have atomic adds in it (the easiest approach is to change the iteration space from grad_output to grad_input, so that the summation happens from a single thread). Writing the decomposition has the added benefit that you can use it to compile things in PT2 (though I don't actually do this in the PR) and it can be a nice, pure-Python reference implementation that you can use for testing / experimentation. So it seems preferable over banging out another CUDA kernel. This seems... like a fair trade for "code duplication"? In the limit, we'd be applying this treatment to every custom operator in torchvision. A long term vision for PT2 is that you wouldn't need to write hand-written CUDA code at all; you could write the pure Python code and generate a kernel automatically from it, but we're still a little bit away from it.

Supporting the cross product of JIT x ONNX x all_platorms x has been a massive challenge in torchvision (and I'm being diplomatic), and I fear that adding yet another factor to that is going to be... err... difficult.

I think the question I would ask you is, if you didn't have to support TorchScript JIT / ONNX, what would this code look like? I argue that your code would look like the new version I've posted: why would you intentionally create a tensor just to do shape computation and convert it back out again? The version of the code here is the clear, idiomatic PyTorch eager implementation of the function.

Signed-off-by: Edward Z. Yang <ezyang@meta.com>

The bulk of the heavy lifting is happening in pytorch/vision#7592 Signed-off-by: Edward Z. Yang <ezyangmeta.com> cc voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx desertfire [ghstack-poisoned]

ezyang · 2023-05-16T14:09:52Z

Attempted simplifying the duplication

The bulk of the heavy lifting is happening in pytorch/vision#7592 Signed-off-by: Edward Z. Yang <ezyangmeta.com> cc voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx desertfire [ghstack-poisoned]

The bulk of the heavy lifting is happening in pytorch/vision#7592 Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: 28e7047b3ed1058cf3e8009cd29e7146cacc4426 Pull Request resolved: #101477

NicolasHug

Thanks for trying to minimize the code duplication @ezyang . As discussed offline the ONNX tests are red, but I'll stamp to unblock.

Signed-off-by: Edward Z. Yang <ezyang@meta.com>

The bulk of the heavy lifting is happening in pytorch/vision#7592 Signed-off-by: Edward Z. Yang <ezyangmeta.com> cc voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx desertfire [ghstack-poisoned]

The bulk of the heavy lifting is happening in pytorch/vision#7592 Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: 3a058371e45a21cf35cdb91ce1d313295f2d80ff Pull Request resolved: #101477

github-actions · 2023-05-20T20:06:19Z

Hey @ezyang!

You merged this PR, but no labels were added. The list of valid labels is available at https://github.com/pytorch/vision/blob/main/.github/process_commit.py

…krcnn" The bulk of the heavy lifting is happening in pytorch/vision#7592 Signed-off-by: Edward Z. Yang <ezyangmeta.com> cc voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx desertfire [ghstack-poisoned]

The bulk of the heavy lifting is happening in pytorch/vision#7592 Signed-off-by: Edward Z. Yang <ezyangmeta.com> cc voznesenskym penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx desertfire [ghstack-poisoned]

The bulk of the heavy lifting is happening in pytorch/vision#7592 Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: e4b86e6f0b9aad0142e227f01b3bb3561cd272cd Pull Request resolved: #101477

The bulk of the heavy lifting is happening in pytorch/vision#7592 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: #101477 Approved by: https://github.com/voznesenskym

…nsor ops (#7592) Summary: Signed-off-by: Edward Z. Yang <ezyang@meta.com> Reviewed By: vmoens Differential Revision: D46071415 fbshipit-source-id: bd1575ba95700e29f5565b720d9d1be070736fe8

This is a small follow up on pytorch#7592 that makes this Dynamo exportable. Signed-off-by: Edward Z. Yang <ezyang@meta.com>

facebook-github-bot added the cla signed label May 16, 2023

ezyang requested a review from NicolasHug May 16, 2023 00:52

ezyang mentioned this pull request May 16, 2023

Partially fix shape mismatch in vision_maskrcnn pytorch/pytorch#101477

Closed

ezyang added a commit to pytorch/pytorch that referenced this pull request May 16, 2023

Fix shape mismatch in vision_maskrcnn

991f6e6

The bulk of the heavy lifting is happening in pytorch/vision#7592 Signed-off-by: Edward Z. Yang <ezyang@meta.com> [ghstack-poisoned]

NicolasHug reviewed May 16, 2023

View reviewed changes

simplify the duplication

8878e03

Signed-off-by: Edward Z. Yang <ezyang@meta.com>

NicolasHug approved these changes May 19, 2023

View reviewed changes

fix onnx

d8043a9

Signed-off-by: Edward Z. Yang <ezyang@meta.com>

ezyang merged commit 300a909 into main May 20, 2023

ezyang added enhancement module: models labels May 20, 2023

ezyang mentioned this pull request Aug 16, 2023

Translation layer (similar to torch_np) that can reliably lift Python operations into Tensor operations pytorch/pytorch#107277

Closed

ezyang mentioned this pull request Sep 7, 2023

Avoid creating a tensor of shape when not tracing #7942

Merged

ezyang added a commit to ezyang/vision that referenced this pull request Sep 7, 2023

Avoid creating a tensor of shape when not tracing

d50177f

This is a small follow up on pytorch#7592 that makes this Dynamo exportable. Signed-off-by: Edward Z. Yang <ezyang@meta.com>

fmassa deleted the maskrcnn-descale branch September 26, 2023 08:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add non-TS'able _resize_image_and_masks variant with less tensor ops #7592

Add non-TS'able _resize_image_and_masks variant with less tensor ops #7592

ezyang commented May 16, 2023

pytorch-bot bot commented May 16, 2023 •

edited

Loading

NicolasHug left a comment

NicolasHug May 16, 2023 •

edited

Loading

ezyang commented May 16, 2023

ezyang commented May 16, 2023

NicolasHug left a comment

github-actions bot commented May 20, 2023

Add non-TS'able _resize_image_and_masks variant with less tensor ops #7592

Add non-TS'able _resize_image_and_masks variant with less tensor ops #7592

Conversation

ezyang commented May 16, 2023

pytorch-bot bot commented May 16, 2023 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/7592

❌ 1 New Failure

NicolasHug left a comment

Choose a reason for hiding this comment

NicolasHug May 16, 2023 • edited Loading

Choose a reason for hiding this comment

ezyang commented May 16, 2023

ezyang commented May 16, 2023

NicolasHug left a comment

Choose a reason for hiding this comment

github-actions bot commented May 20, 2023

pytorch-bot bot commented May 16, 2023 •

edited

Loading

NicolasHug May 16, 2023 •

edited

Loading