Broadcast operations in aten::masked_fill and element_wise operations #1654

apbose · 2023-02-08T23:07:49Z

Description

This PR addresses the broadcasting issue encountered in the NeMo Citrinet model. (Bug 1573).

Fixes # (issue)

This PR makes two main changes-

The TensorRT broadcast rules are more limited as compared to Torch broadcast rules - #1629. Hence the aten::masked_fill is modified to make mask and self Tensors broadcastable according to TensorRT rules.At present only the mask tensor is padded if its no of dimensions is lesser than the self Tensor, but not vice versa.
Modification : The mask tensor is unpadded if its dimensions is greater than the self Tensor and it has leading 1 dimensions
The element_wise operations broadcasts the two tensors to get to the same dimensions.
Modification : The tensor having extra dimensions is checked if it requires the leading 1 dimensions and accordingly unpadded. Else the other tensor is padded.

The above model fails with the runtime error-RuntimeError: expand(CUDABoolType{[1, 1, 1, 1488]}, size=[1, 256, 1488]): the number of sizes provided (3) must be greater or equal to the number of dimensions in the tensor (4)

out dimension = 1, 256, 1488
mask dimension = 1,1,1,1488
Cases with output = out.masked_fill_(mask,2 ) fail while output = torch.masked_fill(mask,2) passes

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

narendasan · 2023-02-13T17:42:15Z

@apbose we need tests to verify these changes. Also the TS tests are failing

apbose · 2023-02-13T22:08:23Z

Yes @narendasan , I am looking at debugging the failing ts test

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

gs-olive · 2023-09-11T18:29:34Z

core/conversion/converters/converter_util.cpp

+      int diff = selfDim.size() - otherDim.size();
+      bool canSqueeze = true;
+      for (int i = 0; i < diff; i++) {
+        if (selfDim[i] != 1) {
+          canSqueeze = false;
+        }
+      }


Does this work if diff < 0?

Yes it probably should since the above condition ensures that the self has greater dimension than the other. But this is leading to dimensions getting mishandled in bert model, so I need to remove this logic

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

…an self tensor

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

apbose · 2023-10-06T17:58:32Z

The above model fails in torchscript. aten::masked_fill rules are different from aten::_masked_fill

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

facebook-github-bot added the cla signed label Feb 8, 2023

github-actions bot added component: conversion Issues re: Conversion stage component: converters Issues re: Specific op converters component: core Issues re: The core compiler labels Feb 8, 2023

github-actions bot requested a review from peri044 February 8, 2023 23:08

github-actions bot approved these changes Feb 8, 2023

View reviewed changes

github-actions bot approved these changes Feb 9, 2023

View reviewed changes

github-actions bot approved these changes May 1, 2023

View reviewed changes

apbose force-pushed the broadcast branch from ed7f63d to 3b10531 Compare May 19, 2023 01:25

github-actions bot approved these changes May 19, 2023

View reviewed changes

apbose force-pushed the broadcast branch from 3b10531 to 927d25e Compare July 29, 2023 00:48

github-actions bot approved these changes Jul 29, 2023

View reviewed changes

github-actions bot added the component: tests Issues re: Tests label Aug 7, 2023

github-actions bot approved these changes Aug 7, 2023

View reviewed changes

apbose marked this pull request as draft September 1, 2023 17:58

apbose force-pushed the broadcast branch from b14dff2 to e2a87a9 Compare September 5, 2023 21:29

github-actions bot approved these changes Sep 5, 2023

View reviewed changes

apbose force-pushed the broadcast branch from e2a87a9 to 7164591 Compare September 9, 2023 02:17

github-actions bot approved these changes Sep 9, 2023

View reviewed changes

gs-olive reviewed Sep 11, 2023

View reviewed changes

apbose force-pushed the broadcast branch from 7164591 to f33795a Compare September 28, 2023 18:46

github-actions bot approved these changes Sep 28, 2023

View reviewed changes

apbose added 4 commits September 28, 2023 11:50

Broadcasting rule changes for masked_fill and elementwise operators

051cf60

Removing unnecessary debug messgages and adding warnings

3f1ca20

Correcting the padding of mask tensor when the dimension is lesser th…

f74f897

…an self tensor

masked_fill test cases

f33795a

github-actions bot approved these changes Sep 28, 2023

View reviewed changes

Reverting changes made to converter_utl

ced4610

apbose closed this Oct 6, 2023

apbose reopened this Oct 6, 2023

apbose closed this Oct 6, 2023

github-actions bot approved these changes Oct 6, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Broadcast operations in aten::masked_fill and element_wise operations #1654

Broadcast operations in aten::masked_fill and element_wise operations #1654

apbose commented Feb 8, 2023 •

edited

Loading

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

narendasan commented Feb 13, 2023

apbose commented Feb 13, 2023

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

gs-olive Sep 11, 2023

apbose Sep 22, 2023 •

edited

Loading

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

apbose commented Oct 6, 2023

github-actions bot left a comment

github-actions bot left a comment

Broadcast operations in aten::masked_fill and element_wise operations #1654

Broadcast operations in aten::masked_fill and element_wise operations #1654

Conversation

apbose commented Feb 8, 2023 • edited Loading

Description

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

narendasan commented Feb 13, 2023

apbose commented Feb 13, 2023

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

gs-olive Sep 11, 2023

Choose a reason for hiding this comment

apbose Sep 22, 2023 • edited Loading

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

apbose commented Oct 6, 2023

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

apbose commented Feb 8, 2023 •

edited

Loading

apbose Sep 22, 2023 •

edited

Loading