fix/feat: Move convolution core to `impl` + add feature (FX converter refactor) #1972

gs-olive · 2023-06-02T16:18:44Z

Description

Centralize convolution implementation in FX across all source IR variants, including support for conv1d, quantized, and other configurations
Update reference implementations across the stack to use centralized utility and remove individual replicated implementations
Allow conv layers to take bias inputs in FX, per new functionality from TRT
Enable pass-through of build errors in Dynamo e2e tests to ensure errors are not being hidden (this PR fixes a bug which disallowed that pass-through)

Fixes #1954
Addresses first bug in #1565

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)

Checklist:

[ x ] My code follows the style guidelines of this project (You can use the linters)
[ x ] I have performed a self-review of my own code
[ x ] I have commented my code, particularly in hard-to-understand areas and hacks
[ x ] I have made corresponding changes to the documentation
[ x ] I have added tests to verify my fix or my feature
[ x ] New and existing unit tests pass locally with my changes
[ x ] I have added the relevant labels to my PR in so that relevant reviewers are notified

gs-olive · 2023-06-06T21:09:21Z

py/torch_tensorrt/fx/converters/impl/convolution.py

+    # Process bias terms
+    if isinstance(bias, torch.Tensor):
+        # Transform the bias constant into a Numpy array
+        bias = to_numpy(bias)
+
+    elif isinstance(bias, TRTTensor):
+        bias = get_trt_tensor(network, bias, f"{name}_bias")
+


Did not add an unsqueeze operation to bias term since the requirement in TRT for the bias term is that it must have number of elements equal to the number of output features of the convolution, so the same bias as is used for Conv1D would work for Conv2D, with the number of output features being fixed.

just to clarify, you mean 1D or 2D conv? For 1D, we need bias to be unsqueezed.

I initially meant for all conv layers since this documentation seems to indicate we just need the number of elements in the bias Tensor to the be correct, and not necessarily the dimensions, but if bias needs to be unsqueezed for 1D, I can add that functionality back. I am wondering if the intended unsqueeze should be in the first dimension (torch.unsqueeze(bias, 0)) or the last dimension (torch.unsqueeze(bias, -1))?

Note: I think initially, it was torch.unsqueeze(bias, 0), while the weights and inputs were unsqueezed in the last dimension

I am wondering if TRT has done the broadcast internally since the unit test for 1D works good even though you did not unsqueeze it.

I verified on a small sample that Conv1D with bias compiles + runs inference successfully without unsqueezing the bias term

Should we have a unit test for this just to catch if TRT behavior changes?

I could add one, but it would likely be very similar to this case, which interprets, builds, and runs inference on a Conv1D model with TRT both with and without bias:

TensorRT/py/torch_tensorrt/fx/test/converters/acc_op/test_convolution.py

Lines 8 to 45 in cae6b7c

class TestConvolutionConverter(AccTestCase):

@parameterized.expand(

[

("default", 1),

param("no_bias", 1, bias=False),

("tuple_parameters", 1, (1), (1)),

param("non_zero_padding", 1, padding=1),

param("dilation", 1, dilation=2),

param("groups", 1, groups=3),

]

)

def test_conv1d(

self,

_,

kernel_size,

stride=1,

padding=0,

dilation=1,

groups=1,

bias=True,

):

class TestModule(torch.nn.Module):

def __init__(self):

super().__init__()

self.conv = torch.nn.Conv1d(

3, 6, kernel_size, stride, padding, dilation, groups, bias

)

def forward(self, x):

return self.conv(x)

inputs = [torch.randn(1, 3, 32)]

self.run_test(

TestModule(),

inputs,

expected_ops={acc_ops.conv1d},

test_explicit_precision=True,

)

A breaking TRT change to that specific case should cause the accuracy check in the above test to fail.

py/torch_tensorrt/fx/converters/acc_ops_converters.py

narendasan

Mostly organization stuff

py/torch_tensorrt/fx/converters/convolution.py

narendasan · 2023-06-22T22:37:46Z

py/torch_tensorrt/fx/converters/impl/convolution.py

+    # Process bias terms
+    if isinstance(bias, torch.Tensor):
+        # Transform the bias constant into a Numpy array
+        bias = to_numpy(bias)
+
+    elif isinstance(bias, TRTTensor):
+        bias = get_trt_tensor(network, bias, f"{name}_bias")
+


Should we have a unit test for this just to catch if TRT behavior changes?

py/torch_tensorrt/fx/converters/nn_ops_converters.py

- Centralize convolution implementation in FX, similar across all source IRs, including aten, acc, nn - Enable pass-through of build errors in e2e tests to ensure errors are not being hidden - Allow conv layers to take bias inputs in FX, per new functionality from TRT - Remove separate `convolution.py` file and centralize `nn` converters to a single file

gs-olive · 2023-06-26T17:20:46Z

HI @wushirong - thank you for the review. I was wondering if you could have another look at the changes, as I've moved convolution.py contents to nn_ops_converters.py for code cleanliness + organization and removed unused imports, in response to review comments by @narendasan

gs-olive self-assigned this Jun 2, 2023

facebook-github-bot added cla signed fx labels Jun 2, 2023

github-actions bot added component: api [Python] Issues re: Python API component: fx labels Jun 2, 2023

github-actions bot requested a review from yinghai June 2, 2023 16:19

gs-olive added component: dynamo Issues relating to the `torch.compile` or `torch._dynamo.export` paths Story: Dynamo Compile Improvements Issues relating to improvement of the Dynamo compile path labels Jun 2, 2023

gs-olive requested review from narendasan and frank-wei and removed request for yinghai June 2, 2023 16:29

github-actions bot requested a review from yinghai June 2, 2023 16:29

gs-olive added the WIP Work is in progress, pull request should not be merged yet label Jun 4, 2023

gs-olive force-pushed the enable_build_failures_e2e branch 7 times, most recently from 5348ac2 to 075a028 Compare June 5, 2023 02:07

gs-olive removed the WIP Work is in progress, pull request should not be merged yet label Jun 5, 2023

gs-olive requested review from wushirong and removed request for yinghai June 5, 2023 15:36

gs-olive changed the title ~~fix: Allow FX convolution layers to take bias inputs~~ fix/feat: Move convolution core to impl + add feature (FX converter refactor) Jun 5, 2023

gs-olive requested a review from apbose June 5, 2023 20:17

gs-olive commented Jun 6, 2023

View reviewed changes

narendasan reviewed Jun 7, 2023

View reviewed changes

py/torch_tensorrt/fx/converters/acc_ops_converters.py Outdated Show resolved Hide resolved

gs-olive force-pushed the enable_build_failures_e2e branch from e92034d to 08d6c41 Compare June 7, 2023 22:10

gs-olive mentioned this pull request Jun 7, 2023

✨[Feature] Centralize Transposed Convolution Operations in FX #1996

Closed

gs-olive requested a review from narendasan June 20, 2023 18:32

gs-olive force-pushed the enable_build_failures_e2e branch 2 times, most recently from a6b5e6c to a21d778 Compare June 21, 2023 16:36

wushirong approved these changes Jun 22, 2023

View reviewed changes

narendasan reviewed Jun 22, 2023

View reviewed changes

gs-olive force-pushed the enable_build_failures_e2e branch from a21d778 to 69e8d33 Compare June 22, 2023 23:14

gs-olive commented Jun 22, 2023

View reviewed changes

py/torch_tensorrt/fx/converters/nn_ops_converters.py Outdated Show resolved Hide resolved

gs-olive force-pushed the enable_build_failures_e2e branch from 69e8d33 to de9938e Compare June 23, 2023 03:04

gs-olive requested a review from narendasan June 23, 2023 03:04

gs-olive force-pushed the enable_build_failures_e2e branch from de9938e to b7eea6f Compare June 23, 2023 04:06

gs-olive added the WIP Work is in progress, pull request should not be merged yet label Jun 23, 2023

github-actions bot requested a review from wushirong June 23, 2023 04:57

gs-olive force-pushed the enable_build_failures_e2e branch from b7eea6f to 0f5be88 Compare June 23, 2023 05:01

gs-olive removed the WIP Work is in progress, pull request should not be merged yet label Jun 23, 2023

gs-olive force-pushed the enable_build_failures_e2e branch from 0f5be88 to 834064e Compare June 23, 2023 05:09

This was referenced Jun 27, 2023

🐛 [Bug] RuntimeError: linear convolution has bias of type <class 'tensorrt_bindings.tensorrt.ITensor'>, Expect Optional[Tensor] #2060

Closed

fix: Move all aten PRs to Dynamo converter registry #2070

Merged

wushirong approved these changes Jun 30, 2023

View reviewed changes

gs-olive merged commit 8b09e71 into pytorch:main Jun 30, 2023

gs-olive deleted the enable_build_failures_e2e branch June 30, 2023 16:42

apbose mentioned this pull request Dec 1, 2023

🐛 [Bug] RuntimeError: linear convolution has bias of type <class 'tensorrt.tensorrt.ITensor'>, Expect Optional[Tensor] when using torch_tensorrt as backend in torch.compile #2506

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix/feat: Move convolution core to `impl` + add feature (FX converter refactor) #1972

fix/feat: Move convolution core to `impl` + add feature (FX converter refactor) #1972

gs-olive commented Jun 2, 2023 •

edited

Loading

gs-olive Jun 6, 2023

frank-wei Jun 9, 2023

gs-olive Jun 9, 2023 •

edited

Loading

frank-wei Jun 9, 2023

gs-olive Jun 12, 2023

narendasan Jun 22, 2023

gs-olive Jun 22, 2023 •

edited

Loading

narendasan left a comment

narendasan Jun 22, 2023

gs-olive commented Jun 26, 2023 •

edited

Loading

	class TestConvolutionConverter(AccTestCase):
	@parameterized.expand(
	[
	("default", 1),
	param("no_bias", 1, bias=False),
	("tuple_parameters", 1, (1), (1)),
	param("non_zero_padding", 1, padding=1),
	param("dilation", 1, dilation=2),
	param("groups", 1, groups=3),
	]
	)
	def test_conv1d(
	self,
	_,
	kernel_size,
	stride=1,
	padding=0,
	dilation=1,
	groups=1,
	bias=True,
	):
	class TestModule(torch.nn.Module):
	def __init__(self):
	super().__init__()
	self.conv = torch.nn.Conv1d(
	3, 6, kernel_size, stride, padding, dilation, groups, bias
	)

	def forward(self, x):
	return self.conv(x)

	inputs = [torch.randn(1, 3, 32)]
	self.run_test(
	TestModule(),
	inputs,
	expected_ops={acc_ops.conv1d},
	test_explicit_precision=True,
	)

fix/feat: Move convolution core to impl + add feature (FX converter refactor) #1972

fix/feat: Move convolution core to impl + add feature (FX converter refactor) #1972

Conversation

gs-olive commented Jun 2, 2023 • edited Loading

Description

Type of change

Checklist:

gs-olive Jun 6, 2023

Choose a reason for hiding this comment

frank-wei Jun 9, 2023

Choose a reason for hiding this comment

gs-olive Jun 9, 2023 • edited Loading

Choose a reason for hiding this comment

frank-wei Jun 9, 2023

Choose a reason for hiding this comment

gs-olive Jun 12, 2023

Choose a reason for hiding this comment

narendasan Jun 22, 2023

Choose a reason for hiding this comment

gs-olive Jun 22, 2023 • edited Loading

Choose a reason for hiding this comment

narendasan left a comment

Choose a reason for hiding this comment

narendasan Jun 22, 2023

Choose a reason for hiding this comment

gs-olive commented Jun 26, 2023 • edited Loading

fix/feat: Move convolution core to `impl` + add feature (FX converter refactor) #1972

fix/feat: Move convolution core to `impl` + add feature (FX converter refactor) #1972

gs-olive commented Jun 2, 2023 •

edited

Loading

gs-olive Jun 9, 2023 •

edited

Loading

gs-olive Jun 22, 2023 •

edited

Loading

gs-olive commented Jun 26, 2023 •

edited

Loading