[microNPU] Add support for TFLite PAD #13732

Aleksei-grovety · 2023-01-09T09:51:52Z

A separate nn.pad relay operator is legalized to an Ethos-U depthwise_conv2d operator. For ethosu_depthwise_conv2d the hardware only supports padding up to 31, 31, 32, 32, 32, so the pad size for legalization on the NPU is within these limits.

cc @leandron, @ekalda, @lhutton1

A separate nn.pad relay operator is legalized to an Ethos-U depthwise_conv2d operator. For ethosu_depthwise_conv2d the hardware only supports padding up to 31, 31, 32, 32, 32, so the pad size for legalization on the NPU is within these limits.

tvm-bot · 2023-01-09T09:51:55Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @Mousius, @leandron, @lhutton1 _{See #10317 for details}

_{Generated by tvm-bot}

ekalda

Thanks @alexey-yazev , I think it's good to go, but I'll leave it open for little longer in case anybody else wants to have a look.

ekalda · 2023-01-09T16:19:46Z

python/tvm/relay/backend/contrib/ethosu/legalize.py

+        channels_map = {
+            "NHWC": 3,
+        }


IIRC this one entry channels_map is a historic relic that can go and so simplify the code little bit, but as it is present in other operators, it's probably a clean up task for some other time.

lhutton1

Thanks @alexey-yazev, LGTM! After looking at how padding is lowered in Vela I think there might be a couple more opportunities for optimization, although it seems out of scope for this PR. Just a couple of things to consider in the future:

In some cases its possible to fuse a nn.pad with the following operation. As an example we currently fuse nn.pad->qnn.conv2d ([microNPU] Optimize separate padding operation for conv2d #11468), however, it seems a similar approach is also possible for average pooling (see: https://git.mlplatform.org/ml/ethos-u/ethos-u-vela.git/tree/ethosu/vela/tflite_graph_optimiser.py#n1413)
With the current implementation nn.pad does not get offloaded if the provided padding exceeds [31, 31, 32, 32]. If these dimensions are exceeded, we might be able to use multiple average pooling operations similar to https://git.mlplatform.org/ml/ethos-u/ethos-u-vela.git/tree/ethosu/vela/tflite_graph_optimiser.py#n1500

lhutton1 · 2023-01-09T21:25:12Z

Thanks @alexey-yazev, @ekalda!

lhutton1 · 2023-01-09T21:46:19Z

Oops I forgot to ask, would it be possible to add a legalization test under: tests/python/contrib/test_ethosu/test_legalize.py in a follow-up? Apologies for missing this before!

arina-grovety · 2023-01-10T10:19:46Z

Hello @lhutton1, thanks for the review!

With the current implementation nn.pad does not get offloaded if the provided padding exceeds [31, 31, 32, 32]. If these dimensions are exceeded, we might be able to use multiple average pooling operations similar to https://git.mlplatform.org/ml/ethos-u/ethos-u-vela.git/tree/ethosu/vela/tflite_graph_optimiser.py#n1500

Yes, this was the first option that we tried to implement. But in the Vela implementation, this is done by "copies IFM to the right place inside the OFM" using write_offset attribute of the created AvgPool operation.

In the TVM, VelaAPI operations are derived from the NpuOperation class, which does not have a write_offset attribute, so we cannot replicate Vela convert_pad() function.

We tried to implement PAD legalization using the Concatenate operation but encountered an error. Seems like Cascader must be turned off for Concatenate to work. For example, Cascader is disabled in test_tflite_concat() (if Cascader is enabled, there is the same error as we have with the Concatenate)

So far, the most feasible option seems to use several depthwise_conv2d operators, if padding exceeds [31, 31, 32, 32].

But of course, I do not have all the knowledge about this, maybe there are other options?

Aleksei-grovety · 2023-01-10T13:30:49Z

Oops I forgot to ask, would it be possible to add a legalization test under: tests/python/contrib/test_ethosu/test_legalize.py in a follow-up? Apologies for missing this before!

Test was added in PR

lhutton1 · 2023-01-12T09:47:34Z

Thanks @arina-grovety for the explanation, just following up on some of the questions...

I suspect this is a case of needing to expose this functionality from within Vela, I'll see if we can make this happen for a future Vela release.

The concatenate error does indeed sound like a separate issue in itself. It might be worth investigating the reason for that at some point.

A separate nn.pad relay operator is legalized to an Ethos-U depthwise_conv2d operator. For ethosu_depthwise_conv2d the hardware only supports padding up to 31, 31, 32, 32, 32, so the pad size for legalization on the NPU is within these limits.

[microNPU] Add support for TFLite PAD

448ecaf

A separate nn.pad relay operator is legalized to an Ethos-U depthwise_conv2d operator. For ethosu_depthwise_conv2d the hardware only supports padding up to 31, 31, 32, 32, 32, so the pad size for legalization on the NPU is within these limits.

github-actions bot requested a review from leandron January 9, 2023 09:52

ekalda approved these changes Jan 9, 2023

View reviewed changes

lhutton1 approved these changes Jan 9, 2023

View reviewed changes

lhutton1 merged commit 6b65a59 into apache:main Jan 9, 2023

Aleksei-grovety deleted the ethosu-add-separate-pad branch January 10, 2023 13:07

ysh329 mentioned this pull request Apr 17, 2023

[Release] v0.12.0 Release Candidate Notes #14645

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[microNPU] Add support for TFLite PAD #13732

[microNPU] Add support for TFLite PAD #13732

Aleksei-grovety commented Jan 9, 2023

tvm-bot commented Jan 9, 2023

ekalda left a comment

ekalda Jan 9, 2023

lhutton1 left a comment

lhutton1 commented Jan 9, 2023

lhutton1 commented Jan 9, 2023

arina-grovety commented Jan 10, 2023

Aleksei-grovety commented Jan 10, 2023

lhutton1 commented Jan 12, 2023

[microNPU] Add support for TFLite PAD #13732

[microNPU] Add support for TFLite PAD #13732

Conversation

Aleksei-grovety commented Jan 9, 2023

tvm-bot commented Jan 9, 2023

ekalda left a comment

Choose a reason for hiding this comment

ekalda Jan 9, 2023

Choose a reason for hiding this comment

lhutton1 left a comment

Choose a reason for hiding this comment

lhutton1 commented Jan 9, 2023

lhutton1 commented Jan 9, 2023

arina-grovety commented Jan 10, 2023

Aleksei-grovety commented Jan 10, 2023

lhutton1 commented Jan 12, 2023