Add Q16x8 Depthwise Conv Support #2140

mbrooksx · 2023-07-20T21:42:32Z

Adds support for 16-bit activations + 8-bit weights for depthwise convolution in the reference kernel. Uses 64-bit bias to match TFLite. Also adds passthrough to the q16x8 reference kernel for Xtensa, CEVA, and ARC (CMSIS already has it's own implementation).

Tested:
depthwise_conv_test

BUG=2141

Adds support for 16-bit activations + 8-bit weights for depthwise convolution in the reference kernel. Uses 64-bit bias to match TFLite. Tested: depthwise_conv_test

For ARC, CEVA, and Xtensa add in a reference fallback for q16x8 depthwise convolution. This allows these platforms to still run the same models as well as fixes failed tests.

The Xtensa deptwise conv was correct for eval, but the Hifi prepare expects 8-bit IO. Early exit in the prepare to skip the Hifi and P6 prepare (those evals will never be called on q16 data). BUG: 2141

rascani

LGTM, just one minor comment clarification and will approve.

rascani · 2023-07-31T19:42:04Z

tensorflow/lite/micro/kernels/xtensa/depthwise_conv.cc

@@ -48,6 +48,16 @@ void* Init(TfLiteContext* context, const char* buffer, size_t length) {

 TfLiteStatus Prepare(TfLiteContext* context, TfLiteNode* node) {
  TF_LITE_ENSURE_OK(context, DepthwiseConvPrepare(context, node));
+  // Use only the default depthwise convolution for int16 input.


Can we expand this comment a bit to explain why we only need to do the default for int16? Specifically, we should mention that we fall back to the reference kernels for int16, so there's no need to call the Xtensa-specific Prepare methods.

Thanks for the review. Updated. I did once again forget the bug on this commit but it should work overall with a squash commit (with the top-level message above).

Add Q16x8 Depthwise Conv Support

dde0c9e

Adds support for 16-bit activations + 8-bit weights for depthwise convolution in the reference kernel. Uses 64-bit bias to match TFLite. Tested: depthwise_conv_test

mbrooksx requested a review from a team as a code owner July 20, 2023 21:42

mbrooksx marked this pull request as draft July 20, 2023 21:48

Add Q16x8 Reference Fallback for Optimized Kernels

47f42cf

For ARC, CEVA, and Xtensa add in a reference fallback for q16x8 depthwise convolution. This allows these platforms to still run the same models as well as fixes failed tests.

rascani added the ci:run label Jul 21, 2023

TFLM-bot removed the ci:run label Jul 21, 2023

Fix Xtensa q16x8

3e42875

The Xtensa deptwise conv was correct for eval, but the Hifi prepare expects 8-bit IO. Early exit in the prepare to skip the Hifi and P6 prepare (those evals will never be called on q16 data). BUG: 2141

rascani added the ci:run label Jul 21, 2023

TFLM-bot removed the ci:run label Jul 21, 2023

mbrooksx marked this pull request as ready for review July 24, 2023 22:25

rascani reviewed Jul 31, 2023

View reviewed changes

Reword Xtensa int16 depthwise conv comment

db27bf4

rascani approved these changes Aug 1, 2023

View reviewed changes

google-ml-butler bot added the ci:ready_to_merge label Aug 1, 2023

Merge branch 'main' into main

8c1d6ed

mergify bot merged commit a7846a1 into tensorflow:main Aug 1, 2023

mergify bot removed the ci:ready_to_merge label Aug 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Q16x8 Depthwise Conv Support #2140

Add Q16x8 Depthwise Conv Support #2140

mbrooksx commented Jul 20, 2023 •

edited

Loading

rascani left a comment

rascani Jul 31, 2023

mbrooksx Aug 1, 2023

Add Q16x8 Depthwise Conv Support #2140

Add Q16x8 Depthwise Conv Support #2140

Conversation

mbrooksx commented Jul 20, 2023 • edited Loading

rascani left a comment

Choose a reason for hiding this comment

rascani Jul 31, 2023

Choose a reason for hiding this comment

mbrooksx Aug 1, 2023

Choose a reason for hiding this comment

mbrooksx commented Jul 20, 2023 •

edited

Loading