feat(compression): implement tensor decompression in op fully_connected #3006

rkuester · 2024-12-11T00:36:13Z

Implement tensor decompression in op fully_connected. Extend
tests to validate operation on compressed tensors.

Include Xtensa extensions for decompression, as they are required
by the tests.

Fix the effect on memory_arena_threshold_test by setting a
different expected value for the persistent buffer allocation
when compression is configured in. The allocation was allowed to
vary by 3%; however, compression adds ~10%. Set the expected
value to the measured value when compression is configured in.

BUG=part of #2636

Implement tensor decompression in op fully_connected. Extend tests to validate operation on compressed tensors. Fix the effect on memory_arena_threshold_test by setting a different expected value for the persistent buffer allocation when compression is configured in. The allocation was allowed to vary by 3%; however, compression adds ~10%. Set the expected value to the measured value when compression is configured in. BUG=part of tensorflow#2636

suleshahid · 2024-12-11T02:09:11Z

Hifi compression tests seem to fail for this

ddavis-2015 · 2024-12-11T08:48:55Z

Hifi compression tests seem to fail for this

@suleshahid @rkuester These tests pass in my local environment, but my environment is not identical to the compress-testing branch or this PR. I found two files with a merge error in the compress-testing branch: fully_connected_test.cc and transpose_conv_test.cc. They had a method templated that should not have a template. However, I don't see how that would cause the test to fail.

I have updated the compress-testing branch with the correctly merged code. I diffed compress-testing against my own branch and found no other relevant differences.

rkuester · 2024-12-11T17:19:18Z

I have updated the compress-testing branch with the correctly merged code. I diffed compress-testing against my own branch and found no other relevant differences.

That still failed. I'll in the process of testing my ultimate destination in CI to see if it's due to something I've left out of this PR's commit.

ddavis-2015 · 2024-12-11T17:31:47Z

I have updated the compress-testing branch with the correctly merged code. I diffed compress-testing against my own branch and found no other relevant differences.

That still failed. I'll in the process of testing my ultimate destination in CI to see if it's due to something I've left out of this PR's commit.

@rkuester The test script requires the optimized kernels, but the PR does not have compression support for the optimized kernels.

rkuester · 2024-12-11T17:47:14Z

The test script requires the optimized kernels, but the PR does not have compression support for the optimized kernels.

I see. That's a surprising way to fail---build and runtime success, but wrong results. Is there an way we can transform this into a failure to build (an extension that should support compression having a new signature or something), or at least a better failure at runtime (an extension noticing the length of the tensor doesn't make sense, etc.)?

suleshahid · 2024-12-12T02:13:04Z

tensorflow/lite/micro/kernels/fully_connected.cc

@@ -115,9 +143,18 @@ TfLiteStatus FullyConnectedEval(TfLiteContext* context, TfLiteNode* node) {
          tflite::micro::GetTensorShape(input),
          tflite::micro::GetTensorData<float>(input),
          tflite::micro::GetTensorShape(filter),
+#ifdef USE_TFLM_COMPRESSION
+          tflite::micro::GetTensorData<float>(micro_context, filter,


Catching this a bit late, but in the new GetTensorData function, we check and return nullptr if tensor is nullptr. However, in the existing code, its a TFLITE_DCHECK, which means we will fail here, since we expected data. I think its better to do it that way, because otherwise we could end up trying to use the nullptr somewhere.

This is valid for cases where the bias tensor is optional as input to the operator.

In that case I would prefer if we have another override for the optional case. Since this will be basically changing the code even for non-compression execution (in the case globally compression is enabled), it will be safer to have the nullptr DCHECK.

If you think that will take too long, for now we could just add a single DCHECK for the weights data in the above ifdef compression code.

Seeing how many files use it, probably easiest to just add the override GetOptionalTensorData.

in progress.

yushhuang · 2024-12-12T13:29:56Z

tensorflow/lite/micro/tools/make/targets/xtensa_makefile.inc

+
+  # override KERNEL_OPTIMIZATION_LEVEL to enable higher performance
+  # Xtensa intrinsics.
+$(KERNEL_OBJDIR)$(TENSORFLOW_ROOT)tensorflow/lite/micro/kernels/xtensa/decompress.o: $(TENSORFLOW_ROOT)tensorflow/lite/micro/kernels/xtensa/decompress.cc


Curious if it's not sufficient to put this in MICROLITE_CC_KERNEL_SRCS? I understand that a dedicated rule was required when the decompression routine was in micro_context.cc, but I'm not sure if it's needed now.

Cadence requires us to use special compile options just for this one file. Using -O3 on all kernels does NOT work.

tensorflow/lite/micro/memory_arena_threshold_test.cc

rkuester · 2024-12-12T16:26:51Z

@ddavis-2015, I'll let you comment on these reviews. (Note, I'm mostly passing through the C++ code on David's behalf at this point.)

tensorflow/lite/micro/tools/make/targets/xtensa_makefile.inc

Fixes for FULLY_CONNECTED optional bias tensor.

rkuester requested a review from a team as a code owner December 11, 2024 00:36

rkuester added the ci:run_full label Dec 11, 2024

rkuester requested a review from suleshahid December 11, 2024 00:36

TFLM-bot removed the ci:run_full label Dec 11, 2024

fix: merge error

6d27e73

rkuester added the ci:run_full label Dec 11, 2024

TFLM-bot removed the ci:run_full label Dec 11, 2024

fix: add xtensa optimizations

136bd45

rkuester added the ci:run_full label Dec 11, 2024

TFLM-bot removed the ci:run_full label Dec 11, 2024

suleshahid reviewed Dec 12, 2024

View reviewed changes

yushhuang reviewed Dec 12, 2024

View reviewed changes

tensorflow/lite/micro/memory_arena_threshold_test.cc Show resolved Hide resolved

suleshahid reviewed Dec 12, 2024

View reviewed changes

tensorflow/lite/micro/tools/make/targets/xtensa_makefile.inc Show resolved Hide resolved

fixes for optional bias tensor DCHECK.

97de224

Fixes for FULLY_CONNECTED optional bias tensor.

rkuester added the ci:run_full label Dec 13, 2024

TFLM-bot removed the ci:run_full label Dec 13, 2024

Merge branch 'main' into feat-compression

d2a8f34

suleshahid approved these changes Dec 13, 2024

View reviewed changes

google-ml-butler bot added the ci:ready_to_merge label Dec 13, 2024

mergify bot merged commit a9c6e6a into tensorflow:main Dec 13, 2024
54 of 55 checks passed

mergify bot removed the ci:ready_to_merge label Dec 13, 2024

rkuester deleted the feat-compression branch December 13, 2024 22:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(compression): implement tensor decompression in op fully_connected #3006

feat(compression): implement tensor decompression in op fully_connected #3006

rkuester commented Dec 11, 2024 •

edited

Loading

suleshahid commented Dec 11, 2024

ddavis-2015 commented Dec 11, 2024

rkuester commented Dec 11, 2024

ddavis-2015 commented Dec 11, 2024

rkuester commented Dec 11, 2024

suleshahid Dec 12, 2024

ddavis-2015 Dec 12, 2024

suleshahid Dec 12, 2024

suleshahid Dec 12, 2024

ddavis-2015 Dec 12, 2024

yushhuang Dec 12, 2024

ddavis-2015 Dec 12, 2024 •

edited

Loading

rkuester commented Dec 12, 2024

feat(compression): implement tensor decompression in op fully_connected #3006

feat(compression): implement tensor decompression in op fully_connected #3006

Conversation

rkuester commented Dec 11, 2024 • edited Loading

suleshahid commented Dec 11, 2024

ddavis-2015 commented Dec 11, 2024

rkuester commented Dec 11, 2024

ddavis-2015 commented Dec 11, 2024

rkuester commented Dec 11, 2024

suleshahid Dec 12, 2024

Choose a reason for hiding this comment

ddavis-2015 Dec 12, 2024

Choose a reason for hiding this comment

suleshahid Dec 12, 2024

Choose a reason for hiding this comment

suleshahid Dec 12, 2024

Choose a reason for hiding this comment

ddavis-2015 Dec 12, 2024

Choose a reason for hiding this comment

yushhuang Dec 12, 2024

Choose a reason for hiding this comment

ddavis-2015 Dec 12, 2024 • edited Loading

Choose a reason for hiding this comment

rkuester commented Dec 12, 2024

rkuester commented Dec 11, 2024 •

edited

Loading

ddavis-2015 Dec 12, 2024 •

edited

Loading