fix wrong quantization target in weight quantization #4038

chenbohua3 · 2021-08-06T08:56:14Z

This pr contains two things:

When bn fold is enable, the folded weight is assign to module.weight, QAT quantizer should quantize module.weight instead of module.old_weight
pass the weight that should be quantized to each quantizer directly to make the training graph more clearly.

QuanluZhang · 2021-08-09T02:09:37Z

nni/algorithms/compression/pytorch/quantization/quantizers.py

@@ -221,15 +220,13 @@ def quantize_input(self, *inputs, wrapper, **kwargs):
            self.record(wrapper, 'input', inputs)
        return inputs

-    def quantize_weight(self, wrapper, **kwargs):
+    def quantize_weight(self, weight, wrapper, **kwargs):


could you explain more about this change? weight can be obtained from wrapper, why pass it again?

It's about the code readability. Since we already pass the weight to be quantized (new_weight) to quant_grad here, it is better to directly use it instead of obtaining it from wrapper. The developer can easily know that quantize_weight is a big op to simulate quantization and the op takes origin/bn-folded weight as input. I think using wrapper.weight will make it difficult to understand to structure of the training graph.

linbinskn · 2021-08-10T05:55:32Z

LGTM. Since this PR also changes quantize_weight in DoReFa and BNN quantizer, please also test them together.

chenbohua3 · 2021-08-10T07:14:51Z

Have added a ut about the interface of quantize_weight

QuanluZhang reviewed Aug 9, 2021

View reviewed changes

chenbohua3 force-pushed the fix_bn branch from 7e36a58 to d92237f Compare August 9, 2021 06:24

linbinskn approved these changes Aug 10, 2021

View reviewed changes

chenbohua3 force-pushed the fix_bn branch from 003e890 to e960d13 Compare August 18, 2021 07:38

QuanluZhang approved these changes Aug 18, 2021

View reviewed changes

chenbohua3 changed the title ~~pass weight to quantize_weight to make the graph more clearly~~ fix wrong quantization target in weight quantization Aug 18, 2021

linbinskn approved these changes Aug 18, 2021

View reviewed changes

linbinskn mentioned this pull request Aug 20, 2021

add quantize_input to QAT quantizer #4084

Merged

QuanluZhang requested a review from J-shang August 21, 2021 10:45

J-shang approved these changes Aug 21, 2021

View reviewed changes

chenbohua3 added 9 commits August 22, 2021 10:24

pass weight to quantize_weight to make the graph more clearly

9abb346

fix lint

b1151f2

fix ut

85785fa

add interface ut

cba35dc

refactor

37ab6b0

refactor

954c92c

refactor

9e780d9

refine

5359cef

refine

32e3fc1

chenbohua3 force-pushed the fix_bn branch from a989825 to 32e3fc1 Compare August 22, 2021 02:25

QuanluZhang closed this Aug 23, 2021

QuanluZhang reopened this Aug 23, 2021

QuanluZhang merged commit a606916 into microsoft:master Aug 24, 2021

liuzhe-lz mentioned this pull request Oct 15, 2021

NNI 2021 August~September Iteration Planning #3986

Closed

78 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix wrong quantization target in weight quantization #4038

fix wrong quantization target in weight quantization #4038

chenbohua3 commented Aug 6, 2021

QuanluZhang Aug 9, 2021

chenbohua3 Aug 9, 2021

linbinskn commented Aug 10, 2021

chenbohua3 commented Aug 10, 2021

fix wrong quantization target in weight quantization #4038

fix wrong quantization target in weight quantization #4038

Conversation

chenbohua3 commented Aug 6, 2021

QuanluZhang Aug 9, 2021

Choose a reason for hiding this comment

chenbohua3 Aug 9, 2021

Choose a reason for hiding this comment

linbinskn commented Aug 10, 2021

chenbohua3 commented Aug 10, 2021