fix(speedup): refactor the execution logic of functions in speedup #5107

Louis-J · 2022-09-02T11:40:32Z

Description

In speedup, we use a non-general trick to treat the prim ops. We analyze the relationship between the prim ops, then delete all the prim ops, finally connect the non-prim op directly. We also assume that all the intermediate variables are tensors, tuples of tensors, or lists of tensors. But it'll cause many problems in various models.

So, it's better to execute all the prim ops for real. In this pr, I remove the deletion of prim ops, and execute the prim recursively. And the assumption of variable types is also removed. Now it's closer to the original execution.

But it changed a lot and is untested now. More work is needed.

#5097

Test Options

fast test
full test - HPO
full test - NAS
full test - compression

Checklist

test case
doc

How to test

merge

edit the docs

J-shang · 2022-09-05T01:56:40Z

docs/source/compression/pruning.rst

@@ -105,4 +105,5 @@ In addition, for the convolutional layers that have more than one filter group,
 ``dependency-aware pruner`` will also try to prune the same number of the channels for each filter group.
 Overall, this pruner will prune the model according to the L1 norm of each filter and try to meet the topological constrains (channel dependency, etc) to improve the final speed gain after the speedup process. 

+Operations that will be recognized as having channel dependencies: add/sub/mul/div, addcmul/addcdiv, logical_and/or/xor


J-shang · 2022-09-20T09:34:21Z

nni/common/graph_utils.py

@@ -725,8 +717,6 @@ def _build_graph(self):

        # associate module name with their trace graph nodes
        for node in graph.nodes():
-            if node.kind() == CONSTANT_KIND:
-                continue


when we should skip the CONSTANT_KIND and when we should not? I found you did not remove all the if node.kind() == CONSTANT_KIND: in the code

J-shang · 2022-09-23T09:18:31Z

nni/compression/pytorch/speedup/compressor.py

+        if node.type == 'module':
+            inputs_name = node.inputs
+        else:
+            inputs_name = [val_node.debugName() for val_node in node.key_node.inputs()]


what is key_node? maybe we need a doc to explain the important attrs in a node

J-shang · 2022-09-29T02:59:13Z

nni/compression/pytorch/speedup/compressor.py

+            elif isinstance(obj, dict):
+                return {k: recr_detacher(v) for k, v in obj.items()}
+            else:
+                return obj


can obj a customized data type with detach function?

try: return obj.detach() except AttributeError: return obj

J-shang · 2022-09-29T03:02:43Z

nni/compression/pytorch/speedup/compressor.py

@@ -297,16 +318,15 @@ def update_indirect_sparsity(self, node):
                debug_name = auto_infer.input_debugname[in_id]

                last_output = self.internal_result[debug_name]
-                # if isinstance(last_output, torch.Tensor):
-                # TODO what if last output is tuple/list of tensor


so what if last output is tuple/list of tensor?

Louis-J · 2022-12-06T06:08:06Z

totally refactored old speedup in https://github.com/Louis-J/nni/tree/refactor_speedup

Louis-J · 2022-12-06T06:09:41Z

the speedup v2 now is runnable. so better to reassess that refactor the old speedup is suitable or not.

Lijiaoa · 2023-02-16T03:10:43Z

#5143

Louis-J and others added 5 commits August 17, 2022 10:16

Merge pull request #2 from microsoft/master

6495777

merge

remove the unsupported syntax positional only arguments

90b130a

fix: dependency_aware didn't support the sub/addcmul functions

c18d76c

fix the alias for multi

33946a0

edit the docs

fix the bug in 5097

74a28e6

QuanluZhang requested review from J-shang and zheng-ningxin September 3, 2022 01:52

J-shang reviewed Sep 5, 2022

View reviewed changes

ultmaster added v3.0 v2.9.1 and removed v3.0 labels Sep 5, 2022

This was referenced Sep 13, 2022

Candidate Issues for stabilization sprint #5106

Closed

speedup_model() causes a error: “NotImplementedError: There were no tensor arguments to this function (e.g., you passed an empty list of Tensors)” #5130

Closed

J-shang reviewed Sep 20, 2022

View reviewed changes

J-shang reviewed Sep 23, 2022

View reviewed changes

Louis-J added 4 commits September 26, 2022 17:15

fix lint problems

75714e0

Merge branch 'master' into fix_5097

497650d

restore wrong changes

909c776

wrong changes

64c7250

J-shang mentioned this pull request Sep 27, 2022

The pruning process gets stuck and does not run. #5136

Closed

Louis-J mentioned this pull request Sep 28, 2022

Pruning: Add support for torch.split() operation #5143

Open

J-shang reviewed Sep 29, 2022

View reviewed changes

Louis-J added v3.0 and removed v2.9.1 labels Oct 19, 2022

liuzhe-lz added v3.1 and removed v3.0 labels Nov 23, 2022

Louis-J closed this Dec 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(speedup): refactor the execution logic of functions in speedup #5107

fix(speedup): refactor the execution logic of functions in speedup #5107

Louis-J commented Sep 2, 2022

J-shang Sep 5, 2022

J-shang Sep 20, 2022

J-shang Sep 23, 2022

J-shang Sep 29, 2022

J-shang Sep 29, 2022

Louis-J commented Dec 6, 2022

Louis-J commented Dec 6, 2022

Lijiaoa commented Feb 16, 2023

fix(speedup): refactor the execution logic of functions in speedup #5107

fix(speedup): refactor the execution logic of functions in speedup #5107

Conversation

Louis-J commented Sep 2, 2022

Description

Test Options

Checklist

How to test

J-shang Sep 5, 2022

Choose a reason for hiding this comment

J-shang Sep 20, 2022

Choose a reason for hiding this comment

J-shang Sep 23, 2022

Choose a reason for hiding this comment

J-shang Sep 29, 2022

Choose a reason for hiding this comment

J-shang Sep 29, 2022

Choose a reason for hiding this comment

Louis-J commented Dec 6, 2022

Louis-J commented Dec 6, 2022

Lijiaoa commented Feb 16, 2023