Support the Resnet/Squeezenet/Mobilenet for speedup #2579

zheng-ningxin · 2020-06-19T05:35:25Z

In this pr, the speedup module will support the add/cat operations and the convolution layers that have more than 1 group. I have tested the speedup module on the resnet18, squeezenet1_1, and mobilenetv_2 and it works fine.

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

model should be set to eval mode before the jit.trace call. Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

In the original way, addmm will also triger the dependency set searching, which may lead to a wrong dependency set. Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

…p_conflict

The name of the node is not a unique identifier globally. Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

mask_conflict can fix the mask conflict of the layers that has channel dependency. This part should be called before the speedup function, so that, the speedup module can handle the model with residual connection/concat operations. Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

update the interface. if we alreay have the traced graph of the model we donnot need to trace the model again. Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Add unittest for tools in analysis_utils to verify the correctness of the visulization, channel dependency, and mask conflict. Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

…is_utils

zheng-ningxin · 2020-06-19T09:41:15Z

I find another problem in the TorchModuleGraph (#2581). It may be too late to fix this problem before this release(code freeze at 6.22), but fortunately, there are not many models with this problem. I'll try to fix it in the next release. Thanks~

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

chicm-ms · 2020-06-21T06:51:54Z

src/sdk/pynni/tests/test_compression_utils.py

@@ -11,13 +11,13 @@

 from nni.compression.torch import L1FilterPruner


Would you please add test cases to verify the model speedup correctness for resnet, squeezenet and mobilenet like this test case?
https://github.com/microsoft/nni/blob/master/src/sdk/pynni/tests/test_model_speedup.py#L106

QuanluZhang · 2020-06-22T02:32:08Z

src/sdk/pynni/nni/_graph_utils.py

+        input_shapes = [t.type().sizes() for t in input_tensors]
+        cat_info['in_shape'] = input_shapes
+        return cat_info
+
    def _extract_shape_info(self, node):


this function is written by me, it is only for view (pretty limited). maybe we can generalize this function to extract different module's shape if needed in future.

QuanluZhang · 2020-06-22T02:53:15Z

src/sdk/pynni/nni/_graph_utils.py

+        Returns
+        -------
+        dict
+            Include auxiliary information for the cat operation.


might be better to explain the content of the dict

QuanluZhang · 2020-06-22T03:06:44Z

src/sdk/pynni/nni/_graph_utils.py

+        # after the build_index function.
+        input_order = []
+        list_construct_cpp = list(cpp_node.inputs())[0].node()
+        input_tensors = list(list_construct_cpp.inputs())


so the order of tensors returned by .inputs() is the order of input arguments?

According to my observation and experimental results, yes it is.
However, because jit itself lacks documentation, I have no documentation to support this point.
I will read the source code of jit and double check it.

QuanluZhang · 2020-06-22T04:34:17Z

src/sdk/pynni/nni/compression/torch/speedup/compressor.py

        self.torch_graph = build_module_graph(model, dummy_input)

-    def infer_module_mask(self, module_name, mask=None, in_shape=None, out_shape=None):
+    def infer_module_mask(self, module_name, last_module, mask=None, in_shape=None, out_shape=None):
        """


please update docstring accordingly

QuanluZhang · 2020-06-22T07:37:54Z

src/sdk/pynni/nni/compression/torch/utils/mask_conflict.py

+
+    Parameters
+    ----------
+    model : torch.nn.Module


please use consistent order with input arguments

QuanluZhang · 2020-06-22T08:05:39Z

src/sdk/pynni/nni/compression/torch/utils/mask_conflict.py

+        """
+        torch.save(self.masks, path)
+
+class CatMaskPadding(MaskFix):


could you explain the logic of cat mask padding in docstring to deliver the high level idea of how the conflict is resolved?

QuanluZhang · 2020-06-22T08:09:55Z

src/sdk/pynni/nni/compression/torch/utils/mask_conflict.py

+                # no layer is pruned
+                continue
+            elif count == len(layers):
+                # all the layers have been pruned


even all the layers have been pruned, is it possible their masks are still not consistent?

QuanluZhang · 2020-06-22T08:13:18Z

src/sdk/pynni/nni/compression/torch/utils/mask_conflict.py

+            for layer in layers:
+                module = name_to_module[layer]
+                w_shape = module.weight.data.size()
+                w_mask = torch.ones(w_shape).to(device)


so the mask is all ones?

Cat concatenates the input masks as the output mask, so when part of the input layers are not pruned, we still need to pass the masks of these not-pruned layers(all ones) to the cat operation to ensure the shape of the final output mask is right.

QuanluZhang · 2020-06-22T08:18:21Z

src/sdk/pynni/nni/compression/torch/utils/mask_conflict.py

-        graph : torch._C.Graph
+        masks : dict
+            a dict object that stores the masks
+        graph : torch._C.torch.jit.TopLevelTracedModule


inconsistent order with input arguments

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

src/sdk/pynni/nni/compression/torch/speedup/compress_modules.py

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Ningxin added 30 commits May 14, 2020 01:26

Add analysis tools for sensitivity and topology.

0a4b7b0

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Reformat the code and add several small new features.

712d982

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Add the flops information rendering for the visulization.

202593c

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Add the depedency rendering feature.

8a7a799

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Update the interface of the SensitivityAnalysis

362441a

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Add sensitivity rendering feature.

e69e78f

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Add copyright and license.

5823276

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Remove the unrelated files.

4a70d79

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Fix some typos.

fc95dd7

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Fix a small issue.

a90c35e

model should be set to eval mode before the jit.trace call. Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Fix a small issue.

1909ff0

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Fix bug.

2d13dda

In the original way, addmm will also triger the dependency set searching, which may lead to a wrong dependency set. Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Add compatibility with versions prior to torch-1.4.0.

0e79624

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Update shape_dependency.

d998d17

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Merge branch 'master' of https://github.com/microsoft/nni into speedu…

1c1eb89

…p_conflict

Find a bug in _graph_utils.

5181fe8

The name of the node is not a unique identifier globally. Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Update the interface.

6029603

update the interface. if we alreay have the traced graph of the model we donnot need to trace the model again. Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Add unit test for analysis_utils.

9beb1e2

Add unittest for tools in analysis_utils to verify the correctness of the visulization, channel dependency, and mask conflict. Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Fix the format warnings from pylint.

6b25ff3

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Add dependencies.

d0bda49

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

comment the visualization test temporarily.

4154cf0

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

update

83f0b26

Skip the test when the torch version is too old.

388056c

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

update

ccbcc6c

update according to the review comments.

4ce8255

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

update according to review comments.

0f70f67

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Add docs for analysis_utils.

2eac259

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

update rst

810f20e

Merge branch 'master' of https://github.com/microsoft/nni into analys…

dcdc736

…is_utils

Ningxin added 3 commits June 19, 2020 12:21

fix pylint errors.

dfc9d2e

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

update

a84dab7

update doc.

5d003e8

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

This was linked to issues Jun 20, 2020

pruned model size no change and inference time is even longer #2225

Closed

Support for more architecture and functions #2335

Closed

QuanluZhang mentioned this pull request Jun 20, 2020

Support for more architecture and functions #2335

Closed

chicm-ms reviewed Jun 21, 2020

View reviewed changes

QuanluZhang reviewed Jun 22, 2020

View reviewed changes

QuanluZhang approved these changes Jun 22, 2020

View reviewed changes

Ningxin added 5 commits June 23, 2020 03:51

Add the integration test for the speedu module.

93bcb36

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

add support for adaptive_avg_pool2d

5c1acad

add the support for reshape operation.

4c0334e

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

update the doc string.

70573f8

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Add the absolute threshold for the unitest.

682a351

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

chicm-ms reviewed Jun 23, 2020

View reviewed changes

src/sdk/pynni/nni/compression/torch/speedup/compress_modules.py Show resolved Hide resolved

Ningxin added 3 commits June 24, 2020 00:03

Add assert to make sure input channels are evenly pruned.

221329a

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

mute the progress bar when download the pretrained model.

0cdd850

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

use a small batch size

e11c36d

chicm-ms approved these changes Jun 24, 2020

View reviewed changes

chicm-ms merged commit e6817d2 into microsoft:master Jun 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support the Resnet/Squeezenet/Mobilenet for speedup #2579

Support the Resnet/Squeezenet/Mobilenet for speedup #2579

zheng-ningxin commented Jun 19, 2020

zheng-ningxin commented Jun 19, 2020

chicm-ms Jun 21, 2020

zheng-ningxin Jun 22, 2020

QuanluZhang Jun 22, 2020 •

edited

Loading

QuanluZhang Jun 22, 2020

QuanluZhang Jun 22, 2020

zheng-ningxin Jun 22, 2020

QuanluZhang Jun 22, 2020

QuanluZhang Jun 22, 2020

QuanluZhang Jun 22, 2020

QuanluZhang Jun 22, 2020

QuanluZhang Jun 22, 2020

zheng-ningxin Jun 22, 2020

QuanluZhang Jun 22, 2020

		@@ -11,13 +11,13 @@

		from nni.compression.torch import L1FilterPruner

Support the Resnet/Squeezenet/Mobilenet for speedup #2579

Support the Resnet/Squeezenet/Mobilenet for speedup #2579

Conversation

zheng-ningxin commented Jun 19, 2020

zheng-ningxin commented Jun 19, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

QuanluZhang Jun 22, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

QuanluZhang Jun 22, 2020 •

edited

Loading