[Bugfix][Relay][Strategy] Enable compile time transformation of weights matrix for arm_cpu NHWC quantized conv2d #15584

Anndrey24 · 2023-08-17T15:39:11Z

Fixed arm_cpu strategy bug which was causing tensorization errors when using the AlterOpLayout pass for the quantized NHWC conv2d schedules, as discovered in #10724. Therefore, we can now also enable the usage of AlterOpLayout for these schedules in order to transform the weight matrix at compile time, instead of runtime as before.

I also modified the padding in Conv2DGemmWeightTransformRel and interleave_transpose_weights to reflect the changes made in #13669 and updated the AlterOpLayout tests accordingly.

cc @ekalda @lhutton1 @leandron

…ts matrix for arm_cpu NHWC quantized conv2d Fixed arm_cpu strategy bug which was causing tensorization errors when using the `AlterOpLayout` pass for the quantized NHWC conv2d schedules, as discovered in apache#10724. Therefore, we can now also enable the usage of `AlterOpLayout` for these schedules in order to transform the weight matrix at compile time, instead of runtime as before. I also modified the padding in `Conv2DGemmWeightTransformRel` and `interleave_transpose_weights` to reflect the changes made in apache#13669 and updated the AlterOpLayout tests accordingly.

tvm-bot · 2023-08-17T15:39:15Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @shingjan _{See #10317 for details}

_{Generated by tvm-bot}

lhutton1

Thanks @Anndrey24, this looks like a great improvement! It seems we have test coverage for the alter op layout pass which is great, although, it would be good to also have a test to check no weight transform happens at runtime. Is it possible to extend

tvm/tests/python/relay/strategy/test_select_implementation.py

Line 60 in 8b37d4d

@pytest.mark.parametrize(

to check the schedule that gets selected is "conv2d_NHWC_quantized_*_without_transform.arm_cpu"?

Updated the int8 conv2d implementation selection test to include more targets and to run the AlterOpLayout pass such that its respective changes of schedule are captured.

lhutton1

Great work @Anndrey24! LGTM

ekalda

Thanks @Anndrey24, very well spotted!

lhutton1 · 2023-08-23T08:12:41Z

Thanks @Anndrey24 @ekalda!

Refactored out a piece of common functionality from the `conv2d_gemm_weight_transform` and `interleave_transpose_weights` functions, which has previously led to bugs stemming from changes made to only one but not the other, like in apache#15584. Determining the necessary padding for the interleaved and transposed weights matrix has now been separated into a new utility function, allowing future changes to be reflected in both callers.

Refactored out a piece of common functionality from the `conv2d_gemm_weight_transform` and `interleave_transpose_weights` functions, which has previously led to bugs stemming from changes made to only one but not the other, like in #15584. Determining the necessary padding for the interleaved and transposed weights matrix has now been separated into a new utility function, allowing future changes to be reflected in both callers.

Anndrey24 added 2 commits August 17, 2023 13:23

Perform linting

e865e10

github-actions bot requested review from ekalda, leandron and lhutton1 August 17, 2023 15:39

lhutton1 reviewed Aug 18, 2023

View reviewed changes

Anndrey24 added 3 commits August 21, 2023 08:29

Update implementation selection test

efad3ba

Updated the int8 conv2d implementation selection test to include more targets and to run the AlterOpLayout pass such that its respective changes of schedule are captured.

Perform linting

3ba11a5

Clean up test

d258723

lhutton1 approved these changes Aug 22, 2023

View reviewed changes

ekalda approved these changes Aug 22, 2023

View reviewed changes

lhutton1 merged commit d0c94d4 into apache:main Aug 23, 2023

ysh329 mentioned this pull request Oct 18, 2023

[Release] v0.14.0 Release Candidate Notes #15948

Closed

Anndrey24 mentioned this pull request Nov 6, 2023

[TOPI] Reduce code redundancy in conv2d weights transformation #16080

Merged

Anndrey24 deleted the conv2d-alter-op-fix branch November 8, 2023 14:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix][Relay][Strategy] Enable compile time transformation of weights matrix for arm_cpu NHWC quantized conv2d #15584

[Bugfix][Relay][Strategy] Enable compile time transformation of weights matrix for arm_cpu NHWC quantized conv2d #15584

Anndrey24 commented Aug 17, 2023

tvm-bot commented Aug 17, 2023

lhutton1 left a comment

lhutton1 left a comment

ekalda left a comment

lhutton1 commented Aug 23, 2023

[Bugfix][Relay][Strategy] Enable compile time transformation of weights matrix for arm_cpu NHWC quantized conv2d #15584

[Bugfix][Relay][Strategy] Enable compile time transformation of weights matrix for arm_cpu NHWC quantized conv2d #15584

Conversation

Anndrey24 commented Aug 17, 2023

tvm-bot commented Aug 17, 2023

lhutton1 left a comment

Choose a reason for hiding this comment

lhutton1 left a comment

Choose a reason for hiding this comment

ekalda left a comment

Choose a reason for hiding this comment

lhutton1 commented Aug 23, 2023