[Vulkan] Add cooperative matrix support #14817

mei-ye · 2023-05-10T05:13:10Z

[Vulkan] Add SPIR-V code generation for "SPV_NV_cooperative_matrix" extension
Add im2col implementation for direct Conv2D. Currently only 16x16x16 FP16 wmma fragments with FP32 intermediates are supported. Add "min_design_space" as a parameter to give minimum design space for meta scheduler tuning. Add "use_int32_const" as a paramter to use int32 type for constants. Allow target query to be called from the schedules so that samplings are constrained to produce legal schedules. Do not allow the reuse of buffers with different dtypes. Add a unit test test_wmma.py.

tvm-bot · 2023-05-10T05:13:13Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

No users to tag found in teams: vulkan _{See #10317 for details}

_{Generated by tvm-bot}

junrushao · 2023-05-10T05:54:30Z

CC @masahi seems similar to your PR: #14770

masahi · 2023-05-10T05:56:36Z

Yeah, hi Mei, it's interesting to know that we worked on this extension around the same time! I'll look through this PR and think about how to integrate our work.

BTW, I'm aware that AMD's Windows VK driver supports this extension. Is Linux driver going to get the support for this extension as well?

mei-ye · 2023-05-10T06:16:22Z

Yeah, hi Mei, it's interesting to know that we worked on this extension around the same time! I'll look through this PR and think about how to integrate our work.

BTW, I'm aware that AMD's Windows VK driver supports this extension. Is Linux driver going to get the support for this extension as well?

Yes, Linux driver supports it. Though let me ask whether it is made into release.

src/target/spirv/codegen_spirv.cc

mei-ye · 2023-05-10T15:29:37Z

Yeah, hi Mei, it's interesting to know that we worked on this extension around the same time! I'll look through this PR and think about how to integrate our work.
BTW, I'm aware that AMD's Windows VK driver supports this extension. Is Linux driver going to get the support for this extension as well?

Yes, Linux driver supports it. Though let me ask whether it is made into release.

Linux driver has been released in April with this extension support.

masahi · 2023-05-10T19:24:17Z

Linux driver has been released in April with this extension support

Do you mean AMDVLK or AMDGPU Pro? I looked into AMDVLK source code but didn't find this extension supported.

mei-ye · 2023-05-11T00:28:51Z

Linux driver has been released in April with this extension support

Do you mean AMDVLK or AMDGPU Pro? I looked into AMDVLK source code but didn't find this extension supported.

AMDGPU pro.

masahi · 2023-05-14T22:15:30Z

@mei-ye Can you split topi / relay / ms change from this PR and send them later? And please add a minimum test case for cooperative matrix, such as the matmul test in my PR.

mei-ye · 2023-05-15T07:03:57Z

@mei-ye Can you split topi / relay / ms change from this PR and send them later? And please add a minimum test case for cooperative matrix, such as the matmul test in my PR.

@masahi : are you going to check in your SPIR-V code gen? In this patch, I added an unit test: test_wmma.py.

masahi · 2023-05-15T07:09:13Z

@mei-ye Can you split topi / relay / ms change from this PR and send them later? And please add a minimum test case for cooperative matrix, such as the matmul test in my PR.

@masahi : are you going to check in your SPIR-V code gen? In this patch, I added an unit test: test_wmma.py.

That's still open for discussion. If people prefer your approach, I'll close my PR. In which case I want your PR to be easier to review.

src/target/spirv/codegen_spirv.cc

src/target/spirv/ir_builder.cc

Add SPIR-V code generation for "SPV_NV_cooperative_matrix" extension. Add a matrix multiplicaiton unit test.

masahi · 2023-05-19T20:53:58Z

@tvm-bot rerun

masahi · 2023-05-20T00:20:02Z

src/target/spirv/codegen_spirv.cc

+    ICHECK(ele_dtype.is_float()) << "Only floating point fragment accumulator is supported";
+    spirv::SType ele_stype = builder_->GetSType(ele_dtype);
+    spirv::SType& fragment_type = fragment_info_[buffer_node].stype;
+    double init = static_cast<uint64_t>(Downcast<FloatImm>(op->args[5])->value);


why cast to uint64?

I can't recall a good reason. Removing this cast works fine. Should I reset and re-patch?

masahi · 2023-05-20T00:21:08Z

src/target/spirv/spirv_support.h

+   *
+   * If support is present, can perform cooperative matrix operations.  If
+   * support is not present, codegen will throw exception on
+   * attempting to perform cooperative matrix.


perform cooperative matrix operations

masahi · 2023-05-20T09:20:25Z

I'll address the nit issues in my upcoming PR.

tqchen · 2023-05-20T13:38:52Z

awesome, thank you @mei-ye @masahi !

masahi · 2023-05-27T08:55:50Z

I've updated my vk 4K matmul test https://github.com/masahi/tensorir-experiment/blob/master/vk_cooperative_matrix_nv/test_4k.py to use this extension support. It gets ~90 TFLOPs on RTX4080.

The cool part is that by changing the target from vk to cuda the exact same schedule / script works with the same performance. https://github.com/masahi/tensorir-experiment/blob/master/vk_cooperative_matrix_nv/test_4k.py#L171-L172

masahi · 2023-05-28T20:29:26Z

Auto tensorization on vk seems to work as well, but the result is not correct after tuning. For CUDA the result is correct after auto-tensorization tuning, so this is a VK-specific issue.

@mei-ye You can try auto tensorization experiment using my branch https://github.com/masahi/tvm/tree/vk-auto-tensorize and this script https://github.com/masahi/tensorir-experiment/blob/vk-auto-tensorize/vk_cooperative_matrix_nv/test_4k.py. I'm curious if the accuracy issue is specific to NV or applies to AMD as well.

masahi reviewed May 10, 2023

View reviewed changes

src/target/spirv/codegen_spirv.cc Show resolved Hide resolved

masahi mentioned this pull request May 10, 2023

[Vulkan] Add VK_NV_cooperative_matrix support #14770

Closed

masahi reviewed May 16, 2023

View reviewed changes

src/target/spirv/codegen_spirv.cc Outdated Show resolved Hide resolved

src/target/spirv/codegen_spirv.cc Outdated Show resolved Hide resolved

src/target/spirv/ir_builder.cc Show resolved Hide resolved

mei-ye closed this May 17, 2023

mei-ye force-pushed the main branch from 2c6b33e to e08caef Compare May 17, 2023 01:08

[Vulkan] Add cooperative matrix support

3d40de9

Add SPIR-V code generation for "SPV_NV_cooperative_matrix" extension. Add a matrix multiplicaiton unit test.

mei-ye reopened this May 19, 2023

mei-ye changed the title ~~[SPIR-V] Add cooperative matrix support.~~ [Vulkan] Add cooperative matrix support May 19, 2023

masahi approved these changes May 20, 2023

View reviewed changes

masahi merged commit b91c2f2 into apache:main May 20, 2023

ysh329 mentioned this pull request Jul 12, 2023

[Release] v0.13.0 Release Candidate Notes #15295

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Vulkan] Add cooperative matrix support #14817

[Vulkan] Add cooperative matrix support #14817

mei-ye commented May 10, 2023 •

edited

Loading

tvm-bot commented May 10, 2023 •

edited

Loading

junrushao commented May 10, 2023

masahi commented May 10, 2023 •

edited

Loading

mei-ye commented May 10, 2023

mei-ye commented May 10, 2023

masahi commented May 10, 2023

mei-ye commented May 11, 2023

masahi commented May 14, 2023

mei-ye commented May 15, 2023

masahi commented May 15, 2023

masahi commented May 19, 2023

masahi May 20, 2023

mei-ye May 20, 2023

masahi May 20, 2023

masahi commented May 20, 2023

tqchen commented May 20, 2023

masahi commented May 27, 2023

masahi commented May 28, 2023 •

edited

Loading

[Vulkan] Add cooperative matrix support #14817

[Vulkan] Add cooperative matrix support #14817

Conversation

mei-ye commented May 10, 2023 • edited Loading

tvm-bot commented May 10, 2023 • edited Loading

junrushao commented May 10, 2023

masahi commented May 10, 2023 • edited Loading

mei-ye commented May 10, 2023

mei-ye commented May 10, 2023

masahi commented May 10, 2023

mei-ye commented May 11, 2023

masahi commented May 14, 2023

mei-ye commented May 15, 2023

masahi commented May 15, 2023

masahi commented May 19, 2023

masahi May 20, 2023

Choose a reason for hiding this comment

mei-ye May 20, 2023

Choose a reason for hiding this comment

masahi May 20, 2023

Choose a reason for hiding this comment

masahi commented May 20, 2023

tqchen commented May 20, 2023

masahi commented May 27, 2023

masahi commented May 28, 2023 • edited Loading

mei-ye commented May 10, 2023 •

edited

Loading

tvm-bot commented May 10, 2023 •

edited

Loading

masahi commented May 10, 2023 •

edited

Loading

masahi commented May 28, 2023 •

edited

Loading