Add a matmul test from int8, bf16 #2718

karupayun · 2023-11-29T16:33:33Z

In this PR we are adding a matmul test from int8, bf16.

First I included two new params:
- acc_dtype: So users of the test class can specify the type used internally in the dot, and not the one set by default given the two types. There are several restrictions for these types anyway.
- output_dtype: The return type of the matmul. I included a few tests in the case of making a dot with two float16.
I had to modify test_matmul to use a small range of values to prevent numerical issues. In the case of testing with two float16 and acc_dtype float16, since I can't force torch to use float16 internally (it uses float32), I was having precision issues when comparing the results with triton. Anyway, the goal of this test shouldn't be testing precision.
I also needed to include torch.int8 in the possible datatypes.
I clean/refactor files a bit.
Finally I tried to simplify a bit the logic of matmul because after adding these two parameters it was a bit hard to follow why we needed every part of the code, so I included a supported_acc_dtypes for the allowed types in relation of the types of the operands a and b.

ptillet · 2023-11-29T17:09:48Z

I generally approve of this PR, but could we split it into two PRs -- one that refactors without functionally changing anything, and a separate one that adds the new modes?

PS: I think we only need to narrow down the range for the case of FP16 accumulation

In this PR we are simplifying matmul test without changing the logic. It's the first PR from splitting triton-lang#2718. Follow ups will be adding the `output_dtype` parameter and adding the bf16, int8 matmul test. Basically: - I clean/refactor files a bit. - rename dot_out_dtype to acc_dtype because it was confusing for me - I added `supported_acc_dtypes` for the allowed types in relation of the types of the operands a and b.

In this PR we are adding a matmul test from `int8`, `bf16`. - First I included two new params: - `acc_dtype`: So users of the test class can specify the type used internally in the dot, and not the one set by default given the two types. There are several restrictions for these types anyway. - `output_dtype`: The return type of the matmul. I included a few tests in the case of making a dot with two float16. - I had to modify test_matmul to use a small range of values to prevent numerical issues. In the case of testing with two `float16` and `acc_dtype` `float16`, since I can't force torch to use `float16` internally (it uses `float32`), I was having precision issues when comparing the results with triton. Anyway, the goal of this test shouldn't be testing precision. - I also needed to include `torch.int8` in the possible datatypes. - I clean/refactor files a bit. - Finally I tried to simplify a bit the logic of matmul because after adding these two parameters it was a bit hard to follow why we needed every part of the code, so I included a `supported_acc_dtypes` for the allowed types in relation of the types of the operands a and b.

karupayun · 2023-12-05T18:54:49Z

I generally approve of this PR, but could we split it into two PRs -- one that refactors without functionally changing anything, and a separate one that adds the new modes?

Sorry for taking so long to reply. I will split it into 3 PRs, the first without any real change (#2760), second to add acc_dtype and return_dtype set by users and last add the int8, bfloat16 test.

PS: I think we only need to narrow down the range for the case of FP16 accumulation

I had a discussion about this topic with @gflegar in openxla#6 (comment). I don't have a strong opinion, I'm ok with doing what you suggest but it's making the test a bit more complex when we shouldn't be testing precision here. What do you think?

In this PR we are simplifying matmul test without changing the logic. It's the first PR from splitting triton-lang#2718. Follow ups will be adding the `output_dtype` parameter and adding the bf16, int8 matmul test. Basically: - I clean/refactor files a bit. - rename dot_out_dtype to acc_dtype because it was confusing for me - I added `supported_acc_dtypes` for the allowed types in relation of the types of the operands a and b.

In this PR we are simplifying matmul test without changing the behavior. It's the first PR from splitting #2718. Follow ups will be adding the `output_dtype` parameter and adding the bf16, int8 matmul test. Basically: - I clean/refactor files a bit. - rename `dot_out_dtype` to `acc_dtype` because it was confusing for me - I added `supported_acc_dtypes` for the allowed types in relation of the types of the operands a and b.

karupayun · 2023-12-13T16:17:11Z

This PR was divided between #2768, #2769 and #2760. All of them are already merged.

In this PR we are simplifying matmul test without changing the behavior. It's the first PR from splitting triton-lang#2718. Follow ups will be adding the `output_dtype` parameter and adding the bf16, int8 matmul test. Basically: - I clean/refactor files a bit. - rename `dot_out_dtype` to `acc_dtype` because it was confusing for me - I added `supported_acc_dtypes` for the allowed types in relation of the types of the operands a and b.

karupayun requested a review from ptillet as a code owner November 29, 2023 16:33

karupayun force-pushed the feature/matmul-int8-bf16 branch from cbc7049 to 0bd49d4 Compare November 29, 2023 16:36

karupayun mentioned this pull request Dec 5, 2023

Make matmul test easier to understand #2760

Merged

karupayun force-pushed the feature/matmul-int8-bf16 branch from 0bd49d4 to 87be2ae Compare December 5, 2023 18:17

ptillet approved these changes Dec 11, 2023

View reviewed changes

karupayun closed this Dec 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a matmul test from int8, bf16 #2718

Add a matmul test from int8, bf16 #2718

karupayun commented Nov 29, 2023

ptillet commented Nov 29, 2023

karupayun commented Dec 5, 2023

karupayun commented Dec 13, 2023

Add a matmul test from int8, bf16 #2718

Add a matmul test from int8, bf16 #2718

Conversation

karupayun commented Nov 29, 2023

ptillet commented Nov 29, 2023

karupayun commented Dec 5, 2023

karupayun commented Dec 13, 2023