[AMD] Map PreciseSqrtOp and PreciseDivFOp to LLVM instructions #3369

jayfurmanek · 2024-03-13T18:41:48Z

Here we map PreciseSqrtOp and PreciseDivOp to LLVM instructions for the AMD backend.

These "Precise" ops are currently defined as round-to-nearest-even which is the default rounding mode in the LLVM instructions for the AMD backend.
Alternatively we could call into the AMD ocml.bc. This works for sqrt but __ocml_div_{rm}_f32 is currently unimplemented.
If further "Precise" math ops are added with different rounding modes or otherwise don't map to LLVM ops, we can revisit this.

…n-lang#3369) Here we map PreciseSqrtOp and PreciseDivOp to LLVM instructions for the AMD backend. These "Precise" ops are currently defined as round-to-nearest-even which is the default rounding mode in the LLVM instructions for the AMD backend. Alternatively we could call into the AMD `ocml.bc`. This works for sqrt but `__ocml_div_{rm}_f32` is currently unimplemented. If further "Precise" math ops are added with different rounding modes or otherwise don't map to LLVM ops, we can revisit this.

[AMD] Map PreciseSqrtOp and PreciseDivOp to LLVM instructions

a2bc8a1

jayfurmanek requested a review from ptillet as a code owner March 13, 2024 18:41

jayfurmanek changed the title ~~[AMD] Map PreciseSqrtOp and PreciseDivOp to LLVM instructions~~ [AMD] Map PreciseSqrtOp and PreciseDivFOp to LLVM instructions Mar 13, 2024

zahimoud approved these changes Mar 13, 2024

View reviewed changes

ThomasRaoux merged commit 62893c1 into triton-lang:main Mar 14, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMD] Map PreciseSqrtOp and PreciseDivFOp to LLVM instructions #3369

[AMD] Map PreciseSqrtOp and PreciseDivFOp to LLVM instructions #3369

jayfurmanek commented Mar 13, 2024

[AMD] Map PreciseSqrtOp and PreciseDivFOp to LLVM instructions #3369

[AMD] Map PreciseSqrtOp and PreciseDivFOp to LLVM instructions #3369

Conversation

jayfurmanek commented Mar 13, 2024