-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[AMD] ElementwiseOpToLLVM: Do not convert types if they are equal #3091
Conversation
Essentially, this patch enables @ptillet |
This patch disables casting. Did you mean "pass"? |
It disables only "intermediate" casting, the rest is in place: |
@binarman in principle yes, in practice there are many subtleties related to FP8 and rounding modes. Even our Nvidia backend isn't handling this super well today |
@joviliast @binarman Can you elaborate on the motivation of this PR? Why does converting between the same type cause FMA failures? |
Because the case of the same internal types just not supported and it fails on compile time as unsupported conversion. |
can you make a minimized lit test out of this failure. That will help everybody understand (and will be prevent regressions) |
@ThomasRaoux , done. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thanks for adding a test
fce1785
to
a69da12
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lit tests under amd dir is not tested
Thanks for catching that. Is someone working on fixing this? |
This commit fixes failure in python/tutorials/03-matrix-multiplication.py for FMA cases. Signed-off-by: joviliast <iveselov.nn@gmail.com>
Signed-off-by: joviliast <iveselov.nn@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It actually works. Sorry for blocking the merge.
…iton-lang#3091) This commit fixes failure in python/tutorials/03-matrix-multiplication.py for FMA cases, also fixes mixed dot for FMA cases. Tested on Navi31 --------- Signed-off-by: joviliast <iveselov.nn@gmail.com>
…iton-lang#3091) This commit fixes failure in python/tutorials/03-matrix-multiplication.py for FMA cases, also fixes mixed dot for FMA cases. Tested on Navi31 --------- Signed-off-by: joviliast <iveselov.nn@gmail.com>
This commit fixes failure in python/tutorials/03-matrix-multiplication.py for FMA cases,
also fixes mixed dot for FMA cases.
Tested on Navi31