Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OpenBLAS build fails on Sapphire Rapids due to "Too many LAPACK tests failed" #880

Closed
robert-mijakovic opened this issue Oct 14, 2023 · 4 comments
Milestone

Comments

@robert-mijakovic
Copy link

robert-mijakovic commented Oct 14, 2023

Hi guys,

while building OpenBLAS on Sapphire Rapids running Ubuntu 22.04LTS, I faced "Too many LAPACK tests failed due to non-numerical errors: 55 (> 0)".

                        -->   LAPACK TESTING SUMMARY  <--
SUMMARY                 nb test run     numerical error         other error
================        ===========     =================       ================
REAL                    1328283         0       (0.000%)        0       (0.000%)
DOUBLE PRECISION        1325997         11      (0.001%)        0       (0.000%)
COMPLEX                 760371          160     (0.021%)        55      (0.007%)
COMPLEX16               771518          48      (0.006%)        0       (0.000%)

--> ALL PRECISIONS      4186169         219     (0.005%)        55      (0.001%)

Are we too restrictive on tolerance?
I will look further to see whether I can find a trace of recent patches in OpenBLAS. So far, I haven't seen anything.
For the moment, I will stick to --ignore-test-fail when installing OpenBLAS 0.3.24 on Sapphire Rapids.

Best regards,
Robert

@robert-mijakovic robert-mijakovic changed the title OpenBLAS build fails on Sapphire Rapids due to "oo many LAPACK tests failed" OpenBLAS build fails on Sapphire Rapids due to "Too many LAPACK tests failed" Oct 16, 2023
@boegel boegel added this to the 4.x milestone Nov 8, 2023
@boegel
Copy link
Member

boegel commented Nov 8, 2023

@robert-mijakovic Can you check whether the changes in easybuilders/easybuild-easyconfigs#19159 are sufficient to fix this problem?

cc @Flamefire

@Flamefire
Copy link
Contributor

Flamefire commented Nov 8, 2023

I assume though because easybuilders/easybuild-easyconfigs#19021 is an exact duplicate of this and that is fixed by easybuilders/easybuild-easyconfigs#19159 which I verified on our new Sapphire Rapids

@robert-mijakovic
Copy link
Author

I managed to get access to the machine again and I'm rebuilding the toolchain.
I will let you know the outcome immediately once I finish the test.
However, as @Flamefire emphasized, I don't expect to observe any issues with the testing phase as easybuilders/easybuild-easyconfigs#19021 is a duplicate of his easybuilders/easybuild-easyconfigs#19159.

@robert-mijakovic
Copy link
Author

I can confirm that easybuilders/easybuild-easyconfigs#19159 fixes the issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants