Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MILC code (master branch) with QUDA version 1.1.0 ---> HISQ fattening unitarization error #62

Open
lcosmai opened this issue Nov 3, 2023 · 5 comments

Comments

@lcosmai
Copy link

lcosmai commented Nov 3, 2023

I successfully compiled the MILC code (master branch) with QUDA version 1.1.0 using CUDA v11.8.

The QUDA compilation passed all the tests.

I compiled the su3_rhmc_hisq target for ks_imp_rhmc.

I then launched a test job on 4 nodes, each with 4 Nvidia A100 GPUs.

The job aborted with the following error:


ERROR: Error in unitarization component of the hisq fattening: 1048576 failures (/leonardo/pub/userexternal/lcosmai0/AREA_COMPILAZIONE_QUDA/quda-1.1.0/lib/interface_quda.cpp:4154 in computeKSLinkQuda())

Could you please provide any suggestions on how to resolve this issue?

Best regards,
Leonardo

@james-simone
Copy link
Collaborator

james-simone commented Nov 3, 2023 via email

@lcosmai
Copy link
Author

lcosmai commented Nov 3, 2023

Hi Jim,

Thanks for your suggestion.

Unfortunately, when I tried to compile milc_qcd-develop, I received the following error messages:

../generic_ks/gauss_smear_ks_QUDA.c: In function 'gauss_smear_delete_2link_QUDA':
../generic_ks/gauss_smear_ks_QUDA.c:51:3: warning: implicit declaration of function 'qudaFreeTwoLink' [-Wimplicit-function-declaration]
51 | qudaFreeTwoLink();
| ^~~~~~~~~~~~~~~
../generic_ks/gauss_smear_ks_QUDA.c: In function 'gauss_smear_v_field_QUDA':
../generic_ks/gauss_smear_ks_QUDA.c:106:3: error: unknown type name 'QudaTwoLinkQuarkSmearArgs_t'
106 | QudaTwoLinkQuarkSmearArgs_t qsmear_args;
| ^~~~~~~~~~~~~~~~~~~~~~~~~~~
../generic_ks/gauss_smear_ks_QUDA.c:107:14: error: request for member 'n_steps' in something not a structure or union
107 | qsmear_args.n_steps = iters;
| ^
../generic_ks/gauss_smear_ks_QUDA.c:108:14: error: request for member 'width' in something not a structure or union
108 | qsmear_args.width = width;
| ^
../generic_ks/gauss_smear_ks_QUDA.c:109:14: error: request for member 'compute_2link' in something not a structure or union
109 | qsmear_args.compute_2link = compute_2link_temp;
| ^
../generic_ks/gauss_smear_ks_QUDA.c:110:14: error: request for member 'delete_2link' in something not a structure or union
110 | qsmear_args.delete_2link = 0;
| ^
../generic_ks/gauss_smear_ks_QUDA.c:111:14: error: request for member 't0' in something not a structure or union
111 | qsmear_args.t0 = t0;
| ^
../generic_ks/gauss_smear_ks_QUDA.c:112:14: error: request for member 'laplaceDim' in something not a structure or union
112 | qsmear_args.laplaceDim = laplaceDim;
| ^
../generic_ks/gauss_smear_ks_QUDA.c:115:3: warning: implicit declaration of function 'qudaTwoLinkGaussianSmear' [-Wimplicit-function-declaration]
115 | qudaTwoLinkGaussianSmear( MILC_PRECISION, MILC_PRECISION, (void*) t_links, (void*) src, qsmear_args );
| ^~~~~~~~~~~~~~~~~~~~~~~~
make[1]: *** [../generic_ks/Make_template:384: gauss_smear_ks_QUDA.o] Error 1
make[1]: Leaving directory '/leonardo/pub/userexternal/lcosmai0/AREA_COMPILAZIONE_MILC/milc_qcd-develop/ks_imp_rhmc'
make: *** [Make_template:223: su3_rhmc_hisq] Error 2

@james-simone
Copy link
Collaborator

james-simone commented Nov 3, 2023 via email

@maddyscientist
Copy link
Contributor

@lcosmai you also need to use the develop version of QUDA from GitHub. We haven’t made a release tag since the two link smearing support was merged in. Thx

@lcosmai
Copy link
Author

lcosmai commented Nov 8, 2023

Following your suggestions, I successfully compiled the develop branch of QUDA with OPENMPI (-DQUDA_MPI=ON) and the develop branch of the MILC code (-DQUDA_SMEAR_GAUSS_TWOLINK=ON).

I also tested the MILC code on a GPU cluster equipped with 4 NVIDIA Ampere GPUs, 64GB HBM2, and 32 Intel Ice Lake cores per node.

I appreciate your kind support.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants