Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CP2K v2024.1 #297

Closed
8 tasks done
laraPPr opened this issue Feb 27, 2024 · 9 comments · Fixed by easybuilders/easybuild-easyconfigs#20989 · May be fixed by easybuilders/easybuild-easyconfigs#20798
Closed
8 tasks done

CP2K v2024.1 #297

laraPPr opened this issue Feb 27, 2024 · 9 comments · Fixed by easybuilders/easybuild-easyconfigs#20989 · May be fixed by easybuilders/easybuild-easyconfigs#20798
Assignees
Labels
difficulty: medium software that likely require a fair amount of effort to support priority: medium site:t1_ugent_hortense Software installation request for Tier-1 Hortense site:ugent Software installation request for UGent Tier-2 update

Comments

@laraPPr
Copy link
Collaborator

laraPPr commented Feb 27, 2024

@laraPPr laraPPr added difficulty: medium software that likely require a fair amount of effort to support priority: medium update site:ugent Software installation request for UGent Tier-2 site:t1_ugent_hortense Software installation request for Tier-1 Hortense labels Feb 27, 2024
pavelToman added a commit that referenced this issue Jun 11, 2024
@pavelToman pavelToman self-assigned this Jun 11, 2024
@pavelToman
Copy link
Collaborator

@pavelToman
Copy link
Collaborator

pavelToman commented Jun 11, 2024

@boegel
Copy link
Contributor

boegel commented Jul 4, 2024

Initial attempt by @pavelToman results in lots of failing tests with 2024.1 (after unsetting OMP_PROC_BIND), see https://github.com/vscentrum/vsc-software-stack/blob/wip/297_CP2K/test_error_1.txt:

------------------------------- Summary --------------------------------
Number of FAILED  tests 56
Number of WRONG   tests 0
Number of CORRECT tests 31
Total number of   tests 87

That may be because too many threads/ranks are used by default, resulting in very limited memory being available.

Next step would be to try with tests/do_regtest.py --mpiranks 2 --ompthreads 2 and see if that improves the situation.

@smoors Do you have any ideas/feedback here?

@boegel
Copy link
Contributor

boegel commented Jul 4, 2024

Instructions for running CP2K test suite are available at https://www.cp2k.org/dev:regtesting

@boegel
Copy link
Contributor

boegel commented Jul 5, 2024

Something I overlooked initially: they also wanted to include https://github.com/electronic-structure/SIRIUS as a dependency for CP2K.

@pavelToman Can you take a look at that one?

@pavelToman
Copy link
Collaborator

pavelToman commented Jul 10, 2024

Initial attempt by @pavelToman results in lots of failing tests with 2024.1 (after unsetting OMP_PROC_BIND), see https://github.com/vscentrum/vsc-software-stack/blob/wip/297_CP2K/test_error_1.txt:

------------------------------- Summary --------------------------------
Number of FAILED  tests 56
Number of WRONG   tests 0
Number of CORRECT tests 31
Total number of   tests 87

That may be because too many threads/ranks are used by default, resulting in very limited memory being available.

Next step would be to try with tests/do_regtest.py --mpiranks 2 --ompthreads 2 and see if that improves the situation.

@smoors Do you have any ideas/feedback here?

I let it run whole night but it stops (freeze) on test step with a command:
python /tmp/vsc47063/easybuild/build/CP2K/2024.1/foss-2023a/cp2k-2024.1/tests/do_regtest.py --maxtasks 1 --mpiranks 2 --ompthreads 2 Linux-x86-64-foss psmp

  • the SLURM envs unseted
  • the OMP_PROC_BIND unseted

@hajgato
Copy link
Collaborator

hajgato commented Jul 16, 2024

I have installed CP2K from easybuilders/easybuild-easyconfigs#20798 with --skip-test-step and it runch with my CP2K test, from an interactive session (2 nodes, 8 ppn) as well as from a script job (with vsc-mympirun)

@hajgato
Copy link
Collaborator

hajgato commented Jul 16, 2024

there are lots of failed jobs with

cp2k.psmp: /tmp/vsc43020/easybuild/CP2K/2024.1/foss-2023a/cp2k-2024.1/src/dbm/dbm_distribution.c:42: dbm_dist_1d_new: Assertion `0 <= coords[i] && coords[i] < dist->nranks' failed.
cp2k.psmp: /tmp/vsc43020/easybuild/CP2K/2024.1/foss-2023a/cp2k-2024.1/src/dbm/dbm_distribution.c:42: dbm_dist_1d_new: Assertion `0 <= coords[i] && coords[i] < dist->nranks' failed

@hajgato
Copy link
Collaborator

hajgato commented Jul 16, 2024

when regtest is done, --maxtasks a --mpiranks b --ompthreads c, then a>=b*c
By default, a is taken form /proc/cpuinfo. Therefore if not a full node is requested for a job, the regtest would run overcommitted. In general a should be the same than the number of parallel builds (the number after make -j)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
difficulty: medium software that likely require a fair amount of effort to support priority: medium site:t1_ugent_hortense Software installation request for Tier-1 Hortense site:ugent Software installation request for UGent Tier-2 update
Projects
None yet
4 participants