Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PBS and NAMD SMP #105

Closed
jimboid opened this issue Jun 22, 2018 · 0 comments · Fixed by #111
Closed

PBS and NAMD SMP #105

jimboid opened this issue Jun 22, 2018 · 0 comments · Fixed by #111
Assignees
Labels
Milestone

Comments

@jimboid
Copy link
Member

jimboid commented Jun 22, 2018

On ARCHER, NAMD jobs with SMP have intermittent difficulties with the ncpus and/or mpiprocs directive to "-l". This should be investigated, further. It appears to stem from the corespernode needing to be set at 1 which also sets ncpus to 1.There are several possibilities here:

  1. A return to defaulting to not issuing ncpus or mpiprocs with the "-l" command and allowing users to specify if they are necessary or not as whether these are needed or not varies machine to machine.

  2. Another consideration is to see if there is actually something that is missing with these NAMD jobs that would let Longbow correctly assign ncpus ie -d to aprun.

  3. Another solution is to have corespernode and mpiprocs as seperate parameters such that mpiprocs defaults to corespernode unless mpiprocs states otherwise. This should provide a better way to calculate ncpus from corespernode and mpiprocs (most users won't need to worry about it, just those using NAMD or those that wish to undersubscribe nodes).

@jimboid jimboid added the bug label Jun 22, 2018
@jimboid jimboid added this to the v1.5.2 milestone Jun 22, 2018
@jimboid jimboid self-assigned this Jun 22, 2018
@jimboid jimboid changed the title pbs and NAMD SMP PBS and NAMD SMP Jun 22, 2018
jimboid added a commit that referenced this issue Aug 9, 2018
Users can now supply "mpiprocs" in their job or host conf files to
undersubscribe without altering the corespernode parameter, which would
often cause issues.

closes #105
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant