You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When submitting a job defining num_cores_per_machine, the scheduler is unable to submit the job because --ncpus-per-task is set to a float, which SLURM doesn't understand. The workchain gets then paused forever because it is unsuccesful at submitting the job.
Steps to reproduce
Using a slurm scheduler in a computer with mpiprocs_per_machine defined:
submit(CalculationClass, options={"resources": {"num_machines": x ,"num_cores_per_machine": y})
Your environment
Operating system [e.g. Linux]: Linux
Python version [e.g. 3.7.1]: 3.6.9
aiida-core version [e.g. 1.2.1]: 1.4.2
Other relevant software versions, e.g. Postres & RabbitMQ
Describe the bug
When submitting a job defining
num_cores_per_machine
, the scheduler is unable to submit the job because--ncpus-per-task
is set to a float, which SLURM doesn't understand. The workchain gets then paused forever because it is unsuccesful at submitting the job.Steps to reproduce
Using a slurm scheduler in a computer with
mpiprocs_per_machine
defined:Your environment
Other relevant software versions, e.g. Postres & RabbitMQ
Additional context
I think it's related to this part of the code:
aiida-core/aiida/schedulers/plugins/slurm.py
Lines 133 to 139 in 9ff07c1
As I understand, this division should return an integer, but the opposite check is performed. I don't know, maybe I'm wrong.
The text was updated successfully, but these errors were encountered: