-
Notifications
You must be signed in to change notification settings - Fork 128
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issues getting SLURM + future to work #1359
Comments
You could give some targets a drake/inst/templates/hpc/slurm_batchtools.tmpl Lines 27 to 29 in 5748292
I am not sure what the default memory would be without this. By the way,
Turns out these kinds of errors are reproducible in library(brew)
library(drake)
drake_hpc_template_file("slurm_batchtools.tmpl", to = tempdir())
path <- file.path(tempdir(), "slurm_batchtools.tmpl")
log.file <- "x"
job.name <- "y"
uri <- "uri"
resources <- list(walltime = 60)
brew(file = path)
#> Error in parse(text = code, srcfile = NULL): 18:42: unexpected ')'
#> 17: .brew.cat(22,24)
#> 18: cat( if (!is.null(resources$walltime)) { )
#> ^ Created on 2021-02-27 by the reprex package (v1.0.0) Maybe I just need to update the SLURM template file. |
I just updated Any particular reason you are using |
This is an old project with like 400 targets and I am not sure if I want to put in the work to port it to {targets}. I wanted to explore if transient workers could me in this project. I am sometimes blocking the whole HPC with persistent workers for many days and at some point most workers are idle. But I found out that the current implementation of transient workers via {future.batchtool} is quite slow and does not support array execution and other stuff (e.g. I then picked up the discussion for transient workers in clustermq via {future.clustermq} in mschubert/clustermq#86 and HenrikBengtsson/future#204 and playing around a bit now (even though I am not really having a clear plan 😄 ). |
Prework
Description
I was trying out future (batchtools) + SLURM to play around with transient workers in contrast to clustermq + SLURM.
I got a bit confused on the following points:
drake_config(template - list())
is only valid for clustermq (took me hours to find this :/) but it is stated in the help page so my failure 😆future (batchtools) + SLURM
I need to add aresources
column to every target? Otherwise this target will run sequentially? I am asking this because I could not get it running yet and therefore could not observe the behavior. If this applies, how much memory is devoted to every target?_drake.R
and has a future.batchtools template here. I did not see how the resources for the individual workers were specified though 🤔r_make()
. Is the template still valid? Inspecting the part I do not see a parsing error actually so I am wondering why this error occurs.Maybe you can still help with some pointers getting me running here - I might be missing something obvious 🤔
Reprex
The same issue arises when I try to use the drake+slurm+batchtools examples with
_drake.R
:The text was updated successfully, but these errors were encountered: