job patterns for partitioning lists and mapping onto them #2297

zulissimeta · 2024-06-21T23:12:17Z

Summary of Changes

This draft PR adds job patterns that are common in high-throughput workflows. When running many jobs on the same flow (say result = map(my_job, range(1000))) , many workflow managers will get sad (network/db/load issues) with too many jobs. This PR is inspired by the dask bag partitions and map operator.

For example:

@job
def testjob(**kwargs):
    print(kwargs)

@flow
def testflow():
    num_partitions = 2
    result = map_partitioned_lists(
        testjob,
        num_partitions=num_partitions,
        test_arg_1=partition([1,2,3,4,5], num_partitions),
        test_arg_2=partition(["a", "b", "c","d","e"], num_partitions),
    )

testflow()

should yield:

{'test_arg_1': 1, 'test_arg_2': 'a'}
{'test_arg_1': 2, 'test_arg_2': 'b'}
{'test_arg_1': 3, 'test_arg_2': 'c'}
{'test_arg_1': 4, 'test_arg_2': 'd'}
{'test_arg_1': 5, 'test_arg_2': 'e'}

but run in only two jobs (instead of 5).

In addition, by using a specified number of partitions, logic flows can quickly move from one step to the next without waiting for any intermediate results.

many_results = generate_many_objects()

# Cannot continue until many_results is generated as the number of jobs to be generated is unknown
final_results = [analysis_job(result) for result in results]

while

many_results = generate_many_objects()
partitions_results = partition(many_results, num_partitions=10)
final_results = mapped_partitioned_lists(analysis_job, result=partitioned_results, num_partitions=10)

is fine since it is clear there is a first job, a second partition job that yields 10 tasks, and a final mapping job that has 10 jobs (one per partition).

buildbot-princeton · 2024-06-21T23:12:19Z

Can one of the admins verify this patch?

codecov · 2024-06-21T23:23:05Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 99.07%. Comparing base (6a5d2c8) to head (31e2a26).

❗ Current head 31e2a26 differs from pull request most recent head 4cbae69

Please upload reports for the commit 4cbae69 to get more accurate results.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #2297   +/-   ##
=======================================
  Coverage   99.07%   99.07%           
=======================================
  Files          82       83    +1     
  Lines        3445     3464   +19     
=======================================
+ Hits         3413     3432   +19     
  Misses         32       32

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Andrew-S-Rosen

Thanks, @zulissimeta! This is an interesting PR.

If I understand correctly, it seems like this is logic that would need to be put within the @flows themselves, right? So, the ideal use case here is someone is importing a pre-made @job (e.g. quacc.recipes.mlp.core import relax_job) and making a custom @flow for themselves from that, right? It would perhaps be a bit difficult to justify when/where to add this logic within pre-made @flows in quacc itself (i.e. in quacc.recipes).

Some (non-substantial) comments below while I await your reply.

src/quacc/wflow_tools/job_patterns.py

tests/core/wflow/test_job_patterns.py

src/quacc/wflow_tools/job_patterns.py

tests/core/wflow/test_job_patterns.py

zulissimeta · 2024-06-22T01:20:08Z

Thanks, @zulissimeta! This is an interesting PR.

If I understand correctly, it seems like this is logic that would need to be put within the @flows themselves, right? So, the ideal use case here is someone is importing a pre-made @job (e.g. quacc.recipes.mlp.core import relax_job) and making a custom @flow for themselves from that, right? It would perhaps be a bit difficult to justify when/where to add this logic within pre-made @flows in quacc itself (i.e. in quacc.recipes).

Some (non-substantial) comments below while I await your reply.

Yes, exactly! If you want a flow that does many jobs in parallel (for example, if you made a bulk_to_adsorbates_flow that generated hundreds of possible adsorbate configuration and had ML potentials to make relaxations fast), this would be helpful.

Partitioning and batching would also be helpful if doing lots of inference; then you could partition a list, and apply a function to take batches and do ML inference quickly, rather than running one MLP relaxation as a separate job.

Andrew-S-Rosen · 2024-06-22T01:23:25Z

Got it, thanks! Since it is pretty independent from existing recipe logic, I don't have much reservation about this. We will just want to add a brief section to the documentation somewhere highlighting how it can be used since it's somewhat of an "advanced" (but useful!) feature. I am happy to take care of the docs though.

Andrew-S-Rosen · 2024-06-24T17:28:11Z

Thanks! Setting this to auto-merge now.

first commit for common minibatch and map job patterns

fce3b4d

pre-commit auto-fixes

78e4132

Andrew-S-Rosen reviewed Jun 22, 2024

View reviewed changes

Andrew-S-Rosen self-assigned this Jun 22, 2024

zulissimeta and others added 10 commits June 22, 2024 02:02

requested edits

f461768

pre-commit auto-fixes

04c5f80

Fix docstring formatting

25f7d6a

pre-commit auto-fixes

edcab92

Merge branch 'main' into batched_tasks

1186bc3

Cosmetic cleanup

9c161c6

Minor docstring cleanup

31e2a26

Don't use assert in source code

b36f5e2

Use a match statement in tests

4cbae69

Update test_map_partitions.py

9bef2f3

Andrew-S-Rosen enabled auto-merge (squash) June 24, 2024 17:28

Andrew-S-Rosen mentioned this pull request Jun 24, 2024

Add a documentation section for advanced job patterns #2300

Open

Andrew-S-Rosen disabled auto-merge June 24, 2024 17:31

Andrew-S-Rosen merged commit c0ab453 into Quantum-Accelerators:main Jun 24, 2024
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

job patterns for partitioning lists and mapping onto them #2297

job patterns for partitioning lists and mapping onto them #2297

zulissimeta commented Jun 21, 2024 •

edited

Loading

buildbot-princeton commented Jun 21, 2024

codecov bot commented Jun 21, 2024 •

edited

Loading

Andrew-S-Rosen left a comment

zulissimeta commented Jun 22, 2024

Andrew-S-Rosen commented Jun 22, 2024

Andrew-S-Rosen commented Jun 24, 2024

job patterns for partitioning lists and mapping onto them #2297

job patterns for partitioning lists and mapping onto them #2297

Conversation

zulissimeta commented Jun 21, 2024 • edited Loading

Summary of Changes

buildbot-princeton commented Jun 21, 2024

codecov bot commented Jun 21, 2024 • edited Loading

Codecov Report

Andrew-S-Rosen left a comment

Choose a reason for hiding this comment

zulissimeta commented Jun 22, 2024

Andrew-S-Rosen commented Jun 22, 2024

Andrew-S-Rosen commented Jun 24, 2024

zulissimeta commented Jun 21, 2024 •

edited

Loading

codecov bot commented Jun 21, 2024 •

edited

Loading