Parallel tabular tasks #41

suzhoum · 2023-07-21T14:25:58Z

Issue #, if available:

Description of changes:
This PR adds a few feature improvements to tabular module

pre-install framework (e.g. AutoGluon:stable) in setup.sh so that docker container is built with the required version of framework.
get amlb task and fold information from amlb_user_dir or the default amlb resources/, and generate config combination for Batch jobs based on individual fold in aws mode.
Mounting of amlb_user_dir to pre-configured directories was required to use the amlb_user_dir content during docker build, and in lambda function.
remove Python3.8 support to align with AMLB.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

… build

For tabular, we also support custom framework defined by amlb_user_dir. By mounting the amlb_user_dir to custom_configs/amlb_configs and lambda dir src/autogluon/bench/cloud/aws/batch_stack/lambdas, we are able to make user_dir visible to the package.

use default and custom amlb config to determine amlb tasks and specific fold to run (capped by `folds`)

yinweisu · 2023-07-27T19:51:51Z

src/autogluon/bench/cloud/aws/batch_stack/lambdas/lambda_function.py

@@ -89,7 +121,85 @@ def save_configs(configs: dict, uid: str):
    return config_file_path


-def process_combination(combination, keys, metrics_bucket, batch_job_queue, batch_job_definition):
+def clone_automlbenchmark_repo():


Does this function clone the whole repo or only the resource folder? Seems like it's later. If so, consider renaming the function and provide a comment on why we are doing this. Currently, it's kind of confusing

yinweisu · 2023-07-27T19:54:59Z

src/autogluon/bench/cloud/aws/batch_stack/lambdas/lambda_function.py

+    with open(file, "r") as f:
+        amlb_benchmark_configs = yaml.safe_load(f)
+        for item in amlb_benchmark_configs:
+            folds = min(item.get("folds", default_max_folds), default_max_folds)


Is this correct? If the user specified folds=20, we will only get the default_max_folds, which is 10

default_max_folds is actually the amlb config folds in contraints.yaml. For example, if constraint == 'test', the folds, aka default_max_folds will be set to 2, and even if the user specified folds=20 for individual task, we will only run max 2 folds.

yinweisu · 2023-07-27T20:03:47Z

src/autogluon/bench/cloud/aws/batch_stack/lambdas/lambda_function.py

+    # Iterate through the combinations and the second set of keys
+    for combo in specific_key_combinations:
+        for benchmark, tasks in config["module_configs"]["tabular"]["fold_to_run"].items():
+            for task, fold_numbers in tasks.items():
+                for fold_num in fold_numbers:
+                    new_config = {key: config[key] for key in common_keys}
+                    new_config.update(dict(zip(specific_keys, combo)))
+                    new_config["amlb_benchmark"] = benchmark
+                    new_config["amlb_task"] = task
+                    new_config["fold_to_run"] = fold_num
+                    job_id, config_s3_path = process_combination(
+                        new_config, metrics_bucket, batch_job_queue, batch_job_definition
+                    )
+                    job_configs[job_id] = config_s3_path


So this basically is generating a config for each fold in each task in each benchmark? And the config would only contain keys that are presented in the original config? It takes some time to understand these code, maybe consider adding some high level comment?

That's a good idea. What you described was mostly correct. Since only amlb_benchmark was required, if amlb_task does not have keys, we assume all tasks in the amlb_benchmark will be run; similarly, if folds_to_run does not have any key, we assume all folds in amlb_task will be run. If the aforementioned have keys, we only populate folds_to_run for the existing keys.

yinweisu · 2023-07-27T20:09:13Z

src/autogluon/bench/cloud/aws/batch_stack/lambdas/lambda_function.py

+    job_configs = {}
+
+    # Generate combinations for the first set of keys
+    specific_key_combinations = list(


These are actually values instead of keys right? I saw on line 302 it was zipped with the keys. This naming is kind of confusing

yinweisu

LGTM!

suzhoum force-pushed the parallel_tabular_tasks branch 8 times, most recently from 2203f12 to 9b73052 Compare July 27, 2023 02:21

suzhoum added 4 commits July 27, 2023 02:26

refactor

97ca3b7

ignore hidden directories

c422388

fix aggregation ray issue

f0eb151

install framework in setup script for tabular

14904a9

suzhoum force-pushed the parallel_tabular_tasks branch 8 times, most recently from f38bf32 to 0ca757b Compare July 27, 2023 18:42

suzhoum added 6 commits July 27, 2023 19:07

add option to run a specific fold in amlb

d982ec9

refactor download utils

31616d5

update path convention for configs uploaded

716e22e

skip setup when split_id is present as it has been done during docker…

a3c33ba

… build

Update lambda function

d3bf082

use default and custom amlb config to determine amlb tasks and specific fold to run (capped by `folds`)

suzhoum force-pushed the parallel_tabular_tasks branch 2 times, most recently from c3a0a92 to b05595e Compare July 27, 2023 19:17

suzhoum requested a review from yinweisu July 27, 2023 19:18

suzhoum marked this pull request as ready for review July 27, 2023 19:18

suzhoum added 2 commits July 27, 2023 19:28

update sample config

8e979f9

update test

fe48990

suzhoum force-pushed the parallel_tabular_tasks branch from b05595e to 250de48 Compare July 27, 2023 19:28

suzhoum added 3 commits July 27, 2023 19:43

remove python3.8 support since amlb does not support <python3.9

99b3693

lint

c45ebbc

umount after run

7e3878f

suzhoum force-pushed the parallel_tabular_tasks branch from 250de48 to 7e3878f Compare July 27, 2023 19:43

yinweisu reviewed Jul 27, 2023

View reviewed changes

suzhoum force-pushed the parallel_tabular_tasks branch 2 times, most recently from 0dd6020 to 3783112 Compare July 27, 2023 20:59

address comments

91e7840

suzhoum force-pushed the parallel_tabular_tasks branch from 3783112 to 91e7840 Compare July 27, 2023 21:28

yinweisu approved these changes Jul 27, 2023

View reviewed changes

suzhoum merged commit 52eee49 into autogluon:master Jul 27, 2023
3 checks passed

suzhoum mentioned this pull request Aug 2, 2023

Aggregating results has error due to pydantic version #39

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel tabular tasks #41

Parallel tabular tasks #41

suzhoum commented Jul 21, 2023 •

edited

Loading

yinweisu Jul 27, 2023

yinweisu Jul 27, 2023

suzhoum Jul 27, 2023 •

edited

Loading

yinweisu Jul 27, 2023

suzhoum Jul 27, 2023

yinweisu Jul 27, 2023

suzhoum Jul 27, 2023

yinweisu left a comment

Parallel tabular tasks #41

Parallel tabular tasks #41

Conversation

suzhoum commented Jul 21, 2023 • edited Loading

yinweisu Jul 27, 2023

Choose a reason for hiding this comment

yinweisu Jul 27, 2023

Choose a reason for hiding this comment

suzhoum Jul 27, 2023 • edited Loading

Choose a reason for hiding this comment

yinweisu Jul 27, 2023

Choose a reason for hiding this comment

suzhoum Jul 27, 2023

Choose a reason for hiding this comment

yinweisu Jul 27, 2023

Choose a reason for hiding this comment

suzhoum Jul 27, 2023

Choose a reason for hiding this comment

yinweisu left a comment

Choose a reason for hiding this comment

suzhoum commented Jul 21, 2023 •

edited

Loading

suzhoum Jul 27, 2023 •

edited

Loading