Eliminate short-circuiting for loading from local #3600

Infernaught · 2023-09-12T21:09:27Z

Previously, we would raise an error if a local path was passed base_model in our config. This PR adds a check that verifies if the local path is valid before checking if it is a valid HF path.

ludwig/schema/llms/base_model.py

arnavgarg1

Would actually be good for us to add a test for this in test_llm.py, which downloads weights to local, then uses that path as the base_model name, and then just makes sure the there are no errors when we do ModelConfig.from_dict. Equally, I wouldn't be opposed to the test instead just training for 3-4 steps. I'd be okay with both, but a test will be good

github-actions · 2023-09-12T22:57:57Z

Unit Test Results

  6 files ±0   6 suites ±0 39m 56s ⏱️ ±0s
31 tests ±0 26 ✔️ ±0   5 💤 ±0 0 ❌ ±0
82 runs ±0 66 ✔️ ±0 16 💤 ±0 0 ❌ ±0

Results for commit fddd82d. ± Comparison against base commit c6964f0.

♻️ This comment has been updated with latest results.

justinxzhao

If you rebase, you should be able to get a clean CI

arnavgarg1 · 2023-09-15T19:55:23Z

ludwig/schema/llms/base_model.py

+            if os.path.isdir(model_name):
+                return model_name


Hmm, wondering if we should add some more checks here, the most basic one being that the directory should not be empty. In an ideal world, we also add validation to ensure that the model config exists in this directly and it can be initialized correctly from this directory using the same code block from line 57, what do you think?

I do think we can do this in a fast-follow, but I do think for completeness these additional checks are important. What do you think? @Infernaught @justinxzhao

Cool. Justin and I expect HF to give us a failure message if the model objects are bad, so it's probably not that bad if we don't have these checks for the time being. I'm going to merge this for now, but I definitely think we should keep thinking about how to properly verify this.

Okay with the merge for now as well, but I do think we should follow-up with custom validation here and raise clear resolution methods for what went wrong and how to fix it.

I do think we should add these checks now because they're honestly so trivial to add and we want to fail fast, always. There's nothing wrong with leaving it up to HF, but I guarantee that we are going to get Ludwig users messaging us asking us why they're seeing cryptic import error or not found errors from HF without them when there is a user-side error. The reason I personally really like these kinds of validation checks is that it lets us get ahead and provide really clear and crisp resolution methods to unblock/self-serve yourself, and that is what I would lean towards.

Eliminate short-circuiting for loading from local

c5fa844

Infernaught requested review from justinxzhao and arnavgarg1 September 12, 2023 21:09

arnavgarg1 reviewed Sep 12, 2023

View reviewed changes

ludwig/schema/llms/base_model.py Outdated Show resolved Hide resolved

arnavgarg1 reviewed Sep 12, 2023

View reviewed changes

Infernaught added 2 commits September 13, 2023 18:12

Clarify comment on config validation

8c9ca60

Add test for local path loading

a4f5adc

justinxzhao approved these changes Sep 15, 2023

View reviewed changes

Merge branch 'master' of github.com:ludwig-ai/ludwig into local_path_fix

fddd82d

arnavgarg1 reviewed Sep 15, 2023

View reviewed changes

Infernaught merged commit 4806254 into master Sep 15, 2023
16 checks passed

Infernaught deleted the local_path_fix branch September 15, 2023 21:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eliminate short-circuiting for loading from local #3600

Eliminate short-circuiting for loading from local #3600

Infernaught commented Sep 12, 2023

arnavgarg1 left a comment

github-actions bot commented Sep 12, 2023 •

edited

Loading

justinxzhao left a comment

arnavgarg1 Sep 15, 2023 •

edited

Loading

arnavgarg1 Sep 15, 2023

Infernaught Sep 15, 2023

arnavgarg1 Sep 15, 2023

Eliminate short-circuiting for loading from local #3600

Eliminate short-circuiting for loading from local #3600

Conversation

Infernaught commented Sep 12, 2023

arnavgarg1 left a comment

Choose a reason for hiding this comment

github-actions bot commented Sep 12, 2023 • edited Loading

Unit Test Results

justinxzhao left a comment

Choose a reason for hiding this comment

arnavgarg1 Sep 15, 2023 • edited Loading

Choose a reason for hiding this comment

arnavgarg1 Sep 15, 2023

Choose a reason for hiding this comment

Infernaught Sep 15, 2023

Choose a reason for hiding this comment

arnavgarg1 Sep 15, 2023

Choose a reason for hiding this comment

github-actions bot commented Sep 12, 2023 •

edited

Loading

arnavgarg1 Sep 15, 2023 •

edited

Loading