-
Notifications
You must be signed in to change notification settings - Fork 196
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix/errors for some models including Mistral-7b and Flan-ul2 #388
Fix/errors for some models including Mistral-7b and Flan-ul2 #388
Conversation
lachlancahill
commented
Nov 18, 2023
- Use os.path.basename for all OS compatibility including Windows.
- Identify if the hub has downloaded to a snapshots subfolder and move to root directory before committing so that commits work.
…ub the binary files write to a snapshots/<uuid> subdirectory. Then when the files are committed, they cannot be found and the commit fails with an error.
…bility with all OS including Windows.
Thanks a lot for your contribution @lachlancahill . Happy to include the Windows fix (my bad on that path splitting). For the snapshots thing I don't really like it as it is (anyone could have used any path there). |
My pleasure, thank you for your great feedback. I've removed the 'snapshots' change from the PR though worth keeping in mind that if we take out that whole block we need to add back the original To provide more detail on the snapshots issue, while investigating errors I noticed the folder structure for Mistral (similar for flan-ul2) was like the below in the temporary directory:
I think the version I was working with had a step which checked the converted files, and this non-standard folder structure was causing issues (I think the checking function was looking in the root folder for the .safetensors files). In the latest version it looks like you have removed that check which I think makes this change unnecessary. Thanks for all your great work, this is such a valuable resource. 😄 |
Yes I removed the check because we're now pretty confident about the conversions as they exist, and removing it allows to convert models with much less RAM than the actual model requires. I would keep it if I knew a given machine had enough RAM to run the check, but it seems too involved/unreliable atm. |