Sharing Models through the Hugging Face Hub #86

osanseviero · 2021-08-24T20:31:50Z

Hi CRFM team!

Mistral is very exciting! I see you currently share your model checkpoints through links to a hosted server. Would you be interested in sharing the pretrained models in the Hugging Face Hub? We already have a similar collaboration with the Stanford NLP group (see org).

The Hub offers free hosting of over 20K models, and it would make your work more accessible and visible to the rest of the community. Some of the benefits of sharing your models would be:

forget about the pain of managing the hosting
built-in versioning
commit history and diffs
repos provide useful metadata about their tasks, languages, metrics, etc that is useful for discoverability but also to understand the model

Creating the repos and adding new models should be a relatively straightforward process if you've used Git before. This is a step-by-step guide explaining the process in case you're interested. Please let us know if you would be interested and if you have any questions.

In a future we could also integrate this to our Inference API so users can play with the models directly in the browser with our widgets.

Happy to hear your thoughts,
Omar and the Hugging Face team

cc @lewtun @anton-l @LysandreJik

siddk · 2021-08-24T20:34:32Z

Hey @osanseviero - this is something we were talking about, but one clarification; would you be able to host all 610 checkpoints for each of the 10 runs (6100 checkpoints total, ~22TB)?

We figured this could get complicated (and expensive), but if you can do it, we can go through the process! Would also let us push our slightly tweaked/stable GPT-2 model definition up to HF as well!

osanseviero · 2021-08-24T20:53:14Z

Hey @siddk! That's great to hear. Yes, we're up to host all your checkpoints if you would really like to share all of them. The way I would suggest to do it is to have one repository for each experiment and do a new commit for each checkpoint. Users would then be able to load the checkpoint from a given revision and the working widget would use the latest checkpoint, which should correspond to step 400,000.

siddk · 2021-08-24T21:25:58Z

Sounds great. I think we need to do a bit of clean-up on our side this week then, but we will start the process above early next week! Looking forward to working through this with you.

And a heads up - this probably won't be the last set of models we train 🙂 ! Looking forward to fostering a stronger relationship with HF as we keep exploring!

dlwh · 2022-03-10T19:52:19Z

@siddk what's the definition of done here? is it having uploaded 6100 model checkpoints? (as opposed to the 10 that are there now?) Are we going to do that?

siddk · 2022-03-10T20:11:09Z

These should all be done - if you look at the different branches, you should see all 610 checkpoints (see here: https://huggingface.co/stanford-crfm/arwen-gpt2-medium-x21/tree/main).

dlwh · 2022-03-10T20:15:53Z

ok, our readme.md says git clone https://huggingface.co/stanford-crfm/arwen-x21-checkpoint-400000 so we should probably fix

dlwh · 2022-03-10T21:45:01Z

i'm closing this and i opened #123

siddk mentioned this issue Sep 1, 2021

Adding Mistral Checkpoints to HF Hub huggingface/huggingface_hub#300

Closed

osanseviero mentioned this issue Sep 8, 2021

Upcasting of attention computation for reliable pretraining of GPT-2 models huggingface/transformers#13463

Closed

siddk mentioned this issue Sep 15, 2021

Add Mistral GPT-2 Stability Tweaks huggingface/transformers#13573

Merged

siddk changed the title ~~Sharing models through the Hugging Face Hub~~ Sharing Models through the Hugging Face Hub Sep 19, 2021

dlwh added this to the Mistral V2 milestone Mar 10, 2022

dlwh mentioned this issue Mar 10, 2022

Generate Model Cards for models #118

Closed

dlwh mentioned this issue Mar 10, 2022

update documentation on accessing new Mistral checkpoints on HF Hub #123

Closed

dlwh closed this as completed Mar 10, 2022

osanseviero mentioned this issue Mar 16, 2022

Document model repo best practices huggingface/hub-docs#53

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sharing Models through the Hugging Face Hub #86

Sharing Models through the Hugging Face Hub #86

osanseviero commented Aug 24, 2021

siddk commented Aug 24, 2021 •

edited

Loading

osanseviero commented Aug 24, 2021

siddk commented Aug 24, 2021

dlwh commented Mar 10, 2022

siddk commented Mar 10, 2022

dlwh commented Mar 10, 2022

dlwh commented Mar 10, 2022

Sharing Models through the Hugging Face Hub #86

Sharing Models through the Hugging Face Hub #86

Comments

osanseviero commented Aug 24, 2021

siddk commented Aug 24, 2021 • edited Loading

osanseviero commented Aug 24, 2021

siddk commented Aug 24, 2021

dlwh commented Mar 10, 2022

siddk commented Mar 10, 2022

dlwh commented Mar 10, 2022

dlwh commented Mar 10, 2022

siddk commented Aug 24, 2021 •

edited

Loading