[PyTorchModelHubMixin] Fix saving model with shared tensors #2086

NielsRogge · 2024-03-04T13:51:35Z

This PR uses save_model instead of save_file in order to properly save shared tensors for the PyTorchModelHubMixin.

In the future, this may be replaced by save_file again when we support saving sharded checkpoints, which deduplicates shared tensors as in the Transformers library.

HuggingFaceDocBuilderDev · 2024-03-04T13:57:02Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

codecov-commenter · 2024-03-04T13:57:15Z

Codecov Report

Attention: Patch coverage is 0% with 6 lines in your changes are missing coverage. Please review.

Project coverage is 82.92%. Comparing base (0ab9391) to head (3cc9f71).

❗ Current head 3cc9f71 differs from pull request most recent head eb0111f. Consider uploading reports for the commit eb0111f to get more accurate results

Files	Patch %	Lines
src/huggingface_hub/hub_mixin.py	0.00%	6 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2086      +/-   ##
==========================================
+ Coverage   82.90%   82.92%   +0.02%     
==========================================
  Files         102      102              
  Lines        9480     9477       -3     
==========================================
  Hits         7859     7859              
+ Misses       1621     1618       -3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Wauplin · 2024-03-05T11:03:56Z

Thanks for the PR @NielsRogge. I've added a test + used safetensors.torch.load_model to correctly load models with shared tensors. Everything should be fine now.

Regarding load_model, it does not support the device parameter at the moment. I have open a PR to fix this in safetensors directly: huggingface/safetensors#449. Will add back proper support once merged. Current workaround is to load on cpu first and then move to gpu (=> increase loading time but 🤷)

Wauplin

Let's get this merged :)

* Use save_model * Use model * add tests + comments + load_model * move to device --------- Co-authored-by: Lucain Pouget <lucainp@gmail.com>

NielsRogge added 2 commits March 4, 2024 13:58

Use save_model

5b48a2e

Use model

2849871

NielsRogge changed the title ~~Feature/add save model~~ [PyTorchModelHubMixin] Use save_model instead of save_file Mar 4, 2024

NielsRogge requested a review from Wauplin March 4, 2024 13:56

add tests + comments + load_model

3cc9f71

move to device

eb0111f

Wauplin approved these changes Mar 6, 2024

View reviewed changes

Wauplin changed the title ~~[PyTorchModelHubMixin] Use save_model instead of save_file~~ [PyTorchModelHubMixin] Fix saving model with shared tensors Mar 6, 2024

Wauplin merged commit 4286577 into huggingface:main Mar 6, 2024
14 checks passed

Wauplin added a commit that referenced this pull request Mar 6, 2024

[PyTorchModelHubMixin] Fix saving model with shared tensors (#2086)

0d331b8

* Use save_model * Use model * add tests + comments + load_model * move to device --------- Co-authored-by: Lucain Pouget <lucainp@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PyTorchModelHubMixin] Fix saving model with shared tensors #2086

[PyTorchModelHubMixin] Fix saving model with shared tensors #2086

NielsRogge commented Mar 4, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Mar 4, 2024

codecov-commenter commented Mar 4, 2024 •

edited

Loading

Wauplin commented Mar 5, 2024 •

edited

Loading

Wauplin left a comment •

edited

Loading

[PyTorchModelHubMixin] Fix saving model with shared tensors #2086

[PyTorchModelHubMixin] Fix saving model with shared tensors #2086

Conversation

NielsRogge commented Mar 4, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Mar 4, 2024

codecov-commenter commented Mar 4, 2024 • edited Loading

Codecov Report

Wauplin commented Mar 5, 2024 • edited Loading

Wauplin left a comment • edited Loading

Choose a reason for hiding this comment

NielsRogge commented Mar 4, 2024 •

edited

Loading

codecov-commenter commented Mar 4, 2024 •

edited

Loading

Wauplin commented Mar 5, 2024 •

edited

Loading

Wauplin left a comment •

edited

Loading