FIX Multiple adapters and modules_to_save #1615

BenjaminBossan · 2024-04-02T15:39:21Z

Resolves #1574.

Previously, we had the bug that if we had multiple adapters, some with
modules_to_save and others without, when trying to switch to an adapter
without modules_to_save, the ModulesToSaveWrapper would raise an error
because it cannot find that adapter. Now, when it detects this, it is
just disabled (so it uses the original weight).

Moreover, we had the issue that when we were using classes such as
PeftModelForSequenceClassification, we implicitly added the classifier
layers to model.modules_to_save. However, this would only add a new
ModulesToSaveWrapper instance for the first adapter being initialized.
When initializing a 2nd adapter via add_adapter, this information was
ignored. To fix this, I now update the peft_config.modules_to_save to
explicitly add the classifier layers. This is a departure from how this
worked previously, but I'm couldn't find a better way to ensure that
this bug was fixed (LMK if you have a suggestion).

Finally, there was a bug in add_weighted_adapters when we were merging
multiple adapters with modules_to_save. Previously, when we called
model.add_weighted_adapter, the LoRA weights were merged and a new
ModulesToSaveWrapper was added for the new adapter based on the first
LoraConfig of the two adapters. This ModulesToSaveWrapper is just a copy
of the original weights. Thus, when we switch to the newly merged
adapter, we just use the original weights for modules_to_save. This
doesn't make a lot of sense and is probably surprising for the user.
Now, we raise an error when we detect this to alert the user to this
fact.

Note that when only one of the adapters to be merged has a
modules_to_save, this does not raise an error, instead that module is
being used.

Edit: If this is merged, we should add a note to the next release notes
about possible changes, as this results in a different model for the same
config (though it's a rather specific edge case). We should clarify that the
previous behavior was erroneous and new one is correct.

Partially addresses huggingface#1574 This fixes an issue that occurs when we have multiple adapters with modules_to_save. Previously, the modules_to_save of the 2nd adapter which is added with add_adapter was not being honored. This is fixed now.

HuggingFaceDocBuilderDev · 2024-04-02T15:43:22Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

The previous fix was not quite correct, so this commit adds further fixes. Previously, we had the bug that if we had multiple adapters, some with modules_to_save and others without, when trying to switch to an adapter without modules_to_save, the ModulesToSaveWrapper would raise an error because it cannot find that adapter. Now, when it detects this, it is just disabled (so it uses the original weight). Moreover, we had the issue that when we were using classes such as PeftModelForSequenceClassification, we implicitly added the classifier layers to model.modules_to_save. However, this would only add a new ModulesToSaveWrapper instance for the first adapter being initialized. When initializing a 2nd adapter via add_adapter, this information was ignored. To fix this, I now update the peft_config.modules_to_save to explicitly add the classifier layers. This is a departure from how this worked previously, but I'm couldn't find a better way to ensure that this works correctly. Finally, there was a bug in add_weighted_adapters when we were merging multiple adapters with modules_to_save. Previously, when we called model.add_weighted_adapter, the LoRA weights were merged and a new ModulesToSaveWrapper was added for the new adapter based on the first LoraConfig of the two adapters. This ModulesToSaveWrapper is just a copy of the original weights. Thus, when we switch to the newly merged adapter, we just use the original weights for modules_to_save. This doesn't make a lot of sense and is probably surprising for the user. Now, we raise an error when we detect this to alert the user to this fact. Note that when only one of the adapters to be merged has a modules_to_save, this does not raise an error, instead that module is being used.

BenjaminBossan · 2024-04-04T12:06:46Z

@pacman100 @younesbelkada I updated this PR after the internal discussion (check the updated PR description).

pacman100

Thank you @BenjaminBossan for fixing various issues related to modeules_to_save and thorough tests for the same! 🚀

BenjaminBossan requested review from pacman100 and younesbelkada April 2, 2024 16:18

pacman100 approved these changes Apr 8, 2024

View reviewed changes

Merge branch 'main' into fix-multiple-adapters-modules-to-save

b4aa023

BenjaminBossan merged commit 0d283ae into huggingface:main Apr 9, 2024
14 checks passed

BenjaminBossan deleted the fix-multiple-adapters-modules-to-save branch April 9, 2024 10:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX Multiple adapters and modules_to_save #1615

FIX Multiple adapters and modules_to_save #1615

BenjaminBossan commented Apr 2, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 2, 2024

BenjaminBossan commented Apr 4, 2024

pacman100 left a comment

FIX Multiple adapters and modules_to_save #1615

FIX Multiple adapters and modules_to_save #1615

Conversation

BenjaminBossan commented Apr 2, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Apr 2, 2024

BenjaminBossan commented Apr 4, 2024

pacman100 left a comment

Choose a reason for hiding this comment

BenjaminBossan commented Apr 2, 2024 •

edited

Loading