QDoRA: Support DoRA with BnB quantization #1518

BenjaminBossan · 2024-02-29T17:21:25Z

Adds support for DoRA on 4bit and 8bit quantized models with bitsandbytes. Merging also works, with the usual caveats for quantized weights (results are not 100% identical), but it's not worse than vanialla LoRA.

I did some quick tests and could see the expected memory savings with bnb. Same as with DoRA on non-quantized layers, using DoRA on quantized layers leads to a moderate increase in runtime.

WIP Adds support for DoRA on 4bit and 8bit quantized models with BnB. For now, merging is not implemented. I'll investigate this next.

HuggingFaceDocBuilderDev · 2024-02-29T17:25:29Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

younesbelkada

Looks great to me thank you @BenjaminBossan for adding quantization support for DoRA !

pacman100

Thank you @BenjaminBossan for adding support to use DoRA with bnb quantized layers and the thorough tests! 🤗

pacman100 · 2024-03-12T07:23:07Z

docs/source/developer_guides/lora.md


 ```py
 from peft import LoraConfig

 config = LoraConfig(use_dora=True, ...)
 ```

+DoRA should work with weights quantized with bitsandbytes ("QDoRA"). Issues have been reported when using QDoRA with DeepSpeed Zero2.


this should be in a Caveats section wherein all such notes can be collated in one place.

I added a caveats section directly below, within the DoRA section. Is this what you had in mind?

pacman100

Thank you @BenjaminBossan! ✨

Adds support for DoRA on 4bit and 8bit quantized models with BnB. Merging also works, with the usual caveats for quantized weights (results are not 100% identical), but it's not worse than vanialla LoRA.

mallorbc · 2024-03-27T05:39:43Z

Qdora seems to be working for me. However, I am noticing a large slowdown comparing Qlora and Qdora of around 2x. I am not sure if that is expected or not. But this seems like a good place as any to share this finding.

BenjaminBossan · 2024-03-27T10:38:12Z

I am noticing a large slowdown comparing Qlora and Qdora of around 2x. I am not sure if that is expected or not.

Yes, QDoRA unfortunately requires an additional dequantization step on the quantized weights to calculate the weight norm. I wouldn't expect this to slow down training by 2x, but a significant slowdown is expected. Maybe you can run some profiler to check further if you think it's worth investigating.

But this seems like a good place as any to share this finding.

You can also create new issues or discussions (if there aren't already existing ones) for this type of question.

Don't pass load_in_8bit to AutoModel.from_pretrained, instead use BitsAndBytesConfig. There was already a PR to clean this up (huggingface#1552) but a slightly later PR (huggingface#1518) re-added this usage.

Don't pass load_in_8bit to AutoModel.from_pretrained, instead use BitsAndBytesConfig. There was already a PR to clean this up (#1552) but a slightly later PR (#1518) re-added this usage. --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

QDoRA: Support DoRA with BnB quantization

afdd5e6

WIP Adds support for DoRA on 4bit and 8bit quantized models with BnB. For now, merging is not implemented. I'll investigate this next.

BenjaminBossan added 3 commits March 1, 2024 14:42

Support merging bnb weights with DoRA

98ad29a

Fix typo

e8c3bf8

Update docs and docstring

10b705f

BenjaminBossan mentioned this pull request Mar 1, 2024

Implement DoRA #1474

Merged

BenjaminBossan marked this pull request as ready for review March 1, 2024 15:59

BenjaminBossan requested review from pacman100 and younesbelkada March 1, 2024 15:59

younesbelkada approved these changes Mar 4, 2024

View reviewed changes

BenjaminBossan added 2 commits March 5, 2024 14:38

Cast weight norm to dtype of weight

5a55991

Documentation

3f35dd5

pacman100 approved these changes Mar 12, 2024

View reviewed changes

BenjaminBossan added 2 commits March 12, 2024 10:56

Add separate caveats section for DoRA

45b51d8

Merge branch 'main' into support-dora-with-bnb

fd63e3c

pacman100 approved these changes Mar 12, 2024

View reviewed changes

BenjaminBossan merged commit 3eb6bba into huggingface:main Mar 12, 2024
14 checks passed

BenjaminBossan deleted the support-dora-with-bnb branch March 12, 2024 11:45

BenjaminBossan mentioned this pull request Mar 12, 2024

Feat: add support for Conv2D DoRA #1516

Merged

5 tasks

BenjaminBossan mentioned this pull request May 30, 2024

MNT Remove deprecated use of load_in_8bit #1811

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QDoRA: Support DoRA with BnB quantization #1518

QDoRA: Support DoRA with BnB quantization #1518

BenjaminBossan commented Feb 29, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Feb 29, 2024

younesbelkada left a comment

pacman100 left a comment

pacman100 Mar 12, 2024

BenjaminBossan Mar 12, 2024

pacman100 left a comment

mallorbc commented Mar 27, 2024

BenjaminBossan commented Mar 27, 2024

QDoRA: Support DoRA with BnB quantization #1518

QDoRA: Support DoRA with BnB quantization #1518

Conversation

BenjaminBossan commented Feb 29, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Feb 29, 2024

younesbelkada left a comment

Choose a reason for hiding this comment

pacman100 left a comment

Choose a reason for hiding this comment

pacman100 Mar 12, 2024

Choose a reason for hiding this comment

BenjaminBossan Mar 12, 2024

Choose a reason for hiding this comment

pacman100 left a comment

Choose a reason for hiding this comment

mallorbc commented Mar 27, 2024

BenjaminBossan commented Mar 27, 2024

BenjaminBossan commented Feb 29, 2024 •

edited

Loading