local path quantized model with lora not working? #49

AugustRush · 2024-09-15T08:42:49Z

weight = transWeight + lora_scale * (lora_b @ lora_a)
~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
ValueError: Shapes (3072,768) and (3072,3072) cannot be broadcast.

filipstrand · 2024-09-17T19:38:55Z

@AugustRush I can reproduce this error. I think it happens because of how and when we quantise the model and load the LoRA weights. I will look into this in more detail later...

In the non local-path quantitated case: Since the LoRA weights are assumed to be non-quantized, we first merge them into the yet-to-be-quantized model, and then we quantise the whole model. But in the case where the original model is already quantized, then we cannot do the merging with the non-quantised LoRA weights.

At the moment I am not fully sure how easy this is to fix, but at least one option I can add (which is nice to have regardless) is for the save.py to support the --lora-paths and --lora-scales arguments, then at least you can save a merged version of the the weights with the LoRA file baked in and then when you run it you would not need to specify the LoRA files and it should work. One obvious downside with this is of course that you cannot easily swap the LoRAs.

filipstrand · 2024-09-22T09:40:31Z

@AugustRush This is now merged to main and will be included in the upcoming release. An example can be found here

filipstrand closed this as completed Sep 24, 2024

filipstrand mentioned this issue Oct 5, 2024

Support for different LoRA formats #47

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

local path quantized model with lora not working? #49

local path quantized model with lora not working? #49

AugustRush commented Sep 15, 2024

filipstrand commented Sep 17, 2024 •

edited

Loading

filipstrand commented Sep 22, 2024 •

edited

Loading

local path quantized model with lora not working? #49

local path quantized model with lora not working? #49

Comments

AugustRush commented Sep 15, 2024

filipstrand commented Sep 17, 2024 • edited Loading

filipstrand commented Sep 22, 2024 • edited Loading

filipstrand commented Sep 17, 2024 •

edited

Loading

filipstrand commented Sep 22, 2024 •

edited

Loading