add changes for lora adapter support and /v1/models endpoint #121

sven-knoblauch · 2024-10-09T10:54:42Z

small changes to add the lora modules ENV variable (supporting 1 lora adapter) as solution for #119
with the format: {"name": "xxx", "path": "xxx/xxxxx", "base_model_name": "xxx/xxxx"}

also change of the v1/models endpoint to return all models

pandyamarut · 2024-10-14T22:44:52Z

Thanks for the PR. @sven-knoblauch . Can you please also add how did you test the PR?

sven-knoblauch · 2024-10-16T13:03:36Z

I made a docker container with the given Dockerfile (on dockerhub: svenknob/runpod-vllm-worker) and tested it on runpod serverless. Worked with a custom trained lora adapter (added in the runpod GUI as ENV variable: LORA_MODULES) with an awq mistral model. The lora adapter is also visible in the v1/models endpoint.

nerdylive123 · 2024-11-07T16:24:02Z

Hi, is there a documentation for this env usage? in the markdown perhaps?

nielsrolf · 2024-11-12T12:24:56Z

Is a docker image publicly available that contains this PR?

sven-knoblauch · 2024-11-12T13:02:53Z

Added a pull request for changing the readme #130.
Usage is similar to the usage in the "original" vllm server. The env var name is LORA_MODULES and the format is {"name": "xxx", "path": "xxx/xxxx", "base_model_name": "xxx/xxxx"}, where the name is the name the http requests are made for, the path is the huggingface path of the adapter and the base_model_name is the modelname it is trained on.

For now you can use my docker image svenknob/runpod-vllm-worker, till it has been published.

add changes for lora adapter support and /v1/models endpoint

5cd12ba

sven-knoblauch marked this pull request as draft October 9, 2024 11:42

update code for case of no lora adapter

677a01e

sven-knoblauch marked this pull request as ready for review October 9, 2024 12:53

KYG-APPS mentioned this pull request Oct 31, 2024

Lora-module support needed for using adapters #119

Closed

pandyamarut merged commit 6e8696c into runpod-workers:main Oct 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add changes for lora adapter support and /v1/models endpoint #121

add changes for lora adapter support and /v1/models endpoint #121

sven-knoblauch commented Oct 9, 2024 •

edited

Loading

pandyamarut commented Oct 14, 2024

sven-knoblauch commented Oct 16, 2024

nerdylive123 commented Nov 7, 2024

nielsrolf commented Nov 12, 2024

sven-knoblauch commented Nov 12, 2024

add changes for lora adapter support and /v1/models endpoint #121

add changes for lora adapter support and /v1/models endpoint #121

Conversation

sven-knoblauch commented Oct 9, 2024 • edited Loading

pandyamarut commented Oct 14, 2024

sven-knoblauch commented Oct 16, 2024

nerdylive123 commented Nov 7, 2024

nielsrolf commented Nov 12, 2024

sven-knoblauch commented Nov 12, 2024

sven-knoblauch commented Oct 9, 2024 •

edited

Loading