-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multilora docs #22865
base: gh-pages
Are you sure you want to change the base?
Multilora docs #22865
Conversation
ded2bd8
to
f7e59ab
Compare
|
||
## Schema | ||
|
||
The schema for the file is as follows |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could we instead point users to the schema inside the repo so we don't need to copy the contents every time there is a change?
https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/lora/adapter_format/adapter_schema.fbs
|
||
#### Return value | ||
|
||
`OgaResult` containing an error message if the adapter failed to unload. This can occur if the method is called with an adapter that is not already loaded. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
`OgaResult` containing an error message if the adapter failed to unload. This can occur if the method is called with an adapter that is not already loaded. | |
`OgaResult` containing an error message if the adapter failed to unload. This can occur if the method is called with an adapter that is not already loaded or has been marked active by a `OgaGenerator` still in use. |
This example is shown in Python, but you can also use the C/C++ API, the C# API, and the Java API (_coming soon!_) | ||
|
||
```python | ||
import onnxruntime_genai as og |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
While this is ok, shall we update model-qa.py
in onnxruntime-genai to add support for loading adapters?
Staged here: