-
Notifications
You must be signed in to change notification settings - Fork 10.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WIP: New python based entry point for containers #1686
Conversation
@SlyEcho This is the endpoint change I've mentioned. |
Does it allow passing arguments with spaces? |
I'm not sure what exactly you mean. My implementation makes some changes. But, for example, these work:
But I wouldn't really like to get this specific version merged. I would prefer subcommands over those args, as I wouldn't have to worry about overloading arguments between run/quantize/etc. Not to say that it's impossible, but subcommands do offer flexibility you don't have with multiple arguments. |
That's what I meant.
Couldn't it be done by just checking |
Advantage of python Now you can do that with plain conditionals checks on argv elements, but you would also have to implement some other logic to make it robust. And that puts unnecessary burden on maintainers. What I would propose is something like this:
Each one of those would have their own args, so no chance of overloading. Actually I'm bit ashamed I've not proposed it already. Edit: I'll make ASCIInema demo later today, when I have more time. |
I have the promised demo.[0] It's aciinema[1] recording, so you can play it in terminal. I do recommend using the [0] https://gist.github.com/jpodivin/ef4d037c21bfc2ce0a9f91b1d3f29ea5 |
5020781
to
89e7976
Compare
@SlyEcho wdyt? |
I have not had time to test it yet. |
89e7976
to
048fd14
Compare
@SlyEcho So are we moving this out of WIP? Or should I just close it? |
I think it's still worth it. There are a couple of things that have changed:
|
Right. I'll take a look at it.
…On Tue, Aug 29, 2023 at 6:55 PM Henri Vasserman ***@***.***> wrote:
I think it's still worth it.
There are a couple of things that have changed:
1. A lot more Dockerfiles.
2. The model format changed, so now we have .gguf files.
—
Reply to this email directly, view it on GitHub
<#1686 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/APZOTD24XBLNH7DVKAD6XYDXXYNG7ANCNFSM6AAAAAAYZPZ3SQ>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
Signed-off-by: Jiri Podivin <jpodivin@gmail.com>
048fd14
to
bb0cadf
Compare
@SlyEcho I've updated the script to work with new model format and server binary. Also I've replaced the endpoint in other container files. I've tested the standard file quantization, conversion, run and server binary with open-llama. |
No description provided.