Custom inference endpoints #306

himsgpt · 2023-07-31T15:55:27Z

Problem

A lot of enterprises are building their own llm models, can we use them instead of chatgpt/hugging face, etc. Sagemaker is one option but I should be able to provide just an inference endpoint to use this for prompts in jupyter lab!

Proposed Solution

A lot of enterprises are building their own llm models, can we use them instead of chatgpt/hugging face, etc. Sagemaker is one option but I should be able to provide just an inference endpoint to use this for prompts in jupyter lab!

Additional context

welcome · 2023-07-31T15:55:29Z

Thank you for opening your first issue in this project! Engagement like this is essential for open source projects! 🤗

If you haven't done so already, check out Jupyter's Code of Conduct. Also, please try to follow the issue template as it helps other other community members to contribute more effectively.

You can meet the other Jovyans by joining our Discourse forum. There is also an intro thread there where you can stop by and say Hi! 👋

Welcome to the Jupyter community! 🎉

JasonWeill · 2023-07-31T16:46:30Z

Are you asking about locally hosted models? If so, this feature request is being discussed in #190.

himsgpt · 2023-08-01T02:40:24Z

Yes, enterprises have been building and deploying custom llm models in on-premises. We therefore want to use these custom endpoints in the jupyter lab instead of chatgpt, etc.

JasonWeill · 2023-08-01T02:59:25Z

@himsgpt Thanks for clarifying! Locally-hosted and on-prem models are high priorities for organizations that can't rely on third-party models for generative AI. Let's keep the conversation going in #190.

c3-viral-lakhani · 2023-12-29T00:27:23Z

@JasonWeill

We have our own inference endpoint, with model already deployed. Is it possible to configure the extension to point to the endpoint or any other way to use the endpoint directly?

Does the model have to come from GPT4All so that it can be used in jupyter-ai extension?

JasonWeill · 2023-12-29T00:29:55Z

@c3-viral-lakhani In #322, @dlqqq added support for OpenAI proxies. GPT4All is another way to use local models. If you have another locally deployed model, I recommend filing an issue and/or opening a pull request to add support for it to Jupyter AI, to ensure that the magic commands and chat UI work with your own endpoint. Thanks for your interest!

himsgpt added the enhancement New feature or request label Jul 31, 2023

JasonWeill closed this as completed Aug 1, 2023

JasonWeill added the duplicate This issue or pull request already exists label Aug 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Custom inference endpoints #306

Custom inference endpoints #306

himsgpt commented Jul 31, 2023

welcome bot commented Jul 31, 2023

JasonWeill commented Jul 31, 2023

himsgpt commented Aug 1, 2023

JasonWeill commented Aug 1, 2023

c3-viral-lakhani commented Dec 29, 2023

JasonWeill commented Dec 29, 2023

Custom inference endpoints #306

Custom inference endpoints #306

Comments

himsgpt commented Jul 31, 2023

Problem

Proposed Solution

Additional context

welcome bot commented Jul 31, 2023

JasonWeill commented Jul 31, 2023

himsgpt commented Aug 1, 2023

JasonWeill commented Aug 1, 2023

c3-viral-lakhani commented Dec 29, 2023

JasonWeill commented Dec 29, 2023