Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add more summarization models #2

Closed
enjalot opened this issue Feb 15, 2024 · 6 comments
Closed

Add more summarization models #2

enjalot opened this issue Feb 15, 2024 · 6 comments
Labels
enhancement New feature or request good first issue Good for newcomers help wanted Extra attention is needed python
Milestone

Comments

@enjalot
Copy link
Owner

enjalot commented Feb 15, 2024

We could easily add more models to the list of chat models used for summarization:
https://github.com/enjalot/latent-scope/blob/main/latentscope/models/chat_models.json

There are plenty of small open source LLMs on HuggingFace that could be added to the list as long as the transformers provider is compatible.

We should also add more Chat API models, currently only OpenAI and Mistral are implemented in providers but Cohere and Together should be easy to add.

Comments on this issue for which models to add would be a helpful start. Especially if there are fine-tuned small models focused on summarization (my experience with out-of-the box 1B and 7B is that their quality isn't great).

@enjalot enjalot added enhancement New feature or request help wanted Extra attention is needed python good first issue Good for newcomers labels Feb 15, 2024
@enjalot
Copy link
Owner Author

enjalot commented May 15, 2024

I've updated in 0.2.3 to add Llama 3 8b instruct and removed some older transformer chat models that didn't perform well. It seems to do at least as well as gpt-3.5 did.

Adding more transformer models should still be done, but it's a little more complicated as some of them seem to have different prompt formats.

I still want to add Claude and other API providers.

@enjalot
Copy link
Owner Author

enjalot commented Jul 26, 2024

Ideally we use a similar HF hub search that was recently enabled for the embedding models. the only issue would be automatically sorting the chat templating for summarizing. I'm sure there is a library for this we could use

@enjalot
Copy link
Owner Author

enjalot commented Sep 28, 2024

Perhaps we use https://github.com/dottxt-ai/outlines
to simplify prompt template across models and get more reliable structured output (currently doing some hacks to get just the label back from the model)

@enjalot enjalot added this to the 1.0 milestone Sep 29, 2024
@enjalot
Copy link
Owner Author

enjalot commented Oct 1, 2024

I think we can search for "conversational" + "text-generation-interface" to get useful instruct models:
https://huggingface.co/models?other=conversational,text-generation-inference&sort=downloads

@meuleman
Copy link

meuleman commented Oct 3, 2024

#66 adds basic support for ollama, which provides a ton of models that can be run on a user's local machine (chat & embed): https://ollama.com/library

enjalot added a commit that referenced this issue Nov 20, 2024
@enjalot
Copy link
Owner Author

enjalot commented Dec 13, 2024

this was accomplished with better huggingface support as well as ollama support

@enjalot enjalot closed this as completed Dec 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request good first issue Good for newcomers help wanted Extra attention is needed python
Projects
None yet
Development

No branches or pull requests

2 participants