Add more summarization models #2

enjalot · 2024-02-15T19:49:00Z

We could easily add more models to the list of chat models used for summarization:
https://github.com/enjalot/latent-scope/blob/main/latentscope/models/chat_models.json

There are plenty of small open source LLMs on HuggingFace that could be added to the list as long as the transformers provider is compatible.

We should also add more Chat API models, currently only OpenAI and Mistral are implemented in providers but Cohere and Together should be easy to add.

Comments on this issue for which models to add would be a helpful start. Especially if there are fine-tuned small models focused on summarization (my experience with out-of-the box 1B and 7B is that their quality isn't great).

enjalot · 2024-05-15T13:51:56Z

I've updated in 0.2.3 to add Llama 3 8b instruct and removed some older transformer chat models that didn't perform well. It seems to do at least as well as gpt-3.5 did.

Adding more transformer models should still be done, but it's a little more complicated as some of them seem to have different prompt formats.

I still want to add Claude and other API providers.

enjalot · 2024-07-26T17:35:31Z

Ideally we use a similar HF hub search that was recently enabled for the embedding models. the only issue would be automatically sorting the chat templating for summarizing. I'm sure there is a library for this we could use

enjalot · 2024-09-28T12:23:21Z

Perhaps we use https://github.com/dottxt-ai/outlines
to simplify prompt template across models and get more reliable structured output (currently doing some hacks to get just the label back from the model)

enjalot · 2024-10-01T16:36:11Z

I think we can search for "conversational" + "text-generation-interface" to get useful instruct models:
https://huggingface.co/models?other=conversational,text-generation-inference&sort=downloads

meuleman · 2024-10-03T23:49:55Z

#66 adds basic support for ollama, which provides a ton of models that can be run on a user's local machine (chat & embed): https://ollama.com/library

enjalot · 2024-12-13T19:06:54Z

this was accomplished with better huggingface support as well as ollama support

enjalot added enhancement New feature or request help wanted Extra attention is needed python good first issue Good for newcomers labels Feb 15, 2024

enjalot added this to the 1.0 milestone Sep 29, 2024

enjalot added a commit that referenced this issue Nov 20, 2024

support custom models via URLS #2 and #35

5d2a9db

enjalot closed this as completed Dec 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more summarization models #2

Add more summarization models #2

enjalot commented Feb 15, 2024

enjalot commented May 15, 2024

enjalot commented Jul 26, 2024

enjalot commented Sep 28, 2024

enjalot commented Oct 1, 2024

meuleman commented Oct 3, 2024

enjalot commented Dec 13, 2024