Add API to summarize text #38578

ChristophWurst · 2023-06-01T13:01:32Z

How to use GitHub

Please use the 👍 reaction to show that you are interested into the same feature.
Please don't comment if you have no relevant information to add. It's just extra noise for everyone subscribed to this issue.
Subscribe to receive notifications on status change and new comments.

Is your feature request related to a problem? Please describe.

As a Nextcloud developer I want to be able to summarize text for the users of the app. E.g. to get the gist of an email thread.

Describe the solution you'd like

Provide an OCP API and optionally an OCS API where an app can send long text and receive a short summary text. The API has to be optional because not every installation will have a backend for language processing.

Describe alternatives you've considered

N/a

Additional context

cc @DaphneMuller

marcelklehr · 2023-06-28T09:50:29Z

Word of caution: The new llm app will be rather slow at least for now (ie. ~10+ minutes for a summary), so it would behoove us to avoid allowing users to generate summaries on demand but restrict use cases to scenarios where we prepare the summary in advance.

ChristophWurst · 2023-06-29T14:11:16Z

In today's discussion with @AndyScherzinger and @marcoambrosini we identified some open questions

How does the runtime complexity scale with the size of the input? Is it static? In other words, would a thread of three messages take roughly as much time to summarize as a thread of 25 messages?
With uncertain quality of the output text we wonder if the output format can be influence, e.g. to not just dump a few sentences but bulletpoints with important infos or "hard facts".
1. If the format output format is not just sentences without formatting but paragraphs, lists or similar, would it make sense to specify if e.g. markdown or HTML is returned to make processing and rendering deterministic?
How will the language model work with localization? Does it detect the source language? Are languages other than English supported? Do we need to specify the source text language?
Do we want users to be able to opt-out of AI features, e.g. when they are not happy with the results?
Where does the LLM processing happen and could it block the execution of any other background jobs?

marcelklehr · 2023-06-29T15:49:03Z

On CPU it's linear or worse, because there's no parallel processing. On GPU I believe it can be constant, but the length of the output affects runtime negatively as well. In the current iteration we're only targeting CPUs.
I think this is unlikely to work consistently. The current model especially cannot produce well-formed markup.
The current model supports English only
I would recommend this.
LLM processing currently happens on the same machine as nextcloud on the CPU in sequential background tasks (no two LLM tasks are allowed to run in parallel)

If you'd like to try the model yourself (which I recommend to suss out what it's capable of and what not), you can install https://gpt4all.io on your computer. Currently I'm using the GPT4All-v1.3-groovy model. better models will become available in the coming weeks.

marcoambrosini · 2023-06-30T04:21:19Z

If these are the current limitations I'd suggest to flip it into an "opt in at your own risk" feature rather than just giving the ability to opt out, or wait until the technology is more reliable.

DaphneMuller · 2023-07-04T10:40:57Z

but the features will only be available / visible to the user anyway if the LLM app is enabled, right? Wouldn't a warning in the readme of the LLM app then be enough?

AndyScherzinger · 2023-07-04T11:42:15Z

I also wouldn't do opt-in since that lowers the adoption rate and rather have opt-out. I think for the moment like Daphne said, it can be "managed" via having the app enabled or disabled. If the results are bad they will be across the user base and if they are good they will be good across the user base (very simplified, I know). So at some point the app will be activated/deactivated by admins. So yeah - I also think info in the readme should be enough. maybe app description in the info.xml since that is shown in the app store.

marcoambrosini · 2023-07-04T12:08:14Z

How about default active if LLM app is enable but still give ability to opt out by just hiding the component in the front end? This could be very simply saved in the browser storage.

marcelklehr · 2023-07-04T12:11:41Z

Also worth noting: integration_openai and perhaps replicate might implement this API as well. Since they also have other uses opt out == disable app might not be the way to go.

DaphneMuller · 2023-07-04T12:24:03Z

ok, then it is up to product management to decide. @jancborchardt @AndyScherzinger

AndyScherzinger · 2023-07-04T12:58:38Z

That would be @jancborchardt @karlitschek call. I still think we shouldn't provide a user-level opt-in/-out option.

jancborchardt · 2023-07-04T13:47:36Z

Since we have more and more "Assistant" features, it might be nice to have a dedicated section in the settings on that.

There we can:

Group all settings related to assistance, AI and such in one easily findable section
Talk about Ethical AI and mark the relevant features accordingly
Offer people to opt out of individual Assistant features (agree with @AndyScherzinger it should be opt-out in this case)

DaphneMuller · 2023-07-04T14:16:36Z

@jancborchardt can you make sure this is aligned where necessary with Frank so the devs can just implement it, and provide the necessary mock-ups / details / requirements for the devs? We can then see who can do the work.

ChristophWurst · 2023-07-20T14:45:28Z

To fulfill the updated requirements from nextcloud/mail#8508 (comment) the new API has to provide rich text as a result, not just plain text. We could go with Markdown or HTML. @marcelklehr would that be doable with the available LLMs?

marcelklehr · 2023-07-20T17:09:27Z

As mentioned in the kick-off meeting, rich-text/HTML is kinda unreliable with the current model. I think as a first iteration we can only rely on plain text.

ChristophWurst added enhancement 1. to develop Accepted and waiting to be taken care of integration labels Jun 1, 2023

ChristophWurst added this to the Nextcloud 28 milestone Jun 1, 2023

ChristophWurst mentioned this issue Jun 1, 2023

Summarize email thread nextcloud/mail#8508

Closed

5 tasks

marcelklehr self-assigned this Jun 13, 2023

marcelklehr mentioned this issue Jun 16, 2023

Introduce LanguageModel/TextProcessing OCP API #38854

Merged

8 tasks

ChristophWurst added 3. to review Waiting for reviews 2. developing Work in progress and removed 1. to develop Accepted and waiting to be taken care of 3. to review Waiting for reviews labels Jun 28, 2023

marcelklehr closed this as completed in #38854 Jul 21, 2023

ChristophWurst mentioned this issue Jul 27, 2023

Support text processing APIs nextcloud/integration_openai#35

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add API to summarize text #38578

Add API to summarize text #38578

ChristophWurst commented Jun 1, 2023

marcelklehr commented Jun 28, 2023

ChristophWurst commented Jun 29, 2023

marcelklehr commented Jun 29, 2023 •

edited

Loading

marcoambrosini commented Jun 30, 2023

DaphneMuller commented Jul 4, 2023

AndyScherzinger commented Jul 4, 2023

marcoambrosini commented Jul 4, 2023

marcelklehr commented Jul 4, 2023

DaphneMuller commented Jul 4, 2023

AndyScherzinger commented Jul 4, 2023

jancborchardt commented Jul 4, 2023

DaphneMuller commented Jul 4, 2023 •

edited

Loading

ChristophWurst commented Jul 20, 2023

marcelklehr commented Jul 20, 2023

Add API to summarize text #38578

Add API to summarize text #38578

Comments

ChristophWurst commented Jun 1, 2023

How to use GitHub

marcelklehr commented Jun 28, 2023

ChristophWurst commented Jun 29, 2023

marcelklehr commented Jun 29, 2023 • edited Loading

marcoambrosini commented Jun 30, 2023

DaphneMuller commented Jul 4, 2023

AndyScherzinger commented Jul 4, 2023

marcoambrosini commented Jul 4, 2023

marcelklehr commented Jul 4, 2023

DaphneMuller commented Jul 4, 2023

AndyScherzinger commented Jul 4, 2023

jancborchardt commented Jul 4, 2023

DaphneMuller commented Jul 4, 2023 • edited Loading

ChristophWurst commented Jul 20, 2023

marcelklehr commented Jul 20, 2023

marcelklehr commented Jun 29, 2023 •

edited

Loading

DaphneMuller commented Jul 4, 2023 •

edited

Loading