Rich markdown formatting (including streaming) in any mode with `--rich` #571

gianlucatruda · 2024-09-12T19:22:31Z

Overview

Fixes #12

This builds on the excellent foundation that @juftin laid down in #278 and

fixes a mysterious bug that was failing some tests and
resolves merge conflicts caused by changes since rich printing #278 was proposed.

I love llm and use it constantly. My only gripe has been the lack of rich formatting in the terminal. I recently used rich for a project and found it excellent, so I was excited to add this to llm. I found #278 was open but dormant, so I decided to nudge things along.

@simonw thanks for your amazing tools and awsome blog!

Screenshots

gianlucatruda · 2024-09-12T19:32:03Z

Here's a demo gif of streaming working with rich output:

Refs simonw#552, simonw#553, simonw#554, simonw#558, simonw#567, simonw#570, simonw#573

gianlucatruda · 2024-09-16T15:50:05Z

Update: I've added pytest tests for making sure that --rich mode works as intended. I also factored in release 0.16 commits. All 185 tests pass.

@simonw is there anything else this needs in order to be merged? That would allow you to close #12

irthomasthomas · 2024-09-16T17:12:35Z

What is the benefit versus piping to something? I like the idea of keeping the main project as light as possible.

dzmitry-kankalovich · 2024-09-16T17:44:23Z

@irthomasthomas likely the difference is in syntax highlighting of a partial / streamed LLM response.

I've been piping llms output to glow for the past months, but the drawback is that you see the result only when streaming is completed, and until that there is like no output. It's a subpar UX when you need to wait some dozens of seconds to see the result.

As I get it from @gianlucatruda examples here this particular problem was solved.

gianlucatruda · 2024-09-16T17:49:29Z

I like the idea of keeping the main project as light as possible.

I normally would agree, @irthomasthomas. But as @dzmitry-kankalovich correctly points out, piping breaks streaming, which is a major drawback to usability. I think this justifies the choice.

irthomasthomas · 2024-09-16T18:44:40Z

Are you talking about the ansi codes being injected?

…

On Mon, 16 Sept 2024, 18:49 Gianluca Truda, ***@***.***> wrote: I like the idea of keeping the main project as light as possible. I normally would agree, @irthomasthomas <https://github.com/irthomasthomas>. But as @dzmitry-kankalovich <https://github.com/dzmitry-kankalovich> correctly points out, that breaks streaming, which is a major drawback to usability. I think this justifies the choice. — Reply to this email directly, view it on GitHub <#571 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AE476NE4CDZSB6EHNVAT3ULZW4K37AVCNFSM6AAAAABOD5UQFSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGNJTGU2DINZXGM> . You are receiving this because you were mentioned.Message ID: ***@***.***>

gianlucatruda · 2024-09-16T19:16:03Z

Are you talking about the ansi codes being injected?

@irthomasthomas When you pipe the output of llm to another application that renders markdown (which may do ANSI code injection), you have to wait for the entire LLM response, which could be several seconds or even minutes. And in chat mode, it's not possible at all. So it's not a viable solution.

This PR enables llm to do the rich textual rendering itself in a way that supports response streaming. That means the user sees the llm output in realtime, rendered prettily, as it arrives from the LLM. It also allows this rich text streaming to work in chat mode (as seen in my screenshots).

Overall, this PR adds functionality to llm that is not possible when piping to other tools. It's a massive upgrade to the user experience and something that has been requested by many people for a long time.

irthomasthomas · 2024-09-16T19:45:15Z

That's not true. The example I gave, I'm using highlight and that displays the rendered markdown as it streams in.

…

On Mon, 16 Sept 2024, 20:16 Gianluca Truda, ***@***.***> wrote: Are you talking about the ansi codes being injected? @irthomasthomas <https://github.com/irthomasthomas> When you pipe the output of llm to another application that renders markdown (which may do ANSI code injection), you have to wait for the entire LLM response, which could be several seconds or even minutes. And in chat mode, it's not possible at all. So it's not a viable solution. This PR enables llm to do the rich textual rendering itself in a way that supports response streaming <#571 (comment)>. That means the user sees the llm output in realtime, rendered prettily, as it arrives from the LLM. It also allows this rich text streaming to work in chat mode <https://private-user-images.githubusercontent.com/1952799/367031734-b556db5f-838c-4544-9091-429a8e199b3c.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjY1MTM4NTEsIm5iZiI6MTcyNjUxMzU1MSwicGF0aCI6Ii8xOTUyNzk5LzM2NzAzMTczNC1iNTU2ZGI1Zi04MzhjLTQ1NDQtOTA5MS00MjlhOGUxOTliM2MucG5nP1gtQW16LUFsZ29yaXRobT1BV1M0LUhNQUMtU0hBMjU2JlgtQW16LUNyZWRlbnRpYWw9QUtJQVZDT0RZTFNBNTNQUUs0WkElMkYyMDI0MDkxNiUyRnVzLWVhc3QtMSUyRnMzJTJGYXdzNF9yZXF1ZXN0JlgtQW16LURhdGU9MjAyNDA5MTZUMTkwNTUxWiZYLUFtei1FeHBpcmVzPTMwMCZYLUFtei1TaWduYXR1cmU9NGViNzY4MTZmZmQ2NmQxODAxNTAxMmNjZTMyYmI4OWJkNWNlNmZiZWRmYjUzZjJmMGUyNmM2MThmNzQ0ZmRhMSZYLUFtei1TaWduZWRIZWFkZXJzPWhvc3QmYWN0b3JfaWQ9MCZrZXlfaWQ9MCZyZXBvX2lkPTAifQ.N-bsTtyPFqoavRBubZW8xg8_I2X3p7u5c4xy5A4Zxfw> . Overall, this PR adds functionality to llm that is not possible when piping to other tools. It's a massive upgrade to the user experience and something that has been requested by many people for a long time. — Reply to this email directly, view it on GitHub <#571 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AE476NDTOGGRZ4EPXPSHXT3ZW4VAVAVCNFSM6AAAAABOD5UQFSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGNJTG4ZDANJZGA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

dzmitry-kankalovich · 2024-09-16T19:56:26Z

@irthomasthomas I just checked, and indeed highlight unlike glow does process stream responses.

however it... does not render markdown?

it just highlights (I guess hence the name) markdown blocks, but it does not render them - at least not like glow or rich.

It is somewhat better than just plain text, but it does not feel as convenient as these other alternatives.

gianlucatruda · 2024-09-16T19:57:27Z

That's not true. The example I gave, I'm using highlight and that displays the rendered markdown as it streams in.

@irthomasthomas

can you link to the source for installing highlight? If it's cli-highlight, then I'm unable to replicate the streaming you claim.
your screenshots with piping to highlight are showing syntax highlighting, not markdown rendering as Markdown renderer support #12 is asking for and this PR provides.
can you provide an example showing evidence that streaming works with piping in both normal and chat modes?

gianlucatruda · 2024-09-23T16:28:00Z

@simonw let me know if you have any feedback on this PR. Happy to make any changes necessary.

sa- · 2024-10-31T10:52:37Z

llm/cli.py

+
+def print_response(response, stream=True, rich=False):
+    # These nested ifs are necessary!? Only way this works.
+    if stream:


This could also work

live.update( Markdown(full_response) if rich else Text(full_response) )

juftin and others added 7 commits September 12, 2024 17:58

rich printing

638967c

rich printing

f488e40

rich printing

1b7dfb9

rich printing

9cee0d5

Patched mystery bug that was failing 2 test cases.

5958fcb

Tidying up print_response()

7e13952

Patched weird control flow bug.

32c72b7

gianlucatruda mentioned this pull request Sep 12, 2024

rich printing #278

Open

simonw and others added 4 commits September 16, 2024 16:42

o1-preview and o1-mini, refs simonw#570 (simonw#573)

008c0ac

Release 0.16

3aefa34

Refs simonw#552, simonw#553, simonw#554, simonw#558, simonw#567, simonw#570, simonw#573

Release notes for 0.16

4e6010b

Added test for rich text.

1df0392

gianlucatruda mentioned this pull request Sep 16, 2024

Markdown renderer support #12

Open

sa- reviewed Oct 31, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rich markdown formatting (including streaming) in any mode with `--rich` #571

Rich markdown formatting (including streaming) in any mode with `--rich` #571

gianlucatruda commented Sep 12, 2024 •

edited

Loading

gianlucatruda commented Sep 12, 2024

gianlucatruda commented Sep 16, 2024

irthomasthomas commented Sep 16, 2024

dzmitry-kankalovich commented Sep 16, 2024

gianlucatruda commented Sep 16, 2024 •

edited

Loading

irthomasthomas commented Sep 16, 2024 via email

gianlucatruda commented Sep 16, 2024 •

edited

Loading

irthomasthomas commented Sep 16, 2024 via email

dzmitry-kankalovich commented Sep 16, 2024

gianlucatruda commented Sep 16, 2024

gianlucatruda commented Sep 23, 2024

sa- Oct 31, 2024

Rich markdown formatting (including streaming) in any mode with --rich #571

Are you sure you want to change the base?

Rich markdown formatting (including streaming) in any mode with --rich #571

Conversation

gianlucatruda commented Sep 12, 2024 • edited Loading

Overview

Screenshots

gianlucatruda commented Sep 12, 2024

gianlucatruda commented Sep 16, 2024

irthomasthomas commented Sep 16, 2024

dzmitry-kankalovich commented Sep 16, 2024

gianlucatruda commented Sep 16, 2024 • edited Loading

irthomasthomas commented Sep 16, 2024 via email

gianlucatruda commented Sep 16, 2024 • edited Loading

irthomasthomas commented Sep 16, 2024 via email

dzmitry-kankalovich commented Sep 16, 2024

gianlucatruda commented Sep 16, 2024

gianlucatruda commented Sep 23, 2024

sa- Oct 31, 2024

Choose a reason for hiding this comment

Rich markdown formatting (including streaming) in any mode with `--rich` #571

Rich markdown formatting (including streaming) in any mode with `--rich` #571

gianlucatruda commented Sep 12, 2024 •

edited

Loading

gianlucatruda commented Sep 16, 2024 •

edited

Loading

gianlucatruda commented Sep 16, 2024 •

edited

Loading