Improve Handling Of Long Outputs #328

jlewi · 2024-10-25T21:04:36Z

Problem

Cell outputs can be very long. For example, if we run a query (gcloud, SQL, etc...) the output could be very verbose. This output could eat up the entire context allocated for the input document. As a result, we might not have sufficiently meaningful context to prompt the model.

There was another bug in our doc tailer. We were applying character limits to the rendered markdown. We were imposing this by tailing the lines. This could produce invalid markdown. For example, we might end up truncating the document in the middle of a code block so we wouldn't have the opening triple quotes for the code block. We might also include the output of the code block without including the code that it is output for.

Solution

First, we impose character limits in a way that is aware of cell boundaries. We move truncation into the Block to Markdown conversion. The conversion now takes the maximum length for the output string. The conversion routine then figures out how much to allocate to the contents of the cell and its outputs. This allows truncation to happen in a way that can respect cell boundaries.

Second, if we truncate the code block or output we output a string indicating that the output was truncated. We want the model to know that output was truncated. We update our prompt to tell the LLM to look for truncated output and to potentially deal with this by running commands that will provide less verbose output.

Fix #299

standard-input

No issues flagged.
Standard Input can make mistakes. Check important info.

netlify · 2024-10-25T21:04:54Z

✅ Deploy Preview for foyle canceled.

Name	Link
🔨 Latest commit	`b75a7ee`
🔍 Latest deploy log	https://app.netlify.com/sites/foyle/deploys/671c0b33e8efe30008b10ae2

jlewi added 4 commits October 23, 2024 17:51

First attempt at tailing the blocks.

d3fa40f

Move truncation into the markdown to make it block aware.

2d11a49

Update the prompt

93cdf54

Fix lint.

a472bbe

standard-input bot reviewed Oct 25, 2024

View reviewed changes

Minor fixes.

d217aee

jlewi enabled auto-merge (squash) October 25, 2024 21:10

Fix prompt.

b75a7ee

jlewi merged commit 3f8fa1a into main Oct 25, 2024
5 checks passed

jlewi deleted the jlewi/longoutputs branch October 25, 2024 21:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Handling Of Long Outputs #328

Improve Handling Of Long Outputs #328

jlewi commented Oct 25, 2024

standard-input bot left a comment

netlify bot commented Oct 25, 2024 •

edited

Loading

Improve Handling Of Long Outputs #328

Improve Handling Of Long Outputs #328

Conversation

jlewi commented Oct 25, 2024

Problem

Solution

standard-input bot left a comment

Choose a reason for hiding this comment

netlify bot commented Oct 25, 2024 • edited Loading

✅ Deploy Preview for foyle canceled.

netlify bot commented Oct 25, 2024 •

edited

Loading