Fix local server regressions caused by Jinja PR #3256

cebtenzzre · 2024-12-11T00:02:59Z

1. Don't ignore content of assistant messages in history

This was originally fixed in #2929 (the first time it worked), but was broken again in #3147. We should write an automated test for this so we don't break it again.

This is a simple test of the message history using the local server:

user: What is your name?
assistant: My name is Bob.
user: Could you repeat that?

After #2929, the model remembers its name, but on current main it forgets. This PR fixes that. See also: #2602

2. Don't report garbage token counts and incorrect stop reason

Additionally, this PR fixes an uninitialized struct that was causing the token counts to be unreasonably high garbage values and the stop reason to be incorrect. Now all of the local server tests pass (including the test that was previously marked as XFAIL).

3. Don't leave previous conversations in the LLM's context

If you send the my-name-is-Bob test above to the LLM in one request, and then in another request send only this:

user: What were we talking about?

The LLM should not mention the Bob test, since it's not part of this conversation. But there was some missing code in the Jinja PR to actually use the server's local list of messages instead of the entire chat view's contents (which reflects previous conversations).

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

TEMPORARY.

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre added 2 commits December 10, 2024 18:44

fix local server ignoring content of assistant messages in history

5f0e395

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

changelog: add this PR

ac533a2

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre marked this pull request as ready for review December 11, 2024 00:04

cebtenzzre requested a review from manyoso December 11, 2024 00:04

cebtenzzre added 3 commits December 11, 2024 13:31

chatllm: fix uninitialized variable causing bad token counts

979918d

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

remove xfail marker from test that now passes

11048ce

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

changelog: mention new changes

587e8f8

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

manyoso approved these changes Dec 12, 2024

View reviewed changes

server: don't prompt model with the entire history of previous chats

186fcb8

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre changed the title ~~Fix local server ignoring assistant messages in history~~ Fix local server regressions caused by Jinja PR Dec 12, 2024

changelog: mention new fix

a0ae59b

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre requested a review from manyoso December 12, 2024 20:54

cebtenzzre mentioned this pull request Dec 12, 2024

Clarification on chat API behaviour #3267

Closed

cebtenzzre linked an issue Dec 12, 2024 that may be closed by this pull request

Clarification on chat API behaviour #3267

Closed

cebtenzzre added a commit that referenced this pull request Dec 13, 2024

local server fixes (#3256)

d28c0ea

TEMPORARY.

replace comment with docstring

86ce58c

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre merged commit f67b370 into main Dec 13, 2024
4 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix local server regressions caused by Jinja PR #3256

Fix local server regressions caused by Jinja PR #3256

cebtenzzre commented Dec 11, 2024 •

edited

Loading

Fix local server regressions caused by Jinja PR #3256

Fix local server regressions caused by Jinja PR #3256

Conversation

cebtenzzre commented Dec 11, 2024 • edited Loading

1. Don't ignore content of assistant messages in history

2. Don't report garbage token counts and incorrect stop reason

3. Don't leave previous conversations in the LLM's context

cebtenzzre commented Dec 11, 2024 •

edited

Loading