chore(participant): refactor chat participant state VSCODE-583 #810

Anemy · 2024-09-09T21:08:35Z

Previously we were storing a lot of data on the participant. This made it so that things like running generated code would reference the current state of the participant, irregardless of which chat and which chat message the runnable code is clicked in. While we could work around this in some cases, if we can avoid coupling app state with the chat messages it should let us be more flexible in our parsing and handling, while avoiding possible bugs with stale state.

This is a draft as I currently have all of the tests commented out, a lot of random todos, and there's still more work to do around chat history passing. We currently pass a lot of not useful messages from our chat history to the model's chat completion. We also want the user to be able to ask another query or change the database or collection requested when they're asking, this namespace parsing isn't very flexible yet.

… history. Remove state items

…ther metadata history messages

src/participant/participant.ts

src/participant/prompts/history.ts

nirinchev

The overall restructuring seems fine to me and I like the removal of the state. I seem to have stumbled onto a corner-y case that resulted in an endless loop of prompts. Other than that, I'm not sure I'm a huge fan of the fact we're asking the model to extract the namespace every time, but I understand why we do it and don't have a brilliant idea for an alternative. I'll think about it a bit more and see if I come up with something.

src/participant/participant.ts

src/participant/prompts/namespace.ts

src/participant/participant.ts

…-participant-to-not-keep-state

src/participant/participant.ts

src/participant/prompts/history.ts

alenakhineika · 2024-09-12T11:55:11Z

From the pr description:

We also want the user to be able to ask another query or change the database or collection requested when they're asking

This was already possible before this refactoring. You could ask the participant to change the database or collection name or update the query.

Anemy · 2024-09-12T17:48:34Z

Going to remove the regex for the namespace in the chat history from this pr. We were using it as a fallback for when the model couldn't find the namespace in the chat history. Let's see how the model performs without it, and if needed we can create a ticket to add something like it back in.

alenakhineika · 2024-09-13T11:06:58Z

I think the flow is now broken somehow. It doesn't understand when I select a collection name from the list and asks for it again. After I click fo the second time on the collection name, it asks for a database name.

Anemy · 2024-09-13T15:54:19Z

@alenakhineika That's from removing the namespace from history regex we added. The model does not do a good job of looking into previous messages to find the namespace. It mostly uses the last user prompt. We could do some other workarounds here, in the POC for this a while back I tried adding all of the user's messages from the chat history into one message to give more context. There's a bit of a tradeoff there, but it might end up giving us a better result for the namespace request.
I think the first step here is storing some chat state as a way to track the namespace overtime. Similar to what this pr is removing, however it would be per-chat rather than a singleton. I created a ticket describing it a more: https://jira.mongodb.org/browse/VSCODE-607

If it's alright with you and @nirinchev I'd like to move that to a separate pr. I've pushed to a few todos in the code to mention that.

alenakhineika · 2024-09-13T16:07:38Z

The problem happens when you select the collection name for the first time. You connect, click on a database name, then click on a collection name and it doesn't go further, it keeps asking for a namespace and doesn't generate a query. Model is actually doing pretty well by looking at old messages. In the merged code we relied on the namespace in the state, now we can do the same with metadata in response history. There is probably some small issue that needs to be found. I don't think we can merge this as is since the query generation is not working.

…ction name from history when it exists

Anemy · 2024-09-13T16:58:20Z

@alenakhineika Looks like the prompt ordering in the namespace caused those issues. Can you see if it's now fixed? I had moved it to the 2nd to last message, and just moved it back to where it was previously, the first message in the chat we send to the model. 🤦

alenakhineika · 2024-09-13T19:06:19Z

We shouldn't merge this pr as a temporary solution, and we better tackle all this with https://jira.mongodb.org/browse/VSCODE-607. It breaks the flow on multiple occasions and limits already merged functionality. Currently, you can not iterate on prompts:

Gaurab and the docs team use the feature branch already, we shouldn't merge something that doesn't work well so they can't use it. Yes, I agree that the merged code works well only with one chat conversation and this should be fixed, but let's do it the right way without merging code that also does not cover all corer cases and breaks what is already working.

alenakhineika · 2024-09-13T19:13:30Z

Let's use the vscode context storage to move these values from ParticipantController and store them per chat conversation:

_queryGenerationState?: QUERY_GENERATION_STATE;
_databaseName?: string;
_collectionName?: string;
_schema?: string;
_sampleDocuments?: Document[];

I agree that keeping _queryGenerationState may not be ideal, but if this helps to keep track of what is happening in the conversation, maybe this is not so bad. We could split this work into chunks/PRs:

The part that you did to open a playground according to the generated code block
Move the state from ParticipantController to the vscode context storage to have different states per chat conversation
Play around how to get rid of _queryGenerationState (which can be done even in October)

alenakhineika · 2024-09-14T10:16:20Z

Here I put together what I meant by using the last metadata as the sourse of the chat conversation state: #816

Let me know if this makes sense to you.

…me when it's clicked

alenakhineika · 2024-09-16T07:56:09Z

The last changes look good 👍 Only the empty state should be fixed as we discussed in DMs.

nirinchev

A few nits from me. Unfortunately, wasn't able to finish it before dinner time, but will wrap up the review later today.

src/mdbExtensionController.ts

src/participant/constants.ts

nirinchev · 2024-09-16T14:48:32Z

src/participant/participant.ts

+    let collectionName: string | undefined = _collectionName;
+    if (!collectionName) {


This reads awkwardly - what's the purpose of _collectionName here and why are we defining a new collectionName variable of the same type as the argument? It feels like I'm missing something, but this method would feel more natural if we had:

async selectCollectionWithParticipant({ // ... collectionName }): { // ... }): Promise<boolean> { if (!collectionName) { // ... }

I made them two variables since when the collection name isn't present we set it with the quick pick.

collectionName = await this._selectCollectionWithQuickPick(databaseName);

We could avoid the extra declaration, and set to the variable that's passed in. I didn't do that as I find setting to variables that are passed in arguments to also look a bit awkward. It wouldn't happen here, but I think it starts to bring in a pattern of possibly mutating the argument which we don't want. It would only be an issue with non-primitives, so again, not here.

Would you prefer if it's written as:

async selectCollectionWithParticipant({ chatId, databaseName, collectionName, }: { chatId: string; databaseName: string; collectionName?: string; }): Promise<boolean> { if (!collectionName) { collectionName = await this._selectCollectionWithQuickPick(databaseName); if (!collectionName) { return false; } }

I'm cool with either.

src/participant/participant.ts

nirinchev · 2024-09-16T15:13:55Z

src/participant/participant.ts

+      void vscode.commands.executeCommand('workbench.action.chat.open', {
+        query: '@MongoDB /query',
+      });


Can we add a helper for this with more strongly defined types? I can see we're using it in a bunch of places and it's useful to extract the knowledge of the command in a single place.

To be honest, I would prefer to keep it as is because it is transparent, and it is a common pattern in vscode to call such commands this way.

Added a helper function for this, with a comment as to why we do it. Doesn't do much on the typing side though. Were you thinking of having the helper do /query based on parameters? That's something we could do.

… chat message

alenakhineika

✨

…-participant-to-not-keep-state

Anemy · 2024-09-16T22:25:46Z

@nirinchev going to merge this in to unblock folks, it's been in flight a while and would cause a good amount of conflicts if we keep it unmerged. Please feel free to leave more comments, I'll see to them in a follow up.
Thanks for all the help on this one y'all!

nirinchev · 2024-09-16T23:26:12Z

Yeah, no worries, I didn't see anything majorly broken, so we can always fix these nits as drive-by's.

Anemy added 3 commits September 9, 2024 16:53

chore(participant): upadte how we parse the namespace to use the chat…

555e9c1

… history. Remove state items

fixup: update comments

805ad16

fixup: organize history better, remove connection name messages and o…

22c9266

…ther metadata history messages

Anemy commented Sep 10, 2024

View reviewed changes

src/participant/participant.ts Show resolved Hide resolved

fixup: remove comments, unused

ca1cc0c

nirinchev reviewed Sep 11, 2024

View reviewed changes

Anemy changed the title ~~chore(participant): update namespace parsing to use history, remove state from chat participant VSCODE-583~~ WIP chore(participant): update namespace parsing to use history, remove state from chat participant VSCODE-583 Sep 11, 2024

nirinchev reviewed Sep 12, 2024

View reviewed changes

fixup: type chat results, re-add tests

035e2c3

Anemy changed the title ~~WIP chore(participant): update namespace parsing to use history, remove state from chat participant VSCODE-583~~ chore(participant): update namespace parsing to use history, remove state from chat participant VSCODE-583 Sep 12, 2024

Anemy requested a review from alenakhineika September 12, 2024 02:00

Anemy marked this pull request as ready for review September 12, 2024 02:00

Merge branch 'VSCODE-528-mongodb-copilot' into VSCODE-583-update-chat…

0610539

…-participant-to-not-keep-state

alenakhineika reviewed Sep 12, 2024

View reviewed changes

src/participant/participant.ts Show resolved Hide resolved

alenakhineika reviewed Sep 12, 2024

View reviewed changes

src/participant/participant.ts Outdated Show resolved Hide resolved

alenakhineika reviewed Sep 12, 2024

View reviewed changes

src/participant/participant.ts Outdated Show resolved Hide resolved

alenakhineika reviewed Sep 12, 2024

View reviewed changes

src/participant/prompts/history.ts Outdated Show resolved Hide resolved

fixup: return on database found in history, cleanup

4116991

fixup: remove manual namespace parsing, defer to a possible later time

eec8f16

Anemy changed the title ~~chore(participant): update namespace parsing to use history, remove state from chat participant VSCODE-583~~ chore(participant): remove state from chat participant VSCODE-583 Sep 12, 2024

Anemy changed the title ~~chore(participant): remove state from chat participant VSCODE-583~~ WIP chore(participant): remove state from chat participant VSCODE-583 Sep 12, 2024

Anemy changed the title ~~WIP chore(participant): remove state from chat participant VSCODE-583~~ chore(participant): remove state from chat participant VSCODE-583 Sep 12, 2024

Anemy added 2 commits September 12, 2024 20:31

fixup: remove extra metadata we aren't using

86d0322

fixup: lint

b30f6fc

fixup: add todos for chat namespace store

34c819b

fixup: move namespace assistant prompt to first message, remove conne…

a0d5d8f

…ction name from history when it exists

alenakhineika mentioned this pull request Sep 14, 2024

chore(participant): move state to history VSCODE-583 #816

Closed

11 tasks

fixup: add chat metadata store for storing collection and database na…

a5bec03

…me when it's clicked

Anemy changed the title ~~chore(participant): remove state from chat participant VSCODE-583~~ chore(participant): refactor chat participant state VSCODE-583 Sep 15, 2024

fixup: remove todo comment

785a1f7

fixup: on empty prompt when asking for db or collection, re-ask

0c41851

nirinchev reviewed Sep 16, 2024

View reviewed changes

alenakhineika mentioned this pull request Sep 16, 2024

feat(participant): implement the docs command VSCODE-570 #817

Merged

11 tasks

Anemy added 3 commits September 16, 2024 12:36

fixup: better typing, use extension constants, use helper for running…

2f7dc8b

… chat message

fixup: improved types

c5ac7a7

fixup: better typing for metadata constants

b9fdb57

alenakhineika approved these changes Sep 16, 2024

View reviewed changes

Merge branch 'VSCODE-528-mongodb-copilot' into VSCODE-583-update-chat…

d5aaad7

…-participant-to-not-keep-state

Anemy merged commit 764b8d1 into VSCODE-528-mongodb-copilot Sep 16, 2024
3 checks passed

Anemy deleted the VSCODE-583-update-chat-participant-to-not-keep-state branch September 16, 2024 22:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(participant): refactor chat participant state VSCODE-583 #810

chore(participant): refactor chat participant state VSCODE-583 #810

Anemy commented Sep 9, 2024 •

edited

Loading

nirinchev left a comment

alenakhineika commented Sep 12, 2024

Anemy commented Sep 12, 2024

alenakhineika commented Sep 13, 2024

Anemy commented Sep 13, 2024

alenakhineika commented Sep 13, 2024

Anemy commented Sep 13, 2024 •

edited

Loading

alenakhineika commented Sep 13, 2024

alenakhineika commented Sep 13, 2024 •

edited

Loading

alenakhineika commented Sep 14, 2024 •

edited

Loading

alenakhineika commented Sep 16, 2024

nirinchev left a comment

nirinchev Sep 16, 2024

Anemy Sep 16, 2024

nirinchev Sep 16, 2024

alenakhineika Sep 16, 2024

Anemy Sep 16, 2024

alenakhineika left a comment

Anemy commented Sep 16, 2024

nirinchev commented Sep 16, 2024

		let collectionName: string \| undefined = _collectionName;
		if (!collectionName) {

chore(participant): refactor chat participant state VSCODE-583 #810

chore(participant): refactor chat participant state VSCODE-583 #810

Conversation

Anemy commented Sep 9, 2024 • edited Loading

nirinchev left a comment

Choose a reason for hiding this comment

alenakhineika commented Sep 12, 2024

Anemy commented Sep 12, 2024

alenakhineika commented Sep 13, 2024

Anemy commented Sep 13, 2024

alenakhineika commented Sep 13, 2024

Anemy commented Sep 13, 2024 • edited Loading

alenakhineika commented Sep 13, 2024

alenakhineika commented Sep 13, 2024 • edited Loading

alenakhineika commented Sep 14, 2024 • edited Loading

alenakhineika commented Sep 16, 2024

nirinchev left a comment

Choose a reason for hiding this comment

nirinchev Sep 16, 2024

Choose a reason for hiding this comment

Anemy Sep 16, 2024

Choose a reason for hiding this comment

nirinchev Sep 16, 2024

Choose a reason for hiding this comment

alenakhineika Sep 16, 2024

Choose a reason for hiding this comment

Anemy Sep 16, 2024

Choose a reason for hiding this comment

alenakhineika left a comment

Choose a reason for hiding this comment

Anemy commented Sep 16, 2024

nirinchev commented Sep 16, 2024

Anemy commented Sep 9, 2024 •

edited

Loading

Anemy commented Sep 13, 2024 •

edited

Loading

alenakhineika commented Sep 13, 2024 •

edited

Loading

alenakhineika commented Sep 14, 2024 •

edited

Loading