Langchain based ask approaches not compatible with 0613 (Chat Completions) #541

pamelafox · 2023-08-17T17:46:10Z

This issue is for a: (mark with an `x`)

- [X] bug report -> please search issues before submitting
- [ ] feature request
- [ ] documentation issue or request
- [ ] regression (a behavior that used to work and stopped in a new release)

Minimal steps to reproduce

We recently changed to version from 0301 to 0613, since Azure isn't allowing new 0301 deployments. Unfortunately, 0613 only supports the new Chat Completions API, not the old Completions API, and the LangChain agents all assume use of the Completions API.

I have a branch that attempts to update the LangChain code to use Chat Completions, but am still QAing it.

The text was updated successfully, but these errors were encountered:

ghost · 2023-08-22T06:39:40Z

Hi @pamelafox : Will be waiting for an update on this, we are trying to deploy it in a corporate scenario, I managed to build an ARM template from the bicep template you have shared and removed role permissions required, I am using 2 models 1. Chat Gpt 35 turbo with version 0301 and chat gpt 35 turbo 16k with 0613 to deploy this but fail to deploy the template with an error saying 'standard' is not part of the 0613 version, if I remove the scale standard and deploy it fails the validation.

pamelafox · 2023-08-22T13:59:08Z

To clarify:

The sample includes 4 different RAG (Retrieval-Augmented Generation) approaches: ChatReadRetrieveRead, ReadDecomposeAsk, ReadRetrieveRead, RetrieveThenRead. The two default approaches are ChatReadRetrieveRead and RetrieveThenRead, and they are both working very well with the Chat Completion APIs. The other two approaches use Langchain and the current code only works with the older Completion API (0301). Those approaches can be deleted from the code/UI, and the app would still work.

Is the problem that you definitely need to use those other two approaches for your particular use case, or is the problem that you can't deploy 0613? Do you have the latest main.bicep and cognitiveservices.bicep? The method of specifying capacity changed a few months ago.

ghost · 2023-08-24T02:21:31Z

Hi Pamela, Thank you for getting back really quickly. Our org currently doesn't allow a bicep file download (.exe file download), so I had to convert the bicep file into an ARM template, remove permissions and roles, and then deploy. I am attaching the ARM template. Regarding your note on the 4 RAG approaches I might just need the first two, how do I modify the code, the app backend code in Python that I need to modify? Where can I get more information about this as the readme doesn't detail this? Lastly, deploy to Azure functionality was super useful on other examples can we expect something like that for this?
OpenAI_template_RAG_ARM_with dummy values.txt

ghost · 2023-08-24T07:29:39Z

And on your comment on capacity, even though I use the template where the set capacity is 30, I still get an error while deploying that 120 tokens are necessary, Do I need to configure any additional settings?

pamelafox · 2023-08-24T13:17:43Z

There is currently an issue (Azure/bicep-types-az#1660) where we can't deploy a capacity greater than what's remaining in our account, even if the deployments will replace whats in the account. So what I do in that case is go into the Azure OpenAI studio, edit each deployment so that it has 1 TPM, and then try azd up again.

nikhilno1 · 2023-09-02T10:20:19Z

Hi @aparnasharmav, could you please provide more details on this -> "removed role permissions required". I suppose this is done for below requirement?

Your Azure Account must have Microsoft.Authorization/roleAssignments/write permissions, such as User Access Administrator or Owner.

I'd also like to get rid of this requirement if I can.

Thanks.

pamelafox · 2023-10-05T16:12:43Z

No longer an issue as they have been removed.

Co-authored-by: Ian Seabock (Centific Technologies Inc) <v-ianseabock@microsoft.com>

pamelafox changed the title ~~Lang-chain based ask approaches not compatible with 0613 (Chat Completions)~~ Langchain based ask approaches not compatible with 0613 (Chat Completions) Aug 17, 2023

pamelafox self-assigned this Aug 17, 2023

pamelafox mentioned this issue Sep 4, 2023

[bug] error when running any langchain based approach #593

Closed

pamelafox mentioned this issue Sep 18, 2023

Add support for an optional login and document level access control system. #624

Merged

pamelafox closed this as completed Oct 5, 2023

ratkinsoncinz pushed a commit to cinzlab/azure-search-openai-demo that referenced this issue Oct 6, 2024

[fix] Duplicate Punctuation (Azure-Samples#541)

1e401ac

Co-authored-by: Ian Seabock (Centific Technologies Inc) <v-ianseabock@microsoft.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Langchain based ask approaches not compatible with 0613 (Chat Completions) #541

Langchain based ask approaches not compatible with 0613 (Chat Completions) #541

pamelafox commented Aug 17, 2023

ghost commented Aug 22, 2023

pamelafox commented Aug 22, 2023

ghost commented Aug 24, 2023

ghost commented Aug 24, 2023

pamelafox commented Aug 24, 2023

nikhilno1 commented Sep 2, 2023

pamelafox commented Oct 5, 2023

Langchain based ask approaches not compatible with 0613 (Chat Completions) #541

Langchain based ask approaches not compatible with 0613 (Chat Completions) #541

Comments

pamelafox commented Aug 17, 2023

This issue is for a: (mark with an x)

Minimal steps to reproduce

ghost commented Aug 22, 2023

pamelafox commented Aug 22, 2023

ghost commented Aug 24, 2023

ghost commented Aug 24, 2023

pamelafox commented Aug 24, 2023

nikhilno1 commented Sep 2, 2023

pamelafox commented Oct 5, 2023

This issue is for a: (mark with an `x`)