From 817ff2be60c91397a909d60b0145ee11fd8ad2f5 Mon Sep 17 00:00:00 2001 From: Roger Barreto <19890735+RogerBarreto@users.noreply.github.com> Date: Wed, 7 Aug 2024 14:27:55 +0100 Subject: [PATCH 1/2] ADR Draft OK --- .../0051-dotnet-azure-model-as-a-service.md | 46 +++++++++++++++++++ 1 file changed, 46 insertions(+) create mode 100644 docs/decisions/0051-dotnet-azure-model-as-a-service.md diff --git a/docs/decisions/0051-dotnet-azure-model-as-a-service.md b/docs/decisions/0051-dotnet-azure-model-as-a-service.md new file mode 100644 index 000000000000..a646d1840115 --- /dev/null +++ b/docs/decisions/0051-dotnet-azure-model-as-a-service.md @@ -0,0 +1,46 @@ +--- +# These are optional elements. Feel free to remove any of them. +status: proposed +contact: rogerbarreto +date: 2024-08-07 +deciders: rogerbarreto, markwallace-microsoft +consulted: taochen +--- + +# Support Connector for .Net Azure Model-as-a-Service (Azure AI Studio) + +## Context and Problem Statement + +There has been a demand from customers to use and support natively models deployed in [Azure AI Studio - Serverless APIs](https://learn.microsoft.com/en-us/azure/ai-studio/how-to/model-catalog-overview#model-deployment-managed-compute-and-serverless-api-pay-as-you-go), This mode of consumption operates on a pay-as-you-go basis, typically using tokens for billing purposes. Clients can access the service via the [Azure AI Model Inference API](https://learn.microsoft.com/en-us/azure/ai-studio/reference/reference-model-inference-api?tabs=azure-studio) or client SDKs. + +At present, there is no official support for [Azure AI Studio](https://learn.microsoft.com/en-us/azure/ai-studio/what-is-ai-studio). The purpose of this ADR is to examine the constraints of the service and explore potential solutions to enable support for the service via the development of a new AI connector. + +## Azure Inference Client library for .NET + +The Azure team has a new client library, namely [Azure.AI.Inference](https://github.com/Azure/azure-sdk-for-net/blob/Azure.AI.Inference_1.0.0-beta.1/sdk/ai/Azure.AI.Inference/README.md) in .Net, for effectively interacting with the service. While the service API is OpenAI-compatible, it is not permissible to use the OpenAI and the Azure OpenAI client libraries for interacting with the service as they are not independent with respect to both the models and their providers. This is because Azure AI Studio features a diverse range of open-source models, other than OpenAI models. + +### Limitations + +Currently is known that the first version of the client SDK will only support: `Chat Completion` and `Text Embedding Generation` and `Image Embedding Generation` with `TextToImage Generation` planned. + +There are no current plans to support `Text Completion`. + +## AI Connector + +### Namespace options + +- `Microsoft.SemanticKernel.Connectors.AzureAI` +- `Microsoft.SemanticKernel.Connectors.AzureAIInference` +- `Microsoft.SemanticKernel.Connectors.AzureAIModelInference` + +Decision: `Microsoft.SemanticKernel.Connectors.AzureAIInference` + +### Support for model-specific parameters + +Models can possess supplementary parameters that are not part of the default API. The service API and the client SDK enable the provision of model-specific parameters. Users can provide model-specific settings via a dedicated argument along with other settings, such as `temperature` and `top_p`, among others. + +Azure AI Inference specialized `PromptExecutionSettings`, will support those customizable parameters. + +### Feature Branch + +The development of the Azure AI Inference connector will be done in a feature branch named `feature-connectors-azureaiinference`. From e5b32b539b796af4355000fd851ba265f0235d1c Mon Sep 17 00:00:00 2001 From: Roger Barreto <19890735+RogerBarreto@users.noreply.github.com> Date: Wed, 7 Aug 2024 15:07:30 +0100 Subject: [PATCH 2/2] Update 0051-dotnet-azure-model-as-a-service.md Fix Text Generation Modality naming. --- docs/decisions/0051-dotnet-azure-model-as-a-service.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/decisions/0051-dotnet-azure-model-as-a-service.md b/docs/decisions/0051-dotnet-azure-model-as-a-service.md index a646d1840115..b023838d5128 100644 --- a/docs/decisions/0051-dotnet-azure-model-as-a-service.md +++ b/docs/decisions/0051-dotnet-azure-model-as-a-service.md @@ -23,7 +23,7 @@ The Azure team has a new client library, namely [Azure.AI.Inference](https://git Currently is known that the first version of the client SDK will only support: `Chat Completion` and `Text Embedding Generation` and `Image Embedding Generation` with `TextToImage Generation` planned. -There are no current plans to support `Text Completion`. +There are no current plans to support `Text Generation` modality. ## AI Connector