LLM Service Factory

LLM Service Factory is a Python module designed to simplify the interaction with multiple large language model (LLM) service providers. It supports Azure OpenAI, OpenAI, and Hugging Face, with built-in functionality for automatic token usage tracking. The architecture is modular and extendable, allowing easy addition of new providers, models, and functionality in the future.

Features

Multi-provider Support: Azure OpenAI, Hugging Face, and OpenAI services are currently supported.
Singleton Design: Ensures only one instance of an LLM service is created per unique configuration.
Token Usage Tracking: Automatically tracks the number of tokens used by each model and calculates the cost, stored in a tokens_usage.json file.
Extensible: Easily add new providers and models by extending the base class.

Installation

Clone the repository:

git clone https://github.com/your-username/llm-service-factory.git

Install the required dependencies:
```
pip install -r requirements.txt
```
Set up environment variables: Create a .env file in the project root directory to store your API keys and other configurations. You can use the .env.example file as a template.

Usage

Initializing Services

You can use the LLMServiceFactory to get an instance of the desired LLM service. Each service instance is a singleton, meaning that the same instance is reused if requested with the same parameters.

from LLMServiceFactory import LLMServiceFactory

# Initialize an Azure OpenAI service
azure_service = LLMServiceFactory.get_service(
    model_name="gpt-4o",
    provider="AzureOpenAI"
)

# Initialize a Hugging Face service
hf_service = LLMServiceFactory.get_service(
    model_name="mistralai/Mistral-7B-Instruct-v0.2",
    provider="HuggingFace"
)

# You can also use the OpenAI provider or easily add a new one by extending the llm_interface.py base class

Making Requests

Once you have the service instance, you can use it to make requests to the LLMs. Each service supports different request types like chat completions, JSON completions, tool usage, and even vision-related tasks.

# Make a request to Azure OpenAI
messages = [{"role": "user", "content": "Explain the theory of relativity."}]
response = azure_service.make_request(messages)
print(response)

Token Usage Tracking

Token usage is automatically tracked for each model and stored in tokens_usage.json. The tracking includes both prompt and completion tokens, and the total cost based on predefined rates.

{
  "gpt-4o": {
    "prompt_tokens": 410020,
    "completion_tokens": 64425
  },
  "overall_cost": 2.978635499999999,
  "gpt-35-turbo-16k": {
    "prompt_tokens": 90032,
    "completion_tokens": 5756
  }
}

Adding a New Provider

To add support for a new LLM provider (e.g., Google Gemini):

Create a class that extends the llm_service/llm_interface.py base class.
Implement the initialize_client, make_request, and other required methods.
Register the new service in the LLMServiceFactory.
Add the necessary API keys in the .env file.
Add cost rates in the utils/token_tracker.py file (if you want to track the costs).

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
llm_services		llm_services
utils		utils
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
LLMServiceFactory.py		LLMServiceFactory.py
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Service Factory

Features

Installation

Usage

Initializing Services

Making Requests

Token Usage Tracking

Adding a New Provider

License

About

Releases

Packages

Languages

License

RamonKaspar/LLM-Service-Factory

Folders and files

Latest commit

History

Repository files navigation

LLM Service Factory

Features

Installation

Usage

Initializing Services

Making Requests

Token Usage Tracking

Adding a New Provider

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages