Build Air-Gapped RAG with Nvidia NIMs and Haystack

📚 This repository is accompanied by our article "Building RAG Applications with NVIDIA NIM and Haystack on K8s"

Info: This repo is set up to use models hosted and accessible via https://build.nvidia.com/

These models are already available and you can use them by creating yourself API keys through the platform. The project is set up so that you can change these models to NIM deployments by setting the model name and api_url in the NvidiaGenerator, NvidiaDocumentEmbedder and NvidiaTextEmbedder components.

👩🏻‍🍳 We also provide a notebook on Haystack Cookbooks that provide the same code and setup, only expecting self-hosted NIMs

Run with Docker

pip install -r requirements.txt
Create a .env file and add NVIDIA_API_KEY (if you're using hosted models via https://build.nvidia.com/)
docker-compose up
hayhooks deploy rag.yaml
Go to localhost:1416/docs to interact with your RAG pipeline

File Structure

indexing.py: This script preproecesses, embeds and writes ChipNemo.pdf into a Qdrant database
rag.py: This scripts runs a RAG pipeline with a NIM LLM and retrieval model.
Dockerfile: This is used by the docker-compose file to install dependencies
docker-compose.yml: This is the docker compose file we use to spin up a container for hayhooks (Haystack pipeline deployment) and Qdrant
rag.yaml: This is the serialized RAG pipeline which is the same as rag.py in YAML. We use this to deploy our pipeline with hayhooks
: This notebook shows you how you can set up your components to use self-hosted NIMs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Build Air-Gapped RAG with Nvidia NIMs and Haystack

Run with Docker

File Structure

About

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
ChipNeMo.pdf		ChipNeMo.pdf
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
indexing.py		indexing.py
rag.py		rag.py
rag.yaml		rag.yaml
requirements.txt		requirements.txt

deepset-ai/rag-with-nvidia-nims

Folders and files

Latest commit

History

Repository files navigation

Build Air-Gapped RAG with Nvidia NIMs and Haystack

Run with Docker

File Structure

About

Topics

Resources

Stars

Watchers

Forks

Contributors 2

Languages