RAG Application with LangChain, AWS Opensearch and AWS Bedrock

This repository contains a full RAG application using Terraform as IaC, LangChain as framework, AWS Bedrock as LLM and Embedding Models, AWS OpenSearch as a vector database, and deployment on AWS OpenSearch endpoint.

Main Steps

Data Ingestion: Load data to an Opensearch Index
Embedding and Model: Bedrock Titan
Vector Store and Endpoint: Opensearch
IaC: Terraform
data: original pdf document and generated json file with embeddings

Feel free to ⭐ and clone this repo 😉

Tech Stack

Project Structure

The project has been structured with the following files:

terraform: IaC
tests: unittest and mock tests
src: scripts with the app logic
requirements.txt: project requirements
Makefile: command for testing, linting and formating
pyproject.toml: linting/formatting requirements

Project Set Up

The Python version used for this project is Python 3.11.

Clone the repo (or download it as a zip file):

git clone https://github.com/benitomartin/aws-bedrock-opensearch-langchain.git

Create the virtual environment named main-env using Conda with Python version 3.10:
```
conda create -n main-env python=3.11
conda activate main-env
```

Install the requirements.txt:

pip install -r requirements.txt

or

make req

Create infrastructure from the terraform folder. This can take up to 30 minutes

conda install conda-forge::terraform
terraform init
terraform plan
terraform apply

Generate embeddings from documents:
```
python src/generate_embeddings.py
```
Create Index:
```
python src/create_index.py
```

Ingest documents into index:

python src/ingest_docs_with_embeddings.py

Test the app to get a reply:
```
python src/app.py
```

The app contains a question. You can change it accordingly to test other scenarios.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Application with LangChain, AWS Opensearch and AWS Bedrock

Tech Stack

Project Structure

Project Set Up

About

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
src		src
terraform		terraform
tests		tests
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt

benitomartin/aws-bedrock-opensearch-langchain

Folders and files

Latest commit

History

Repository files navigation

RAG Application with LangChain, AWS Opensearch and AWS Bedrock

Tech Stack

Project Structure

Project Set Up

About

Topics

Resources

Stars

Watchers

Forks

Languages