This is a demo RAG system, which is basically a QA bot which uses solid data to answer questions, rather than relying solely on it's own LLM knowledge.
The System is based on the following components:
- Code - spring boot framework with spring ai.
- Local models deployment, including chat and embeddings generation: ollama.
- Vector Database, for storing embeddings and querying them: postgres pgvector. All components used are free and open source.
- docker, make sure to login to dockerhub.
simply download the file "docker-compose.yml" and run docker-compose up
.