GitHub - rootsec1/anime-recommendation-rag: LLMs for anime recommendation

Anime Recommender RAG based LLM

Dockerfile

Final Demo Video

Project Purpose

There are several sources for getting anime recommendations such as MyAnimeList, Reddit and so on. However, the scores could be biased and sometimes anime fans like myself go down a rabbit hole digging through subreddits and so on to find the perfect anime to watch. This project aims to solve this problem by providing a chatbot interface where users can ask for anime recommendations based on their preferences. The chatbot uses the LLM model to generate responses and provide recommendations.

Architecture Diagram

Instructions

Setup

Download llava-v1.5-7b-q4.llamafile (4.29 GB).
Open your computer's terminal.
If you're using macOS, Linux, or BSD, you'll need to grant permission for your computer to execute this new file. (You only need to do this once.)

chmod +x llava-v1.5-7b-q4.llamafile

If you're on Windows, rename the file by adding ".exe" on the end.
Run the llamafile. e.g.:

./llava-v1.5-7b-q4.llamafile

Your browser should open automatically and display a chat interface. (If it doesn't, just open your browser and point it at http://localhost:8080)
When you're done chatting, return to your terminal and hit Control-C to shut down llamafile.
Clone the repository

git clone git@github.com:rootsec1/anime-recommendation-rag.git

Change directory to the cloned repository

cd anime-recommendation-rag

Install the dependencies

pip install -r requirements.txt

Running the application

Start the FastAPI server

PYTHONPATH=. fastapi dev backend/server.py
(or run using the Docker image)
docker run --platform linux/x86_64 -e MODEL_URL=http://127.0.0.1:8080/v1/chat/completions -p 8000:8000 --network="host" abhishekwl/anime-recommendation-backend

Start the streamlit frontend application

streamlit run frontend/ui.py

Testing the application

Run the tests

PYTHONPATH=. pytest backend/test_server.py --disable-pytest-warnings -v

Screenshot of unit tests

Application Screenshots

Performance Report (Load Testing using Locust)

Type	Name	Request Count	Median Response Time	Average Response Time	Min Response Time	Max Response Time	Average Content Size	Requests/s	50%	66%	75%	80%	90%	95%	98%	99%	99.9%	99.99%	100%
GET	/	386	5000.0	5100.0	4900.0	5200.0	0.0	7.5583	5000	5050	5100	5120	5150	5180	5190	5195	5200	5200	5200
POST	/recommend-anime	54	155000.0	151417.85	71608.70	227882.48	1821.0	0.237	155000	155000	228000	228000	228000	228000	228000	228000	228000	228000	228000
	Aggregated	440	155000.0	26668.36	4900.0	227882.48	1821.0	7.7953	5000	5050	228000	228000	228000	228000	228000	228000	228000	228000	228000

This performance report summarizes the results of load testing conducted on our API endpoints using Locust. The tests were performed on a MacBook Air with the following specifications:

Hardware Overview:

   Model Name: MacBook Air
   Model Identifier: Mac14,2
   Chip: Apple M2
   Total Number of Cores: 8 (4 performance and 4 efficiency)
   Memory: 16 GB
   System Firmware Version: 10151.121.1
   OS Loader Version: 10151.121.1

This setup was used to assess the API's responsiveness and reliability under concurrent requests.

GET / Endpoint: The health check endpoint was tested with 386 requests. The average response time was approximately 5.1 seconds, with no failures, showcasing consistent performance under load.
POST /recommend-anime Endpoint: The recommendation endpoint was tested with 54 requests. The response times exhibited significant variance, with a median of 155 seconds. Despite the higher latency, the endpoint successfully handled all requests without any failures.

The test results indicate that the system can handle multiple requests concurrently without errors, though optimizations may be required to reduce response times for the recommendation endpoint under heavier loads.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.github/workflows		.github/workflows
backend		backend
data/raw		data/raw
frontend		frontend
performance_reports		performance_reports
scripts		scripts
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
locustfile.py		locustfile.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Anime Recommender RAG based LLM

Dockerfile

Final Demo Video

Project Purpose

Architecture Diagram

Instructions

Setup

Running the application

Testing the application

Application Screenshots

Performance Report (Load Testing using Locust)

About

Releases 1

Packages

Contributors 2

Languages

rootsec1/anime-recommendation-rag

Folders and files

Latest commit

History

Repository files navigation

Anime Recommender RAG based LLM

Dockerfile

Final Demo Video

Project Purpose

Architecture Diagram

Instructions

Setup

Running the application

Testing the application

Application Screenshots

Performance Report (Load Testing using Locust)

About

Topics

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages