Skip to content

aidotse/reranker-inference

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Reranker Inference Service

Reranker inference service intended for use with the Digital Assistant. Simply hosts a reranker model using HuggingFace transformers and exposes a prediction endpoint.

Build

make build

Running

To run in the project use

make run

When running in production, use

docker volume create hf_cache  # If not exists
docker run -it -p 5000:5000 -v hf_cache:/app/hf_cache --gpus all -e API_KEY=<token> ghcr.io/aidotse/reranker-inference:latest

Push

make push

About

API service for reranker inference

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages