Pinned Loading
-
lm-sys/FastChat
lm-sys/FastChat PublicAn open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
-
sgl-project/sglang
sgl-project/sglang PublicSGLang is a fast serving framework for large language models and vision language models.
-
alpa-projects/alpa
alpa-projects/alpa PublicTraining and serving large-scale neural networks with auto parallelization.
-
FMInference/FlexiGen
FMInference/FlexiGen PublicRunning large language models on a single GPU for throughput-oriented scenarios.
-
apache/tvm
apache/tvm PublicOpen deep learning compiler stack for cpu, gpu and specialized accelerators
-
vllm-project/vllm
vllm-project/vllm PublicA high-throughput and memory-efficient inference and serving engine for LLMs
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.