Name		Name	Last commit message	Last commit date
parent directory ..
docs		docs
scripts		scripts
src		src
CMakeLists.txt		CMakeLists.txt
Dockerfile		Dockerfile
Dockerfile_CUDA_11_2		Dockerfile_CUDA_11_2
Dockerfile_CUDA_11_2_TRT_8_5_PADDLE_2_4_2		Dockerfile_CUDA_11_2_TRT_8_5_PADDLE_2_4_2
Dockerfile_CUDA_11_4_TRT_8_4		Dockerfile_CUDA_11_4_TRT_8_4
Dockerfile_cpu		Dockerfile_cpu
Dockerfile_ipu		Dockerfile_ipu
Dockerfile_xpu		Dockerfile_xpu
Dockerfile_xpu_encrypt_auth		Dockerfile_xpu_encrypt_auth
README.md		README.md
README_CN.md		README_CN.md

README.md

简体中文 | English

FastDeploy Serving Deployment

Introduction

FastDeploy builds an end-to-end serving deployment based on Triton Inference Server. The underlying backend uses the FastDeploy high-performance Runtime module and integrates the FastDeploy pre- and post-processing modules to achieve end-to-end serving deployment. It can achieve fast deployment with easy-to-use process and excellent performance.

FastDeploy also provides an easy-to-use Python service deployment method, refer PaddleSeg deployment example for its usage.

Prepare the environment

Environment requirements

Linux
If using a GPU image, NVIDIA Driver >= 470 is required (for older Tesla architecture GPUs, such as T4, the NVIDIA Driver can be 418.40+, 440.33+, 450.51+, 460.27+)

Obtain Image

CPU Image

CPU images only support Paddle/ONNX models for serving deployment on CPUs, and supported inference backends include OpenVINO, Paddle Inference, and ONNX Runtime

docker pull registry.baidubce.com/paddlepaddle/fastdeploy:1.0.4-cpu-only-21.10

GPU Image

GPU images support Paddle/ONNX models for serving deployment on GPU and CPU, and supported inference backends including OpenVINO, TensorRT, Paddle Inference, and ONNX Runtime

docker pull registry.baidubce.com/paddlepaddle/fastdeploy:1.0.4-gpu-cuda11.4-trt8.5-21.10

Users can also compile the image by themselves according to their own needs, referring to the following documents:

FastDeploy Serving Deployment Image Compilation

Task	Model
Classification	PaddleClas
Detection	PaddleDetection
Detection	ultralytics/YOLOv5
NLP	PaddleNLP/ERNIE-3.0
NLP	PaddleNLP/UIE
Speech	PaddleSpeech/PP-TTS
OCR	PaddleOCR/PP-OCRv3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

serving

serving

README.md

FastDeploy Serving Deployment

Introduction

Prepare the environment

Environment requirements

Obtain Image

CPU Image

GPU Image

Other Tutorials

Serving Deployment Demo

Files

serving

Directory actions

More options

Directory actions

More options

Latest commit

History

serving

Folders and files

parent directory

README.md

FastDeploy Serving Deployment

Introduction

Prepare the environment

Environment requirements

Obtain Image

CPU Image

GPU Image

Other Tutorials

Serving Deployment Demo