简体中文 | English

Model Inference Examples

This example demonstrates how to perform model inference using CLI, Python, and C++ with the YOLOv8s model as an example.

Important

If you want to use the EfficientRotatedNMS plugin to infer an OBB model, please refer to Building TensorRT Custom Plugins for guidance.

Model Export

Detection Model

Download the YOLOv8s model and save it to the models folder.
Use the following command to export the model to ONNX format with the EfficientNMS plugin:
```
trtyolo export -w models/yolov8s.pt -v yolov8 -o models
```
After running the above command, a yolov8s.onnx file will be generated in the models folder. Next, use the trtexec tool to convert the ONNX file to a TensorRT engine:
```
trtexec --onnx=models/yolov8s.onnx --saveEngine=models/yolov8s.engine --fp16
```

Oriented Bounding Box Model (OBB)

Download the YOLOv8s-obb model and save it to the models folder.
Use the following command to export the model to ONNX format with the EfficientRotatedNMS plugin:
```
trtyolo export -w models/yolov8s-obb.pt -v yolov8 -o models
```
After running the above command, a yolov8s-obb.onnx file will be generated in the models folder. Next, use the trtexec tool to convert the ONNX file to a TensorRT engine:
```
trtexec --onnx=models/yolov8s-obb.onnx --saveEngine=models/yolov8s-obb.engine --fp16
```

Dataset Preparation

Detection Model

Download the coco128 dataset.
After extraction, move the images from the coco128/images/train2017 folder to the images folder for inference.

Oriented Bounding Box Model (OBB)

Download the DOTA-v1.0 dataset.
After extraction, move the images from the part1/images folder to the images folder for inference.

Model Inference

Inference Using CLI

Use the trtyolo command-line tool for inference. Run the following command to view the help information:
```
trtyolo infer --help
```
Run the following commands for inference:

[!NOTE] The --cudaGraph option, introduced in version 4.0, can further accelerate the inference process, but this feature only supports static models.

From version 4.2 onwards, OBB model inference is supported, with the new -m, --mode option for selecting Detection or OBB models.
```
# Detection Model
trtyolo infer -e models/yolov8s.engine -m 0 -i images -o output -l labels_det.txt --cudaGraph
# Oriented Bounding Box Model
trtyolo infer -e models/yolov8s-obb.engine -m 1 -i images -o output -l labels_obb.txt --cudaGraph
```
The inference results will be saved to the output folder and generate visualized results.

Inference Using Python

Use the tensorrt_yolo library for Python inference. The sample script detect.py is ready for use.

Run the following commands for inference:

[!NOTE] The --cudaGraph option can further accelerate the inference process, but this feature only supports static models.

# Detection Model
python detect.py -e models/yolov8s.engine -m 0 -i images -o output -l labels_det.txt --cudaGraph
# Oriented Bounding Box Model
python detect.py -e models/yolov8s-obb.engine -m 1 -i images -o output -l labels_obb.txt --cudaGraph

Inference Using C++

Ensure that the project has been compiled according to the Deploy Compilation Guide.
Use xmake to compile detect.cpp into an executable file:
```
xmake f -P . --tensorrt="/path/to/your/TensorRT" --deploy="/path/to/your/TensorRT-YOLO"

xmake -P . -r
```
After compilation, the executable file will be generated in the build folder at the project root.

Run the following commands for inference:

[!NOTE] The --cudaGraph option can further accelerate the inference process, but this feature only supports static models.

# Detection Model
xmake run -P . detect -e models/yolov8s.engine -m 0 -i images -o output -l labels_det.txt --cudaGraph
# Oriented Bounding Box Model
xmake run -P . detect -e models/yolov8s-obb.engine -m 1 -i images -o output -l labels_obb.txt --cudaGraph

By following the steps above, you can successfully complete model inference.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.en.md

README.en.md

Model Inference Examples

Model Export

Detection Model

Oriented Bounding Box Model (OBB)

Dataset Preparation

Detection Model

Oriented Bounding Box Model (OBB)

Model Inference

Inference Using CLI

Inference Using Python

Inference Using C++

Files

README.en.md

Latest commit

History

README.en.md

File metadata and controls

Model Inference Examples

Model Export

Detection Model

Oriented Bounding Box Model (OBB)

Dataset Preparation

Detection Model

Oriented Bounding Box Model (OBB)

Model Inference

Inference Using CLI

Inference Using Python

Inference Using C++