Can the MLServer runtime be replaced by Triton? #185

OvervCW · 2022-07-11T16:22:14Z

ModelMesh currently uses the MLServer runtime to serve sklearn, xgboost, and lightgbm models. However, it seems like recent versions of Triton now support all of those model types as well.

Does that mean that ModelMesh could support all types of models using a single runtime? And if so, would it be easy to change the adapter to handle this?

njhill · 2022-07-22T10:38:15Z

@OvervCW there shouldn't be anything stopping you using these other kinds of models with Triton, would be great if you could try it out!

Hopefully all that should be needed is to:

Update the triton-2.x ServingRuntime spec to include additional supportedModelFormats as needed
Reference the appropriate type in your Predictor and either specify the runtime name explicitly or delete/remove other runtimes like MLServer which also support the same type (or set autoSelect to false in the right places in those other runtimes)
Make sure to include a full Triton model configuration pbtxt file in the storage location that the Predictor's path points to (like this)

ckadner · 2024-01-20T00:06:29Z

@rafvasq -- since you updated the Triton serving runtime last year, it might be worth trying this out and documenting it. WDYT?

rafvasq · 2024-01-26T20:56:34Z

@ckadner I confirmed that with a few changes, Triton's able to deploy those models too. Think it'd be worth updating the docs to reflect that which includes a couple of small updates to the example models via kserve/modelmesh-minio-examples#7.

Related to kserve/modelmesh-serving#485 and kserve/modelmesh-serving#185, this PR expands on `lightgbm` and `xgboost` examples to show that they can be deployed with Triton (in addition to MLServer). --------- Signed-off-by: Rafael Vasquez <raf.vasquez@ibm.com>

#### Motivation Triton introduced [support for more model frameworks last year](https://developer.nvidia.com/blog/real-time-serving-for-xgboost-scikit-learn-randomforest-lightgbm-and-more/) and can support xgboost, lightgbm, and more. This PR adds examples and docs to advertise this. #### Modifications - Add newly supported models to Triton runtime config, setting `autoSelect: false`. - Add an example ISVC config for Triton-served XGBoost model. - Update example-models doc to reflect example models added in kserve/modelmesh-minio-examples#7 - Update model-formats README to reflect framework support and framework-specific docs to show example ISVC using Triton. - Add FVTs for lightgbm and xgboost deployment on Triton runtime #### Result Closes #185 --------- Signed-off-by: Rafael Vasquez <raf.vasquez@ibm.com> Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com>

#### Motivation Triton introduced [support for more model frameworks last year](https://developer.nvidia.com/blog/real-time-serving-for-xgboost-scikit-learn-randomforest-lightgbm-and-more/) and can support xgboost, lightgbm, and more. This PR adds examples and docs to advertise this. #### Modifications - Add newly supported models to Triton runtime config, setting `autoSelect: false`. - Add an example ISVC config for Triton-served XGBoost model. - Update example-models doc to reflect example models added in kserve/modelmesh-minio-examples#7 - Update model-formats README to reflect framework support and framework-specific docs to show example ISVC using Triton. - Add FVTs for lightgbm and xgboost deployment on Triton runtime #### Result Closes kserve#185 --------- Signed-off-by: Rafael Vasquez <raf.vasquez@ibm.com> Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com> Signed-off-by: zhlsunshine <huailong.zhang@intel.com>

njhill added the question Further information is requested label Jul 22, 2022

ckadner added documentation Improvements or additions to documentation good first issue Good for newcomers labels Jan 20, 2024

rafvasq self-assigned this Jan 22, 2024

This was referenced Jan 26, 2024

docs: Update model support with Triton's FIL backend #484

Closed

feat: Adds and refactors for Triton FIL examples kserve/modelmesh-minio-examples#7

Merged

rafvasq mentioned this issue Jan 29, 2024

feat: Update Triton model support #485

Merged

rafvasq closed this as completed in #485 Mar 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can the MLServer runtime be replaced by Triton? #185

Can the MLServer runtime be replaced by Triton? #185

OvervCW commented Jul 11, 2022 •

edited

Loading

njhill commented Jul 22, 2022

ckadner commented Jan 20, 2024

rafvasq commented Jan 26, 2024

Can the MLServer runtime be replaced by Triton? #185

Can the MLServer runtime be replaced by Triton? #185

Comments

OvervCW commented Jul 11, 2022 • edited Loading

njhill commented Jul 22, 2022

ckadner commented Jan 20, 2024

rafvasq commented Jan 26, 2024

OvervCW commented Jul 11, 2022 •

edited

Loading