-
Notifications
You must be signed in to change notification settings - Fork 118
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can the MLServer runtime be replaced by Triton? #185
Comments
@OvervCW there shouldn't be anything stopping you using these other kinds of models with Triton, would be great if you could try it out! Hopefully all that should be needed is to:
|
@rafvasq -- since you updated the Triton serving runtime last year, it might be worth trying this out and documenting it. WDYT? |
@ckadner I confirmed that with a few changes, Triton's able to deploy those models too. Think it'd be worth updating the docs to reflect that which includes a couple of small updates to the example models via kserve/modelmesh-minio-examples#7. |
Related to kserve/modelmesh-serving#485 and kserve/modelmesh-serving#185, this PR expands on `lightgbm` and `xgboost` examples to show that they can be deployed with Triton (in addition to MLServer). --------- Signed-off-by: Rafael Vasquez <raf.vasquez@ibm.com>
#### Motivation Triton introduced [support for more model frameworks last year](https://developer.nvidia.com/blog/real-time-serving-for-xgboost-scikit-learn-randomforest-lightgbm-and-more/) and can support xgboost, lightgbm, and more. This PR adds examples and docs to advertise this. #### Modifications - Add newly supported models to Triton runtime config, setting `autoSelect: false`. - Add an example ISVC config for Triton-served XGBoost model. - Update example-models doc to reflect example models added in kserve/modelmesh-minio-examples#7 - Update model-formats README to reflect framework support and framework-specific docs to show example ISVC using Triton. - Add FVTs for lightgbm and xgboost deployment on Triton runtime #### Result Closes #185 --------- Signed-off-by: Rafael Vasquez <raf.vasquez@ibm.com> Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com>
#### Motivation Triton introduced [support for more model frameworks last year](https://developer.nvidia.com/blog/real-time-serving-for-xgboost-scikit-learn-randomforest-lightgbm-and-more/) and can support xgboost, lightgbm, and more. This PR adds examples and docs to advertise this. #### Modifications - Add newly supported models to Triton runtime config, setting `autoSelect: false`. - Add an example ISVC config for Triton-served XGBoost model. - Update example-models doc to reflect example models added in kserve/modelmesh-minio-examples#7 - Update model-formats README to reflect framework support and framework-specific docs to show example ISVC using Triton. - Add FVTs for lightgbm and xgboost deployment on Triton runtime #### Result Closes kserve#185 --------- Signed-off-by: Rafael Vasquez <raf.vasquez@ibm.com> Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com> Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
ModelMesh currently uses the MLServer runtime to serve sklearn, xgboost, and lightgbm models. However, it seems like recent versions of Triton now support all of those model types as well.
Does that mean that ModelMesh could support all types of models using a single runtime? And if so, would it be easy to change the adapter to handle this?
The text was updated successfully, but these errors were encountered: