-
Notifications
You must be signed in to change notification settings - Fork 117
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
doc: Add custom runtime with PVC example #495
Conversation
1085963
to
613e31c
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @zhlsunshine! I think having a complete end-to-end example for this will be very helpful. I've done a first-pass and reviewed some edits that I think will benefit the doc.
I also think that since this is an E2E example along with ready-to-use code, it can be added to /docs/examples
instead, maybe in its own directory called /docs/examples/python-custom-runtime
.
.../python-based-custom-runtime-example/Python-Based-Custom-Runtime-with-Model-Stored-on-PVC.md
Outdated
Show resolved
Hide resolved
.../python-based-custom-runtime-example/Python-Based-Custom-Runtime-with-Model-Stored-on-PVC.md
Outdated
Show resolved
Hide resolved
.../python-based-custom-runtime-example/Python-Based-Custom-Runtime-with-Model-Stored-on-PVC.md
Outdated
Show resolved
Hide resolved
.../python-based-custom-runtime-example/Python-Based-Custom-Runtime-with-Model-Stored-on-PVC.md
Outdated
Show resolved
Hide resolved
.../python-based-custom-runtime-example/Python-Based-Custom-Runtime-with-Model-Stored-on-PVC.md
Outdated
Show resolved
Hide resolved
.../python-based-custom-runtime-example/Python-Based-Custom-Runtime-with-Model-Stored-on-PVC.md
Outdated
Show resolved
Hide resolved
.../python-based-custom-runtime-example/Python-Based-Custom-Runtime-with-Model-Stored-on-PVC.md
Outdated
Show resolved
Hide resolved
.../python-based-custom-runtime-example/Python-Based-Custom-Runtime-with-Model-Stored-on-PVC.md
Outdated
Show resolved
Hide resolved
.../python-based-custom-runtime-example/Python-Based-Custom-Runtime-with-Model-Stored-on-PVC.md
Outdated
Show resolved
Hide resolved
.../python-based-custom-runtime-example/Python-Based-Custom-Runtime-with-Model-Stored-on-PVC.md
Outdated
Show resolved
Hide resolved
08f4e21
to
e4d8169
Compare
Hi @rafvasq, thanks for reviewing and comments, I moved this e2e example into |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @zhlsunshine, this looks great. Just the small nit, run make run fmt
to get past the Lint check, and rebase.
docs/examples/python-custom-runtime/Python-Based-Custom-Runtime-with-Model-Stored-on-PVC.md
Outdated
Show resolved
Hide resolved
3c233a2
to
786c913
Compare
Hi @rafvasq rebasing done! |
#### Motivation Triton introduced [support for more model frameworks last year](https://developer.nvidia.com/blog/real-time-serving-for-xgboost-scikit-learn-randomforest-lightgbm-and-more/) and can support xgboost, lightgbm, and more. This PR adds examples and docs to advertise this. #### Modifications - Add newly supported models to Triton runtime config, setting `autoSelect: false`. - Add an example ISVC config for Triton-served XGBoost model. - Update example-models doc to reflect example models added in kserve/modelmesh-minio-examples#7 - Update model-formats README to reflect framework support and framework-specific docs to show example ISVC using Triton. - Add FVTs for lightgbm and xgboost deployment on Triton runtime #### Result Closes kserve#185 --------- Signed-off-by: Rafael Vasquez <raf.vasquez@ibm.com> Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com> Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
…ording to comments Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
This is to move away from CVE-20204-24786: infinite loop vulnerability. Signed-off-by: Edgar Hernández <23639005+israel-hdez@users.noreply.github.com> Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
dd9fdab
to
1a8cc6c
Compare
Hi @rafvasq, I also completed the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @zhlsunshine!
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: rafvasq, zhlsunshine The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
1 similar comment
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: rafvasq, zhlsunshine The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/lgtm |
Motivation
I'm very happy to quickly install
kserve modelmesh
by following quickstart, but I encountered the problem when I want to write a python-based customServingRuntime
for modelmnist-svm.joblib
which is also used in many guides and docs, and I also opened an issue for it. However, I can hardly find the complete process which is available online, including the community. So I pieced together some information to make it's easier for user to do it.Modifications
Result
The process for writing a python-based custom
ServingRuntime
is completed, hope it can be a easy start for fresh user.