Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

doc: Add custom runtime with PVC example #495

Merged
merged 8 commits into from
Mar 22, 2024

Conversation

zhlsunshine
Copy link

Motivation

I'm very happy to quickly install kserve modelmesh by following quickstart, but I encountered the problem when I want to write a python-based custom ServingRuntime for model mnist-svm.joblib which is also used in many guides and docs, and I also opened an issue for it. However, I can hardly find the complete process which is available online, including the community. So I pieced together some information to make it's easier for user to do it.

Modifications

  1. Add a REAMD for describing the complete process
  2. Add a folder for custom ServingRuntime, including the python code, Dockerfile and required library configuration file.

Result

The process for writing a python-based custom ServingRuntime is completed, hope it can be a easy start for fresh user.

Copy link
Member

@rafvasq rafvasq left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you @zhlsunshine! I think having a complete end-to-end example for this will be very helpful. I've done a first-pass and reviewed some edits that I think will benefit the doc.

I also think that since this is an E2E example along with ready-to-use code, it can be added to /docs/examples instead, maybe in its own directory called /docs/examples/python-custom-runtime.

@zhlsunshine
Copy link
Author

Hi @rafvasq, thanks for reviewing and comments, I moved this e2e example into /docs/examples/python-custom-runtime as you suggested, and change the doc according to your comments.

@rafvasq rafvasq self-requested a review March 20, 2024 18:08
Copy link
Member

@rafvasq rafvasq left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @zhlsunshine, this looks great. Just the small nit, run make run fmt to get past the Lint check, and rebase.

@zhlsunshine
Copy link
Author

zhlsunshine commented Mar 21, 2024

Thanks @zhlsunshine, this looks great. Just the small nit, run make run fmt to get past the Lint check, and rebase.

Hi @rafvasq rebasing done!

rafvasq and others added 7 commits March 22, 2024 11:27
#### Motivation
Triton introduced [support for more model frameworks last
year](https://developer.nvidia.com/blog/real-time-serving-for-xgboost-scikit-learn-randomforest-lightgbm-and-more/)
and can support xgboost, lightgbm, and more. This PR adds examples and
docs to advertise this.

#### Modifications
- Add newly supported models to Triton runtime config, setting
`autoSelect: false`.
- Add an example ISVC config for Triton-served XGBoost model.
- Update example-models doc to reflect example models added in
kserve/modelmesh-minio-examples#7
- Update model-formats README to reflect framework support and
framework-specific docs to show example ISVC using Triton.
- Add FVTs for lightgbm and xgboost deployment on Triton runtime

#### Result
Closes kserve#185

---------

Signed-off-by: Rafael Vasquez <raf.vasquez@ibm.com>
Signed-off-by: Rafael Vasquez <rafvasq21@gmail.com>
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
…ording to comments

Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
This is to move away from CVE-20204-24786: infinite loop vulnerability.

Signed-off-by: Edgar Hernández <23639005+israel-hdez@users.noreply.github.com>
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
Signed-off-by: zhlsunshine <huailong.zhang@intel.com>
@zhlsunshine
Copy link
Author

Hi @rafvasq, I also completed the golangci-lint process, submitted the formatted docs.

Copy link
Member

@rafvasq rafvasq left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @zhlsunshine!

Copy link

oss-prow-bot bot commented Mar 22, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: rafvasq, zhlsunshine

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

1 similar comment
Copy link

oss-prow-bot bot commented Mar 22, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: rafvasq, zhlsunshine

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@rafvasq
Copy link
Member

rafvasq commented Mar 22, 2024

/lgtm

@oss-prow-bot oss-prow-bot bot added the lgtm label Mar 22, 2024
@rafvasq rafvasq changed the title The Python-Based Custom Runtime with Model Stored on Persistent Volume Claim doc: Add custom runtime with PVC example Mar 22, 2024
@rafvasq rafvasq merged commit 70cdc3e into kserve:main Mar 22, 2024
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
3 participants