Added docs for raw deployment autoscaling. #312

andyi2it · 2023-11-06T02:39:20Z

"Fixes #303" Update Autoscaling docs for Raw deployment mode

Proposed Changes

netlify · 2023-11-06T02:39:24Z

✅ Deploy Preview for elastic-nobel-0aef7a ready!

Name	Link
🔨 Latest commit	`8135ecd`
🔍 Latest deploy log	https://app.netlify.com/sites/elastic-nobel-0aef7a/deploys/6548ac23ad6ec4000887d949
😎 Deploy Preview	https://deploy-preview-312--elastic-nobel-0aef7a.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

kserve-oss-bot · 2023-11-06T02:39:27Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: andyi2it
To complete the pull request process, please assign theofpa after the PR has been reviewed.
You can assign the PR to them by writing /assign @theofpa in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com>

yuzisun · 2023-11-26T15:19:03Z

docs/modelserving/autoscaling/autoscaling.md

+        serving.kserve.io/deploymentMode: RawDeployment
+        serving.kserve.io/autoscalerClass: hpa
+        serving.kserve.io/metric: cpu
+        serving.kserve.io/targetUtilizationPercentage: "80"


these are the annotations for the old schema

also document the possible supported metric type for RawDeployment mode

yuzisun · 2023-11-26T15:19:48Z

docs/modelserving/autoscaling/autoscaling.md

+### HPA in Raw Deployment
+
+When using Kserve with the `RawDeployment` mode, Knative is not installed. In this mode, if you deploy an `InferenceService`, Kserve uses **Kubernetes’ Horizontal Pod Autoscaler (HPA)** for autoscaling instead of **Knative Pod Autoscaler (KPA)**. For more information about Kserve's autoscaler, you can refer [`this`](https://kserve.github.io/website/master/modelserving/v1beta1/torchserve/#knative-autoscaler)


better to refer to the official Knative autoscaler doc.

yuzisun · 2023-11-26T15:20:56Z

docs/modelserving/autoscaling/autoscaling.md

+The default for scaleMetric is `concurrency` and possible values are `concurrency`, `rps`, `cpu` and `memory`.
+
+## Autoscaler for Kserve's Raw Deployment Mode


Maybe worth separate page for this, this doc is a bit too long.

kserve-oss-bot requested review from alexagriffith and theofpa November 6, 2023 02:39

andyi2it added 2 commits November 6, 2023 14:34

Added docs for raw deployment autoscaling.

c781ccf

Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com>

Schema order changed.

8135ecd

Signed-off-by: Andrews Arokiam <andrews.arokiam@ideas2it.com>

andyi2it force-pushed the issue-303 branch from 480f4d3 to 8135ecd Compare November 6, 2023 09:04

yuzisun reviewed Nov 26, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added docs for raw deployment autoscaling. #312

Added docs for raw deployment autoscaling. #312

andyi2it commented Nov 6, 2023 •

edited

Loading

netlify bot commented Nov 6, 2023 •

edited

Loading

kserve-oss-bot commented Nov 6, 2023

yuzisun Nov 26, 2023

yuzisun Nov 26, 2023

yuzisun Nov 26, 2023

yuzisun Nov 26, 2023

		### HPA in Raw Deployment

		When using Kserve with the `RawDeployment` mode, Knative is not installed. In this mode, if you deploy an `InferenceService`, Kserve uses Kubernetes’ Horizontal Pod Autoscaler (HPA) for autoscaling instead of Knative Pod Autoscaler (KPA). For more information about Kserve's autoscaler, you can refer [`this`](https://kserve.github.io/website/master/modelserving/v1beta1/torchserve/#knative-autoscaler)

		The default for scaleMetric is `concurrency` and possible values are `concurrency`, `rps`, `cpu` and `memory`.

		## Autoscaler for Kserve's Raw Deployment Mode

Added docs for raw deployment autoscaling. #312

Are you sure you want to change the base?

Added docs for raw deployment autoscaling. #312

Conversation

andyi2it commented Nov 6, 2023 • edited Loading

Proposed Changes

netlify bot commented Nov 6, 2023 • edited Loading

✅ Deploy Preview for elastic-nobel-0aef7a ready!

kserve-oss-bot commented Nov 6, 2023

yuzisun Nov 26, 2023

Choose a reason for hiding this comment

yuzisun Nov 26, 2023

Choose a reason for hiding this comment

yuzisun Nov 26, 2023

Choose a reason for hiding this comment

yuzisun Nov 26, 2023

Choose a reason for hiding this comment

andyi2it commented Nov 6, 2023 •

edited

Loading

netlify bot commented Nov 6, 2023 •

edited

Loading