v0.14 release blog #431

israel-hdez · 2024-12-09T23:36:09Z

Proposed Changes

Contribute blog article for v0.14 release
Remove experimental note from modelcars docs, because we stabilized in Modelcar race condition mitigation with an init container kserve#3932
- Complementary PR in [v0.14] Remove experimental note from OCI (modelcars) docs #430

Notes

I haven't tried the model cache. So, I'm not sure the provided YAML is correct.

Contribute blog article for v0.14 release Signed-off-by: Edgar Hernández <23639005+israel-hdez@users.noreply.github.com>

netlify · 2024-12-09T23:36:31Z

✅ Deploy Preview for elastic-nobel-0aef7a ready!

Name	Link
🔨 Latest commit	`05f73b6`
🔍 Latest deploy log	https://app.netlify.com/sites/elastic-nobel-0aef7a/deploys/67588b55b8da2a0009890a7b
😎 Deploy Preview	https://deploy-preview-431--elastic-nobel-0aef7a.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

israel-hdez · 2024-12-09T23:37:50Z

@yuzisun @greenmoon55 May you, please, review?

yuzisun · 2024-12-10T00:00:48Z

Thanks @israel-hdez !!

Signed-off-by: Edgar Hernández <23639005+israel-hdez@users.noreply.github.com>

spolti · 2024-12-10T18:51:41Z

docs/blog/articles/2024-12-13-KServe-0.14-release.md

+  modelSize: 1Gi
+  nodeGroup: nodegroup1
+  sourceModelUri: gs://kfserving-examples/models/sklearn/1.0/model
+```


maybe add the isvc with the example explaining how to use this

AFAIK, the InferenceService doesn't change and you use it normally (i.e. you still would use gs://kfserving-examples/models/sklearn/1.0/model for storageUri) .

The difference you would notice is that the model will be fetched/mounted from the cache, instead of downloading it.

I think I can add a brief note about what I just wrote.

yuzisun · 2024-12-14T18:19:51Z

docs/blog/articles/2024-12-13-KServe-0.14-release.md

+
+This release also includes several enhancements and changes:
+
+### What's New?


@sivanantha321 @andyi2it also good to the add the binary extension support and response header support?
#419

I somehow thought that the binary extension was an enhancement of the Inference Client. So, to better understand... should I add it under the inference client heading? or it is good here as a bullet under What's New? ?

binary extension is not part of the inference client effort, it is implementing the binary extension as part of the open inference protocol along with FP16 support.

Maybe you can link the documentation as well. https://kserve.github.io/website/latest/modelserving/data_plane/binary_tensor_data_extension/

yuzisun · 2024-12-14T18:25:41Z

docs/blog/articles/2024-12-13-KServe-0.14-release.md

+* Allow PVC storage to be mounted in ReadWrite mode via an annotation [#3687](https://github.com/kserve/kserve/issues/3687)
+
+### What's Changed?
+* Added `hostIPC` field to `ServingRuntime` CRD, for supporting more than one GPU in Serverless mode [#3791](https://github.com/kserve/kserve/issues/3791)


I think it is good to add a section for LLM runtime support to include the changes as part of the 0.14 release

vllm 0.6.x support

add health endpoint for vLLM backend

support shared memory volume for vLLM backend

support chat completion template file

support trust_remote_code for vllm and HF backend

Do you mean a dedicated section with LLM-related enhancements? or should those be listed here under What's Changed?

yes worth calling it out separately, cc @sivanantha321 to check if the list of changes are correct

Ray is an optional dependency now and the way it is implemented is changed. It is worth mentioning this as a breaking change. kserve/kserve#3834

v0.14 release blog

3b9d22f

Contribute blog article for v0.14 release Signed-off-by: Edgar Hernández <23639005+israel-hdez@users.noreply.github.com>

Fix introductory paragraph with the right summary

05f73b6

Signed-off-by: Edgar Hernández <23639005+israel-hdez@users.noreply.github.com>

spolti reviewed Dec 10, 2024

View reviewed changes

yuzisun reviewed Dec 14, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.14 release blog #431

v0.14 release blog #431

israel-hdez commented Dec 9, 2024

netlify bot commented Dec 9, 2024 •

edited

Loading

israel-hdez commented Dec 9, 2024

yuzisun commented Dec 10, 2024

spolti Dec 10, 2024

israel-hdez Dec 12, 2024

israel-hdez Dec 12, 2024

yuzisun Dec 14, 2024

israel-hdez Dec 16, 2024

yuzisun Dec 17, 2024

sivanantha321 Dec 17, 2024

yuzisun Dec 14, 2024

israel-hdez Dec 16, 2024

yuzisun Dec 17, 2024

sivanantha321 Dec 17, 2024


		This release also includes several enhancements and changes:

		### What's New?

v0.14 release blog #431

Are you sure you want to change the base?

v0.14 release blog #431

Conversation

israel-hdez commented Dec 9, 2024

Proposed Changes

Notes

netlify bot commented Dec 9, 2024 • edited Loading

✅ Deploy Preview for elastic-nobel-0aef7a ready!

israel-hdez commented Dec 9, 2024

yuzisun commented Dec 10, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

netlify bot commented Dec 9, 2024 •

edited

Loading