-
Notifications
You must be signed in to change notification settings - Fork 509
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[k8s] SkyServe on Kubernetes #3377
Conversation
…o serve_k8s_playground # Conflicts: # sky/clouds/kubernetes.py # sky/serve/core.py
…o serve_k8s_playground # Conflicts: # sky/cli.py
…o serve_k8s_playground
…t-org/skypilot into serve_k8s_playground # Conflicts: # sky/serve/core.py # sky/serve/replica_managers.py
Thanks for the reviews @Michaelvll! Ready for another look. If it looks ok at a high level, I'll start running smoke tests. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for the update @romilbhardwaj and sorry for the delay! LGTM. Could you help resolve the conflict? If the tests work fine, I think it is good to go.
@@ -308,11 +308,12 @@ Let's bring up a real LLM chat service with FastChat + Vicuna. We'll use the `Vi | |||
conda activate chatbot | |||
|
|||
echo 'Starting controller...' | |||
python -u -m fastchat.serve.controller > ~/controller.log 2>&1 & |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: should we also mention serving on kubernetes a bit in the serving doc and refer to the cloud-permissions for the setup?
We may also want to explicitly list out why people should use skyserve on kubernetes than just using deployments, such as:
- easier customization for load balancer and autoscaling policy
- simple interface and easy to setup
- spillover to clouds.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good idea - filed #3518, will submit a separate PR for docs.
…o serve_k8s_final # Conflicts: # sky/backends/backend_utils.py # sky/cli.py # sky/utils/controller_utils.py # sky/utils/schemas.py
Thanks @Michaelvll! Addressed comments. Running smoke and backcompat tests now:
|
Update - had to add
Smoke tests pass now, this is ready to be merged. |
tests/test_yamls/minimal.yaml
Outdated
@@ -2,6 +2,7 @@ name: min | |||
|
|||
setup: | | |||
echo "running setup" | |||
sudo apt-get install -y jq |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would be nice to move this to test_minimal
in test_smoke.py
only. : )
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
good point, since this minimal.yaml is also used in a bunch of other tests. moved jq installation to test_minimal.yaml.
Adds SkyServe support on Kubernetes. Replicas and controller can now be run on Kubernetes. Supersedes #3109.
Changes in this PR:
get_endpoints
to backend_utilsget_endpoints
API in serve controller and LBTested:
bash format.sh