Skip to content

Commit

Permalink
Merge branch 'GoogleCloudPlatform:main' into updates
Browse files Browse the repository at this point in the history
  • Loading branch information
vivianrwu authored Oct 9, 2024
2 parents 728e283 + b0588cc commit da99beb
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -51,6 +51,7 @@ spec:
- name: vllm
ports:
- containerPort: 80
name: metrics
image: "vllm/vllm-openai:v0.5.5"
command: ["python3", "-m", "vllm.entrypoints.openai.api_server"]
args: ["--model", "${model_id}", "--tensor-parallel-size", "${gpu_count}", "--port", "80", "--swap-space", "${swap_space}", "--disable-log-requests"]
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -8,5 +8,5 @@ spec:
matchLabels:
app: vllm
endpoints:
- port: 80
- port: metrics
interval: 15s

0 comments on commit da99beb

Please sign in to comment.