Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement how we gather GPU types. #49

Merged
merged 2 commits into from
Mar 26, 2024

Conversation

naved001
Copy link
Collaborator

Ths cluster has multiple types of GPUs. To determine the type of GPUs we query the node labels where a pod requesting GPU was run, from that we can find exact model of the GPU.

This commit updates the query that to get the node labels when we gather GPU usage. The merge metrics method is updated to keep the gpu_type. The GPU type for lenovo A100s is added and a corresponding SU is also created.

Ths cluster has multiple types of GPUs. To determine the type of GPUs we
query the node labels where a pod requesting GPU was run, from that we
can find exact model of the GPU.

This commit updates the query that to get the node labels when we gather GPU
usage. The merge metrics method is updated to keep the gpu_type. The GPU type
for lenovo A100s is added and a corresponding SU is also created.
@naved001 naved001 force-pushed the calculate-gpu-costs branch from fe49c3c to 1451c1e Compare March 26, 2024 20:19
…ter nor

do we know what it would be named, so I removing it. We can add it if it's added
to the cluster.

Also update the test to include A100SXM4 instead of A2
@naved001 naved001 force-pushed the calculate-gpu-costs branch from c4c3aa1 to 902162e Compare March 26, 2024 20:42
@naved001 naved001 merged commit 87259b3 into CCI-MOC:main Mar 26, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant