Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

basic checking of container resources #59

Merged
merged 2 commits into from
Sep 16, 2024

Conversation

dgrove-oss
Copy link
Collaborator

@dgrove-oss dgrove-oss commented Sep 13, 2024

(base) dgrove@Dave's IBM Mac cluster-checker % node checker.js
CLUSTER QUEUE         GPU QUOTA   GPU USAGE   ADMITTED WORKLOADS   PENDING WORKLOADS
team1-cluster-queue          16           0                    0                   1

Total GPU count in cluster:        24
Unschedulable GPU count:         -  0
Schedulable GPU count:           = 24

Nominal GPU quota:                 16
Slack GPU quota:                 +  8
Total GPU quota:                 = 24

GPU usage by admitted workloads:    0
Borrowed GPU count:                 0

WARNING: workload "team1/appwrapper-sample-job-7fd73" has a container requesting 2T memory
WARNING: workload "team1/appwrapper-sample-job-7fd73" has a container requesting 2 roce_gdr but only 1 GPUs
WARNING: workload "team1/appwrapper-sample-job-7fd73" has a container requesting 20 cpus but only 1 GPUs
WARNING: workload "team1/appwrapper-sample-job-7fd73" has a container requesting 2T memory but only 1 GPUs

@dgrove-oss dgrove-oss merged commit 951c1a7 into project-codeflare:main Sep 16, 2024
1 check passed
@dgrove-oss dgrove-oss deleted the resources branch September 16, 2024 17:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant