-
Notifications
You must be signed in to change notification settings - Fork 109
Rename GPU related parameters and split CUDACapability classad in two (min, max) #12416
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Jenkins results:
|
Jenkins results:
|
Jenkins results:
|
Jenkins results:
|
Classads are injected just fine, but the macros feature does not work because it requires at least HTCondor 23.8.1, and we are using 23.0.3
EDIT: Okay, it looks like my testbed agent is the one with 23.0.3 for some reason, other testbed agents and productions agents seem to work with 24.0.6 which should be okay. I will test on a different agent. |
@amaltaro I finished my test. Everything works well with 24.0.6. Here are the classads injected:
And here is the Requirements expression changed:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome! Thank you, @khurtado
ad['My.DESIRED_GPUMemoryMB'] = str(job['gpuRequirements']['GPUMemoryMB']) | ||
# CUDACapabilities is a list of strings, with each string matching this regex: r"^\d+.\d$" | ||
# E.g.: ["1.0", "10.0", "2.1"] | ||
cudaCapabilities = sorted(job['gpuRequirements']['CUDACapabilities'], key=float) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
neat!
@khurtado actually, can you please provide a short summary in the PR description? |
@amaltaro I have updated the description and the documentation: |
Thank you, Kenyi! |
Fixes #11942
Status
Ready
Description
Rename GPU related parameters and split CUDACapability, following #11942 (comment)
A summary of the changes:
Also, following: https://htcondor.readthedocs.io/en/latest/man-pages/condor_submit.html#gpus_minimum_capability , we need to:
The above will create this classad automatically from the macros above:
and the above expression will be added to the general
Requirements
expression automatically as well.HTCondor versions earlier than August 2024 won't append these expressions to the Requirements expressions. This works well with HTCondor 24.0.6 (used in production).
Is it backward compatible (if not, which system it affects?)
YES