feat: add gpu numa node attribute#1165
Conversation
Signed-off-by: Erez Freiberger <enoodle@gmail.com>
Signed-off-by: Erez Freiberger <enoodle@gmail.com>
✅ Deploy Preview for dra-driver-nvidia-gpu ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
|
Welcome @enoodle! |
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: enoodle The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
Hi @enoodle. Thanks for your PR. I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with Regular contributors should join the org to skip this step. Once the patch is verified, the new status will be reflected by the I understand the commands that are listed here. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
| "k8s.io/utils/ptr" | ||
| ) | ||
|
|
||
| const compatibilityNumaNodeAttribute resourceapi.QualifiedName = "dra.net/numaNode" |
There was a problem hiding this comment.
currently, dra-net, dra-cpu and dra-sriov-vf all seem to use same prefix to advertise this attribute. This could be an intermittent solution until the derived attributes and list-attributes features are GA to allow use cases where topology alignment using numaNode make sense. @varunrsekar WDYT?
There was a problem hiding this comment.
I’m not a fan of the short-term solution. GPU DRA driver does not own the “dra.net/numaNode” attribute. If anything, we should wait for this KEP kubernetes/enhancements#6073 to be merged and use the standardized attribute under the resource.kubernetes.io domain.
There was a problem hiding this comment.
i see that the standard attribute is now considered for 1.37. Thought there was a push back on this in favor of derived attributes feature which would have been only alpha in 1.37. @enoodle if kubernetes/enhancements#6073 is approved, then we can immediately use the same prefix in our driver. We have to propose same with the other drivers too.
What type of PR is this?
/kind feature
What this PR does / why we need it:
This adds numaNode attribute to GPU resource which is similar to the attribute created by https://github.com/kubernetes-sigs/dra-driver-cpu
This will allow creating claims of both CPU and GPU that request to be on the same NUMA node without necessarily involving network cards or depending on the network card to be on the same PCIe as the GPU.
Which issue(s) this PR is related to:
N/A
Special notes for your reviewer:
Does this PR introduce a user-facing change?
Additional documentation (design docs, usage docs, etc.):
Checklist
make check testpasses locallymake check-generatepasses ifapi/changed (CRDs, deepcopy, informers, listers, clientset)make check-modulespasses ifgo.mod/go.sumchangeddeployments/helm) updated if flags, RBAC, or defaults changed