Skip to content

Revert daemon status updates from SSA to optimistic concurrency#835

Merged
klueska merged 1 commit into
kubernetes-sigs:mainfrom
klueska:remove-ssa
Jan 28, 2026
Merged

Revert daemon status updates from SSA to optimistic concurrency#835
klueska merged 1 commit into
kubernetes-sigs:mainfrom
klueska:remove-ssa

Conversation

@klueska
Copy link
Copy Markdown
Contributor

@klueska klueska commented Jan 28, 2026

Replace Server-Side Apply (SSA) with traditional Update/UpdateStatus
calls using optimistic concurrency in the compute-domain-daemon. This
aligns with the pre-SSA implementation pattern and simplifies the
conflict resolution model by relying on Kubernetes resource versions.

Key changes:

  • Back out SSA patching in cdstatus.go and cdclique.go, restore direct
    Update/UpdateStatus calls with optimistic concurrency
  • Add ensureCliqueExists() to create CDClique with proper labels and
    owner reference, using workqueue retry logic
  • Rename EnsureNodeInfoInCD to syncNodeInfoToCD for naming symmetry
    with syncDaemonInfoToClique
  • Add ComputeDomain UID label check in cdclique.go to match the UID
    validation pattern in cdstatus.go
  • Add 'update' verb to RBAC for computedomaincliques resource

The CDClique is created with labels:

  • resource.nvidia.com/computeDomain:
  • resource.nvidia.com/computeDomain.cliqueID:

return nil, fmt.Errorf("error getting next available index: %w", err)
}
myDaemon.Index = idx
klog.V(4).Infof("syncDaemonInfoToClique: IMEX daemon not started yet, DNS index collision with %v, picked new index: %d", other, idx)
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

bye bye code

@jgehrcke
Copy link
Copy Markdown
Contributor

I had a quick look at the last commit (the SSA removal). Already looks good.

Signed-off-by: Kevin Klues <kklues@nvidia.com>
Copy link
Copy Markdown
Contributor

@jgehrcke jgehrcke left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks great!

(full test suite passes)

@klueska klueska merged commit 16a3de4 into kubernetes-sigs:main Jan 28, 2026
16 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants