Description
What steps did you take:
Reported from other users:
Installed kapp-controller (as a part of a larger product, TAP) on a new GKE cluster.
What happened:
During the installation, the Kubernetes control plane became unavailable for several minutes. This caused package installs to enter a ReconcileFailed state. Eventually, when the API server became available, packages reconciled again to completion.
What did you expect:
The installation works without any control plane unavailability.
Anything else you would like to add:
- This identified behavior may happen on newly provisioned clusters that have not gone through GKE API server autoscaling (API server sizing in GKE is automatic, non-configurable, and is not determined based on size or number of nodes). Once GKE scales up the API server, the current install will continue and any subsequent installs succeed without interruption. Could this be improved by adjusting the kapp-controller's concurrency config (default is 10, what if we reduced it to 5)?
- If it's difficult to replicate with minimal components (such as kapp-controller on its own), try larger distributions (such as TAP).
Environment:
- kapp Controller version: v0.30.0 (latest)
Vote on this request
This is an invitation to the community to vote on issues, to help us prioritize our backlog. Use the "smiley face" up to the right of this comment to vote.
👍 "I would like to see this addressed as soon as possible"
👎 "There are other more important things to focus on right now"
We are also happy to receive and review Pull Requests if you want to help working on this issue.
Metadata
Metadata
Assignees
Labels
Type
Projects
Status