Skip to content

rpk: add high disk usage report on 'cluster health'#29480

Merged
r-vasquez merged 1 commit into
redpanda-data:devfrom
r-vasquez:disk-on-health-report
Feb 7, 2026
Merged

rpk: add high disk usage report on 'cluster health'#29480
r-vasquez merged 1 commit into
redpanda-data:devfrom
r-vasquez:disk-on-health-report

Conversation

@r-vasquez

Copy link
Copy Markdown
Contributor

This was introduced in 5b5286f and already released in 25.3.5

Solves INC-1051

Example

$ rpk cluster health
CLUSTER HEALTH OVERVIEW
=======================
Healthy:                          false
Unhealthy reasons:                [high_disk_usage_nodes]
Controller ID:                    0
All nodes:                        [0]
Nodes down:                       []
Nodes in recovery mode:           []
Nodes with high disk usage:       [0]
Leaderless partitions (0):        []
Under-replicated partitions (0):  []
Cluster UUID:                     1e0e7ba3-b3ca-4e6b-a8ca-453170d6b0af

Backports Required

  • none - not a bug fix
  • none - this is a backport
  • none - issue does not exist in previous branches
  • none - papercut/not impactful enough to backport
  • v25.3.x
  • v25.2.x
  • v25.1.x

Release Notes

Features

  • rpk cluster health: high_disk_usage_nodes is now reported.

@r-vasquez r-vasquez requested a review from kbatuigas as a code owner January 30, 2026 21:53
Copilot AI review requested due to automatic review settings January 30, 2026 21:53
@r-vasquez r-vasquez requested a review from a team as a code owner January 30, 2026 21:53
@r-vasquez r-vasquez requested review from c-julin and removed request for a team January 30, 2026 21:53

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adds support for displaying nodes with high disk usage in the rpk cluster health command output. The feature was previously added in commit 5b5286f and released in version 25.3.5, and this change surfaces that information in the CLI.

Changes:

  • Added HighDiskUsageNodes field to the health response structure
  • Updated the health overview output to display nodes with high disk usage
  • Updated protobuf and dependency versions to support the new field

Reviewed changes

Copilot reviewed 3 out of 4 changed files in this pull request and generated no comments.

File Description
src/go/rpk/pkg/cli/cluster/health.go Added HighDiskUsageNodes field to struct and print logic to display nodes with high disk usage
src/go/rpk/go.mod Updated dependencies including protobuf definitions and rpadmin library to versions that support high disk usage reporting
buf.gen.yaml Updated core module reference to newer version containing high disk usage definitions

This was introduced in 5b5286f and already
released in 25.3.5
@r-vasquez r-vasquez force-pushed the disk-on-health-report branch from 1b3170a to 9b48d17 Compare February 6, 2026 23:09
@vbotbuildovich

Copy link
Copy Markdown
Collaborator

CI test results

test results on build#80350
test_class test_method test_arguments test_kind job_url test_status passed reason test_history
NodesDecommissioningTest test_decommission_status null integration https://buildkite.com/redpanda/redpanda/builds/80350#019c353f-ded9-4343-b788-9fdb6adec6ad FLAKY 10/11 Test PASSES after retries.No significant increase in flaky rate(baseline=0.0484, p0=1.0000, reject_threshold=0.0100. adj_baseline=0.1382, p1=0.2260, trust_threshold=0.5000) https://redpanda.metabaseapp.com/dashboard/87-tests?tab=142-dt-individual-test-history&test_class=NodesDecommissioningTest&test_method=test_decommission_status

@r-vasquez r-vasquez merged commit 103c225 into redpanda-data:dev Feb 7, 2026
24 of 28 checks passed
@vbotbuildovich

Copy link
Copy Markdown
Collaborator

/backport v25.3.x

@vbotbuildovich

Copy link
Copy Markdown
Collaborator

Failed to create a backport PR to v25.3.x branch. I tried:

git remote add upstream https://github.com/redpanda-data/redpanda.git
git fetch --all
git checkout -b backport-pr-29480-v25.3.x-647 remotes/upstream/v25.3.x
git cherry-pick -x 9b48d17457

Workflow run logs.

@r-vasquez

Copy link
Copy Markdown
Contributor Author

/backport v25.3.x

@vbotbuildovich

Copy link
Copy Markdown
Collaborator

Failed to create a backport PR to v25.3.x branch. I tried:

git remote add upstream https://github.com/redpanda-data/redpanda.git
git fetch --all
git checkout -b backport-pr-29480-v25.3.x-829 remotes/upstream/v25.3.x
git cherry-pick -x 9b48d17457

Workflow run logs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants