Skip to content

[Serve][5/n] Add proxy health and request routing delay metrics#59238

Merged
abrarsheikh merged 10 commits intomasterfrom
59218-abrar-proxy
Dec 18, 2025
Merged

[Serve][5/n] Add proxy health and request routing delay metrics#59238
abrarsheikh merged 10 commits intomasterfrom
59218-abrar-proxy

Conversation

@abrarsheikh
Copy link
Contributor

@abrarsheikh abrarsheikh commented Dec 7, 2025

Signed-off-by: abrar <abrar@anyscale.com>
@abrarsheikh abrarsheikh added the go add ONLY when ready to merge, run all tests label Dec 7, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces several new metrics to improve the observability of Ray Serve. Specifically, it adds a histogram to measure the routing stats propagation delay from replicas to the controller, a gauge to monitor proxy health status, and another gauge to track when a proxy is in a draining state. The changes are implemented across deployment_state.py, proxy.py, and proxy_state.py, and are accompanied by new tests in test_metrics.py and test_metrics_2.py. The implementation is solid and the new metrics are a great addition. I've identified a minor issue in one of the new tests that could lead to flakiness and have suggested a correction.

Signed-off-by: abrar <abrar@anyscale.com>
Signed-off-by: abrar <abrar@anyscale.com>
@abrarsheikh abrarsheikh marked this pull request as ready for review December 16, 2025 05:33
@abrarsheikh abrarsheikh requested review from a team as code owners December 16, 2025 05:33
@ray-gardener ray-gardener bot added serve Ray Serve Related Issue observability Issues related to the Ray Dashboard, Logging, Metrics, Tracing, and/or Profiling labels Dec 16, 2025
Signed-off-by: abrar <abrar@anyscale.com>
@abrarsheikh abrarsheikh enabled auto-merge (squash) December 18, 2025 18:10
@abrarsheikh abrarsheikh merged commit 224838f into master Dec 18, 2025
7 checks passed
@abrarsheikh abrarsheikh deleted the 59218-abrar-proxy branch December 18, 2025 19:28
Yicheng-Lu-llll pushed a commit to Yicheng-Lu-llll/ray that referenced this pull request Dec 22, 2025
peterxcli pushed a commit to peterxcli/ray that referenced this pull request Feb 25, 2026
…project#59238)

ray-project#59218

---------

Signed-off-by: abrar <abrar@anyscale.com>
Signed-off-by: peterxcli <peterxcli@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

go add ONLY when ready to merge, run all tests observability Issues related to the Ray Dashboard, Logging, Metrics, Tracing, and/or Profiling serve Ray Serve Related Issue

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants