[Harbor] Add rate limit for trials/sec and max concurrency by CharlieFRuan · Pull Request #1074 · NovaSky-AI/SkyRL

CharlieFRuan · 2026-02-11T20:18:14Z

We add a rate_limiter.py implementation that supports "N trajectories/second" rate throttling and maximum concurrent trajectories throttling.

This may be needed when the environment (Daytona/Modal in Harbor's case) might have restrictions on the maximum number of sandbox running concurrently, or the sandbox creation rate; or we'd want smaller pressure on the inference engine (so that there's not much KV eviction happening)

We put the rate_limiter.py in a general util folder since it can be compatible with other custom generators.

gemini-code-assist

Code Review

The pull request introduces a robust rate and concurrency limiter for trajectory submissions, which is a valuable addition for managing environment constraints. The implementation uses a token bucket algorithm for rate limiting and an asyncio.Semaphore for concurrency control. I have provided feedback on the manual lock management in the AsyncRateLimiter class, which can be simplified to improve maintainability and robustness against cancellation issues.

skyrl-train/skyrl_train/utils/rate_limiter.py

devin-ai-integration

Devin Review found 1 potential issue.

View 5 additional findings in Devin Review.

skyrl-train/skyrl_train/utils/rate_limiter.py

devin-ai-integration

Devin Review found 1 new potential issue.

View 9 additional findings in Devin Review.

devin-ai-integration · 2026-02-11T20:52:45Z

skyrl-train/examples/terminal_bench/generator/terminal_bench_generator.py

+        # Initialize rate limiter
+        rate_limit_config = terminal_bench_cfg.get("rate_limit", None)
+        self._rate_limiter = create_rate_limiter(rate_limit_config)


🔴 SkyRL-specific rate_limit config leaks into Harbor's TrialConfig.model_validate(), potentially causing validation errors

When a user enables rate limiting by adding a rate_limit section to their terminal bench YAML config, this SkyRL-specific field is included in _harbor_config_template (created at terminal_bench_generator.py:63 via OmegaConf.to_container(terminal_bench_cfg, resolve=True)) and then passed to Harbor's TrialConfig.model_validate(config) at terminal_bench_generator.py:169.

Root Cause and Impact

At line 63, the entire terminal_bench_cfg is converted to a plain dict:

self._harbor_config_template = OmegaConf.to_container(terminal_bench_cfg, resolve=True)

At lines 92-94, rate_limit is read from the same config but NOT removed from the template:

rate_limit_config = terminal_bench_cfg.get("rate_limit", None) self._rate_limiter = create_rate_limiter(rate_limit_config)

Later, in terminal_bench_agent_loop at line 166-169, the template (still containing rate_limit) is deep-copied and passed to Harbor:

config = deepcopy(self._harbor_config_template) config["task"] = {"path": prompt} config["agent"]["kwargs"]["session_id"] = uuid4().hex trial_config = TrialConfig.model_validate(config)

If Harbor's TrialConfig Pydantic model is configured with extra='forbid' (a common setting for strict config validation), this will raise a ValidationError on every trial, causing all trajectories to fail. Even with extra='ignore', passing unrelated config to an external library's validator is unintended and fragile.

Impact: When a user enables rate limiting (the primary feature of this PR), every trial could fail with a Pydantic validation error, making the rate limiting feature unusable.

Suggested change

# Initialize rate limiter

rate_limit_config = terminal_bench_cfg.get("rate_limit", None)

self._rate_limiter = create_rate_limiter(rate_limit_config)

# Initialize rate limiter

rate_limit_config = terminal_bench_cfg.get("rate_limit", None)

self._rate_limiter = create_rate_limiter(rate_limit_config)

self._harbor_config_template.pop("rate_limit", None)

Was this helpful? React with 👍 or 👎 to provide feedback.

[Harbor] Add rate limit for trials/sec and max concurrency

0dcee68

CharlieFRuan mentioned this pull request Feb 11, 2026

[skyrl-train][Harbor] Upstream Harbor training code to main skyrl-train #866

Closed

fix lint

7b852be

gemini-code-assist bot reviewed Feb 11, 2026

View reviewed changes

skyrl-train/skyrl_train/utils/rate_limiter.py Outdated Show resolved Hide resolved

devin-ai-integration bot reviewed Feb 11, 2026

View reviewed changes

skyrl-train/skyrl_train/utils/rate_limiter.py Outdated Show resolved Hide resolved

CharlieFRuan added 3 commits February 11, 2026 20:37

address gemini comments

becb21f

devin comment address

e74a3d4

lint

ab84315

devin-ai-integration bot reviewed Feb 11, 2026

View reviewed changes

CharlieFRuan merged commit b5bb81e into NovaSky-AI:main Feb 11, 2026
3 checks passed

CharlieFRuan deleted the pr-rate-limit branch February 11, 2026 20:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Harbor] Add rate limit for trials/sec and max concurrency#1074

[Harbor] Add rate limit for trials/sec and max concurrency#1074
CharlieFRuan merged 5 commits intoNovaSky-AI:mainfrom
CharlieFRuan:pr-rate-limit

CharlieFRuan commented Feb 11, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

devin-ai-integration bot Feb 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

CharlieFRuan commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

CharlieFRuan commented Feb 11, 2026 •

edited

Loading