fix(llm): cap retry-after delays by G7CNF · Pull Request #1351 · nearai/ironclaw

G7CNF · 2026-03-18T08:22:20Z

Clamps absurd Retry-After values to avoid effectively unbounded sleeps.

gemini-code-assist · 2026-03-18T08:22:42Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request addresses a potential issue where LLM providers could return excessively large Retry-After values, leading to effectively unbounded sleep durations. By introducing a maximum cap of one hour on these delays, the system now gracefully handles such responses, preventing prolonged service interruptions while still respecting reasonable rate-limiting directives.

Highlights

Introduced Retry-After Capping Logic: A new constant MAX_RETRY_AFTER_SECS (set to 3600 seconds or 1 hour) and a cap_retry_after function were added to prevent excessively long retry delays suggested by LLM providers.
Applied Capping to LLM Providers: The cap_retry_after function was integrated into the Retry-After header parsing logic for Anthropic, NearAI Chat, OpenAI Embeddings, and NearAI Embeddings providers, ensuring delays do not exceed the defined maximum.
Enhanced Test Coverage: Existing tests for Retry-After parsing were updated, and new tests were added to verify that large Retry-After values are correctly capped to MAX_RETRY_AFTER_SECS.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a cap on Retry-After delays from LLM providers to prevent excessively long sleep periods. A new function cap_retry_after is added in src/llm/retry.rs along with a MAX_RETRY_AFTER_SECS constant, and this is applied consistently across anthropic_oauth.rs, nearai_chat.rs, and workspace/embeddings.rs where Retry-After headers are parsed. The tests have also been updated to verify this new capping behavior. The changes are well-implemented and address the issue described. I have one suggestion to improve the robustness of the cap_retry_after function.

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: be863db005

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

henrypark133

Review: Cap retry-after delays to prevent unbounded sleep

Solid defensive fix. A malicious or malformed Retry-After header could previously cause effectively unbounded sleep — now capped at 3600s.

Positives:

cap_retry_after() centralized in src/llm/retry.rs with clear doc comment
Applied consistently across all 4 parsing sites (anthropic_oauth, nearai_chat, OpenAI embeddings, NearAI embeddings)
Gemini's suggestion to use duration.min() instead of Duration::from_secs(duration.as_secs().min(...)) was a good catch — preserves sub-second precision
Tests cover both edge cases (u64::MAX and 0)

No missed sites, no security concerns. LGTM.

* fix(llm): cap retry-after delays * Update src/llm/retry.rs Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

fix(llm): cap retry-after delays

be863db

github-actions Bot added scope: llm LLM integration scope: workspace Persistent memory / workspace size: M 50-199 changed lines risk: low Changes to docs, tests, or low-risk modules contributor: core 20+ merged PRs labels Mar 18, 2026

gemini-code-assist Bot reviewed Mar 18, 2026

View reviewed changes

Comment thread src/llm/retry.rs Outdated

Update src/llm/retry.rs

0d4c7c4

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

chatgpt-codex-connector Bot reviewed Mar 18, 2026

View reviewed changes

Comment thread src/llm/retry.rs Outdated

henrypark133 approved these changes Mar 18, 2026

View reviewed changes

henrypark133 mentioned this pull request Mar 18, 2026

feat(web): add Omnisearch command palette UI (#1022) #1298

Closed

5 tasks

henrypark133 merged commit bedc71e into nearai:staging Mar 18, 2026
14 checks passed

BrewTestBot mentioned this pull request Mar 19, 2026

ironclaw 0.20.0 Homebrew/homebrew-core#273275

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(llm): cap retry-after delays#1351

fix(llm): cap retry-after delays#1351
henrypark133 merged 2 commits intonearai:stagingfrom
G7CNF:codex/issue-1287-retry-after-cap

G7CNF commented Mar 18, 2026

Uh oh!

gemini-code-assist Bot commented Mar 18, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

henrypark133 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

G7CNF commented Mar 18, 2026

Uh oh!

gemini-code-assist Bot commented Mar 18, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

henrypark133 left a comment

Choose a reason for hiding this comment

Review: Cap retry-after delays to prevent unbounded sleep

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants