fix(agent): treat tool_use/tool_result as atomic groups in history pruning by 5queezer · Pull Request #74 · 5queezer/hrafn

5queezer · 2026-04-02T06:14:32Z

Summary

History pruner Phase 1 (collapse) now handles multi-tool groups atomically: an assistant message followed by N consecutive tool messages is collapsed into a single summary instead of only handling 1:1 pairs
Phase 2 (budget enforcement) drops assistant + N tool messages as a unit, preventing orphaned tool_result messages
emergency_history_trim in history.rs gains the same atomic-group awareness
5 new tests verify the invariant that no tool message exists without a preceding assistant message

Upstream reference

Original PR: fix(agent): treat tool_use/tool_result as atomic groups in history pruning zeroclaw-labs/zeroclaw#4825 by @singlerider
Status: Closed without comment
Changes from original: Identical logic — Hrafn's history.rs and history_pruner.rs share the same codebase structure as zeroclaw. Only the test assertion content format string was adapted (uses em-dash in summary format).

Test plan

cargo fmt --all -- --check
cargo clippy --all-targets -- -D warnings
cargo test (all 11 history_pruner tests pass, full suite green)

🤖 Generated with Claude Code

Summary by CodeRabbit

Bug Fixes
- Enhanced conversation history management to ensure better preservation of interactive exchanges and maintain consistency during message pruning operations.

@singlerider

…uning Prevents orphaned tool_result messages from causing Anthropic 400 errors during tool-heavy conversations with low context budgets. Ported from zeroclaw-labs/zeroclaw#4825 by @singlerider. Original PR was closed without review. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

coderabbitai · 2026-04-02T06:14:46Z

Note

`.coderabbit.yaml` has unrecognized properties

CodeRabbit is using all valid settings from your configuration. Unrecognized properties (listed below) have been ignored and may indicate typos or deprecated fields that can be removed.

⚠️ Parsing warnings (1)

Validation error: Unrecognized key(s) in object: 'tools', 'path_filters', 'review_instructions'

⚙️ Configuration instructions

Please see the configuration documentation for more information.
You can also validate your configuration using the online YAML validator.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

📝 Walkthrough

Walkthrough

These changes modify message history management in two functions to treat assistant messages followed by consecutive tool messages as atomic units during trimming and pruning operations, preventing orphaned tool messages and maintaining tool_use/tool_result adjacency.

Changes

Cohort / File(s)	Summary
History Management `src/agent/history.rs`	Modified `emergency_history_trim` to drop assistant messages together with immediately following consecutive tool messages as atomic groups, preserving tool_use/tool_result adjacency.
History Pruning Logic `src/agent/history_pruner.rs`	Updated `prune_history` to treat assistant+consecutive tool messages as atomic groups in both tool collapsing (phase 1) and budget enforcement (phase 2) phases; adjusted tests to cover multi-tool collapsing and atomic group removal with invariants ensuring no orphaned tool messages.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~50 minutes

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Description check	⚠️ Warning	The description covers the key changes and test plan, but lacks the required 'What' section with issue closure, 'How to test' section with specific test commands, and the checklist items from the template.	Add a 'What' section linking the issue being closed, provide explicit 'How to test' instructions with cargo test commands, complete the checklist items, and indicate whether there are breaking changes or security implications.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately and concisely summarizes the main change: treating tool_use/tool_result as atomic groups in history pruning, which is the core objective of the PR.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch port/zc-4825-atomic-tool-pruning

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 4

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/agent/history_pruner.rs`:
- Around line 367-374: The loop enforcing "tool" messages must be immediately
preceded by an "assistant" is too strict for multi-tool groups; update the check
in the iterations over messages (the for (i, m) in messages.iter().enumerate()
loop and the analogous checks at the other spots noted) so that a "tool" is
allowed if the previous message is also a "tool" (i.e., only assert failure if i
== 0 or the previous message exists and is neither "assistant" nor "tool"); in
practice change the assertion to: if m.role == "tool" then ensure i > 0 and
(messages[i-1].role == "assistant" || messages[i-1].role == "tool"), otherwise
panic — apply the same logic to the other two loops referenced (lines ~401-409
and ~468-476) so multi-tool consecutive groups are accepted.
- Around line 153-167: The current assistant-group drop logic in the block
checking if messages[i].role == "assistant" removes the assistant plus all
following "tool" messages without checking the protected/keep_recent boundary;
change it so you only remove the group if every message in that assistant+tool
run has index >= protected (i.e., is unprotected). Concretely, compute the tool
run as you do with tool_count, then verify that i and i+1..i+tool_count are all
>= protected before removing any entries; if any message in that group is
protected, skip dropping the entire group (do not remove partial messages),
leaving messages, dropped_messages, and dropped_any unchanged for that group.
Reference symbols: messages, protected, tool_count, dropped_messages,
dropped_any in the assistant-role removal branch.
- Around line 113-133: The collapse logic currently counts only an unprotected
prefix of consecutive tool messages and then replaces the assistant with a
summary while leaving protected tool messages after that summary; change the
algorithm so you first scan the full consecutive tool run (starting at i+1) to
determine its length, then check whether every message in that run is
unprotected before performing the collapse; if any tool in the run is protected,
do not replace messages[i] or remove any messages in that run. Update the code
around variables/messages: messages[i], tool_count, protected, collapsed_pairs
and the ChatMessage replacement so collapsing happens only when the entire tool
run is unprotected.

In `@src/agent/history.rs`:
- Around line 92-103: The removal loop for assistant+tool groups can split a
tool-result group at the keep_recent cutoff; to fix, compute the cutoff as
history.len().saturating_sub(keep_recent) and only count/remove the assistant
plus trailing tool messages if the entire group lies strictly before that
cutoff: in the block using variables history, i, tool_count, keep_recent and
dropped, advance tool_count as you do but then check if (i + 1 + tool_count) <=
cutoff (or that the last tool index < cutoff) before performing the for-loop of
history.remove; if the group crosses into the protected suffix, skip removal
(and advance i appropriately) so you never orphan the protected suffix or
increment dropped for a partial group.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: a802f5bc-7e1b-4e24-b445-343ffc9f1036

📥 Commits

Reviewing files that changed from the base of the PR and between 2a7e20e and d2e687a.

📒 Files selected for processing (2)

src/agent/history.rs
src/agent/history_pruner.rs

coderabbitai · 2026-04-02T06:25:57Z

+            if messages[i].role == "assistant" && !protected[i] {
+                // Count consecutive tool messages following this assistant
+                let mut tool_count = 0;
+                while i + 1 + tool_count < messages.len()
+                    && messages[i + 1 + tool_count].role == "tool"
+                    && !protected[i + 1 + tool_count]
+                {
+                    tool_count += 1;
+                }
+                if tool_count > 0 {
+                    let summary =
+                        format!("[Tool exchange: {tool_count} tool call(s) — results collapsed]");
+                    messages[i] = ChatMessage {
+                        role: "assistant".to_string(),
+                        content: summary,
+                    };
+                    for _ in 0..tool_count {
+                        messages.remove(i + 1);
+                    }
+                    collapsed_pairs += tool_count;
+                    continue;


⚠️ Potential issue | 🔴 Critical

Avoid partial collapse when a multi-tool group straddles keep_recent.

Line 116 only counts unprotected tools. If the cutoff lands inside a multi-tool run, Lines 123-131 replace the original assistant with a summary and remove only the unprotected prefix. The remaining protected tool message(s) now follow a synthetic summary instead of the original tool-call payload, which can still violate provider pairing rules.

🛠️ Suggested fix

if messages[i].role == "assistant" && !protected[i] { - // Count consecutive tool messages following this assistant - let mut tool_count = 0; - while i + 1 + tool_count < messages.len() - && messages[i + 1 + tool_count].role == "tool" - && !protected[i + 1 + tool_count] - { - tool_count += 1; - } - if tool_count > 0 { + let mut group_end = i + 1; + while group_end < messages.len() && messages[group_end].role == "tool" { + group_end += 1; + } + + if group_end > i + 1 { + if protected[i + 1..group_end].iter().any(|p| *p) { + i = group_end; + continue; + } + + let tool_count = group_end - i - 1; let summary = format!("[Tool exchange: {tool_count} tool call(s) — results collapsed]"); messages[i] = ChatMessage { role: "assistant".to_string(), content: summary, }; - for _ in 0..tool_count { - messages.remove(i + 1); - } + drop(messages.drain(i + 1..group_end)); collapsed_pairs += tool_count; continue; } }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/agent/history_pruner.rs` around lines 113 - 133, The collapse logic currently counts only an unprotected prefix of consecutive tool messages and then replaces the assistant with a summary while leaving protected tool messages after that summary; change the algorithm so you first scan the full consecutive tool run (starting at i+1) to determine its length, then check whether every message in that run is unprotected before performing the collapse; if any tool in the run is protected, do not replace messages[i] or remove any messages in that run. Update the code around variables/messages: messages[i], tool_count, protected, collapsed_pairs and the ChatMessage replacement so collapsing happens only when the entire tool run is unprotected.

coderabbitai · 2026-04-02T06:25:57Z

+            if messages[i].role == "assistant" {
+                // Count following tool messages — drop as atomic group
+                let mut tool_count = 0;
+                while i + 1 + tool_count < messages.len()
+                    && messages[i + 1 + tool_count].role == "tool"
+                {
+                    tool_count += 1;
+                }
+                if tool_count > 0 {
+                    for _ in 0..=tool_count {
+                        messages.remove(i);
+                    }
+                    dropped_messages += 1 + tool_count;
+                    dropped_any = true;
+                    break;


⚠️ Potential issue | 🟠 Major

Don't let group dropping override keep_recent.

Lines 156-164 remove every consecutive tool after an unprotected assistant without consulting protected. If the recent window starts inside that run, Phase 2 deletes protected tool results just to keep the group atomic, which breaks the keep_recent contract.

🛠️ Suggested fix

if messages[i].role == "assistant" { - // Count following tool messages — drop as atomic group - let mut tool_count = 0; - while i + 1 + tool_count < messages.len() - && messages[i + 1 + tool_count].role == "tool" - { - tool_count += 1; - } - if tool_count > 0 { - for _ in 0..=tool_count { - messages.remove(i); - } - dropped_messages += 1 + tool_count; + let mut group_end = i + 1; + while group_end < messages.len() && messages[group_end].role == "tool" { + group_end += 1; + } + if group_end > i + 1 { + if protected[i + 1..group_end].iter().any(|p| *p) { + i = group_end; + continue; + } + + dropped_messages += group_end - i; + drop(messages.drain(i..group_end)); dropped_any = true; break; } }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

if messages[i].role == "assistant" {

// Count following tool messages — drop as atomic group

let mut tool_count = 0;

while i + 1 + tool_count < messages.len()

&& messages[i + 1 + tool_count].role == "tool"

{

tool_count += 1;

}

if tool_count > 0 {

for _ in 0..=tool_count {

messages.remove(i);

}

dropped_messages += 1 + tool_count;

dropped_any = true;

break;

if messages[i].role == "assistant" {

let mut group_end = i + 1;

while group_end < messages.len() && messages[group_end].role == "tool" {

group_end += 1;

}

if group_end > i + 1 {

if protected[i + 1..group_end].iter().any(|p| *p) {

i = group_end;

continue;

}

dropped_messages += group_end - i;

drop(messages.drain(i..group_end));

dropped_any = true;

break;

}

}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/agent/history_pruner.rs` around lines 153 - 167, The current assistant-group drop logic in the block checking if messages[i].role == "assistant" removes the assistant plus all following "tool" messages without checking the protected/keep_recent boundary; change it so you only remove the group if every message in that assistant+tool run has index >= protected (i.e., is unprotected). Concretely, compute the tool run as you do with tool_count, then verify that i and i+1..i+tool_count are all >= protected before removing any entries; if any message in that group is protected, skip dropping the entire group (do not remove partial messages), leaving messages, dropped_messages, and dropped_any unchanged for that group. Reference symbols: messages, protected, tool_count, dropped_messages, dropped_any in the assistant-role removal branch.

coderabbitai · 2026-04-02T06:25:57Z

+        for (i, m) in messages.iter().enumerate() {
+            if m.role == "tool" {
+                assert!(
+                    i > 0 && messages[i - 1].role == "assistant",
+                    "tool message at index {i} has no preceding assistant"
+                );
+            }
+        }


⚠️ Potential issue | 🟡 Minor

The orphan invariant is too strict for multi-tool groups.

This PR explicitly supports assistant + N consecutive tool exchanges, but these loops require every remaining tool message to be immediately preceded by assistant. That will fail on a valid protected multi-tool group and makes it impossible to add the boundary-straddle regression that would catch the current keep_recent bug.

Also applies to: 401-409, 468-476

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/agent/history_pruner.rs` around lines 367 - 374, The loop enforcing "tool" messages must be immediately preceded by an "assistant" is too strict for multi-tool groups; update the check in the iterations over messages (the for (i, m) in messages.iter().enumerate() loop and the analogous checks at the other spots noted) so that a "tool" is allowed if the previous message is also a "tool" (i.e., only assert failure if i == 0 or the previous message exists and is neither "assistant" nor "tool"); in practice change the assertion to: if m.role == "tool" then ensure i > 0 and (messages[i-1].role == "assistant" || messages[i-1].role == "tool"), otherwise panic — apply the same logic to the other two loops referenced (lines ~401-409 and ~468-476) so multi-tool consecutive groups are accepted.

coderabbitai · 2026-04-02T06:25:57Z

+        } else if history[i].role == "assistant" {
+            // Count following tool messages — drop as atomic group
+            let mut tool_count = 0;
+            while i + 1 + tool_count < history.len().saturating_sub(keep_recent)
+                && history[i + 1 + tool_count].role == "tool"
+            {
+                tool_count += 1;
+            }
+            for _ in 0..=tool_count {
+                history.remove(i);
+                dropped += 1;
+            }


⚠️ Potential issue | 🔴 Critical

Don't split a tool group at the keep_recent boundary.

When keep_recent starts between two tool results, Line 95 stops counting at the cutoff but Lines 100-103 still remove the assistant and the unprotected prefix. That leaves the protected suffix orphaned, and src/agent/loop_.rs:2800-2840 retries immediately on dropped > 0, so the overflow-recovery path can still resend invalid history.

🛠️ Suggested fix

} else if history[i].role == "assistant" { - // Count following tool messages — drop as atomic group - let mut tool_count = 0; - while i + 1 + tool_count < history.len().saturating_sub(keep_recent) - && history[i + 1 + tool_count].role == "tool" - { - tool_count += 1; - } - for _ in 0..=tool_count { - history.remove(i); - dropped += 1; - } + let mut group_end = i + 1; + while group_end < history.len() && history[group_end].role == "tool" { + group_end += 1; + } + + let recent_start = history.len().saturating_sub(keep_recent); + if group_end > recent_start { + i = group_end; + continue; + } + + dropped += group_end - i; + drop(history.drain(i..group_end)); } else {

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/agent/history.rs` around lines 92 - 103, The removal loop for assistant+tool groups can split a tool-result group at the keep_recent cutoff; to fix, compute the cutoff as history.len().saturating_sub(keep_recent) and only count/remove the assistant plus trailing tool messages if the entire group lies strictly before that cutoff: in the block using variables history, i, tool_count, keep_recent and dropped, advance tool_count as you do but then check if (i + 1 + tool_count) <= cutoff (or that the last tool index < cutoff) before performing the for-loop of history.remove; if the group crosses into the protected suffix, skip removal (and advance i appropriately) so you never orphan the protected suffix or increment dropped for a partial group.

github-actions Bot added the agent label Apr 2, 2026

coderabbitai Bot reviewed Apr 2, 2026

View reviewed changes

5queezer merged commit 06bc5ab into master Apr 2, 2026
22 checks passed

5queezer mentioned this pull request Apr 2, 2026

fix(agent): prevent multi-tool group splitting at protection boundaries #79

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(agent): treat tool_use/tool_result as atomic groups in history pruning#74

fix(agent): treat tool_use/tool_result as atomic groups in history pruning#74
5queezer merged 1 commit intomasterfrom
port/zc-4825-atomic-tool-pruning

5queezer commented Apr 2, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Apr 2, 2026 •

edited

Loading

`.coderabbit.yaml` has unrecognized properties

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot Apr 2, 2026

Uh oh!

coderabbitai Bot Apr 2, 2026

Uh oh!

coderabbitai Bot Apr 2, 2026

Uh oh!

coderabbitai Bot Apr 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

5queezer commented Apr 2, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Upstream reference

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

.coderabbit.yaml has unrecognized properties

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

5queezer commented Apr 2, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Apr 2, 2026 •

edited

Loading

`.coderabbit.yaml` has unrecognized properties