feat(web): persist tool calls, restore approvals on thread switch, and UI fixes by henrypark133 · Pull Request #382 · nearai/ironclaw

henrypark133 · 2026-02-26T07:46:19Z

Summary

Persist tool calls as role="tool_calls" DB messages between user and assistant rows, rendered as collapsible summaries when loading history
Restore pending approvals on thread switch via pending_approval field in HistoryResponse
Fix UTF-8 truncation panic in result preview truncation (&s[..500] → char-boundary-safe truncate_preview())
Fix turn count query to count only user messages (PG + libSQL)
Show processing indicator (thinking dots) when switching back to a thread mid-inference
Auto-remove approval cards from DOM after user acts on them
Silence stale approval errors instead of showing "No pending approval request"
Remove confusing status bar text — inline activity cards handle visual feedback

Test plan

Send message triggering tool calls, verify tool activity cards appear live
Switch threads and back — tool call summaries persist in history
Restart server, load thread — tool calls load from DB correctly
Trigger tool approval, switch threads, switch back — approval card reappears
Click approve/deny — card shows confirmation then auto-removes after 1.5s
Send message, switch threads mid-processing, switch back — user message + thinking dots visible
Verify no "Done" or "No pending approval request" text in status bar
cargo clippy --all --all-features — zero warnings
cargo test — all tests pass

🤖 Generated with Claude Code

…d UI fixes - Persist tool call summaries as role="tool_calls" DB messages between user and assistant rows. Both build_turns_from_db_messages (server.rs and handlers/chat.rs) parse the user → tool_calls → assistant triple. - Enrich ToolCallInfo with result_preview and error fields; render tool call history in loadHistory with collapsible summaries and CSS. - Fix UTF-8 truncation panic: replace &s[..500] with truncate_preview() that respects char boundaries (4 call sites across 3 modules). - Fix turn count query (PG + libSQL) to count only user messages. - Add PendingApprovalInfo to HistoryResponse so the frontend can re-render approval cards when switching back to an awaiting thread. - Show processing indicator (thinking dots) when loadHistory loads an in-progress turn, so the UI isn't blank mid-inference after a switch. - Auto-remove approval cards from DOM 1.5s after user acts on them. - Silence stale "No pending approval request" errors (return no-op instead of error when thread is no longer in AwaitingApproval state). - Remove status bar text for normal processing flow; inline activity cards now handle all visual feedback. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

gemini-code-assist · 2026-02-26T07:46:45Z

Summary of Changes

Hello @henrypark133, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the user experience and robustness of the web interface by improving how tool calls and approvals are handled. It introduces persistent storage for tool call summaries, ensuring that conversational context is maintained across sessions and thread switches. Additionally, it refines the approval workflow, making it more intuitive and less error-prone, while also addressing several UI and backend stability issues, such as UTF-8 truncation and accurate turn counting.

Highlights

Tool Call Persistence: Tool call details are now stored in the database as 'role="tool_calls"' messages, appearing as collapsible summaries in the UI history.
Approval State Restoration: Pending tool approvals are now correctly restored and displayed when switching between threads.
Robust Text Truncation: Implemented a character-boundary-safe truncation function to prevent panics when previewing UTF-8 strings.
Accurate Turn Counting: The logic for counting conversation turns has been updated to only consider user messages, providing a more accurate representation.
Enhanced UI Feedback: A 'thinking dots' processing indicator is now shown when returning to a thread that is still mid-inference.
Streamlined Approval Workflow: Approval cards automatically disappear from the UI shortly after a user acts on them.
Graceful Error Handling: Stale or duplicate approval requests are now silently ignored instead of displaying an error message.
Simplified Status Bar: Redundant status bar text has been removed, as inline activity cards now provide sufficient visual feedback.

Changelog

src/agent/thread_ops.rs
- Added truncate_preview function for safe string truncation.
- Modified filter_map to ignore tool_calls role messages when building LLM conversation context.
- Introduced logic to capture and persist tool_calls after completing a turn.
- Implemented persist_tool_calls function to save tool call summaries to the database.
- Changed handle_approval_action to return an ok_with_message for stale approvals, silencing previous error messages.
- Integrated persist_tool_calls into the agentic loop's response handling.
src/channels/web/handlers/chat.rs
- Added truncate_preview function.
- Updated chat_history_handler to check for pending_approval when loading in-memory threads.
- Extended TurnInfo serialization to include result_preview and error for tool calls, and added pending_approval to HistoryResponse.
- Modified build_turns_from_db_messages to parse and include tool_calls messages from the database, supporting user -> tool_calls -> assistant patterns.
- Adjusted turn_count calculation to count only user messages.
- Added new test cases for build_turns_from_db_messages covering tool calls and malformed JSON.
src/channels/web/server.rs
- Added truncate_preview function.
- Updated chat_history_handler to check for pending_approval when loading in-memory threads.
- Extended TurnInfo serialization to include result_preview and error for tool calls, and added pending_approval to HistoryResponse.
- Modified build_turns_from_db_messages to parse and include tool_calls messages from the database, supporting user -> tool_calls -> assistant patterns.
- Adjusted turn_count calculation to count only user messages.
src/channels/web/static/app.js
- Removed setStatus calls, relying on inline activity.
- Added setTimeout to automatically remove approval cards after 1.5 seconds.
- Modified loadHistory to display tool_calls summaries and re-render pending_approval cards.
- Implemented logic to show a 'Processing...' indicator if a thread is mid-inference.
- Added addToolCallsSummary and createToolCallsSummaryElement functions to render collapsible tool call summaries.
src/channels/web/static/style.css
- Added new CSS rules for .tool-calls-summary, .tool-calls-header, .tool-calls-list, .tool-call-item, .tool-call-name, .tool-call-preview, .tool-call-error-text, and .tool-error to style the new tool call summary UI.
src/channels/web/types.rs
- Added result_preview and error fields to ToolCallInfo struct.
- Added pending_approval field to HistoryResponse struct.
- Defined new PendingApprovalInfo struct to represent pending approval details.
src/db/libsql/conversations.rs
- Updated the message_count query to specifically count only messages with role = 'user'.
src/history/store.rs
- Updated the message_count query to specifically count only messages with role = 'user'.

Activity

Sent message triggering tool calls, verified tool activity cards appear live.
Switched threads and back, confirmed tool call summaries persist in history.
Restarted server, loaded thread, verified tool calls load from DB correctly.
Triggered tool approval, switched threads, switched back, confirmed approval card reappears.
Clicked approve/deny, observed card showing confirmation then auto-removing after 1.5s.
Sent message, switched threads mid-processing, switched back, confirmed user message + thinking dots visible.
Verified no 'Done' or 'No pending approval request' text in status bar.
cargo clippy --all --all-features executed with zero warnings.
cargo test executed with all tests passing.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

gemini-code-assist

Code Review

This pull request introduces several valuable features and fixes, including persisting tool calls, restoring pending approvals on thread switch, and fixing a UTF-8 truncation panic. The implementation of these features is generally solid, and the addition of related tests is appreciated.

My main feedback concerns code duplication. The truncate_preview utility function has been added to multiple files, and there's significant duplication of request handlers. While the truncate_preview function should be moved to a shared utility, the larger refactoring of request handlers should be deferred to a follow-up task, aligning with our guidelines for PR scope. Please see the specific comments for details.

gemini-code-assist · 2026-02-26T07:51:24Z

+/// Truncate a string to at most `max_bytes` bytes at a char boundary, appending "...".
+fn truncate_preview(s: &str, max_bytes: usize) -> String {
+    if s.len() <= max_bytes {
+        return s.to_string();
+    }
+    // Walk backwards from max_bytes to find a valid char boundary
+    let mut end = max_bytes;
+    while !s.is_char_boundary(end) {
+        end -= 1;
+    }
+    format!("{}...", &s[..end])
+}


This truncate_preview function is also defined in src/channels/web/handlers/chat.rs and src/channels/web/server.rs. To improve maintainability and avoid code duplication, consider moving this function to a shared utility module so it can be reused across the codebase.

References

Consolidate related sequences of operations, such as creating, persisting, and scheduling a job, into a single reusable method to improve code consistency and maintainability.

gemini-code-assist · 2026-02-26T07:51:24Z

+/// Truncate a string to at most `max_bytes` bytes at a char boundary, appending "...".
+fn truncate_preview(s: &str, max_bytes: usize) -> String {
+    if s.len() <= max_bytes {
+        return s.to_string();
+    }
+    let mut end = max_bytes;
+    while !s.is_char_boundary(end) {
+        end -= 1;
+    }
+    format!("{}...", &s[..end])
+}


This file, src/channels/web/server.rs, appears to contain a significant amount of duplicated code from src/channels/web/handlers/chat.rs. For example, the truncate_preview function, chat_history_handler, and build_turns_from_db_messages are present in both files with identical changes in this PR. While addressing this duplication is important for maintainability, given that the primary goal of this pull request is to achieve correctness and feature parity, this large structural refactoring should be deferred and tracked as a follow-up task.

References

Defer large structural refactorings, like breaking down a large file, when the primary goal of a pull request is to achieve correctness and feature parity with an existing implementation. The refactoring should be tracked as a follow-up task.

serrrfirat

Summary

This is a well-structured PR that adds three valuable features: persisting tool call summaries to DB, restoring pending approvals on thread switch, and several UI polish fixes. The code is generally correct, handles edge cases well (malformed JSON, legacy data, stale approvals), and includes good test coverage for the new DB message parsing logic.

The most significant concern is the massive code duplication between server.rs and handlers/chat.rs — the same handlers, parsing logic, and utility functions exist in both files and must be kept in manual sync. This PR correctly applies changes to both copies, but this is a fragile arrangement that will eventually lead to divergence bugs. The truncate_preview function alone is copy-pasted three times.

The correctness changes are sound: the UTF-8 truncation fix prevents a real panic, the turn count query change correctly adapts to the new tool_calls message role, and the approval silencing is a reasonable UX trade-off (though logging the suppression would improve debuggability). The build_turns_from_db_messages parser handles all three message patterns gracefully and degrades well on malformed data.

Overall: approve with the strong recommendation to deduplicate the server.rs/handlers/chat.rs code and add tests for truncate_preview.

serrrfirat · 2026-02-26T08:46:55Z

    Ok(bound_addr)
 }

+/// Truncate a string to at most `max_bytes` bytes at a char boundary, appending "...".


truncate_preview is copy-pasted identically in three files

The truncate_preview function is defined with the exact same implementation in src/agent/thread_ops.rs, src/channels/web/handlers/chat.rs, and src/channels/web/server.rs. Any future fix (e.g., handling of the max_bytes=0 edge case, or adding grapheme cluster awareness) must be applied to all three copies. This is a divergence bug waiting to happen.

Suggested fix:

Extract `truncate_preview` into a shared utility module (e.g., `src/util.rs` or `src/channels/web/util.rs`) and import it from all three call sites.

Severity: medium · Confidence: high

Fixed in 06786c6 — extracted truncate_preview into src/channels/web/util.rs and removed all three copies. The shared version also adds an end > 0 guard for the zero-max-bytes edge case.

serrrfirat · 2026-02-26T08:46:55Z

@@ -730,12 +742,13 @@ async fn chat_history_handler(
            turns,


chat_history_handler and build_turns_from_db_messages are fully duplicated between server.rs and handlers/chat.rs

The entire chat_history_handler, build_turns_from_db_messages, and chat_threads_handler functions exist in both src/channels/web/server.rs and src/channels/web/handlers/chat.rs with identical logic. This PR correctly applied the same changes (pending_approval, tool_calls parsing, turn_count fix) to both copies, but this is extremely fragile — a future change applied to only one copy would silently create an inconsistency. The two handlers appear to be reached via different routing paths, making the divergence especially dangerous since bugs would be path-dependent.

Suggested fix:

Consolidate the handlers into one module (handlers/chat.rs is the better home) and route from server.rs to the shared implementation. If both entry points exist for a legitimate reason (e.g., different middleware stacks), at minimum extract `build_turns_from_db_messages` into a shared function since it's pure logic with no dependencies on the handler context.

Severity: high · Confidence: high

Fixed in 06786c6 — extracted build_turns_from_db_messages into src/channels/web/util.rs and both server.rs and handlers/chat.rs now import from the shared module. chat_history_handler still has two copies since handlers/chat.rs isn't wired up yet (it's behind #[allow(dead_code)] as an in-progress migration) — will consolidate fully when that migration lands.

serrrfirat · 2026-02-26T08:46:55Z


            if thread.state != ThreadState::AwaitingApproval {
-                return Ok(SubmissionResult::error("No pending approval request."));
+                // Stale or duplicate approval (tool already executed) — silently ignore.


Silencing stale approval errors masks real bugs

Changing SubmissionResult::error("No pending approval request.") to SubmissionResult::ok_with_message("") silences the response when a user submits an approval for a thread not in AwaitingApproval state or when take_pending_approval() returns None. While this improves UX for the common stale/duplicate approval case, it also means that if there's a real bug that causes approval state to be lost (e.g., a race condition in session management), the error signal is completely suppressed. The user's approval action silently succeeds with an empty message, and the tool never executes.

Suggested fix:

Consider logging at `debug!` or `info!` level when a stale approval is silently ignored, so the suppression is at least observable in logs. E.g.: `tracing::debug!("Ignoring stale approval for thread {}: state is {:?}", thread_id, thread.state);`

Severity: medium · Confidence: medium

Fixed in 06786c6 — added tracing::debug! with thread_id and current state for both stale approval paths (not in AwaitingApproval state, and no pending approval found).

serrrfirat · 2026-02-26T08:46:55Z

 use crate::error::Error;
 use crate::llm::ChatMessage;

+/// Truncate a string to at most `max_bytes` bytes at a char boundary, appending "...".


No unit test for truncate_preview despite it fixing a known panic

The truncate_preview function was introduced specifically to fix a UTF-8 truncation panic (&s[..500] on multi-byte characters). However, there are no tests verifying the fix works correctly with multi-byte strings. A regression here would reintroduce the panic.

Suggested fix:

Add tests covering: (1) a string shorter than max_bytes returns unchanged, (2) a pure-ASCII string is truncated at exactly max_bytes with '...' appended, (3) a multi-byte string (e.g., containing emoji or CJK characters) is truncated at a valid char boundary without panicking, (4) an empty string returns empty.

Severity: medium · Confidence: high

Fixed in 06786c6 — added 8 unit tests in src/channels/web/util.rs covering: short string (no-op), exact boundary, ASCII truncation, empty string, multi-byte char boundary (€), emoji (🦀), CJK characters (你好世界), and zero max_bytes.

serrrfirat · 2026-02-26T08:46:55Z

+        let store = match self.store() {
+            Some(s) => Arc::clone(s),
+            None => return,
+        };


persist_tool_calls and persist_assistant_response lack transactional ordering guarantee

The persist_tool_calls and persist_assistant_response are called sequentially but each performs its own ensure_conversation + add_conversation_message pair without a wrapping transaction. If the process crashes between persisting tool_calls and assistant response, the DB will have a tool_calls row without a following assistant row. build_turns_from_db_messages handles this gracefully (the turn is marked 'Failed'), but the tool_calls data would be orphaned relative to its turn.

In the normal case, since these are sequential awaits in a single-user gateway, interleaving is unlikely. But it's worth noting the lack of atomicity.

Suggested fix:

Consider wrapping tool_calls + assistant message persistence in a single DB transaction, or document that partial persistence is acceptable and handled by the reader.

Severity: low · Confidence: medium

Acknowledged — the reader (build_turns_from_db_messages) already handles partial persistence gracefully by marking turns without an assistant response as "Failed". Adding a wrapping transaction would require plumbing a transaction API through the Database trait which feels out of scope here. Leaving as-is with the understanding that partial persistence is handled by the reader.

serrrfirat · 2026-02-26T08:46:55Z

+            if let Some(next) = iter.peek()
+                && next.role == "tool_calls"
+            {
+                let tc_msg = iter.next().expect("peeked");


Malformed tool_calls JSON is silently consumed with no logging

In build_turns_from_db_messages, when the tool_calls message content fails to parse as Vec<serde_json::Value>, the tool_calls message is consumed (via iter.next()) but no log or diagnostic is emitted. The turn proceeds with empty tool_calls. While the graceful degradation is correct (and tested), silent data corruption can make debugging difficult.

Suggested fix:

Add a `tracing::warn!` when deserialization fails, including the conversation message ID, so corrupted data is discoverable in logs.

Severity: low · Confidence: medium

Fixed in 06786c6 — build_turns_from_db_messages (now in src/channels/web/util.rs) emits tracing::warn! with the message_id when deserialization fails.

serrrfirat · 2026-02-26T08:46:55Z

+/// Lightweight DTO for a pending tool approval (excludes context_messages).
+#[derive(Debug, Serialize)]
+pub struct PendingApprovalInfo {
+    pub request_id: String,


PendingApprovalInfo is not persisted to DB — implicit assumption

The PendingApprovalInfo is populated from in-memory thread state only. The history response returns pending_approval: None for all DB-sourced paths. This means if the server restarts while a thread is awaiting approval, the approval is silently lost — the user would need to re-send their message. The HistoryResponse struct suggests persistence support with the field, but the behavior is purely in-memory. This assumption should be documented.

Suggested fix:

Add a doc comment on the `pending_approval` field: `/// Only populated from in-memory state; not persisted to DB. Server restart clears pending approvals.`

Severity: low · Confidence: high

Fixed in 06786c6 — added doc comment on the pending_approval field: "Only populated from in-memory state; not persisted to DB. Server restart clears pending approvals."

serrrfirat · 2026-02-26T08:46:55Z

                        has_error: tc.error.is_some(),
+                        result_preview: tc.result.as_ref().map(|r| {
+                            let s = match r {
+                                serde_json::Value::String(s) => s.clone(),


Raw tool error strings exposed to frontend via persisted history

The error field from TurnToolCall is now serialized into the tool_calls DB content and returned to the frontend in ToolCallInfo.error. These error strings originate from e.to_string() on tool execution failures and could contain internal file paths, stack traces, or system-specific details. While these errors are already visible in real-time via SSE tool_completed events, persisting them in the DB and returning them in history responses increases their exposure surface — they're now available long after the original error occurred.

Suggested fix:

Consider truncating or sanitizing error strings before persisting (e.g., `truncate_preview(&error, 200)`) to limit exposure of potentially sensitive internal details.

Severity: low · Confidence: low

Fixed in 06786c6 — error strings are now truncated to 200 bytes via truncate_preview(&error, 200) before persisting to DB.

…dd diagnostics - Extract `truncate_preview` and `build_turns_from_db_messages` into shared `src/channels/web/util.rs`, removing three copies across thread_ops.rs, handlers/chat.rs, and server.rs - Add unit tests for `truncate_preview` (multi-byte, emoji, CJK, edge cases) and `build_turns_from_db_messages` (complete, incomplete, tool calls, malformed JSON, backward compat) - Add `tracing::debug!` when stale approvals are silently ignored - Add `tracing::warn!` when tool_calls JSON fails to parse from DB - Truncate persisted error strings to 200 bytes via `truncate_preview` - Document `pending_approval` field as in-memory only (not persisted to DB) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…d UI fixes (nearai#382)

github-actions Bot added scope: agent Agent core (agent loop, router, scheduler) scope: channel/web Web gateway channel size: XL 500+ changed lines risk: medium Business logic, config, or moderate-risk modules contributor: experienced 6-19 merged PRs labels Feb 26, 2026

style: cargo fmt

d626907

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

github-actions Bot added size: L 200-499 changed lines and removed size: XL 500+ changed lines labels Feb 26, 2026

gemini-code-assist Bot reviewed Feb 26, 2026

View reviewed changes

serrrfirat requested changes Feb 26, 2026

View reviewed changes

github-actions Bot added size: XL 500+ changed lines and removed size: L 200-499 changed lines labels Feb 26, 2026

serrrfirat added 2 commits February 27, 2026 08:31

Merge branch 'main' into worktree-persist-tool-calls

239ca77

Merge branch 'main' into worktree-persist-tool-calls

41fb044

serrrfirat self-requested a review February 27, 2026 06:43

serrrfirat approved these changes Feb 27, 2026

View reviewed changes

serrrfirat merged commit ddd01a6 into main Feb 27, 2026
14 checks passed

serrrfirat deleted the worktree-persist-tool-calls branch February 27, 2026 13:46

This was referenced Feb 27, 2026

chore: release v0.13.0 #385

Merged

🦞 OpenClaw 生态日报 2026-02-28 duanyytop/agents-radar#24

Open

🦞 OpenClaw 生态日报 2026-02-28 duanyytop/agents-radar#28

Open

bkutasi pushed a commit to bkutasi/ironclaw that referenced this pull request Mar 28, 2026

feat(web): persist tool calls, restore approvals on thread switch, an…

b2f0bdd

…d UI fixes (nearai#382)

Conversation

henrypark133 commented Feb 26, 2026

Summary

Test plan

Uh oh!

gemini-code-assist Bot commented Feb 26, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

serrrfirat left a comment

Choose a reason for hiding this comment

Summary

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants