cloudflare
diff --git a/‎.changeset/agent-tools-orchestration.md‎
Lines changed: 11 additions & 0 deletions b/‎.changeset/agent-tools-orchestration.md‎
Lines changed: 11 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 3 additions & 2 deletions b/‎README.md‎
Lines changed: 3 additions & 2 deletions
diff --git a/‎design/AGENTS.md‎
Lines changed: 1 addition & 0 deletions b/‎design/AGENTS.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎design/README.md‎
Lines changed: 1 addition & 0 deletions b/‎design/README.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎design/agent-tools.md‎
Lines changed: 52 additions & 0 deletions b/‎design/agent-tools.md‎
Lines changed: 52 additions & 0 deletions
diff --git a/‎docs/agent-tools.md‎
Lines changed: 184 additions & 0 deletions b/‎docs/agent-tools.md‎
Lines changed: 184 additions & 0 deletions
diff --git a/‎docs/chat-agents.md‎
Lines changed: 3 additions & 1 deletion b/‎docs/chat-agents.md‎
Lines changed: 3 additions & 1 deletion
diff --git a/‎docs/index.md‎
Lines changed: 1 addition & 0 deletions b/‎docs/index.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/server-driven-messages.md‎
Lines changed: 2 additions & 2 deletions b/‎docs/server-driven-messages.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/sub-agents.md‎
Lines changed: 6 additions & 0 deletions b/‎docs/sub-agents.md‎
Lines changed: 6 additions & 0 deletions
@@ -0,0 +1,11 @@
+---
+"agents": minor
+"@cloudflare/think": minor
+"@cloudflare/ai-chat": minor
+---
+
+Add agent tool orchestration for running Think and AIChatAgent sub-agents as
+retained, streaming tools from a parent agent. The new surface includes
+`runAgentTool`, `agentTool`, parent-side run replay and cleanup, Think and
+AIChatAgent child adapter support, and headless React/client event state
+helpers.
@@ -105,6 +105,7 @@ The agent is a Durable Object, so it needs a binding and a SQLite migration in `
 | **Persistent State**    | Syncs to all connected clients, survives restarts                               |
 | **Callable Methods**    | Type-safe RPC via the `@callable()` decorator                                   |
 | **Sub-agents**          | Parent/child DO composition via facets, nested routing, and typed parent lookup |
+| **Agent Tools**         | Run chat-capable sub-agents as tools with streaming child timelines             |
 | **Scheduling**          | One-time, recurring, and cron-based tasks                                       |
 | **WebSockets**          | Real-time bidirectional communication with lifecycle hooks                      |
 | **AI Chat**             | Message persistence, resumable streaming, server/client tool execution          |
@@ -142,7 +143,7 @@ The agent is a Durable Object, so it needs a binding and a SQLite migration in `
 The [`examples/`](examples) directory has 30+ self-contained demos. A non-exhaustive tour:
 
 - **Showcase** — [`playground/`](examples/playground) is the kitchen-sink app: state, callable methods, scheduling, chat, tools, MCP, workflows, email, voice — all in one UI
-- **Chat & assistants** — [`assistant/`](examples/assistant), [`workspace-chat/`](examples/workspace-chat), [`resumable-stream-chat/`](examples/resumable-stream-chat), [`structured-input/`](examples/structured-input), [`dynamic-tools/`](examples/dynamic-tools), [`multi-ai-chat/`](examples/multi-ai-chat)
+- **Chat & assistants** — [`assistant/`](examples/assistant), [`agents-as-tools/`](examples/agents-as-tools), [`workspace-chat/`](examples/workspace-chat), [`resumable-stream-chat/`](examples/resumable-stream-chat), [`structured-input/`](examples/structured-input), [`dynamic-tools/`](examples/dynamic-tools), [`multi-ai-chat/`](examples/multi-ai-chat)
 - **MCP** — [`mcp/`](examples/mcp), [`mcp-client/`](examples/mcp-client), [`mcp-worker/`](examples/mcp-worker), [`mcp-worker-authenticated/`](examples/mcp-worker-authenticated), [`mcp-elicitation/`](examples/mcp-elicitation), [`mcp-rpc-transport/`](examples/mcp-rpc-transport), [`webmcp/`](examples/webmcp)
 - **Code Mode & sandboxes** — [`codemode/`](examples/codemode), [`codemode-mcp/`](examples/codemode-mcp), [`codemode-mcp-openapi/`](examples/codemode-mcp-openapi), [`dynamic-workers/`](examples/dynamic-workers), [`dynamic-workers-playground/`](examples/dynamic-workers-playground), [`worker-bundler-playground/`](examples/worker-bundler-playground)
 - **Voice** — [`voice-agent/`](examples/voice-agent), [`voice-input/`](examples/voice-input), [`elevenlabs-starter/`](examples/elevenlabs-starter)
@@ -165,7 +166,7 @@ npm start
 - [`docs/`](docs) directory in this repo (synced upstream)
 - [Anthropic Patterns guide](guides/anthropic-patterns) — sequential, routing, parallel, orchestrator, evaluator
 - [Human-in-the-Loop guide](guides/human-in-the-loop) — approval workflows with pause/resume
-- [`design/`](design) — architecture and design decision records (chat API, sub-agents RFC, workspace, voice, browser tools, retries, and more)
+- [`design/`](design) — architecture and design decision records (chat API, sub-agents, agent tools, workspace, voice, browser tools, retries, and more)
 
 ## Repository Structure
 
 
@@ -82,6 +82,7 @@ Keep it concise. A few paragraphs is fine. These are records, not essays.
 | `retries.md`                            | design doc | Retry system — primitives, integration points, backoff strategy, tradeoffs            |
 | `visuals.md`                            | design doc | UI component library (Kumo), dark mode, custom patterns, routing integration          |
 | `workspace.md`                          | design doc | Workspace — hybrid SQLite+R2 filesystem, bash, symlinks, observability                |
+| `agent-tools.md`                        | design doc | Agent tools — chat sub-agent orchestration, parent registry, event replay             |
 | `sub-agent-routing.md`                  | design doc | Sub-agent routing as shipped — facets, nested URLs, registry, parent lookup, caveats  |
 | `rfc-sub-agents.md`                     | RFC        | Sub-agents — child DOs via facets, typed stubs, built into Agent (accepted)           |
 | `rfc-sub-agent-routing.md`              | RFC        | Sub-agent external addressability — nested URLs, `onBeforeSubAgent`, per-call bridge  |
 
@@ -12,6 +12,7 @@ The goal is to give contributors (and future-us) a quick way to understand _why_
 | [visuals.md](./visuals.md)                                                       | UI component library choice, Kumo usage, custom patterns                      |
 | [readonly-connections.md](./readonly-connections.md)                             | Readonly connection enforcement, storage, tradeoffs, and caveats              |
 | [workspace.md](./workspace.md)                                                   | Workspace — hybrid SQLite+R2 filesystem, bash, symlinks                       |
+| [agent-tools.md](./agent-tools.md)                                               | Agent tools — chat sub-agent orchestration, parent registry, replay           |
 | [sub-agent-routing.md](./sub-agent-routing.md)                                   | Sub-agent routing as shipped — facets, nested URLs, registry, parent lookup   |
 | [rfc-sub-agents.md](./rfc-sub-agents.md)                                         | RFC: Sub-agents — child DOs via facets, typed stubs, mixin API                |
 | [rfc-helper-sub-agent-orchestration.md](./rfc-helper-sub-agent-orchestration.md) | RFC: Agent tool orchestration — `runAgentTool`, `agentTool`, event forwarding |
 
@@ -0,0 +1,52 @@
+# Agent Tools
+
+Agent tools are the orchestration layer that lets a parent agent run a
+chat-capable sub-agent as part of a larger operation. The shipped V1 follows
+[`rfc-helper-sub-agent-orchestration.md`](./rfc-helper-sub-agent-orchestration.md).
+
+The parent owns a framework table, `cf_agent_tool_runs`, that records each
+logical run by `runId`: parent tool call id, child class, safe input preview,
+display order, status, summary, and terminal error metadata. The child remains a
+normal sub-agent facet and owns the full chat transcript plus resumable stream
+chunks. Think children use `cf_agent_tool_child_runs` to map `runId` to the
+underlying Think request and stream ids; AIChatAgent children use
+`cf_ai_chat_agent_tool_runs` to map `runId` to their `saveMessages()` request.
+
+`runAgentTool(Cls, options)` is the foundational API. It inserts the parent row
+before waking the child, starts the child adapter idempotently by `runId`,
+forwards child `UIMessageChunk` bodies to parent clients as
+`agent-tool-event` frames, records a terminal state, and retains the child facet
+for replay and drill-in. `agentTool(Cls, options)` is a small AI SDK tool
+factory layered on top for model-selected dispatch.
+
+The React surface is intentionally headless. `applyAgentToolEvent` reconstructs
+child `UIMessage.parts` from opaque chunk bodies and groups runs by parent tool
+call id; `useAgentToolEvents` subscribes to the existing parent connection and
+deduplicates replay/live races. Applications own layout, panels, and drill-in
+UI.
+
+V1 supports Think children and AIChatAgent children. Live child chunks cross
+Durable Object RPC as byte-encoded newline-delimited records; the parent decodes
+them and broadcasts `agent-tool-event` frames. Cancellation is bridged by
+parent-side cancellation callbacks rather than serializing `AbortSignal` across
+Durable Object RPC. If a parent restarts while a run is non-terminal, V1 replays
+stored chunks and marks the parent row `interrupted`; live-tail reattach is
+deferred.
+
+## Tradeoffs
+
+- Runs and facets are retained by default so refresh, drill-in, and debugging
+  work after completion. Applications must call `clearAgentToolRuns()` when
+  clearing chat history or enforcing retention.
+- The parent registry stores input previews, not raw inputs, to avoid creating a
+  second prompt store.
+- AIChatAgent agent-tool turns are headless. Server-side tools work normally,
+  but browser-provided client tools are not available unless the application
+  models the interaction as server-side state or a separate parent-mediated
+  workflow.
+
+## History
+
+- [`rfc-helper-sub-agent-orchestration.md`](./rfc-helper-sub-agent-orchestration.md)
+  — accepted V1 direction for `runAgentTool`, `agentTool`, event forwarding,
+  replay, and cleanup.
@@ -0,0 +1,184 @@
+# Agent Tools
+
+Agent tools let one chat agent dispatch another chat-capable sub-agent as part
+of its work. The child is a real sub-agent with its own Durable Object storage,
+messages, tools, resumable stream, and drill-in URL. The parent keeps a small
+run registry so clients can render the child timeline, replay it after refresh,
+and clean it up later.
+
+Agent tools support `@cloudflare/think` agents and `AIChatAgent` subclasses.
+`AIChatAgent` children run headlessly through `saveMessages()`, so they should
+use server-side tools. Browser-provided client tools are not available during an
+agent-tool turn unless you model that interaction as server-side state or a
+separate parent-mediated workflow.
+
+## Use an Agent as an AI SDK tool
+
+Use `agentTool()` when the parent model should decide when to call the helper.
+
+```ts
+import { Think } from "@cloudflare/think";
+import { agentTool } from "agents/agent-tools";
+import { z } from "zod";
+
+export class Researcher extends Think<Env> {
+  getSystemPrompt() {
+    return "Research the user's topic and end with a concise summary.";
+  }
+}
+
+export class Assistant extends Think<Env> {
+  getTools() {
+    return {
+      research: agentTool(Researcher, {
+        description: "Research one topic in depth.",
+        displayName: "Researcher",
+        inputSchema: z.object({
+          query: z.string().min(3)
+        })
+      })
+    };
+  }
+}
+```
+
+The child can also be an `AIChatAgent`:
+
+```ts
+import { AIChatAgent } from "@cloudflare/ai-chat";
+import { agentTool } from "agents/agent-tools";
+import { convertToModelMessages, stepCountIs, streamText } from "ai";
+import { z } from "zod";
+
+export class Summarizer extends AIChatAgent<Env> {
+  protected override formatAgentToolInput(input: { text: string }, request) {
+    return {
+      id: `agent-tool-${request.runId}-input`,
+      role: "user",
+      parts: [{ type: "text", text: `Summarize:\n\n${input.text}` }]
+    };
+  }
+
+  async onChatMessage() {
+    const result = streamText({
+      model: this.env.MODEL,
+      messages: await convertToModelMessages(this.messages)
+    });
+    return result.toUIMessageStreamResponse();
+  }
+}
+
+export class Assistant extends AIChatAgent<Env> {
+  async onChatMessage() {
+    const result = streamText({
+      model: this.env.MODEL,
+      messages: await convertToModelMessages(this.messages),
+      tools: {
+        summarize: agentTool(Summarizer, {
+          description: "Summarize long text in a separate retained agent.",
+          inputSchema: z.object({ text: z.string() })
+        })
+      },
+      stopWhen: stepCountIs(5)
+    });
+    return result.toUIMessageStreamResponse();
+  }
+}
+```
+
+The generated tool calls `this.runAgentTool(ChildAgent, ...)`, streams
+`agent-tool-event` frames on the parent WebSocket, and returns the child
+summary to the parent model. If the run fails, aborts, or is interrupted, the
+tool returns a structured failure instead of an empty success value.
+
+## Run an Agent tool imperatively
+
+Use `runAgentTool()` for deterministic workflows, scheduled work, HTTP
+handlers, or fan-out code.
+
+```ts
+const [a, b] = await Promise.allSettled([
+  this.runAgentTool(Researcher, {
+    input: { query: "HTTP/3" },
+    parentToolCallId: toolCallId,
+    displayOrder: 0
+  }),
+  this.runAgentTool(Researcher, {
+    input: { query: "gRPC" },
+    parentToolCallId: toolCallId,
+    displayOrder: 1
+  })
+]);
+```
+
+`runAgentTool()` is idempotent by `runId`. Passing the same `runId` never starts
+a duplicate child turn. Completed, failed, aborted, and interrupted runs are
+retained until you explicitly clear them.
+
+## Render child timelines in React
+
+`useAgentToolEvents()` is a headless hook. It subscribes to the existing parent
+connection, deduplicates replay/live races, applies child `UIMessageChunk`
+bodies to message parts, and groups sibling runs by parent tool call id.
+
+```tsx
+import { useAgent, useAgentToolEvents } from "agents/react";
+import { useAgentChat } from "@cloudflare/ai-chat/react";
+
+const agent = useAgent({ agent: "Assistant", name: userId });
+const { messages } = useAgentChat({ agent });
+const agentTools = useAgentToolEvents({ agent });
+
+for (const message of messages) {
+  for (const part of message.parts) {
+    if (part.type === "tool-call") {
+      const runs = agentTools.getRunsForToolCall(part.toolCallId);
+      // Render the child runs beside this tool call.
+    }
+  }
+}
+```
+
+Imperative runs without a parent tool call are available as
+`agentTools.unboundRuns`.
+
+## Drill in and gate access
+
+Agent tools are normal sub-agents. Connect to a retained child through the
+parent route:
+
+```ts
+useAgent({
+  agent: "Assistant",
+  name: userId,
+  sub: [{ agent: "Researcher", name: runId }]
+});
+```
+
+Gate external access with the parent registry so guessed run ids cannot spawn
+fresh child facets:
+
+```ts
+override async onBeforeSubAgent(_request, child) {
+  if (!this.hasAgentToolRun(child.className, child.name)) {
+    return new Response("Not found", { status: 404 });
+  }
+}
+```
+
+## Clear retained runs
+
+Runs and child facets are retained by default for refresh, drill-in, and later
+inspection. Delete them explicitly when clearing chat history or applying your
+own retention policy:
+
+```ts
+await this.clearAgentToolRuns();
+await this.clearAgentToolRuns({
+  status: ["completed", "error", "aborted", "interrupted"]
+});
+await this.clearAgentToolRuns({ olderThan: Date.now() - 7 * 24 * 60 * 60_000 });
+```
+
+If a retained run is still `starting` or `running`, cleanup cancels the child
+before deleting its facet.
@@ -355,7 +355,9 @@ if (result.status === "aborted") {
 
 The same `options.signal` is accepted by `continueLastTurn()`. See
 [`cloudflare/agents#1406`](https://github.com/cloudflare/agents/issues/1406)
-for the helper-as-sub-agent pattern that motivated the API.
+for the agent-tool orchestration pattern that motivated the API, and
+[Agent Tools](./agent-tools.md) for using `AIChatAgent` and Think subclasses as
+retained, streaming tools.
 
 ### `onChatResponse`
 
 
@@ -43,6 +43,7 @@
 - TODO: [AI SDK Integration](./ai-sdk.md) - Using Vercel AI SDK with agents
 - TODO: [TanStack Integration](./tanstack.md) - Using TanStack AI with agents
 - [Chat Agents](./chat-agents.md) - `AIChatAgent` class and `useAgentChat` React hook
+- [Agent Tools](./agent-tools.md) - Run chat-capable sub-agents as tools with streaming child timelines
 - [Server-Driven Messages](./server-driven-messages.md) - Autonomous agent workflows: scheduled follow-ups, queue processing, webhooks, chained reasoning
 - TODO: [Using AI Models](./using-ai-models.md) - OpenAI, Anthropic, Workers AI, and other providers
 - TODO: [RAG (Retrieval Augmented Generation)](./rag.md) - Vector search with Vectorize
 
@@ -416,10 +416,10 @@ Pre-aborted signals short-circuit before any model work runs.
 
 ### Limitations
 
-- **Signals cannot cross Durable Object boundaries.** `AbortSignal` is not an RPC-serializable type. Construct the controller inside the DO that calls `saveMessages`. To bridge a parent's abort intent into a child DO, return a `ReadableStream` from the child and let the parent cancel it — workerd propagates the cancel back to the source's `cancel` callback. See `examples/agents-as-tools` for the canonical helper-as-sub-agent pattern.
+- **Signals cannot cross Durable Object boundaries.** `AbortSignal` is not an RPC-serializable type. Construct the controller inside the DO that calls `saveMessages`. For Think child-agent orchestration, use [Agent Tools](./agent-tools.md); `runAgentTool()` bridges parent aborts into the child run. For lower-level custom RPC, return a `ReadableStream` from the child and let the parent cancel it — workerd propagates the cancel back to the source's `cancel` callback.
 - **Hibernation drops the listener.** The signal lives in memory. If the DO hibernates mid-turn and `chatRecovery` is enabled, the recovered turn calls `continueLastTurn()` internally without the original signal — an abort fired after restart has no effect on the recovered turn. This is true for top-level agents and sub-agents; sub-agent recovery still works, but the original caller's in-memory signal is gone. Override `onChatRecovery` (Think) or set `chatRecovery = false` for callers that need stronger guarantees.
 
-This is the integration point for helper-as-sub-agent patterns where the parent's AI SDK abort signal needs to propagate into a child DO's `saveMessages` call. See [`cloudflare/agents#1406`](https://github.com/cloudflare/agents/issues/1406) for the original use case.
+This is the integration point for agent-tool orchestration where the parent's AI SDK abort signal needs to propagate into a child DO's `saveMessages` call. See [`cloudflare/agents#1406`](https://github.com/cloudflare/agents/issues/1406) for the original use case.
 
 ## Important notes
 
 
@@ -4,6 +4,11 @@ Sub-agents are child Durable Objects colocated under a parent agent. Each sub-ag
 
 Use sub-agents when a single user or entity owns an open-ended set of long-lived agents — chats, documents, sessions, shards, projects — and you want each one to run in parallel with its own state while keeping one parent agent as the coordinator.
 
+If you want a parent chat agent to dispatch another chat-capable agent during a
+single turn and render that child's progress inline, use [Agent Tools](./agent-tools.md).
+Agent tools are built on sub-agents, but add a parent-side run registry,
+streaming `agent-tool-event` frames, replay, cancellation, and cleanup.
+
 ## Overview
 
 ```typescript
@@ -355,6 +360,7 @@ See [`examples/multi-ai-chat`](https://github.com/cloudflare/agents/tree/main/ex
 ## Related
 
 - [Think sub-agents and programmatic turns](./think/sub-agents.md) — Think's `chat()` RPC method for streaming from a parent to a Think-based child
+- [Agent Tools](./agent-tools.md) — run Think or `AIChatAgent` sub-agents as tools with inline streaming child timelines
 - [Long-running agents](./long-running-agents.md) — how sub-agents fit alongside `schedule`, `runFiber`, and workflows
 - [Callable methods](./callable-methods.md) — `@callable` methods work unchanged on sub-agents
 - [Scheduling](./scheduling.md) — scheduling primitives for top-level agents and sub-agents