Fix/audit findings by Wmaxlees · Pull Request #1314 · qwibitai/nanoclaw

Wmaxlees · 2026-03-21T19:31:16Z

Type of Change

Feature skill - adds a channel or integration (source code changes + SKILL.md)
Utility skill - adds a standalone tool (code files in .claude/skills/<name>/, no source changes)
Operational/container skill - adds a workflow or agent skill (SKILL.md only, no source changes)
Fix - bug fix or security fix to source code
Simplification - reduces or simplifies source code
Documentation - docs, README, or CONTRIBUTING changes only

Description

For Skills

SKILL.md contains instructions, not inline code (code goes in separate files)
SKILL.md is under 500 lines
I tested this skill on a fresh clone

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Adds reaction receiving, sending, storage, and search. Includes StatusTracker for message lifecycle signaling and react_to_message MCP tool for container agents. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Syncs with upstream main (on schedule, dispatch, or push), then merges main into all skill/* branches with build+test validation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…icts with core

….yml)

# Conflicts: # src/channels/whatsapp.test.ts # src/channels/whatsapp.ts

Add an evolving behavioral skills system where agents browse and apply relevant guidelines per-task, with automated evaluation and evolution. Phase 1: DB schema (6 tables), skill deployer, container mount, report_skills_used MCP tool, IPC handler, task run recording. Phase 2: Evaluator loop using direct Anthropic API (Sonnet) with 30-minute deadline before automated scoring. Phase 3: Evolution agent with cold start, candidate lifecycle, drift validation, auto-rollback, and version management. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Resolved conflicts in src/db.ts (kept both behavioral skills and reactions tables), src/index.ts (kept both runStartTime and firstOutputSeen, both onTasksChanged and statusHeartbeat), src/ipc.ts (kept both skills-used handling and status heartbeat, preserved update_task case). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Groups consecutive agent turns (up to 6) into rollouts that are evaluated as a unit. The evaluator now scores the full conversation window including tool call inputs/outputs and produces a reasoning field passed to the evolution agent. Key changes: - config.ts: ROLLOUT_SIZE (6) and ROLLOUT_INACTIVITY_MS (30min) - types.ts: ToolCall, Rollout interfaces; updated SkillTaskRun/SkillEvaluation - db.ts: rollouts table, rollout_id/tool_calls on runs, evaluator_reasoning on evaluations, rollout CRUD + getLowScoringRollouts - session-reader.ts: extract tool calls from Claude Code JSONL transcripts - rollout-manager.ts: getOrCreateRollout / closeStaleRollouts - evaluator.ts: evaluate closed rollouts with tool_selection dimension - evolution.ts: consume rollout context with per-turn tool calls + reasoning - index.ts: accumulate response text, assign rollout_id, extract tool calls - evaluator-prompt.md / evolution-prompt.md: updated for multi-turn format Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

feat: multi-turn rollout evaluation windows

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

- Add worker_tasks and wall DB tables - WorkerTask and WallEntry types - IPC handlers: create_worker_task, post_wall (with depth guard) - Worker manager: spawns containers for pending tasks, collects results, propagates completion up parent chain - Root task completion triggers orchestrator synthesis + user notification - ContainerInput extended with isWorkerTask / workerTaskId / workerDepth - container/worker-guide.md: agent instructions for delegating and acting as worker Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Host derives chatJid from registered groups by folder name when the agent omits it. Simplifies delegation — agents no longer need to know or pass their own chat JID. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Main agent enqueues all requests as worker tasks. Only exception: answering status questions about in-progress work. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Critical: - SQL operator precedence bug in getSkillVersionCount - JSON.parse crash on container_config in group lookup - Evolution skill modifications wrapped in transactions (atomic) - checkParentCompletion converted to iterative (no stack overflow) - Worker spawn failure now always marks task as failed High: - Silent migration failures now log warnings - getRootTaskId has cycle detection + iteration cap - Dynamic SQL UPDATE field names whitelisted - 30s timeout + 429 handling on evaluator/evolution API calls - LLM response structure validated before use - User content wrapped in code fences (prompt injection) - Streaming parse buffer capped at 10MB - Worker synthesis callback has full error handling - parentDepth read from DB not IPC message - Wall entries verify group ownership Medium: - foreign_keys = ON enabled in SQLite - MAX() on TEXT timestamps replaced with CASE WHEN - Evolution candidate/rollback ops in transactions - Missed selection logging uses skill ID not name - IPC write failures logged (group-queue) - tool_calls/dimensions parse errors logged Low: - Index on skill_task_runs.created_at - Unused _ipcDir param removed - Worker result capped at 500KB Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Workers now participate in the full skill feedback loop: - Each root task tree gets a worker rollout (rollout_type='worker', id=worker-{rootTaskId}), created idempotently when the first worker task is spawned - Each worker task completion records a skill_task_runs row linked to the rollout (worker_task_id FK, root_outcome_score propagated later) - After synthesis is sent, the synthesis is scored via claude-haiku and the score is propagated back to all contributing worker runs as root_outcome_score; the rollout is then closed - Evaluator processes closed worker rollouts separately using a worker-specific rubric (task_completion, accuracy, efficiency, decomposition_quality, result_quality) via claude-haiku - Evolution context now includes low-scoring worker task trees alongside conversation rollouts so skill evolution can address worker behavior Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

gavrielc and others added 30 commits March 8, 2026 22:59

skill/whatsapp: WhatsApp channel integration

8698fc8

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Merge remote-tracking branch 'origin/main' into skill/whatsapp

94b1ac6

skill/reactions: WhatsApp emoji reaction support

a23e372

Adds reaction receiving, sending, storage, and search. Includes StatusTracker for message lifecycle signaling and react_to_message MCP tool for container agents. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Merge remote-tracking branch 'origin/main' into skill/reactions

3d44b7a

Merge remote-tracking branch 'origin/main' into skill/whatsapp

ad5f24a

Merge remote-tracking branch 'origin/main' into skill/reactions

72aca59

Merge remote-tracking branch 'origin/main' into skill/whatsapp

2ef93b4

ci: add upstream sync and merge-forward workflow

604320e

Syncs with upstream main (on schedule, dispatch, or push), then merges main into all skill/* branches with build+test validation. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ci: rename sync workflow to fork-sync-skills.yml to avoid merge confl…

638ac3f

…icts with core

ci: remove old merge-forward-skills.yml (replaced by fork-sync-skills…

56f2373

….yml)

Merge remote-tracking branch 'whatsapp/main' into rebuild-fork

7579fdb

Merge commit '72aca59' into rebuild-fork

1b03799

# Conflicts: # src/channels/whatsapp.test.ts # src/channels/whatsapp.ts

fix: update sync condition to check repo name, not owner

bbdc1b5

fix: re-commit fork-sync workflow (clean encoding)

8b727e3

Merge branch 'main' into skill/reactions

b6c3600

fix: repair escaped newlines in fork-sync workflow

f770e36

Merge branch 'main' into skill/reactions

8cbee41

fix: use GitHub App token for fork-sync (workflows permission needed)

3edaddb

Merge branch 'main' into skill/reactions

fd222f1

Merge remote-tracking branch 'upstream/main'

d41d1bf

Merge branch 'main' into skill/reactions

d5e0497

fix: re-fetch before skill branch merges to avoid stale refs

afac3ff

Merge branch 'main' into skill/reactions

5489e3c

Merge remote-tracking branch 'upstream/main'

a30471d

Merge branch 'main' into skill/reactions

48b26a8

fix: add concurrency group to prevent parallel fork-sync races

0c5f7bd

Merge branch 'main' into skill/reactions

a8c2a25

Merge remote-tracking branch 'upstream/main'

3c2f30f

Merge branch 'main' into skill/reactions

5b5fbe1

Merge remote-tracking branch 'upstream/main'

4cab695

github-actions bot and others added 26 commits March 11, 2026 10:26

Merge remote-tracking branch 'upstream/main'

61ccfba

docs: update token count to 43.1k tokens · 22% of context window

aa81de7

Merge branch 'main' into skill/reactions

7be0a27

Merge remote-tracking branch 'upstream/main'

d955147

Merge branch 'main' into skill/reactions

0b099e7

Merge remote-tracking branch 'upstream/main'

5672548

Merge branch 'main' into skill/reactions

544e9cd

Merge remote-tracking branch 'upstream/main'

b6fb897

chore: bump version to 1.2.14

2a2ab2a

Merge branch 'main' into skill/reactions

9474448

Merge remote-tracking branch 'whatsapp/main'

b265fe4

style: apply prettier formatting

d89160d

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

style: apply prettier formatting after reactions merge

937cde7

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Fixed some issues

df39055

style: run prettier on multi-turn rollout files

2478ac9

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Merge pull request #2 from Wmaxlees/feat/multi-turn-rollouts

31a1a9b

feat: multi-turn rollout evaluation windows

chore: add debug SQL script for skills and evaluations

95395a5

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

fix: make chatJid optional in create_worker_task IPC

f64c230

Host derives chatJid from registered groups by folder name when the agent omits it. Simplifies delegation — agents no longer need to know or pass their own chat JID. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

feat: orchestrator always delegates — never executes tasks directly

7afbf1d

Main agent enqueues all requests as worker tasks. Only exception: answering status questions about in-progress work. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

fix: orchestrator delegates tasks but handles conversation directly

537742d

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Wmaxlees requested review from gabi-simons and gavrielc as code owners March 21, 2026 19:31

Wmaxlees closed this Mar 21, 2026

github-actions bot mentioned this pull request Mar 22, 2026

🦞 Bản tin hàng ngày hệ sinh thái OpenClaw 2026-03-22 compasify/agents-radar#71

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix/audit findings#1314

Fix/audit findings#1314
Wmaxlees wants to merge 57 commits intoqwibitai:mainfrom
Wmaxlees:fix/audit-findings

Wmaxlees commented Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Wmaxlees commented Mar 21, 2026

Type of Change

Description

For Skills

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants