Skip to content

feat: make overnight-dent testable — --test-task, merge tracking, --experiment (kaizen #322)#255

Merged
aviadr1 merged 2 commits intomainfrom
case/260321-1230-k322-overnight-testable
Mar 21, 2026
Merged

feat: make overnight-dent testable — --test-task, merge tracking, --experiment (kaizen #322)#255
aviadr1 merged 2 commits intomainfrom
case/260321-1230-k322-overnight-testable

Conversation

@aviadr1
Copy link
Copy Markdown

@aviadr1 aviadr1 commented Mar 21, 2026

Summary

  • --test-task flag: synthetic fast task that creates a trivial PR instead of running /make-a-dent. Completes in <2 min for pipeline iteration.
  • --experiment flag: extra diagnostics — main HEAD before/after pull, per-PR merge status tracking, auto-merge queue visibility.
  • checkMergeStatus(): new exported function that checks PR state via gh CLI, returns merged/auto_queued/open/closed/unknown.
  • buildPrompt() now exported and supports test_task mode.
  • BatchState extended with test_task and experiment optional fields.
  • 13 new tests (42 total, all passing).

Context

This is Step 0 of the overnight-dent merge pipeline investigation (PRD qwibitai#321). The overnight-dent batch runner creates PRs but doesn't merge them. Before fixing the pipeline, we need fast iteration tools to test hypotheses about why merges fail.

Test plan

  • 42/42 vitest tests pass (29 existing + 13 new)
  • 91/91 scripts/ tests pass (no regressions)
  • TypeScript build clean
  • buildPrompt with test_task=true produces synthetic prompt without /make-a-dent
  • buildPrompt with test_task=false still uses /make-a-dent
  • checkMergeStatus returns correct states for merged/queued/open/error cases
  • --test-task and --experiment flags parsed in trampoline, passed to state.json
  • Manual: ./scripts/overnight-dent.sh --test-task --max-runs 1 "probe" (deferred to Step 1)
scripts/overnight-dent.sh: bash script — tested indirectly via overnight-dent-run.test.ts (buildPrompt tests verify state.test_task handling)

Fixes Garsson-io/kaizen#322

🤖 Generated with Claude Code

aviadr1 and others added 2 commits March 21, 2026 12:12
…xperiment (kaizen qwibitai#322)

- --test-task flag: synthetic fast task that creates a trivial PR instead of
  running /make-a-dent. Completes in <2 min for pipeline iteration.
- --experiment flag: extra diagnostics — main HEAD before/after pull, per-PR
  merge status tracking, auto-merge queue visibility.
- checkMergeStatus(): new exported function that checks PR state via gh CLI,
  returns merged/auto_queued/open/closed/unknown.
- buildPrompt() now exported and supports test_task mode.
- BatchState extended with test_task and experiment optional fields.
- 13 new tests (42 total, all passing).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@aviadr1 aviadr1 merged commit c300ce3 into main Mar 21, 2026
3 checks passed
@aviadr1 aviadr1 deleted the case/260321-1230-k322-overnight-testable branch March 21, 2026 10:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[PRD #321] Step 0: Make overnight-dent testable — synthetic tasks + merge tracking

1 participant